I give permission for my final project to be made available through the LIS Learning Showcase web server. Thanks.

Andrew Chanse
May 5th, 2004
Final--Option #2      

Bibliometric Study: Remote Sensing of Environment



1.  Introduction
 

            The journal Remote Sensing of Environment (0034-4257) began publication by Elsevier Inc. in 1969 under the currently used title.  Remote Sensing of Environment was first published on a quarterly basis, but is now published 20 times a year.

            The current editor-in-chief is M.E. Bauer.  W. Gregg currently serves as the Associate Editor.  The editorial board currently consists of  24 members; 20 members associate themselves with the United States, 2 with Canada, and one a piece for France and Australia.  Six of the editors (including the editor-in-chief and associate editor) are associated with government entities while the remaining editors are affiliated with an an academic institution.

            Elsevier identifies the journals audience as a wide variety of  researchers.  These researchers work in such fields as: agriculture, ecology, environment, forestry, geology, hydrology, meteorology, oceanography, and remote sensing.  The intellectual scope of the journal strives to serve the remote sensing community as a publisher of results on science, technology, theory and applications of remote sensing of the Earth's resources and environment.  Elsevier describes the journal as interdisciplinary because it publishes material on terrestrial, oceanic, and atmospheric sensing with an emphasis of biophysical and quantitative approaches to remote sensing. 

            Remote Sensing of Environment
is available in both print, micro, and electronic formats.  Current issues are also available in both print and electronic formats via subscription.  The Arizona State University library has a full run of the journal in a tangible format since it's commencement in 1969 through 2002.  Print copies are available for browsing for volumes 1-83.  Volumes 2-4 are also available on a micro format.  None of the print issues nor the micro format are available to circulate outside of the library.  ASU also has access to all volumes from volume 2 through the present issues of the publication in electronic format.  The electronic issues can be accessed from library terminals and by remote access by affiliates of Arizona State University.

Methodology

            Remote Sensing of Environment publishes articles, corrections, editorial materials, notes, and reviews.  Utilizing ISI's Web of Science and choosing the period of 1994-1998, the frequency of each of these document types was collected.  The most cited authors within the journal and the mean and median number of references per article within a single issue (Volume 66, Issue 1) were also recorded.  Lastly, the impact factor and immediacy index for 2002 were also determined using ISI's Journal Citation Reports.

2.  Findings 

Document Types

            According to Science Citation Index, the document types that Remote Sensing of Environment publishes are articles, corrections, editorial materials, notes, and reviews.  It should be noted in searching for "All Documents" for the year 1996, 97 total documents were returned instead of the 96 retrieved through searching by document types and adding the total.  A chart detailing the breakdown of such document types follows:

Table 2a
DOCUMENT TYPE
Articles
Corrections Editorial Materials
Notes
Reviews
Total Number of Documents Per Year
Number of Occurrences for 1998
97
3
0
0
1
101
1997
134
1
0
0
2
137
1996
95
1
0
0
0
96
1995
99
1
3
2
0
105
1994
125
0
3
3
0
131
Total Number of Occurrences of by Document Types for 5 Year Period
550
6
6
5
3
570=Total Number of Document Types for 5 Year Period

   
        The vast majority of document types published in Remote Sensing of Environment  as indexed by ISI is that of articles.  A curiously low occurance of corrections, editorials, and reviews occur over the analyzed 5 year period.  This calls into question the completenessof ISI's indexing.

Cited Authors

            The top five most cited article's for the 1994-1998 time period follow.  Succeeding the citations for each article is the number of citations the article received.

Holben, BN; Eck, TF; Slutsker, I; Tanre, D; Buis, JP; Setzer, A; Vermote, E; Reagan, JA; Kaufman, YJ; Nakajima, T;        
    Lavenu, F; Jankowiak, I; Smirnov, A. (1998).  AERONET - A federated instrument network and data archive for aerosol
    characterization.  Remote Sensing of Environment.  66(1): 1-16.          (315 times)

Field, CB; Randerson, JT; and Malmstrom, CM. (1995).  Global net primary production--combining ecology and    
    remote-sensing.  Remote Sensing of Environment.  51(1): 74-88.         (101 times)

Qi, J; Chehbouni, A; Huete, AR; Kerr, YH; Sorooshian, S. (1994).  A modified soil adjusted vegetation index.  Remote
    Sensing of Environment. 
48(2): 119-126.         (81 times)

Chen, JM and Cihlar, J.  (1996).  Retrieving leaf area index of boreal conifer forests using landsat TM images.  Remote Sensing
     of Environment. 
55(2): 153-162.        (77 times)

Carlson, TN and Ripley, DA. (1997).  On the relation between NDVI, fractional vegetation cover, and leaf area index. 
    Remote Sensing of Environment. 
62(3): 241-252.        (77 times)

            From the above articles, an author search was conducted within the 1994-1998 time period and also within the Remote Sensing of Environment journal in order to determine how many articles each author of each article has published.  A breakdown of the data follows in the table.

Table 2b
AUTHOR NAME
DOCUMENTS PUBLISHED (1994-1998) IN REMOTE SENSING OF ENVIRONMENT
ARTICLES PUBLISHED (1994-1998) IN REMOTE SENSING OF ENVIRONMENT
Holben BN
3
3
Eck TF
3
3
Slutsker I
1
1
Tanre D 4
4
Buis JP 1
1
Setzer A 1
1
Vermote E 1
1
Reagan JA
1
1
Kaufman YJ
4
4
Nakajima T 1
1
Lavenu F
1
1
Jankowiak I 1
1
Smirnov A 1
1
Field CB 4
4
Randerson JT 1
1
Malmstrom CM 1
1
Qi J 5
5
Chehbouni A 2
2
Huete AR 5
5
Kerr YH 2
2
Sorooshian S
1
1
Chen JM 2
2
Cihlar J 6
6
Carlson TN 2
2
Ripley DA 1
1

            According to the search conducted in Science Citation Index's database, the following authors have the most documents (the number of documents is in parenthesis following each author's name) published in Remote Sensing of Environment for the years 1994-1998: J Cihlar (6), J Qi (5), A.R. Huete (5), C.B Field (4), Y.J. Kaufman (4), and D Tanre (4).  The authors with the highest number of articles published in the journal is the same because the numbers did not change for any of the authors.  It should also be noted that variations of authors' names were not searched in the database.

Citivity

Volume 66, Issue 1 (October 1998) of Remote Sensing of Environment has 9 total documents in the issue. 

Table 2c
Article #
Number of Cited References
1
46
2
57
3
25
4
24
5
45
6
26
7
37
8
19
9
1

The mean number of cited references for 9 documents is 31.11.
The median number of cited references for the 9 documents is 26.

Journal Citation Reports

The impact factor of the journal for 2002 is 1.992.

The immediacy index for the journal in 2002 is 0.799.


Conclusion

            The number of articles published in Remote Sensing of Environment for the 1994-1998 period is very  inconsistent.  More specifically, the difference in the number of articles published between each successive year is staggering.  Between 1994 and 1995 there was a decrease of 26 articles--that's a 20% decrease between the two years.  The results are similar for the publication years of 1996 to 1997 and 1997 to 1998.  1996 to 1997 saw a 30 increase in articles published, while 1997 to 1998's number of articles published decreased again by 27%.

            The 5 most cited articles for the 1994-1998 period has a representative from each year analyzed.  The number of citations for each article would be fairly consistent, if it weren't for the most highly cited article (315 citations).  The high level of citations for this article could be due to the popularity or controversy surrounding the instrument the article discusses, AERONET. 

            No one author stands out with a exceedingly large number of articles published, so a "Matthew Effect" doesn't seem to exist for authors within this journal.
The citivity analysis for Volume 66, Issue 1 (October 1998) indicates a consistency in the citivity of articles published with a mean number of cited references for the 9 documents at 31.11 and the median at 26
.
            In conclusion, the amount of articles published on a yearly basis is not consistent for the journal. The large breadth of topics covered within the GIS literature, the diversity of authors and the consistent citability of the articles within issues all lend to the journals reliable production to the field of GIS.

3. Personal Bibliometric Profile

Methodology

            Using ISI's Web of Science and Howard White's (2001) study as a guide, data compromising Chehbouni's citation identity was collected.  Co-authors and cited authors were collected by conducting a search on ISI's Person Search for all articles authored by Chehbouni, marking the articles, and having an email sent to an email account in which the authors and cited references for each article were listed in the text of the email.  This data was then separately organized by author counts for co-authors and cited authors.  Citing authors' data was collected in a similar manner.  A search was conducted using ISI to find the articles in ISI's database that have cited Chehbouni's work.  These articles were marked and an email was sent that included every authors' name of each article.  Co-sitees were determined using Dialog and following the methodology laid out by (White 2001).

            A difference that exists in the data collected in this study is the counts for the cited authors.  This data is a count of each cited reference within Chehbouni's paper, not a count of the publications that name a particluar citee in their reference list--as the Dialog methodology set forth by White accomplishes.  This was done because an actual count of the times cited might better indicate the examined author's citation characteristics.  It should also be noted that adjustments were not made to correct author homilies--a limitation discussed further in this section.

Chehbouni's Bibliometric Profile

           
ISI indexes 40 articles in which Chehbouni was the author or co-author.  ISI also indicated there are a total of 355 articles that cite Chehbouni's articles.  Table 3a shows a breakdown of the top 30 entries for Chehbouni's co-authors, citing authors, cited authors and co-sitees.

Table 3a
Top Co-authors
Top Citing Authors (Citation Set)
Top Cited Authors (Citation Identity)
Top Co-sitees (Citation Image)


14 Kerr YH 
11 Lhomme JP
9 Goodrich DC
9 Qi J
8 Boulet G  
8 Dedieu G
8 Moran MS
7 Watts C 
6 Huete AR
6 Monteny B
5 Nouvellon Y
5 Lo Seen D
4 Cayrol P
4 Rodriguez JC
4 Shuttleworth WJ
4 Sorooshian S
3 Bonnefond JM
3 Chaouki J
3 Guy C
3 Hartogensis O 
3 Kerr Y
3 Klvana D
3 Lagouarde JP
3 Njoku EG 
3 Rambal S
3 Santiago F
3 Scott R
3 Troufleau D
2 Begue A
2 Braud I 

     
            35 Chehbouni, A
            30 Kustas, WP
            29 Moran, MS
            21 Kerr, YH

            18 Goodrich, DC
            16 Lhomme, JP
            13 Dedieu, G
            13 Humes, KS
            12 Chaouki, J
            12 Shuttleworth, WJ
            10 Qi, J

             9 De Bruin, HAR
             9 Huete, AR
             9 Schmugge, TJ
             9 Watts, C
             8 Braud, I
             8 Guy, C
             8 Ma, YM
             7 Boulet, G
             7 Klvana, D
             7 Menenti, M
             7 Monteny, B
             7 Norman, JM
             7 Nouvellon, Y
             7 Weltz, MA
             6 Cayrol, P
             6 Jackson, TJ
             6 Lo Seen, D
             6 Olioso, A
             6 Ottle, C


54 Chehbouni A
45 Shuttleworth WJ
44 Lhomme JP
37 Kustas WP
30 Moran MS
23 Choudhury BJ
21 Goodrich DC
18 Raupach MR
17 Huete AR
16 Nouvellon Y

15 Jackson RD
13 Monteith JL
13 Stewart JB
14 Kimes DS
12 Blyth EM
11 Avissar R
11 Brutsaert W
11 Goutorbe JP
10 Boulet G
10 Lagouarde JP
10 Noilhan J
10 Qi J
 9  Debruin Har
 9  Hall FG
 9  Kalma JD
 9  Taconet O
 8  Eagleson PS
 8  Kerr YH
 8  Mcnaughton KG
 8  Monteny BA


38 Kustas WP
35 Shuttleworth WJ
32 Lhomme JP
28 Aiche J
28 Grace JR
27 Moran MS
27 Sellers PJ
25 Bi HT
24 Raupach MR
23 Norman JM
21 Brutsaert W
20 Choudhury BJ
20 Yerushalmi J
19 Auissar R
19 Brereton CMH
18 Cai P
18 Hall FG
18 Stewart JB
18 Werther J
17 Troufleau D
16 Lagouarde JP
16 Lee GS
15 Horio M
15 Mori S
14 Debruin HAR
14 Goodrich DC
14 Mahrt L
14 Monteith JL
14 Rhodes MJ
13 Lbyth EM



            There are a number of glaring limitations to ISI coverage and the data mined from it.  For example, ISI databases draw citations only from a set of several thousand journals, the coverage for certain journals is only limited to a short time frame (an inconsistancy of coverage from journal to journal), and the entry of author names can be complicated if a single author is listed with multiple names (spellings, marriage, etc.).  As White (2001) notes, the problem can be nearly eliminated if a background of the field's culture is known.  If authors publish items in journals outside the set contained by ISI databases, those references will not be a readily retrievable part of that author’s citation identity.

            As expected, Chehbouni is the top cited author within the articles in which he was an author.  Chehbouni is also the biggest citer of his own work.  Again, this is not a big surprise.  Interestingly enough, Kustas, the next most frequent citer (set) of Chehbouni's work and the top co-sitee (image) of Chehbouni, does not make the top 30 list for co-authors.  In fact, according to ISI, Kustas has only co-authored one article with Chehbouni.  According to White, citation set, identity, and image lists commonly contain duplicate names and works, but generally have different frequencies in the rankings (White, 2001). The fact that Chehbouni is most commonly cited (excluding Chehbouni) by Kustas (citation set), most commonly co-cited with Kustas (citation image), and that Kustas is near the top of cited author's by Chehbouni indicates that the pair is probably engaged in similar research.

Frequency of Names within other Section

            
Table 3b shows how many times author's names appear in other sections.  Under each category a value is indicated of how many times authors in that set appeared in another set. 

Table 3b
Occurances of Author
Names in Other Sections
Top Co-Authors
Top Citing Authors (citation set)
Top Citing Authors (citation identity)
Top Co-sitees (citation image)
0
12
9
9
20
1
10
10
10
4
2
4
7
7
2
3
4
4
4
4

            Authors which appear in all sections do so near the top of each list.  This trend would indicate a tightly knit research community whose focus is narrow.  The section with the least correlation of top authors (those with the greatest amount of 0's or 1's) to the other sections is the citation image and the top co-authors.  The lesser correlation for top co-authors may indicate that there isn't  necessarily an obligation to cite someone just because they have worked with an author.  While the much lesser correlation for citation image may indicate the application of the research done by the author's into other fields. 

4.  Subject Keywords

Table 4
Article #
Subject Keywords Taken From ISI's KeyWords Plus
1
SKY BRIGHTNESS MEASUREMENTS, VAPOR COLUMN ABUNDANCE, WATER-VAPOR, OPTICAL-PROPERTIES, SUN PHOTOMETRY, AVHRR DATA, SOLAR, EXTINCTION, RADIATION, RETRIEVAL
2
SPECTRAL REFLECTANCE, TM DATA, LANDSAT, VEGETATION, INDEXES, SENSORS, BANDS, IMAGE
3
IMAGING RADAR OBSERVATIONS, FOREST BIOMASS, SAND DUNES, BACKSCATTER, MODEL, DESERTIFICATION, SENSITIVITY, IMAGES
4
REFLECTANCE SPECTROSCOPY, SPECTRA, LEAVES, DIFFERENTIATION, FLUORESCENCE, ALGORITHM, GREEN
5
SEA-SURFACE, GRAVITY-WAVES, OIL FILMS, SLICKS, SCATTERING
6
HIGH-RESOLUTION RADIOMETER, BIDIRECTIONAL REFLECTANCES, REFLECTION PROPERTIES, VEGETATED SURFACES, DERIVATION, CANOPY, MODELS
7
PHOTOSYNTHETICALLY ACTIVE RADIATION, CONTRASTING FIELD CONDITIONS, DRY-MATTER PRODUCTION, SOLAR-RADIATION, USE EFFICIENCY, SEMI-LEAFLESS, SPECTRAL REFLECTANCE, LIGHT INTERCEPTION, ABSORBED RADIATION, CORN CANOPIES
8
SOLAR-RADIATION MEASUREMENTS, EFFECTIVE PARTICLE RADIUS, MICROPHYSICAL PROPERTIES, OPTICAL-THICKNESS
9
[NO KEYWORDS GIVEN]

            A subject analysis of the subject keywords displays the breadth of the topics covered within the articles.  Little duplication of keywords between the articles can be identified.  The only duplication that occurs is of model/models.  There are a number of keywords that seem to describe a system or application of a system.  These keywords are: sky brightness measurements, AVHRR data, retrieval, reflectence spectroscopy, optical spectroscopy, sun photometry, vapor column abundance, TM data, LANDSAT, model, algorithm, high-resolution radiometer, solar-radiation measurements, dry-matter production, use efficiency, and effective particle radius.  The fact there are so many keywords that seem to be an application of a tool or system seems to indicate that Remote Sensing of Environment is a technically orientated journal.  This may have an effect on the frequency of co-authors, citation sets, citation identities, and citation images of authors that work, research, and write within the field of GIS because technical areas of study tend to have more teams or research groups that produce data than those of more general, individual studies.

5.  Essay

The On-line Medical Dictionary defines bibliometrics as “the use of statistical methods in the analysis of a body of literature to reveal the historical development of subject fields and patterns of authorship, publication, and use.”  Thus, bibliometric methods can provide a description of the publication patterns within a subject field or other body of literature.

Bibliometric methods such as those that analyze the publication history of a journal, or more accurately the journals within a subject area, are useful in determining the growth patterns of the subject area itself.  If patterns exist with many journals within a subject area such as gains in size or changes in scope throughout their history—assumptions can be made about the scope, growth, and importance of the subject area.  If many journals split or combine into different publications over a period of time, then assumptions can also be made about the narrowing nature of the subject area.

Analyzing the authors of a journal or even the authors within the specified subject area can be used to make similar comparisons and correlations within the data sets.  If many authors are published within the genre, a researcher could come to the conclusion that the field isn’t dominated by one group of elite people.  If papers are commonly written with a large amount of co-authors, assumptions can also be made about the socialization of the researchers in the subject area.  Many co-authors would indicate a large amount of teamwork or research teams within the subject area.  A good mix of different authors from different geographic areas would indicate that the journal and the subject area are of global concern.

Of course, there are limitations to bibliometric studies and their usefulness in gaining an understanding of the patterns of scholarly communications.  Maybe the biggest limitation to citation studies is that the thoroughness in the collection of data is limited to the database used to collect the data.  In an ideal situation, this could be looked at as sample data (much like the sample data collected for the Census), but the database representations for journals in relation to subject area and geographic affiliation (of both authors and country of publication) is not always a good representation of what is actually published.  Complete runs of journals are also not always available.  This lack of coverage can distort the actual publication history of the journal, authors, and publishing countries.  Correcting the distortion would be a lot of work if done hand.  Problems also exist with author names.  Misspellings, name changes, and the use or non-use of the middle initials can affect the retrieval results for the authors.  This is especially disadvantageous to women researchers who elect to take their husband’s last name at some point during their career as a researcher.  Multiple searches need to be done for those who are not consistently listed with the same name throughout their career.  Knowledge of the actual subject area is almost definitely needed to accurately perform the gathering of much of the bibliometric data.  By knowing the history and current culture of the subject area, the collector of data could make educated guesses as to the relevancy of some of these inconsistencies within the data.

Another potential problem with practiced bibliometric methods is that they study how many times an article is cited.  This count does not take into account how the author or article is cited.  If an article or author is cited in a negative manner (i.e. people disagree or refute what the author or study is reporting), then the actual importance or prominence of the author is misrepresented.  An author could have been cited hundreds of times for a piece of faulty information that other researchers are analyzing (and citing).  This bad citation could misrepresent not only the prominence of the author, but also effect the author’s citation identity, set and image—falsely grouping the author with (hopefully) more credible researchers.  This counting of citations does not even always accurately depict the importance of an author within a field.  There is the rare occasion in which someone as prominent as a Nobel laureate has a low amount of citations compared to a peer whose work wasn’t as important or influential (Cole 2000).  Although one would think this would change, post-honor, over time.

            Webometrics attempts to apply the principles of bibliometrics to the hyperlinks of the World Wide Web, in order to examine how the Internet and its pages are searched by users.  Currently, webometrics is capable of being useful in tracking many aspects of scholarly communications.  Webometrics can track the popularity or unpopularity of particular links.  This tracking could indicate subject trends in research--this would allow for certain hot topics to be expanded.  For academic journals, the editors may look at this data and solicit (or at least pay closer attention to) papers related to this area.  This would allow for a more user-centered demand publication.

            Webometrics is also subject to some of the same reliability risks as traditional bibliometric methods.  Coverage would be a major concern to those conducting webometric studies.  In the case of the web, different search engines do not index the same pages.  In addition, different engines need to be searched differently (Borgman 2000).  This is much like the coverage and author issues that exist in searching ISI, except ISI dilemmas would seem to be easier to fix.

            It seems for webometrics to be really useful in gathering information about patterns of scholarly communications; it needs to be invasive upon the users utilizing the resources.  The gathering of usage logs is useful in measuring the traffic of a website, but doesn't give much insight towards the understanding of the patron’s behavior patterns.  This type of information gathering could be comparable to gathering in house use statistics within a library--just because a book is out of place (on a table, on a re-shelving cart, etc.) doesn't mean it has been utilized.  The book could have actually been used as a prop for a laptop.  Future webometrics data could track a multitude of behaviors such as whether or not the accessed resource was saved to a disk, printed from a workstation, emailed, or if nothing at all was done to the resource after it was accessed.  Furthermore, it could show for how long the resource was accessed.  The length of access could show that the scholar read the article at the time and place of access.  The use of monitoring tools could provide such details by gathering the necessary webometric data.

            This sort of data--how people physically use the information they access could then be examined in correlation and/or comparison with citation data.  Researchers could examine if the information accessed is the same information that is cited within the data's literature.  Researchers would then be able to better assess what kind of literature is used as merely a jumping off point for scholarly studies and what kind of resources are utilized as more of the backbone of the literature.

            The digital environment provides for many opportunities to apply bibliometric and webometric methods so a better understanding of scholarly communication can be attained.  These methods and studies all have their limitations in accuracy and reliability.  It is important to consider these limitations while gaining the unique insight that these methods have to offer.



References

Borgman, C.L. (2000).  Scholarly communication and bibliometrics revisited.  n B. Cronin & H.B. Atkins (Eds.).  The Web of Knowledge: A Festschrift in Honor
             of Eugene Garfield (pp143-155).  
Medford, NJ: Information Today.

Cole, J.R.  (2000).  A short history of the use of citations as a measure of the impact of scientific and scholarly work.  In B. Cronin & H.B. Atkins (Eds.).  The Web
            of Knowledge: A Festschrift in Honor of Eugene Garfield
(pp. 281-300).  Medford, NJ: Information Today.

White, H. D. (2001). Authors as citers over time. Journal of the American Society for Information Science and Technology, 52         (2), 87-108.