Smita Joshipura

IRLS 501

Final Project

12/09/03

 

I give permission for my work to be published in the SIRLS LIS learning showcase.

 

Intellectual Organization of Information: Principles and Challenges

 

Abstract:

 

Information is a product of human mind and is meant for use by others for various purposes. Institutions like libraries, museums, archives etc. organizes information so that it can be retrieved easily and used effectively by everyone. The article discusses one of the most important retrieval tools, “Catalog” which is a collection of surrogate records. It is considered to be a mirror of the library since it reflects the holding of the library to its users. The Catalog is a vital access mechanism, which began centuries ago with handwritten entries of manuscripts. In 1970’s digital catalogs, also known as machine-based bibliographic records or Online Public Access Catalog (OPAC) emerged. The latter form has potential to overcome the limitations of the earlier forms of library catalog. The article highlights the first codification of library catalog rules by Charles Cutter as well as “Paris Principle” which has provided the solid basis for the development of International Cataloging code. It covers the core concepts of Cataloging including method of organizing the Cataloging/Metadata records, objectives, principles, rules, information organization processes like descriptive and subject cataloging as well as importance of relatively “new” descriptive catalog standard called Dublin Core (DC) and the concept of “Universal Bibliographic Control” (UBC) to the users. It also analyzes the relationship of Cataloging principles and rules as well as highlights the differences between papers based catalog with OPAC. Thus this article provides an overview of the growth of OPAC along with various organizing tools like Cataloging codes and standards, Authority files, Classification for subject description and access.

           

Finally, the article compares the functions of the digital libraries with the traditional library. It explains how the new techniques are more complex for the user as well as for the staff, who requires additional training, assistance and help.  The article summarizes the role of libraries and information professionals in a digital library environment, key challenges confronted by libraries and the future of Cataloging with respect to ongoing developments in digital technologies.

 

Introduction:

 

Information is all around us. It is a product of human mind, and is meant for use by others for various purposes. Productive transfer of information and absorption of the same by a user is dependent on intellectual organization of information. Information and knowledge are the lifeblood of society and to make it more useful to the society, it needs to be structured and organized. According to Konigner and Janowitz, 1995, “Information is only valuable to the extent that it is structured. Because of a lack of structure in the creation, distribution and reception of information, the information does not arrive where it is needed and therefore, is useless.” To make the information more valuable to the users and society in general, it needs to be organized systematically. There are various institutions like Libraries, Museums, and Archives etc., which organize information so that it can be retrieved easily, and used effectively by everyone. Moreover, there are various retrieval tools in the library like Bibliographies, Catalogs, Indices, Finding Aids, Registers, etc which help in retrieving recorded information. Retrieval tools are surrogate records, which provide description of individual pieces of information.

 

Catalog:

 

“Catalog” is a collection of surrogate records and is one of the most important retrieval tools. It is a “list of the documents in a library, with the entries representing the documents arranges for access in some systematic order”. (Rowley, P.246) It is very vital to create a catalog for organizing a larger collection, which helps in the effective retrieval process. Thus, Library Cataloging is a practice of organizing a collection of bibliographic items, which help to identify, locate, access and use of recorded information. It is considered to be a mirror of the library as it reflects the holdings of the library to its users.

 

In a library, each entry in the catalog is the representation of a bibliographic record. A bibliographic record is a transcription of the complete cataloging information for any item. The process of bibliographic record is:

·        To provide all the information necessary to describe an item accurately both physically and intellectually, which distinguish it from other items.

·        To provide its location in the collection.

 

Thus, in a Catalog any user can find two important pieces of information i.e. whether the library has the item required, and, if so, where it is located in the collection.

 

Functions:

 

Charles Ammi Cutter systematically defined the functions of the Catalog. Charles Cutter’s Rules for a Printed Dictionary Catalog (1876) is regarded as the first codification of library cataloging rules. This code is remarkable since it has covered not only the objectives (objects) but also the methods (means) of creating catalog entries to provide easy access of library materials. Cutter’s objectives and methods were:

 

Objects

 

1.      To enable a person to find a book of which, any one of the followings is known

·        (A) the author

·        (B) the title

·        (C) the subject

2.      To show what the library has

·        (D) by a given author

·        (E) on a given subject

·        (F) in a given kind of literature

3.      To assist in the choice of a book

·        (G) as to its edition (bibliographically)

·        (H) as to its character (literary or topical)

 

Means

 

1.      Author-entry with the necessary references (for A and D)

2.      Title-entry or title-references (for B)

3.      Subject-entry, cross references, and classed subject table (for C and E)

4.      Form-entry (for F)

5.      Giving edition and imprint, with notes when necessary (for G)

6.      Notes (for H)

 

This code has extended “Corporate authorship” and “Uniformity” in cataloging. He also made rules for “Subject Cataloging”, “Filing” of entries as well as “Description” of items. Cutter’s classic analysis is still widely accepted and these principles are the foundation of best cataloging practice.

 

Thus, according to Cutter’s objects, the Catalog acts as an efficient instrument and has provided the basis of the Cataloging functions such as:

1.      Identification (Cutter’s Object: to enable a person to find a book when a title or author or subject is known)

2.      Collocation: (Cutter’s Object: to show what the library has by a given author, on a given subject, and in a given kind of literature)

3.      Evaluation: (Cutter’s Object: to assist in a choice of a book as to the edition and as to it’s character (literary or topical)

4.      Location

 

 

Code of Cataloging:

 

In the 20th century, American Library Association (ALA) published a Code of Cataloging in two   parts, which was a complex code and raised criticism in the library world. The code was based on the Legalistic theory, the most important theory among other theories of cataloging like the Perfectionist, the Bibliographic and the Pragmatic. According to the Legalistic theory, there must be rules and definitions for issues as well as there must be authority to settle these issues. Due to a few weaknesses in this theory, Osborn criticized ALA code in 1941. Lubetzky again carried out a critical study of cataloging rules in 1951 and he submitted a draft code to an International conference on cataloging principles in Paris in 1961. The draft crafted by Lubetzky was accepted internationally and is also known as Paris Principles” or “IFLA (International Federation of Library Association) Principles” which has provided the solid basis for the development of International cataloging. In 1967, Americans and the British co-operatively published the new set of cataloging rules known as “Anglo-American Cataloging Rules” (AACR). It is a set of rules, for producing the descriptive and name-and-title access points part of a surrogate record of an information. In 1974, IFLA issued “International Standard Bibliographic Description” (ISBD).It is a standard for the international communication of bibliographic information and in 1978, AACR2 was published which incorporated ISBD.

 

Cataloging Process:

 

Over and above the functions of the library catalog, which was based on Paris Principles like identification, location, finding and collocation, “Bibliographic Control”, is equally important function of the library catalog. It is a process of describing the information and providing name, title, and subject access to the description. The process of bibliographic control includes Descriptive cataloging, Subject cataloging, Authority control and Physical Processing of an item.

 

                                   

 

Cataloging Process

                                                   

                        │─────────── │────────│                     

Descriptive Cataloging                                             Subject Cataloging

                                                                                            

                │─Description                                                         │─Classification

                                                                                            

                │─Choice of entry                                                  │─Subject heading

               

                │─Form of entry

 

 

Descriptive Cataloging is a process where the cataloger’s primary task is

1.      To describe and identify library material.

2.      To provide name and title access points to the bibliographic record.

 

                              Access Points

                                                                      Bibliographic Description

025.3          Kao, Mary Liu.                                                                        
 Cataloging and classification for library technicians / Mary L.  Kao. –

                           2nd ed.—New York: Haworth Press, C2001   

                       145 p.; 22 cm. – (Haworth cataloging & classification)

                        Includes bibliographic references (p.139-149) and index.

                       ISBN 0789010631 (pbk: alk.paper)

 

                        1. Cataloging—United States. 2. Classification- Books.                        
                        I. Title. II. Series.

 

There are various tools used to process the Descriptive Cataloging like AACR2R, 1998; Library of Congress Rule Interpretations (LCRI); Library of Congress Name Authority File (NAF) etc. Moreover, AACR2R is a set of principles used for descriptive cataloging.

 

Structure of a Catalog

 

To carry out these functions, the structure of the catalog should contain

·        At least one entry for each book cataloged, and

·        More than one entry relating to any book, whenever it is necessary from the user’s perspective or because of the characteristics of the book, for example when the author is known by more than one name or whether the name of the author is not on the title-page of the book or when there are several authors sharing the creation of the book.

·        The entries should be of different kinds such as main entries, added entries or references.

 

e.g.    Main entry record of a book:

 

658.054      O’Brien, James A., 1936-

O13c         Computers in business management: an intro-         

1979                      -duction / James A. O’Brien. – Rev. ed.-

Homewood, Ill. : R.D. Irwin, 1979

Xiv, 497 p. :ill. ; 25 cm. – (The Irwin Series in information and decision sciences)

bibliography: p. 488-489.

Includes index.

ISBN 0-256-02121-X

1.      Business-Data Processing. I. Title

 

Added Entry

 

BUSINESS-DATA PROCESSING

 

658.054      O’Brien, James A., 1936-

                        O13c             Computers in business management: an intro-     

1980                      duction / James A. O’Brien. – Rev. ed.-

Homewood, Ill.: R.D. Irwin, 1979

Xiv, 497 p.: ill. ; 25 cm. – (The Irwin Series in information and decision sciences)

Bibliography: p. 488-489.

Includes index.

ISBN 0-256-02121-X

Business-Data Processing. I. Title

 

The structure of a bibliographic record has two major components: Bibliographic Description and Access Points. Bibliographic Description contains description and identification of the item, and is divided into eight areas like title, edition, type of publication, etc. Once the Bibliographic Description is identified, cataloger provides access points to the bibliographic description. Names, titles, and name/titles are considered to be Access Points of an item. Mainly there are two types of Access points, Main Entry and Added Entries. Main Entry is the primary Access Points to a bibliographic record while Added Entries are secondary Access Points to the bibliographic record. There are also AACR2R principles based on Paris Principles for determining the Main Entry which are as follows:

 

·         Personal Main entry:

 

1.      When there is a single author, Main Entry is under a single author.

2.      If it is a collection of works by a single author, Main Entry is under single author.

3.      If 2 or 3 authors are responsible for the work, and if the principal author is indicated, Main Entry will be under the principal author.

4.      If the principal author is not indicated, the first name indicated will be considered as the principal author.

 

e.g.

Principal author indicated

 

Haynes, Connie.

    Speed, strength, and stamina: conditioning for tennis/ by Connie Haynes, Eve Kraft and John Conroy; illustrated by George James.

 

I. Kraft, Eve. II. Conroy, John, 1908- III. Title

 

·        Choice of Corporate body Main Entry: When the work involves corporate body, the Main Entry is under a corporate body i.e. when the work is the expression of the corporate thought or activity of the body (e.g. official reports etc.) (Taylor, P.116)

 

e.g.

 

AIAA Communications Satellite Systems Conference

   (7th: 1978: San Diego, Calif.)

   A collection of technical papers: AIAA 7th

Communications Satellite Systems Conference, San Diego, California, April 24-27, 1978.

I. Title.

 

·        Choice of Title Main Entry: Main Entry will be under the title when the personal author is unknown or work does not identify by corporate body or consists of collection of independent works by different persons.

e.g.

 

Europe reborn: the story of Renaissance civilization/

    Contributors, Julian Mates… [et al.]

I. Mates, Julian, 1927-

 

 

·        Choice of Main Entry under Uniform Title: When the work is published under a different title, Main Entry will be under the uniform title.

e.g.

 

Arabian nights. English.

   More fairy tales from the Arabian nights /

Edited and arranges by E. Dixon; illustrated by

J.D. Batten.

 

Over and above the Main Entry, other Access Points are called, “Added Entries”. “Added Entries” consist of personal name, personal name and title, corporate body, corporate body and title, uniform title, etc. These are the cardinal principles of Cataloging though “Main Entry” has raised controversy in modern times.

 

Subject Cataloging is a process where the Cataloger’s primary task is:

 

1.      To analyze an item for primary and secondary subject content.

2.      To assign standardized subject headings to classify the item by assigning standardized class number.

 

 

Classification (Numerical Subject Access)

025.3         Kao,Mary
 Cataloging and classification for library technicians / Mary L.  Kao. --   

                           2nd ed.—New York: Haworth Press 
 145 p.; 22 cm. – (
Haworth cataloging & classification)

                      Includes bibliographical references (p.139-140) and index.

          ISBN 0789010631 (pbk: alk. paper)

                      1. Cataloging—United States. 2. Classification—Books.

                        I. Title. II. Series.          

                                                Subject Headings (Verbal Subject Headings/Analysis)

 

Some of the Subject Cataloging tools are Library of Congress Subject Headings; Classification system like Library of Congress (LC) and Dewey Decimal Classification (DDC), Library of Congress Subject Authority File etc.

 

 

There are two processes involved in Subject Cataloging: Classification and Subject Headings.

Classification (Numerical Subject Access) scheme is an orderly distribution of the universe into groups of main subjects and classes. Some of the classification schemes commonly used among the libraries is Dewey Decimal Classification (DDC), Universal Decimal Classification (UDC), and Library of Congress Classification (LCC).

 

Subject Headings or Verbal Subject Headings/Analysis is a process of assigning Subject Headings of a document. To carry out this process a master list of preferred term is used to assign to the documents. For example, Library of Congress Subject Headings (LCSH) is one such list.

 

“Classification” is a very important function of Information Organization. Collection in libraries is arranged according to some system, and this arrangement is referred as, “Classification”. It provides a formal, and orderly access to the shelves. The ultimate aim of any classification system is to lead the user to the item required. It is a part of the process of Cataloging, which is usually the first process following receipt of the materials in the library. Thus, the Subject Cataloging is one of the processes of Information Organization, which can be carried out by Verbal Subject Heading/Analysis and Library Classification.

 

The principles of Subject Cataloging are based on Cutter’s rules of subject headings in a dictionary catalog. On the basis of identification and collocation functions for the library Cataloging, Cutter set forth his principles for subject entry which includes:

 

1.      The user and Usage: Based on the principles, the cataloger should consider the convenience of the user, and terms should be selected accordingly.

 

2.      Uniform and Unique Headings: Based on this principle, for a subject with more than one name e.g. Gas and Petrol, one term must be selected.  Under the principle of unique headings, some term should not be used for more than one subject, and classification must be made when the same term is used in different disciplines e.g. Pyrethrum (Insecticide) and Pyrethrum (Plant).

 

3.      Specific and Direct Entry: According to this principle, subject heading formed must be the most specific as possible e.g. American religions: Literary sources and documents will be under Religion and not under Literature or History.

 

4.      Consistent and Current Terminology: Based on this principle, heading should be both consistent and current.

 

5.      Cross References: Three types of cross references (Use, See also, General Reference) should be used.

·        Use term guides user to the preferred term e.g. ALA use American Library Association

·        See also: This type of cross reference guides user to broad term, narrow term, and related term

e.g. Access to Information

      Narrow term: Freedom of Information

       Related term: Information Literacy,

                             Scholarly Communication

·        The General Reference: It directs the user to a group or category of headings

 

Authority Control

 

Along with the Descriptive and Subject Cataloging, “Authority Control” is also an important process of Cataloging. It is concerned with the maintenance and application of standard access points of terms. Thus, its purpose is to standardize and control a library’s use of non-subject entries, subject headings, and their cross references. All Access Points need to be under Authority/Access control, and is considered to be one of the basic principles in Information Organization.

 

 

Major Development:

 

In 1960s two major developments took place for bibliographic control: Machine-Readable Cataloging record (MARC), and Online Computer Library Center (OCLC). MARC, a communication format was created at Library of Congress, which enables the machine/computer to read the bibliographic records. It has also resulted into “Metadata” record, which was previously known as bibliographic record. Metadata is a structured data about data. It organizes information and provides the efficient, accurate retrieval and precision of information to the user. A catalog record commonly known as bibliographic record is now known as metadata record or surrogate record. Thus, AACR2 (incorporating ISBD) and MARC are the content and encoding standards widely used in the libraries. They help in creating an important bibliographic retrieval tool-library catalog. Along with the development of MARC format, OCLC, which is the largest and most comprehensive bibliographic utility in the world, enables to start shared cataloging for its members. 

 

The concept of shared, online cataloging system led the development of standards for information exchange in the early 1970’s, such as ISO 2709 and the MARC format. Along with the standards and format the agreement on rules such as AACR and ISBD series, libraries became source of cataloging and metadata for the world’s information. With the development of the MARC format three major benefits for the libraries were identified.

1.      Libraries were able to reduce their amount of original cataloging as they started sharing records with each other.

2.      Vendors could create online catalogs.

3.      With the development of Z39.50 standard for the electronic sharing of data, users could search web-based library catalog from any part of the world. 

 

This development has resulted into the concept of “Universal Bibliographic Control” (UBC) to the users. The primary purpose of UBC was to organize and analyze documents. It refers to all possible means to have access to surrogate records for all the world’s information packages. It is a program under IFLA, which was expanded to UBCIM (Universal Bibliographic Control and International MARC) to facilitate the exchange of bibliographic records among the countries. It is a very important concept for bibliographers and catalogers, which prevents duplication of work; allow libraries to better share bibliographic resources and enables libraries to acquire reliable cataloging data.

 

Forms of Library Catalog:

 

The cataloging is a vital access mechanism, which began centuries ago with the handwritten entries of manuscripts. Later on, various forms of the Library Catalog emerged like Book, Card, and Computer Output Microform (COM). Catalogs, today, are held as a computer database; usually called Digital Catalogs or Machine based bibliographic records or Online Public Access Catalog (OPAC).

 

Online Public Access Catalog (OPAC):

 

OPAC is considered to be a new form of Cataloging. The records within the earlier Catalogs are arranged in some order to make it more usable. There were mainly two basic arrangements, classified and alphabetical order. As the filing of the Cataloging becomes more complicated due to increase in size, divided catalog came into existence. The divided catalog was easier and less expensive but with the development of OPACs in 1980s, the divided catalog was converted to online catalog. Online Catalogs are surrogate records, which are commonly encoded in MARC format, a standard that permits sharing among other institutions. Now, OPACs are predominant form of Catalog in the United States as well as in other countries. It has potential to overcome the limitations of the earlier forms of library catalog. The various features of OPAC are interactivity, flexibility in display, infinitely expandable and public. It can dynamically communicate with its users as well as can be responsive and informative at a given time to a given need. The access points can be added and redefined at the same time search, display, and support functions can be added or modified easily. It also allows linkage between other information systems. Thus, it has become a popular option for access to the resources of the collection of libraries, library consortia and other remote collections of information resources. Online Catalogs offer potentially better retrieval and display than card catalog.

 

Currently, OPACs are in the third generation. The first generation was created in 1980s, where card catalogs were automated e.g. DIALOG or MEDLINE. In 1992, the second generation of OPAC came into existence, where access points, search capabilities, and display options were improved. The third generation OPAC which began in 1996, has web interfaces, hypertext browsing, graphical user interfaces, and Z 39.50 compliant communication protocol. Thus, OPAC has facilitated rapid and effective access to the desired items in the collection where the user knows the author, title, or subject headings of the item. However, it has also been realized that the present generation of online catalog in most library do not provide topical or subject access to the collection.

 

 

 

 

Study on OPAC:  

 

The article on the “Research on the use of Online Catalogs and its implication for library practice” by David Lewis highlights studies of card catalog use in 1960s and early 1970s. The studies were compared with the use of OPAC studies and were concluded to create a more effective tool for library users. The Council on Library Research (CLR) in 1981-83 founded a nationwide study of online catalogs in Public, University and College Libraries. Some of the significant results were:

 

·        Users have a strong preference for OPAC.

·        Users adapt themselves to the capabilities and limitations of the OPAC.

·        Users need and would like to use better subject access.

 

The research on both card and OPAC concludes that

 

·        Users have little knowledge about the structure of the catalog and find it complex.

·        Inconsistencies in cataloging practices create confusion among users.

·        Users are successful in conducting known-item searches.

·        Most of the time, users have incomplete information while searching catalog, which creates more frustration.

 

Thus, the research on OPAC has provided wealth of information on user attitudes and behaviors. The Online Catalog is difficult to use because of their complex design for searching. The studies have indicated that user prefers online catalogs, but they show inadequacies in the areas of subject access. They would like to enhance the subject access. The Catalog, whether online or card, is considered to be a primary tool for finding information in the library collection. Though, OPAC overcomes the limitations of the earlier forms of Catalog, and provide more techniques for searching the data, it also adds a layer of complexity to the process, which directs the librarians to create a more effective tool for the library users.

 

Digital Library:

 

 Since 1990 the information world is expanding dramatically, and the use of Internet as well as the World Wide Web is becoming more popular among the users. The technological progress has changed the functions of traditional library, which has resulted into the birth of Digital Library, or Virtual Library. It has also impacted the changing role of libraries and librarians in organizing digital information as well as challenges associated with it. Librarians organize knowledge through the process of Subject Analysis and Cataloging. There is a major challenge to develop a consistent method, which uniquely identifies and retrieves the information in different formats.

 

Digital libraries focus more on access rather than collections. The organization of a digital library will reflect the user’s needs by providing support for searching and acquiring information. The new technique used is more complex to the user, and requires extra training, assistance, and help.

 

In the paper-based information environment, Information Organization is based on physical objects, while in networked digital world the information is virtually endless. It is overwhelming for the librarians and bibliographic catalogers to control the digital library. One of the major challenges in digital libraries is cataloging of constantly changing Internet resources. With the inception of the MARC record, librarians have created records faster. However, with the growth of web sites, users expect cataloging of electronic resources and websites. A major challenge in organizing and managing the websites has resulted into the new standard called, Dublin Core” (DC). It is the latest technique in information organization and considered to be new descriptive catalog standard that can be used for all types of materials. The new standard is more useful compared to MARC/AACR2 since it allows more flexibility in data elements. It is a “Creator” driven but at the same time the quality of Cataloging may become inconsistent as it does not have any rules for creation.  The following crosswalk would help to know the similarities and difference among the element found in both MARC/AACR2R and DC format.  

 

 

 

MARC/AACR2R

DC Elements

MARC is an encoding standard for AACR2R

It’s content has been encoded in HTML tags, XML, RDF etc.

It consists of 8 areas for identifying information package.