Integration and Implementation Sciences Thesaurus

A developing vocubulary for an emerging specialization
Caryn Anderson and Kyle Nicholls

 
 
 

Annotated Bibliography

On Thesaurus Construction and Indexing Languages

Aitchison, J., A. Gilchrist, and David Bawden. 1997. Thesaurus construction and use: A practical manual. London, England: Aslib.
• This book is the current bible for thesaurus construction. Incorporating all the updated relevant standards (ANSI, BSI, ISO, NISO), Aitchison’s book is designed as more of a “workbook,” outlining a step by step process that is easy to follow. It will remain always open as a ready reference point throughout the project.

Aitchison, Jean. 1969. Thesaurofacet: A thesaurus & faceted classification for engineering & related subjects. Leicester, England: English Electric Co. Ltd.
• Aitchison's seminal work product for the electrical engineering field is the key reference example for integrating a faceted classification scheme with a thesaurus (rather than just attaching an alphabetical index to the classification scheme). While some elements of faceted classification were valuable for the IIS Thesaurus, the development of a full scheme is beyond the scope of this project in Spring 2004. This reference will be of greater use in later phases of the project.

American National Standards Institute. 1980. American National Standard Guidelines for Thesaurus Structure, Construction, and Use. New York:
• This resource is relevant and demands inclusion, but is in essence covered by Aitchison, Gilchrist and Bawden.

Anderson, James D., Jose Perez-Carballo. 2001. The nature of indexing: How humans and machines analyze messages and texts for retrieval. Part I: Research, and the nature of human indexing. Information Processing & Management 37: 231-254.
• The most useful part of this article was the summaries of the views of leading experts on the human analysis skills of indexing. Scholars reviewed included Mulvaney, Chan, Fugmann, Soergel, Lancaster, Fairthorne, Wellisch, Wilson, Taylor, and Hjxrland. This outline provided a defining fence around the slippery nature of indexing and conceptual analysis. As the process of indexing is the action served by the thesaurus tool, it was valuable to obtain a general sense of the nature of the activity.

Bates, Marcia. 2002. After the dot-bomb:getting Web information retrieval right this time. First Monday 7, no. 7: Internet on-line. Available from <http://firstmonday.dk/issues/issue7_7/bates/index.html> [8 February 2004].
• By identifying what went wrong in information retrieval in the early, heady days of the Web, Bates lays out a map of the mine field for anyone constructing an information retrieval system. The reverse of her list of pitfalls constitutes a useful checklist: 1. Use faceted classification, not hierarchical; 2. Forget “ontologies” – they mislead into believing things can be organized definitively; 3. Roget’s is a stylistic writer’s thesaurus, info retrieval thesauri have different purposes and must be structured differently; 4. Bradford distributions/power laws operate in full force in info retrieval – prepare for and work with them; 5. Design for scalability or you will be sorry; 6. User interface design of indexing tools is as important as (or more than) the structure/logic of the tool itself; 7. Take advantage of existing experts – rest assured, every wheel has been invented – you are only making a new kind of cart. Good advice, all of it.

Batty, D. 1998. WWW - wealth, weariness or waste: Controlled vocabulary and thesauri in support of online information access. D-Lib Magazine. (Electronic version available via Simmons Libraries)
• Written as an introduction to controlled vocabulary (and as an advocacy piece for controlled vocabulary on the Web) the treatment is somewhat cursory, but it is a clear, quick reference and reminder. In particular, Batty reminds us that although modern keyword searching seems wonderful, the increasing volume of items, and therefore the increasingly overwhelming retrieval results, demand a return to controlled vocabulary.

Berners-Lee, Tim, James Hendler, and Ora Lassila. 2001. The Semantic Web. Scientific American 284, no. 5: 34-43 Av. Available from <http://www.sciam.com/article.cfm?articleID=00048144-10D2-1C70-84A9809EC588EF21> [8 February 2004].
• It appears that this article is cited frequently less because of its direct application to the various subjects within which it is referenced, and more because it serves as an outstanding point on the horizon against which all information retrieval development activities can navigate and measure their progress. It is almost a little reminder to developers, "just remember, whatever you do is ultimately going to have to fit with this model of the world..." Useful vision, if not specifically relevant to the heavy lifting in thesauri construction.

Booth, Pat F. 1987. Thesauri: Their uses for indexers. The Indexer 15, no. 3: 141-144.
• This article expands awareness about the variety of uses of a thesaurus. Booth reminds us that thesauri are useful for: 1. subject familiarization (Cliff notes for self-education); 2. meaning and authenticity checks (better than dictionaries because provide hierarchical context and more frequently updated because in active use); 3. vocabulary control (associated with indexing activity-most commonly understood use), and; 4. standard vocabulary for successive indexes (quality control). It is valuable to keep these four applications in mind in order to create a fully developed, mature and effective tool. These applications are also good measures for evaluation.

British Standards Institution. 1987. Guidelines for the Establishment and Development of Monolingual Thesauri. London
• This resource is relevant and demands inclusion, but is in essence covered by Aitchison, Gilchrist and Bawden.

Cleveland, Donald B., and Ana D. Cleveland. 2001. Introduction to Indexing and Abstracting. Littleton, CO: Libraries Unlimited.
• Situated as it is within a volume focused primarily on activities rather than tools, the chapter on Vocabulary Control is introductory in nature yet surprisingly comprehensive and dense. Not only does it define and describe thesauri, but it does a good job of outlining construction procedures, including evaluation. It is most valuable for it’s easily comprehensible lists (checklists in some cases) describing characteristics, activities, rules, and criteria. A good reference document for ongoing guidance.

Craven, Tim. Freeware. 21 January 2004. Internet on-line. Available from <http://publish.uwo.ca/~craven/freeware.htm>. [8 February, 2004].
•This freeware was very easy to download and quick to learn. The interface/display is not fancy, but is very simple to use, with simple instructions and help. The fact that it is free allows for the option of users of the IIS thesaurus to actually download the software and the IIS Thesaurus file for easier access than scrolling a static Web page.

Craven, Tim. Thesaurus Construction [Tutorial]. 19 February 2002. Internet on-line. Available from <http://instruct.uwo.ca/gplis/677/thesaur/main00.htm>. [8 February, 2004].
• This was the first resource consulted for this project. It appeared deceptively simple at the time, but after further review of more complex documents, it is clear that this resource is an outstanding introduction and a good reference when more complex literature and/or activity generate theoretical or logical confusion.

Foskett, A.C. 1996. The subject approach to information. London, England: Library Association.
• Foskett’s book is a thorough textbook on subject access but does not provide the in-depth coverage of thesaurus construction that other dedicated resources have. Nevertheless, the chapters on “Assigned Indexing” cover the key theoretical/linguistic concepts of semantic and syntactic relationships.

Foskett, D.M. 1980. Thesaurus. In Encyclopedia of library and information science. New York: Marcel Dekker.
• This encyclopedia entry provided a useful overview of the nature, structure and operation of thesauri. Of greatest interest were Foskett's seven major purposes of a thesaurus which were used to reinforce the evaluation design of this project and were particularly relevant to IIS. Also of use was a visually effective chart outlining the conditions under which thesauri terms should be in the singular or plural form. A good checklist for this component of thesaurus construction.

Gilchrist, A. 2003. Thesauri, taxonomies and ontologies - an etymological note. Journal of Documentation 59, no. 1: 7-18.

Haas, Stephanie W. 1996. Sublanguages and the automatic identification of sublanguage terms. In Encyclopedia of library and information science.
• Haas focuses primarily on research to automatically identify sub-language terms for specific domains, but the value of this article to this project is as a reminder to pay special attention to the unique “jargon” of the field and identify the boundaries between the sub-language and the parent domain of which it is a part. Additionally, it is useful to note the various types of terms including those that are invented and those situations where “the meanings of existing words may be modified to become more specialized, such as the world linear. In general language, it describes something having the characteristics of a line; in mathematics, it refers to a particular kind of function; and in botany, it describes a specific arrangement of pollen grains.” It is through the hierarchical and faceted relationships of the thesaurus that these invented terms and modified commons terms will define their significance in the sub-language of IIS.

International Organization for Standardization. 1986. Guidelines for the Establishment and Development of Multilingual Thesauri. Geneva.
• This resource is relevant and demands inclusion, but is in essence covered by Aitchison, Gilchrist and Bawden.

Jones, S. 1993. A thesaurus data model for an intelligent retrieval system. Journal of Information Science 19: 167-178.
• Not only does this article outline the structure of thesauri in a way that is easily understood by a student of Database Management (complete with Entity-Relationship diagramming), but it provides an outstanding framework for integrating the thesaurus into the database containing all of the Integration and Implementation resources. This document will continue to serve as an active reference tool like Aitchison’s Thesaurus construction and use.

Krooks, David A., F.W. Lancaster. 1993. The evolution of guidelines for thesaurus construction. Libri 43, no. 4: 326-342.
• It was expected that this article might emphasize particular elements of the thesaurus construction process by identifying when and why they entered the “standards.” The article was more concerned, however, with the individuals and organizations involved and provided a much more historical than functional view of the standards. Students pursuing doctoral work specifically in thesaurus construction will no doubt be interested, and the article deserves a place in this Bibliography, but the work will not play an active role in this project.

Lancaster, F.W. 1998. Indexing and abstracting in theory and practice. Champaign, IL: University of Illinois, Graduate School of Library & Information Science.
• In this volume Lancaster is primarily concerned with the theoretical, philosophical and practical elements of indexing and abstracting activities and considers thesauri only in their role as a tool. The value of this resource lies in its full treatment of the challenges of indexing, thus providing a context to hold in mind during the tool-building process.

Lancaster, F.W. 1986. Vocabulary control for information retrieval. Washington, DC: IRP.
• This volume is a critical resource for thesaurus development treating all elements of construction carefully and thoroughly. It is an older kin to the Aitchison, Gilchrist and Bawden bible, though certain sections stood out as more memorable or useful. The discussion of gathering raw material and term selection based on both literary and user warrant provided some key direction in the early stages of this project. The section on Vocabulary Failures provided a reasonably extensive treatment of the term choices that cause failures and how to balance the complex elements involved in term selection. Highly valuable was Lancaster’s discussion of quantitative evaluation measures including: equivalence ratio, reciprocity ratio, definition, flexibility, pre-coordination level, and size of term groups. Some of these measures are included in the evaluation design for this project.

Michel, Dee. Appendix A: Report to the ALCTS/CCS Subject Analysis Committee July 1996.Internet on-line. Available from <http://www.ala.org/ala/alctscontent/catalogingsection/catcommittees/subjectanalysis/subjectrelations/appendix.htm>. [8 February, 2004].
• This Web page serves primarily as an explanatory and contextual note for Dee Michel’s Taxonomy of subject relationships (listed below). In and of itself it did not contribute much to the project but must be listed independently to reflect it’s unique URL.

Michel, Dee. Appendix B (part 1): Taxonomy of subject relationships.Internet on-line. Available from <http://archive.ala.org/alcts/organization/ccs/sac/msrscu2.pdf>. [8 February, 2004].
• This document is an un-annotated list, presented with BT, NT, RT, and USE references, of Michel’s analysis of the many different types of subject relationships. It is fascinating, and great risk is run that many hours will be wiled away using Michel’s taxonomy to investigate, ponder and debate the nature of unique relationships between individual terms as this IIS thesaurus is constructed. This list may prove useful since Integration and Implementation Sciences is a field characterized by the re-combination of existing philosophies, principles and practices in different relationships. It is anticipated that Michel’s relationship definitions will provide the foundation of highly instructive scope notes as well as facilitating more effective use of facets and BT, NT, RT for IIS terminology.

Michel, Dee. Appendix B (part 2): Taxonomy of subject relationships.Internet on-line. Available from <http://www.ala.org/ala/alctscontent/catalogingsection/catcommittees/subjectanalysis/subjectrelations/appendixbpartii.htm>. [8 February, 2004].
• This page represents the short list of subject relationships (no BT, NT, RT, etc.), as identified by Michel, grouped generally into the familiar key categories of Associative, Equivalence and Hierarchical relationships.

Middleton, Michael. Controlled Vocabularies. Queensland University of Technology. Internet on-line. Available from <http://sky.fit.qut.edu.au/~middletm/cont_voc.html>. [2 March, 2004].
• This is an amazing gateway providing a fabulous first step in the journey into thesaurus construction. Resources are arranged into the following groupings: Thesaurus building & use guidance, Thesaurus online sites, Thesaurus software, Databases of thesauri, Bibliography, Classification schemes online, and Website compilations from elsewhere. In addition to Candy’s pages and the Will’s (see below), it is an excellent place to start for beginning students of thesaurus construction.

Milstead, Jessica L. About Thesauri. 2000. Internet on-line. Available from <http://www.bayside-indexing.com/Milstead/about.htm>. [8 February, 2004].
• Milstead succinctly hits all the key points a key decision-maker would want to know about thesauri. The page appears designed for the person that has had someone say to them, “You need a thesaurus…” and then says to themselves, “What? Why? How? Ugh.” Once they’ve hit this page they will move on to other resources, but Milstead excels at outlining the key elements of the essential processes. The software links she provides are not relevant to this project, but the process outlines are a good reference for a double check in the development process.

Milstead, Jessica L. 1993. Thesaurus managment software. In Encyclopedia of library and information science.
•Once again, Milstead achieves maximum density. While the purchase of thesaurus software is not a part of this project, Milstead's checklist of things to consider when purchasing thesaurus software forms an outstanding outline of things to remember in the construction process.

Milstead, Jessica L. 1984. Subject access systems: Alternatives in design. Orlando: Academic Press.
• This volume treats subject access from the whole systems perspective. Milstead discusses thesauri construction, classification, indexing and cataloguing issues in turn, but her entire effort considers the complex and interconnected affects of one on the other with the intent of providing the reader/developer with the ability to make informed decisions in their design process. As she sets out in her introduction, “This work treats the design of a system for subject access as a set of interacting choices, describing the choices together with the ways in which one choice may constrain others. Every aspect of a subject access system is the result of a choice between alternatives.” This resource is being read cover to cover as the thesaurus construction process evolves to ensure an honorable product with healthy karma.

National Information Standards Organization. Guidelines for the Construction, Format and Management of Monolingual Thesauri. 2003. Internet on-line. Available from <http://www.niso.org/standards/resources/Z39-19.pdf>. [8 February, 2004].
• This resource is relevant and demands inclusion, but is in essence covered by Aitchison, Gilchrist and Bawden.

Pollard, Richard. 1993. A hypertext-based thesaurus as a subject browsing aid for bibliographic databases. Information Processing & Management 29, no. 3: 345-357.
• The title of this article is precise. Pollard discusses the advantages of using a hypertext-based thesaurus to provide structural cues to a subject domain, improving user browsing experiences. The article is fascinating, but slightly beyond the scope of this project as it currently stands. It is anticipated that this resource will be invaluable at Phase Two of this thesaurus where the focus shifts to interactive Web-based user interfaces for the product.

Priss, Uta, Elin Jacob. 1999. Utilizing faceted structures for information systems design. In ASIS '99: Proceedings of the 62nd ASIS Annual Meeting203-212.
• Priss and Elin have written directly to the broader endeavor of which this thesaurus construction project is a part – a full database, and ultimately dynamic Web site, linking all the elements of the budding Integration and Implementation Sciences field. “This paper argues for the use of a faceted thesaurus as the basis for organizing a small-scale institutional website.” While IIS aims to be large-scale in the future, and the paper is by no means comprehensive, the proposals for organizing information set forth here are very useful.

Ross, Jan. 2000. A new way of information retrieval: 3-D indexing and concept mapping. Learned Publishing 13, no. 2: 119-123.

Schwartz, Candy. 2001. Sorting out the Web: Approaches to subject access. Westport, CT: Ablex.
• Controlled vocabularies form a small component of this volume, but the components of an indexing language were laid out very clearly and concisely for easy review. Notably, Schwartz clearly includes the component of "Application Policies" - a subject absent from, or hidden in, many other discussions of controlled vocabulary.

Shiri, A.A., C. Revie. 2000. Thesauri on the Web: current developments and trends. Online Information Review 24, no. 4: 273-280.
•This brief article provides a reasonable assessment of the state of thesauri on the Web in 2000, and adequately identifies future activities, including the next stage of thesauri standards development: standards for electronic thesauri. There was little here, however, to contribute additional knowledge to this project. To be fair, it does serve as a relevant milestone and reinforces knowledge gathered elsewhere and is thus included.

Smith, Linda C. 1992. UNISIST revisited: Compatibility in the context of collaboratories. In Classification research for knowledge representation and organization. Amsterdam: Elsevier.

Soergel, Dagobert. 1974. Indexing languages and thesauri: Construction and maintenance. Los Angeles: Melville.
• Ugh. This was the bible in 1974, and remains a seminal reference in any article or volume discussing thesaurus construction. Soergel is comprehensive, detailed and honorably precise. Unfortunately, the sheer size and scope of this volume makes it an unwieldy resource for this project. It will be used primarily as a reference source to double check points of debate and to ensure that critical activities are not overlooked. In this regard, the inclusion of both a condensed and exploded Table of Contents, as well as an Index, is extremely valuable.

Svenonius, Elaine. 1990. Design of controlled vocabularies. In Encyclopedia of library and information science.
• Svenonious considers the myriad details of the narrow activity of term selection and relationship definition in this encyclopedia article. Her treatment of Vocabulary, Syntax and Semantics provide detailed perspectives of well-covered ground. Of particular interest is her unique coverage of issues like scholarly usage, context independence, structural warrant, and her thorough discussions of binding vs. synthesizing syntax and category vs. referential vs. relational semantics. This is the document for the deep logic of it all - though not quite so voluminous as Soergel.

Taylor, Arlene G. 2004. The organization of information. Littleton, CO: Libraries Unlimited.
• Taylor's treatment is basically introductory, as a chapter within a book with a much larger subject. The sub-section on the Mechanics of Controlled Vocabularies, however, provided a diverse treatment of the various types of relationships. She covers the traditional whole/part, genus/species, class/member, topic/instance types of relationships most commonly found in thesauri. She also describes relationships as outlined in WordNet (synonyms, coordinate terms, hypernyms, hyponyms, holonyms, meronyms, and antonyms), and in the OWL Web Ontology Language (subClassOf, one of, equivalentClass, intersectionOf, disjointWith). The juxtaposition of these different articulations of relationships complicates the subject while simultaneously illustrating the need for clarity and commitment in determining how relationships will be expressed in a newly constructed controlled vocabulary.

Vickery, B.C. 1997. Ontologies. Journal of Information Science 23, no. 4: 277-286.
Weinberg, B.H. 1995. Library classification and information retrieval thesauri: comparison and contrast. Cataloguing & Classification Quarterly 19, no. 3/4: 23-44.
• There is little of direct practical application in this article effectively outlining the convergent course of thesauri and classification, particularly in the Web environment. However, Weinberg highlights a critical point. Greater classification structure in thesauri will enable them to serve as more effective tools at the searching level (as opposed to just the indexing level). This is an important observation to keep in mind while designing a new thesaurus for a growing field.

Wellisch, Hans H. 2000. Glossary of terminology in abstracing, indexing, classification, and thesaurus construction. Medford, NJ: Information Today, in association with the American Society of Indexers.
• This thin volume is an important reference work in deciphering much of the sub-language pervasive in the literature on thesaurus construction. Not an especially active resource for this project, but a representative one that must remain on the list.

Wellisch, Hans H. 1995. Indexing from A to Z. New York: H.W. Wilson Co.

Will, Leonard D., Sheena E. Will. Thesaurus principles and practice. 13 February 1998Internet on-line. Available from <http://www.willpower.demon.co.uk/thesprin.htm>. [8 February 2004].
• This introduction to thesauri is similar to that of Tim Craven, but perhaps even more focused on the beginner. One advantage of this tutorial, however is that all chapters are on one page and are more easily scrolled, linked and printed. While it had little additional instructional value for this project, it remains in the bibliography as a good introductory reference. And it does provide some introductory treatment of thesauri principles for museum objects – a topic not always addressed by other treatments.

Will, Leonard D., Sheena E. Will. Publications on thesaurus construction and use. 13 August 2003Internet on-line. Available from <http://www.willpower.demon.co.uk/thesbibl.htm>. [2 March, 2004].
• The Wills maintain quite a phenomenal gateway bibliography of resources grouped according to the broad categories of thesaurus structure and use, facet analysis, search interfaces that allow for combining terms from multiple facets of a thesaurus, lists of thesauri, taxonomies, ontologies, and topic maps. Many of the resources reviewed for this project, and included in this Bibliography, are listed and/or linked from the Will’s Publications page. Neck and neck with Michael Middleton for coverage (and they link to each other).

Williamson, Nancy J. Deriving a thesaurus from a restructured UDC. In 4th International Conference of the International Society for Knowledge Organization370-377.
• This article is useful for the phase of the project concerned with developing classification. Some classification schemes are being reviewed in relation to the key concepts of Integration and Implementation Sciences. If there is a classification scheme that serves the field more effectively than the hierarchical structure of a particular thesaurus, it may prove effective to derive some terms from the classification. At that point, this resource will prove its value.

Zeng, Lei. 1993. Compatibility of indexing languages in an online access environment: A review of the approaches. Advances in Classficiation Research 3: 161-181.
• Zeng discusses the importance of compatibility and interoperability among thesauri in anticipation of the meta-searching systems that are just now becoming a prevalent reality. In searching automatically across databases, discrepancies and differences in indexing terms can cause both precision and recall problems. While Zeng investigates this issue in far more detail than is necessary for this project, compatibility and interoperability are important reasons why this project is structured to make the greatest use of existing thesauri prior to developing new terminology. It is hoped that existing terminology may be able to be “re-used” by establishing different relationships that will make the terms relevant for IIS. Zeng’s analysis may prove helpful as this work comes to the fore.

Top

On Faceted Classification and Classification Theory

Ranganathan, S.R.. Elements of Library Classification. 3rd ed. New York: Asia Publishing House, 1962.
• This book is based on a series of Ranganathan lectures on classification. The first part of the book focuses on general classification using an example of 108 “Subjects in Unhelpful Alphabetical Sequence”. These 108 subjects are analyzed and grouped according to Ranganathan’s Canons of Classification. This is an excellent introduction to developing a classification scheme, but can be difficult to decipher at times because of the syntax of the language. The lectures on facet analysis are divided into four parts: facet and focus, fundamental categories, rounds, and levels. The five fundamental categories (personality, matter, energy, space and time) are applied in facet analysis examples where facet symbols are applied to 18 subjects. This was particularly useful in learning how to analyze and deconstruct a complex subject.

Classification Research Group. “The Need for a Faceted Classification as the Basis of All Methods of Information Retrieval.” Theory of Subject Analysis: A Sourcebook, edited by Lois Mai Chan, Phyllis A. Richmond pp 154-67 Littleton, Colo Libraries Unlimited, 1985
• This is an interesting report presented by the Classification Research Group (CRG) in the early 1950’s. It focuses on the need for improved classification systems to aid in two steps of information retrieval: identification of the search subject, and locating the subject through a referral to one or more subjects. The CRG argues that generic relations cannot be displayed in enumerative tree structures and that traditional schemes are rigid and inflexible. It is suggested that faceted classification schemes are the best mechanism to allow given genus to be subdivided in more than one way allowing for flexibility and a better understanding of generic relationships. The report goes on to identify ten problems with classification schedules, including misplacement of subjects, distorted relationships between terms, insufficient detail and lengthy and complex notation. This report defines reasons for using faceted classification schemes, but does not go into much detail about the construction of such schemes.

Vickery, B.C. Faceted classification: a guide to construction and use of special schemes, prepared for the Classification Research Group. London, Aslib, 1960
• Vickery, a member of the Classification Research Group, provides a solid introduction to faceted classification schemes. The book includes a history of facet analysis, indexing term definitions, as well as examples of facet construction, and comparison of faceted classifications. Vickery uses interesting examples to demonstrate the creation of facets, the ordering, and the notation. The book also focuses on how systems retrieve information using pre-coordinate and post-coordinate language. This book is an excellent how-to guide for facet classification creation.

Top

Classification and the Internet

High level subject access tools and techniques in Internet cataloging / Judith R. Ahronheim ed. Binghamton, NY: Haworth Information Press, c2002

Dahlberg, Ingetraut. The future of classification in libraries and networks: a theoretical point of view. Cataloging & Classification Quarterly, 21(2), 23-36 (1995)

Denton, William. How to Make a Faceted Classification Scheme and Put it on the Web. (2003) Available:
http://www.miskatonic.org/library/facet-web-howto.html

• This is a fantastic paper on faceted classification creation and its use on the Web. Denton clearly explains how (and when) to create facets within the context of the web. This paper is not limited to the Web however. It is an excellent guide to creating a faceted scheme, and draws upon many of the original texts on facet creation such as S.R. Ranganathan and the Classification Research Group. This resource has been invaluable. Includes how to store information in relational databases, faceted-based navigation, as well as keyword searching.

--. Putting Facets on the Web: An Annotated Bibliography. (2003) Available:
http://www.miskatonic.org/library/facet-biblio.html

• A superb bibliography not only for Web and facet analysis, but also facet analysis theory and its history.

Faceted classifications and thesauri. Barbara Perles. http://is.gseis.ucla.edu/impact/f95/Papers-projects/Papers/perles.html

Doyle, Bob. The Classification & Evaluation of Content Management Systems. The Gilbane Report 11 (2), 2-13 (2003) Available: http://www.gilbane.com/artpdf/GR11.2.pdf.
• This report was not particularly useful for this project, as it focuses on the actual classification of content management systems and no the process of classification. This paper does however serve as a terrific tool in helping one to evaluate the currently available content management systems.

Fast, Karl, Fred Leise, Mike Steckel. All About Facets and Controlled Vocabularies Boxes and Arrows, (2002) Available: http://www.boxesandarrows.com/archives/all_about_facets_controlled_vocabularies.php.
• This series of articles explains the basic concept of facet creation as well as controlled vocabularies. It is directed towards information architects and contains excellent, clearly written definitions and examples. This is a nice supplement to the more complicated and involved texts of Ranganathan and Vickery

Koch, Traugott. The Role of Classification Schemes in Internet Resource Description and Discovery. Available: http://www.ukoln.ac.uk/metadata/desire/classification/class_tc.htm
• Classification schemes lend themselves quite well to information retrieval, browsing, and navigation on the Internet. Traugott had compiled a comprehensive report investigating how efficient classification schemes are in aiding information retrieval in Internet environments. The Universal Decimal Classification (UDC), Dewey Decimal Classification (DDC), Library of Congress Classification (LCC) as well as some international, subject specific and home-grown schemes are all evaluated in terms of their Web-adaptability. Evaluation topics include Internet service availability, multilingual capabilities, classification scheme integration with controlled vocabulary / subject headings, digital availability, copyright, and the strengths and weakness of the scheme. This is an excellent resource for anyone who is interested in adapting and or creating a classification scheme for a body of knowledge in the Internet domain.

Kwasnik, Barbara. (1999). The Role of Classification in Knowledge Representation and Discovery. Library Trends, 48, 22-47.

McKiernan, Gerry. Beyond Bookmarks: Schemes for Organizing the Web. Available: http://www.iastate.edu/~CYBERSTACKS/CTW.htm
• This is a clearinghouse of links to Web sites that have applied classification schemes and or controlled vocabularies to organize their sites. Some of the classification schemes used include: Dewey Decimal, Universal Decimal Classification, Library of Congress, and National Library of Medicine. Some of the controlled vocabularies used in Web site organization include: Library of Congress Subject Headings (LCSH), National Library of Medicine (NLM), and Medical Subject Headings (MeSH). Of interesting note, there are quite a few Finnish Web sites using thesauri as an aid in organizing their sites.

Noy, Natalya F. et al. Ontology Development: A Guide to Creating Your First Ontology. (2002) Available:
http://ksl.stanford.edu/people/dlm/papers/ontology101/ontology101-noy-mcguinness.html

Spiteri, Louise. A Simplified Model for Facet Analysis: Ranganathan 101 Canadian Journal of Information and Library Science 23 (1/2),1-30 (April-July 1998) Available: http://aifia.org/pg/a_simplified_model_for_facet_analysis.php.
• Spiteri proposes a simplified model for facet analysis that includes elements from both S.R. Ranganathan and the Classification Research Group’s (CRG) models. Spiteri reviews Ranganathan’s 46 canons in the context of his three Planes of Work: Idea, Verbal, and Notational. She then goes on to compare and contrast CRG principles of facet analysis with Ranganathan. The final portion of Spiteri’s paper is her simplified model. This is a fantastic primer for reading Ranganathan and CRG works on facet analysis. Its comprehensive overview of Ranganathan’s canons is unique and extremely helpful.

Top

Indexing Languages Reviewed

California Environmental Resources Evaluation System. CERES.Internet on-line. Available from <http://ceres.ca.gov/thesaurus/>.
• This environmental thesaurus was initially considered because often issues in the environmental area have a tendency to be multidisciplinary - particularly those that engage in Sustainable Development. It was not multidisciplinary enough, however, to be of use for the IIS Thesaurus.

Cambridge Scientific Abstracts. Thesaurus of Sociological Indexing Terms.Internet on-line. Available from <http://www.csa.com/edit/sociothes.html>.
• CSA's sociological thesaurus covered a great deal of terms relevant to IIS. It presented a formidable challenge to the ERIC thesaurus. CSA contained relevant terms that ERIC did not, and vice versa. Ultimately ERIC was chosen over CSA because ERIC had more terms relevant to academia (which were relevant considering that IIS is a new "specialization" among fields of study). The final key disadvantage for CSA was that access to its thesaurus required a subscription to its services. As such, it would not be freely and easily available to a range of users.

ELDIS, Institute for Development Studies. Internet on-line. Available from <http://www.ids.ac.uk/eldis/thessea.htm>.
• Because much of the application of Integration and Implementation Sciences related research focuses on complex problems like International or Sustainable Development, the ELDIS thesaurus seemed a logical source to review. It had too narrow of a focus, however, and was not useful for the IIS thesaurus.

Global Environmental Information Exchange Network. Envoc thesaurus.Internet on-line. Available from <http://p5uni.ii.pw.edu.pl/envoc/>.
• Similar to the CERES thesaurus, the Envoc thesaurus was considered for its potential to represent multidisciplinary concepts. It was missing important pieces of academia and sociology, however, and was quickly discarded.

HASSET/SOSIG implementation. HASSET thesaurus.Internet on-line. Available from <http://sosig.ac.uk/roads/cgi/thesaurus.pl>.
• The HASSET sociological thesaurus is cumbersome to use and was too broad to be of use for the IIS thesaurus.

Organization for Economic Co-operation and Development. OECD Macrothesaurus. 1991. Internet on-line. Available from <http://info.uibk.ac.at/info/oecd-macroth/>. [8 February, 2004].
• The OECD thesaurus considered because its international focus might provide coverage of a range of complex, large scale problems. It was also the only thesaurus with a category devoted to research and theory, which was the most appropriate home for Integration and Implementation Sciences as a discipline. But the OECD thesaurus was entirely to narrowly focused on economics to be of use to IIS.

UNESCO. UNESCO thesaurus. 1995. Internet on-line. Available from <http://www.ulcc.ac.uk/unesco/>. [8 February, 2004].
• The UNESCO thesaurus contained a great many terms applicable to IIS resources, but ultimately it was to general and did not provide enough terms related to academic study. A portion of the UNESCO thesaurus, however, was incorporated into the IIS system and referenced as the authority file for geographic place names.

United States Geological Survey. USGS Thesaurus: Gateway to the Earth Draft 9 28-Jun-2002. 2004 2002. Internet on-line. Available from <http://geo-nsdi.er.usgs.gov/term>. , 8 February].
• It was clear from the beginning that a geological thesaurus would not provide appropriate terms for indexing IIS resources. Instead it was reviewed as a model for its faceted structure and excellent presentation (both layout and functionality).

US Department of Education. ERIC Thesaurus (professional version).Internet on-line. Available from <http://www.ericfacility.net/extra/pub/thessearch.cfm>.
• ERIC contained the most terms relevant to IIS resources and provided a reasonably functional and flexible interface that was freely available to all users over the Web. As such it was selected as the primary thesaurus for the IIS thesaurus system.

USAID. USAID Thesaurus. 1998. Internet on-line. Available from <http://www.dec.org/pdf_docs/PNACD400.pdf [8 February, 2004].
• The USAID Thesaurus was available in a PDF form which was ungainly to use. It was considered for the same reasons that the UNESCO and ELDIS thesauri were considered - the broad multidisciplinary scope of international development seemed a likely area to contain the kinds of terms needed by IIS. But USAID did not provide enough detail in the six key IIS areas and lacked the necessary academic terms. It did provide, however, an excellent model for how to present an introduction to a thesaurus.

Top