Annotated Bibliography
On Thesaurus Construction and Indexing Languages
Aitchison, J., A. Gilchrist, and David Bawden. 1997. Thesaurus construction
and use: A practical manual. London, England: Aslib.
• This book is the current bible for thesaurus construction. Incorporating
all the updated relevant standards (ANSI, BSI, ISO, NISO), Aitchison’s
book is designed as more of a “workbook,” outlining a step by step
process that is easy to follow. It will remain always open as a ready reference
point throughout the project.
Aitchison, Jean. 1969. Thesaurofacet: A thesaurus & faceted classification
for engineering & related subjects. Leicester, England: English Electric
Co. Ltd.
• Aitchison's seminal work product for the electrical engineering field is the key reference example for integrating a faceted classification scheme with a thesaurus (rather than just attaching an alphabetical index to the classification scheme). While some elements of faceted classification were valuable for the IIS Thesaurus, the development of a full scheme is beyond the scope of this project in Spring 2004. This reference will be of greater use in later phases of the project.
American National Standards Institute. 1980. American National Standard Guidelines
for Thesaurus Structure, Construction, and Use. New York:
• This resource is relevant and demands inclusion, but is in essence
covered by Aitchison, Gilchrist and Bawden.
Anderson, James D., Jose Perez-Carballo. 2001. The nature of indexing: How humans
and machines analyze messages and texts for retrieval. Part I: Research, and
the nature of human indexing. Information Processing & Management 37: 231-254.
• The most useful part of this article was the summaries of the views of leading experts on the human analysis skills of indexing. Scholars reviewed included Mulvaney, Chan, Fugmann, Soergel, Lancaster, Fairthorne, Wellisch, Wilson, Taylor, and Hjxrland. This outline provided a defining fence around the slippery nature of indexing and conceptual analysis. As the process of indexing is the action served by the thesaurus tool, it was valuable to obtain a general sense of the nature of the activity.
Bates, Marcia. 2002. After the dot-bomb:getting Web information retrieval right
this time. First Monday 7, no. 7: Internet on-line. Available from <http://firstmonday.dk/issues/issue7_7/bates/index.html>
[8 February 2004].
• By identifying what went wrong in information retrieval in the early,
heady days of the Web, Bates lays out a map of the mine field for anyone constructing
an information retrieval system. The reverse of her list of pitfalls constitutes
a useful checklist: 1. Use faceted classification, not hierarchical; 2. Forget
“ontologies” – they mislead into believing things can be organized
definitively; 3. Roget’s is a stylistic writer’s thesaurus, info
retrieval thesauri have different purposes and must be structured differently;
4. Bradford distributions/power laws operate in full force in info retrieval
– prepare for and work with them; 5. Design for scalability or you will
be sorry; 6. User interface design of indexing tools is as important as (or
more than) the structure/logic of the tool itself; 7. Take advantage of existing
experts – rest assured, every wheel has been invented – you are
only making a new kind of cart. Good advice, all of it.
Batty, D. 1998. WWW - wealth, weariness or waste: Controlled vocabulary and
thesauri in support of online information access. D-Lib Magazine. (Electronic
version available via Simmons Libraries)
• Written as an introduction to controlled vocabulary (and as an advocacy
piece for controlled vocabulary on the Web) the treatment is somewhat cursory,
but it is a clear, quick reference and reminder. In particular, Batty reminds
us that although modern keyword searching seems wonderful, the increasing volume
of items, and therefore the increasingly overwhelming retrieval results, demand
a return to controlled vocabulary.
Berners-Lee, Tim, James Hendler, and Ora Lassila. 2001. The Semantic Web. Scientific
American 284, no. 5: 34-43 Av. Available from <http://www.sciam.com/article.cfm?articleID=00048144-10D2-1C70-84A9809EC588EF21>
[8 February 2004].
•
It appears that this article is cited frequently less because of its direct application to the various subjects within which it is referenced, and more because it serves as an outstanding point on the horizon against which all information retrieval development activities can navigate and measure their progress. It is almost a little reminder to developers, "just remember, whatever you do is ultimately going to have to fit with this model of the world..." Useful vision, if not specifically relevant to the heavy lifting in thesauri construction.
Booth, Pat F. 1987. Thesauri: Their uses for indexers. The Indexer 15, no. 3: 141-144.
• This article expands awareness about the variety of uses of a thesaurus. Booth reminds us that thesauri are useful for: 1. subject familiarization (Cliff notes for self-education); 2. meaning and authenticity checks (better than dictionaries because provide hierarchical context and more frequently updated because in active use); 3. vocabulary control (associated with indexing activity-most commonly understood use), and; 4. standard vocabulary for successive indexes (quality control). It is valuable to keep these four applications in mind in order to create a fully developed, mature and effective tool. These applications are also good measures for evaluation.
British Standards Institution. 1987. Guidelines for the Establishment and Development
of Monolingual Thesauri. London
• This resource is relevant and demands inclusion, but is in essence
covered by Aitchison, Gilchrist and Bawden.
Cleveland, Donald B., and Ana D. Cleveland. 2001. Introduction to Indexing and
Abstracting. Littleton, CO: Libraries Unlimited.
• Situated as it is within a volume focused primarily on activities rather
than tools, the chapter on Vocabulary Control is introductory in nature yet
surprisingly comprehensive and dense. Not only does it define and describe thesauri,
but it does a good job of outlining construction procedures, including evaluation.
It is most valuable for it’s easily comprehensible lists (checklists in
some cases) describing characteristics, activities, rules, and criteria. A good
reference document for ongoing guidance.
Craven, Tim. Freeware. 21 January 2004. Internet on-line. Available from <http://publish.uwo.ca/~craven/freeware.htm>.
[8 February, 2004].
•This freeware was very easy to download and quick to learn. The interface/display is not fancy, but is very simple to use, with simple instructions and help. The fact that it is free allows for the option of users of the IIS thesaurus to actually download the software and the IIS Thesaurus file for easier access than scrolling a static Web page.
Craven, Tim. Thesaurus Construction [Tutorial]. 19 February 2002. Internet on-line.
Available from <http://instruct.uwo.ca/gplis/677/thesaur/main00.htm>.
[8 February, 2004].
•
This was the first resource consulted for this project. It appeared deceptively simple at the time, but after further review of more complex documents, it is clear that this resource is an outstanding introduction and a good reference when more complex literature and/or activity generate theoretical or logical confusion.
Foskett, A.C. 1996. The subject approach to information. London, England: Library
Association.
• Foskett’s book is a thorough textbook on subject access but does
not provide the in-depth coverage of thesaurus construction that other dedicated
resources have. Nevertheless, the chapters on “Assigned Indexing”
cover the key theoretical/linguistic concepts of semantic and syntactic relationships.
Foskett, D.M. 1980. Thesaurus. In Encyclopedia of library and information science.
New York: Marcel Dekker.
•
This encyclopedia entry provided a useful overview of the nature, structure and operation of thesauri. Of greatest interest were Foskett's seven major purposes of a thesaurus which were used to reinforce the evaluation design of this project and were particularly relevant to IIS. Also of use was a visually effective chart outlining the conditions under which thesauri terms should be in the singular or plural form. A good checklist for this component of thesaurus construction.
Gilchrist, A. 2003. Thesauri, taxonomies and ontologies - an etymological note.
Journal of Documentation 59, no. 1: 7-18.
Haas, Stephanie W. 1996. Sublanguages and the automatic identification of sublanguage
terms. In Encyclopedia of library and information science.
• Haas focuses primarily on research to automatically identify sub-language
terms for specific domains, but the value of this article to this project is
as a reminder to pay special attention to the unique “jargon” of
the field and identify the boundaries between the sub-language and the parent
domain of which it is a part. Additionally, it is useful to note the various
types of terms including those that are invented and those situations where
“the meanings of existing words may be modified to become more specialized,
such as the world linear. In general language, it describes something having
the characteristics of a line; in mathematics, it refers to a particular kind
of function; and in botany, it describes a specific arrangement of pollen grains.”
It is through the hierarchical and faceted relationships of the thesaurus that
these invented terms and modified commons terms will define their significance
in the sub-language of IIS.
International Organization for Standardization. 1986. Guidelines for the Establishment
and Development of Multilingual Thesauri. Geneva.
• This resource is relevant and demands inclusion, but is in essence
covered by Aitchison, Gilchrist and Bawden.
Jones, S. 1993. A thesaurus data model for an intelligent retrieval system. Journal of Information Science 19: 167-178.
• Not only does this article outline the structure of thesauri in a way
that is easily understood by a student of Database Management (complete with
Entity-Relationship diagramming), but it provides an outstanding framework for
integrating the thesaurus into the database containing all of the Integration
and Implementation resources. This document will continue to serve as an active
reference tool like Aitchison’s Thesaurus construction and use.
Krooks, David A., F.W. Lancaster. 1993. The evolution of guidelines for thesaurus construction. Libri 43, no. 4: 326-342.
• It was expected that this article might emphasize particular elements
of the thesaurus construction process by identifying when and why they entered
the “standards.” The article was more concerned, however, with the
individuals and organizations involved and provided a much more historical than
functional view of the standards. Students pursuing doctoral work specifically
in thesaurus construction will no doubt be interested, and the article deserves
a place in this Bibliography, but the work will not play an active role in this
project.
Lancaster, F.W. 1998. Indexing and abstracting in theory and practice. Champaign, IL: University of Illinois, Graduate School of Library & Information Science.
• In this volume Lancaster is primarily concerned with the theoretical,
philosophical and practical elements of indexing and abstracting activities
and considers thesauri only in their role as a tool. The value of this resource
lies in its full treatment of the challenges of indexing, thus providing a context
to hold in mind during the tool-building process.
Lancaster, F.W. 1986. Vocabulary control for information retrieval. Washington, DC: IRP.
• This volume is a critical resource for thesaurus development treating
all elements of construction carefully and thoroughly. It is an older kin to
the Aitchison, Gilchrist and Bawden bible, though certain sections stood out
as more memorable or useful. The discussion of gathering raw material and term
selection based on both literary and user warrant provided some key direction
in the early stages of this project. The section on Vocabulary Failures provided
a reasonably extensive treatment of the term choices that cause failures and
how to balance the complex elements involved in term selection. Highly valuable
was Lancaster’s discussion of quantitative evaluation measures including:
equivalence ratio, reciprocity ratio, definition, flexibility, pre-coordination
level, and size of term groups. Some of these measures are included in the evaluation
design for this project.
Michel, Dee. Appendix A: Report to the ALCTS/CCS Subject Analysis Committee July 1996.Internet on-line. Available from <http://www.ala.org/ala/alctscontent/catalogingsection/catcommittees/subjectanalysis/subjectrelations/appendix.htm>. [8 February, 2004].
• This Web page serves primarily as an explanatory and contextual note
for Dee Michel’s Taxonomy of subject relationships (listed below). In
and of itself it did not contribute much to the project but must be listed independently
to reflect it’s unique URL.
Michel, Dee. Appendix B (part 1): Taxonomy of subject relationships.Internet on-line. Available from <http://archive.ala.org/alcts/organization/ccs/sac/msrscu2.pdf>. [8 February, 2004].
• This document is an un-annotated list, presented with BT, NT, RT, and
USE references, of Michel’s analysis of the many different types of subject
relationships. It is fascinating, and great risk is run that many hours will
be wiled away using Michel’s taxonomy to investigate, ponder and debate
the nature of unique relationships between individual terms as this IIS thesaurus
is constructed. This list may prove useful since Integration and Implementation
Sciences is a field characterized by the re-combination of existing philosophies,
principles and practices in different relationships. It is anticipated that
Michel’s relationship definitions will provide the foundation of highly
instructive scope notes as well as facilitating more effective use of facets
and BT, NT, RT for IIS terminology.
Michel, Dee. Appendix B (part 2): Taxonomy of subject relationships.Internet on-line. Available from <http://www.ala.org/ala/alctscontent/catalogingsection/catcommittees/subjectanalysis/subjectrelations/appendixbpartii.htm>.
[8 February, 2004].
• This page represents the short list of subject relationships (no BT,
NT, RT, etc.), as identified by Michel, grouped generally into the familiar
key categories of Associative, Equivalence and Hierarchical relationships.
Middleton, Michael. Controlled Vocabularies. Queensland University of Technology. Internet on-line. Available from <http://sky.fit.qut.edu.au/~middletm/cont_voc.html>. [2 March, 2004].
• This is an amazing gateway providing a fabulous first step in the journey
into thesaurus construction. Resources are arranged into the following groupings:
Thesaurus building & use guidance, Thesaurus online sites, Thesaurus software,
Databases of thesauri, Bibliography, Classification schemes online, and Website
compilations from elsewhere. In addition to Candy’s pages and the Will’s
(see below), it is an excellent place to start for beginning students of thesaurus
construction.
Milstead, Jessica L. About Thesauri. 2000. Internet on-line. Available from <http://www.bayside-indexing.com/Milstead/about.htm>. [8 February, 2004].
• Milstead succinctly hits all the key points a key decision-maker would
want to know about thesauri. The page appears designed for the person that has
had someone say to them, “You need a thesaurus…” and then
says to themselves, “What? Why? How? Ugh.” Once they’ve hit
this page they will move on to other resources, but Milstead excels at outlining
the key elements of the essential processes. The software links she provides
are not relevant to this project, but the process outlines are a good reference
for a double check in the development process.
Milstead, Jessica L. 1993. Thesaurus managment software. In Encyclopedia of library and information science.
•Once again, Milstead achieves maximum density. While the purchase of thesaurus software is not a part of this project, Milstead's checklist of things to consider when purchasing thesaurus software forms an outstanding outline of things to remember in the construction process.
Milstead, Jessica L. 1984. Subject access systems: Alternatives in design. Orlando: Academic Press.
• This volume treats subject access from the whole systems perspective.
Milstead discusses thesauri construction, classification, indexing and cataloguing
issues in turn, but her entire effort considers the complex and interconnected
affects of one on the other with the intent of providing the reader/developer
with the ability to make informed decisions in their design process. As she
sets out in her introduction, “This work treats the design of a system
for subject access as a set of interacting choices, describing the choices together
with the ways in which one choice may constrain others. Every aspect of a subject
access system is the result of a choice between alternatives.” This resource
is being read cover to cover as the thesaurus construction process evolves to
ensure an honorable product with healthy karma.
National Information Standards Organization. Guidelines for the Construction, Format and Management of Monolingual Thesauri. 2003. Internet on-line. Available from <http://www.niso.org/standards/resources/Z39-19.pdf>. [8 February, 2004].
• This resource is relevant and demands inclusion, but is in essence
covered by Aitchison, Gilchrist and Bawden.
Pollard, Richard. 1993. A hypertext-based thesaurus as a subject browsing aid for bibliographic databases. Information Processing & Management 29, no. 3: 345-357.
• The title of this article is precise. Pollard discusses the advantages
of using a hypertext-based thesaurus to provide structural cues to a subject
domain, improving user browsing experiences. The article is fascinating, but
slightly beyond the scope of this project as it currently stands. It is anticipated
that this resource will be invaluable at Phase Two of this thesaurus where the
focus shifts to interactive Web-based user interfaces for the product.
Priss, Uta, Elin Jacob. 1999. Utilizing faceted structures for information systems design. In ASIS '99: Proceedings of the 62nd ASIS Annual Meeting203-212.
• Priss and Elin have written directly to the broader endeavor of which
this thesaurus construction project is a part – a full database, and ultimately
dynamic Web site, linking all the elements of the budding Integration and Implementation
Sciences field. “This paper argues for the use of a faceted thesaurus
as the basis for organizing a small-scale institutional website.” While
IIS aims to be large-scale in the future, and the paper is by no means comprehensive,
the proposals for organizing information set forth here are very useful.
Ross, Jan. 2000. A new way of information retrieval: 3-D indexing and concept mapping. Learned Publishing 13, no. 2: 119-123.
Schwartz, Candy. 2001. Sorting out the Web: Approaches to subject access. Westport, CT: Ablex.
•
Controlled vocabularies form a small component of this volume, but the components of an indexing language were laid out very clearly and concisely for easy review. Notably, Schwartz clearly includes the component of "Application Policies" - a subject absent from, or hidden in, many other discussions of controlled vocabulary.
Shiri, A.A., C. Revie. 2000. Thesauri on the Web: current developments and trends. Online Information Review 24, no. 4: 273-280.
•This brief article provides a reasonable assessment of the state of thesauri on the Web in 2000, and adequately identifies future activities, including the next stage of thesauri standards development: standards for electronic thesauri. There was little here, however, to contribute additional knowledge to this project. To be fair, it does serve as a relevant milestone and reinforces knowledge gathered elsewhere and is thus included.
Smith, Linda C. 1992. UNISIST revisited: Compatibility in the context of collaboratories. In Classification research for knowledge representation and organization. Amsterdam: Elsevier.
Soergel, Dagobert. 1974. Indexing languages and thesauri: Construction and maintenance. Los Angeles: Melville.
• Ugh. This was the bible in 1974, and remains a seminal reference in any article or volume discussing thesaurus construction. Soergel is comprehensive, detailed and honorably precise. Unfortunately, the sheer size and scope of this volume makes it an unwieldy resource for this project. It will be used primarily as a reference source to double check points of debate and to ensure that critical activities are not overlooked. In this regard, the inclusion of both a condensed and exploded Table of Contents, as well as an Index, is extremely valuable.
Svenonius, Elaine. 1990. Design of controlled vocabularies. In Encyclopedia of library and information science.
• Svenonious considers the myriad details of the narrow activity of term selection and relationship definition in this encyclopedia article. Her treatment of Vocabulary, Syntax and Semantics provide detailed perspectives of well-covered ground. Of particular interest is her unique coverage of issues like scholarly usage, context independence, structural warrant, and her thorough discussions of binding vs. synthesizing syntax and category vs. referential vs. relational semantics. This is the document for the deep logic of it all - though not quite so voluminous as Soergel.
Taylor, Arlene G. 2004. The organization of information. Littleton, CO: Libraries Unlimited.
• Taylor's treatment is basically introductory, as a chapter within a book with a much larger subject. The sub-section on the Mechanics of Controlled Vocabularies, however, provided a diverse treatment of the various types of relationships. She covers the traditional whole/part, genus/species, class/member, topic/instance types of relationships most commonly found in thesauri. She also describes relationships as outlined in WordNet (synonyms, coordinate terms, hypernyms, hyponyms, holonyms, meronyms, and antonyms), and in the OWL Web Ontology Language (subClassOf, one of, equivalentClass, intersectionOf, disjointWith). The juxtaposition of these different articulations of relationships complicates the subject while simultaneously illustrating the need for clarity and commitment in determining how relationships will be expressed in a newly constructed controlled vocabulary.
Vickery, B.C. 1997. Ontologies. Journal of Information Science 23, no. 4: 277-286.
Weinberg, B.H. 1995. Library classification and information retrieval thesauri: comparison and contrast. Cataloguing & Classification Quarterly 19, no. 3/4: 23-44.
• There is little of direct practical application in this article effectively
outlining the convergent course of thesauri and classification, particularly
in the Web environment. However, Weinberg highlights a critical point. Greater
classification structure in thesauri will enable them to serve as more effective
tools at the searching level (as opposed to just the indexing level). This is
an important observation to keep in mind while designing a new thesaurus for
a growing field.
Wellisch, Hans H. 2000. Glossary of terminology in abstracing, indexing, classification,
and thesaurus construction. Medford, NJ: Information Today, in association with
the American Society of Indexers.
• This thin volume is an important reference work in deciphering much
of the sub-language pervasive in the literature on thesaurus construction. Not
an especially active resource for this project, but a representative one that
must remain on the list.
Wellisch, Hans H. 1995. Indexing from A to Z. New York: H.W. Wilson Co.
Will, Leonard D., Sheena E. Will. Thesaurus principles and practice. 13 February 1998Internet on-line. Available from <http://www.willpower.demon.co.uk/thesprin.htm>. [8 February 2004].
• This introduction to thesauri is similar to that of Tim Craven, but
perhaps even more focused on the beginner. One advantage of this tutorial, however
is that all chapters are on one page and are more easily scrolled, linked and
printed. While it had little additional instructional value for this project,
it remains in the bibliography as a good introductory reference. And it does
provide some introductory treatment of thesauri principles for museum objects
– a topic not always addressed by other treatments.
Will, Leonard D., Sheena E. Will. Publications on thesaurus construction and use. 13 August 2003Internet on-line. Available from <http://www.willpower.demon.co.uk/thesbibl.htm>. [2 March, 2004].
• The Wills maintain quite a phenomenal gateway bibliography of resources
grouped according to the broad categories of thesaurus structure and use, facet
analysis, search interfaces that allow for combining terms from multiple facets
of a thesaurus, lists of thesauri, taxonomies, ontologies, and topic maps. Many
of the resources reviewed for this project, and included in this Bibliography,
are listed and/or linked from the Will’s Publications page. Neck and neck
with Michael Middleton for coverage (and they link to each other).
Williamson, Nancy J. Deriving a thesaurus from a restructured UDC. In 4th International Conference of the International Society for Knowledge Organization370-377.
• This article is useful for the phase of the project concerned with
developing classification. Some classification schemes are being reviewed in
relation to the key concepts of Integration and Implementation Sciences. If
there is a classification scheme that serves the field more effectively than
the hierarchical structure of a particular thesaurus, it may prove effective
to derive some terms from the classification. At that point, this resource will
prove its value.
Zeng, Lei. 1993. Compatibility of indexing languages in an online access environment: A review of the approaches. Advances in Classficiation Research 3: 161-181.
• Zeng discusses the importance of compatibility and interoperability
among thesauri in anticipation of the meta-searching systems that are just now
becoming a prevalent reality. In searching automatically across databases, discrepancies
and differences in indexing terms can cause both precision and recall problems.
While Zeng investigates this issue in far more detail than is necessary for
this project, compatibility and interoperability are important reasons why this
project is structured to make the greatest use of existing thesauri prior to
developing new terminology. It is hoped that existing terminology may be able
to be “re-used” by establishing different relationships that will
make the terms relevant for IIS. Zeng’s analysis may prove helpful as
this work comes to the fore.
Top
On Faceted Classification and Classification Theory
Ranganathan, S.R.. Elements of Library Classification. 3rd ed. New York: Asia
Publishing House, 1962.
• This book is based on a series of Ranganathan lectures on classification.
The first part of the book focuses on general classification using an example
of 108 “Subjects in Unhelpful Alphabetical Sequence”. These 108
subjects are analyzed and grouped according to Ranganathan’s Canons of
Classification. This is an excellent introduction to developing a classification
scheme, but can be difficult to decipher at times because of the syntax of the
language. The lectures on facet analysis are divided into four parts: facet
and focus, fundamental categories, rounds, and levels. The five fundamental
categories (personality, matter, energy, space and time) are applied in facet
analysis examples where facet symbols are applied to 18 subjects. This was particularly
useful in learning how to analyze and deconstruct a complex subject.
Classification Research Group. “The Need for a Faceted Classification
as the Basis of All Methods of Information Retrieval.” Theory of Subject
Analysis: A Sourcebook, edited by Lois Mai Chan, Phyllis A. Richmond pp 154-67
Littleton, Colo Libraries Unlimited, 1985
• This is an interesting report presented by the Classification Research
Group (CRG) in the early 1950’s. It focuses on the need for improved classification
systems to aid in two steps of information retrieval: identification of the
search subject, and locating the subject through a referral to one or more subjects.
The CRG argues that generic relations cannot be displayed in enumerative tree
structures and that traditional schemes are rigid and inflexible. It is suggested
that faceted classification schemes are the best mechanism to allow given genus
to be subdivided in more than one way allowing for flexibility and a better
understanding of generic relationships. The report goes on to identify ten problems
with classification schedules, including misplacement of subjects, distorted
relationships between terms, insufficient detail and lengthy and complex notation.
This report defines reasons for using faceted classification schemes, but does
not go into much detail about the construction of such schemes.
Vickery, B.C. Faceted classification: a guide to construction and use of special
schemes, prepared for the Classification Research Group. London, Aslib, 1960
• Vickery, a member of the Classification Research Group, provides a solid
introduction to faceted classification schemes. The book includes a history
of facet analysis, indexing term definitions, as well as examples of facet construction,
and comparison of faceted classifications. Vickery uses interesting examples
to demonstrate the creation of facets, the ordering, and the notation. The book
also focuses on how systems retrieve information using pre-coordinate and post-coordinate
language. This book is an excellent how-to guide for facet classification creation.
Top
Classification and the Internet
High level subject access tools and techniques in Internet cataloging / Judith
R. Ahronheim ed. Binghamton, NY: Haworth Information Press, c2002
Dahlberg, Ingetraut. The future of classification in libraries and networks:
a theoretical point of view. Cataloging & Classification Quarterly, 21(2),
23-36 (1995)
Denton, William. How to Make a Faceted Classification Scheme and Put it on the
Web. (2003) Available:
http://www.miskatonic.org/library/facet-web-howto.html
• This is a fantastic paper on faceted classification creation and its
use on the Web. Denton clearly explains how (and when) to create facets within
the context of the web. This paper is not limited to the Web however. It is
an excellent guide to creating a faceted scheme, and draws upon many of the
original texts on facet creation such as S.R. Ranganathan and the Classification
Research Group. This resource has been invaluable. Includes how to store information
in relational databases, faceted-based navigation, as well as keyword searching.
--. Putting Facets on the Web: An Annotated Bibliography. (2003) Available:
http://www.miskatonic.org/library/facet-biblio.html
• A superb bibliography not only for Web and facet analysis, but also
facet analysis theory and its history.
Faceted classifications and thesauri. Barbara Perles. http://is.gseis.ucla.edu/impact/f95/Papers-projects/Papers/perles.html
Doyle, Bob. The Classification & Evaluation of Content Management Systems.
The Gilbane Report 11 (2), 2-13 (2003) Available: http://www.gilbane.com/artpdf/GR11.2.pdf.
• This report was not particularly useful for this project, as it focuses
on the actual classification of content management systems and no the process
of classification. This paper does however serve as a terrific tool in helping
one to evaluate the currently available content management systems.
Fast, Karl, Fred Leise, Mike Steckel. All About Facets and Controlled Vocabularies
Boxes and Arrows, (2002) Available: http://www.boxesandarrows.com/archives/all_about_facets_controlled_vocabularies.php.
• This series of articles explains the basic concept of facet creation
as well as controlled vocabularies. It is directed towards information architects
and contains excellent, clearly written definitions and examples. This is a
nice supplement to the more complicated and involved texts of Ranganathan and
Vickery
Koch, Traugott. The Role of Classification Schemes in Internet Resource Description
and Discovery. Available: http://www.ukoln.ac.uk/metadata/desire/classification/class_tc.htm
• Classification schemes lend themselves quite well to information retrieval,
browsing, and navigation on the Internet. Traugott had compiled a comprehensive
report investigating how efficient classification schemes are in aiding information
retrieval in Internet environments. The Universal Decimal Classification (UDC),
Dewey Decimal Classification (DDC), Library of Congress Classification (LCC)
as well as some international, subject specific and home-grown schemes are all
evaluated in terms of their Web-adaptability. Evaluation topics include Internet
service availability, multilingual capabilities, classification scheme integration
with controlled vocabulary / subject headings, digital availability, copyright,
and the strengths and weakness of the scheme. This is an excellent resource
for anyone who is interested in adapting and or creating a classification scheme
for a body of knowledge in the Internet domain.
Kwasnik, Barbara. (1999). The Role of Classification in Knowledge Representation and Discovery. Library Trends, 48, 22-47.
McKiernan, Gerry. Beyond Bookmarks: Schemes for Organizing the Web. Available:
http://www.iastate.edu/~CYBERSTACKS/CTW.htm
• This is a clearinghouse of links to Web sites that have applied classification
schemes and or controlled vocabularies to organize their sites. Some of the
classification schemes used include: Dewey Decimal, Universal Decimal Classification,
Library of Congress, and National Library of Medicine. Some of the controlled
vocabularies used in Web site organization include: Library of Congress Subject
Headings (LCSH), National Library of Medicine (NLM), and Medical Subject Headings
(MeSH). Of interesting note, there are quite a few Finnish Web sites using thesauri
as an aid in organizing their sites.
Noy, Natalya F. et al. Ontology Development: A Guide to Creating Your First
Ontology. (2002) Available:
http://ksl.stanford.edu/people/dlm/papers/ontology101/ontology101-noy-mcguinness.html
Spiteri, Louise. A Simplified Model for Facet Analysis: Ranganathan 101 Canadian
Journal of Information and Library Science 23 (1/2),1-30 (April-July 1998) Available:
http://aifia.org/pg/a_simplified_model_for_facet_analysis.php.
• Spiteri proposes a simplified model for facet analysis that includes
elements from both S.R. Ranganathan and the Classification Research Group’s
(CRG) models. Spiteri reviews Ranganathan’s 46 canons in the context of
his three Planes of Work: Idea, Verbal, and Notational. She then goes on to
compare and contrast CRG principles of facet analysis with Ranganathan. The
final portion of Spiteri’s paper is her simplified model. This is a fantastic
primer for reading Ranganathan and CRG works on facet analysis. Its comprehensive
overview of Ranganathan’s canons is unique and extremely helpful.
Top
Indexing Languages Reviewed
California Environmental Resources Evaluation System. CERES.Internet on-line.
Available from <http://ceres.ca.gov/thesaurus/>.
•
This environmental thesaurus was initially considered because often issues in the environmental area have a tendency to be multidisciplinary - particularly those that engage in Sustainable Development. It was not multidisciplinary enough, however, to be of use for the IIS Thesaurus.
Cambridge Scientific Abstracts. Thesaurus of Sociological Indexing Terms.Internet
on-line. Available from <http://www.csa.com/edit/sociothes.html>.
•
CSA's sociological thesaurus covered a great deal of terms relevant to IIS. It presented a formidable challenge to the ERIC thesaurus. CSA contained relevant terms that ERIC did not, and vice versa. Ultimately ERIC was chosen over CSA because ERIC had more terms relevant to academia (which were relevant considering that IIS is a new "specialization" among fields of study). The final key disadvantage for CSA was that access to its thesaurus required a subscription to its services. As such, it would not be freely and easily available to a range of users.
ELDIS, Institute for Development Studies. Internet on-line. Available from <http://www.ids.ac.uk/eldis/thessea.htm>.
•
Because much of the application of Integration and Implementation Sciences related research focuses on complex problems like International or Sustainable Development, the ELDIS thesaurus seemed a logical source to review. It had too narrow of a focus, however, and was not useful for the IIS thesaurus.
Global Environmental Information Exchange Network. Envoc thesaurus.Internet
on-line. Available from <http://p5uni.ii.pw.edu.pl/envoc/>.
•
Similar to the CERES thesaurus, the Envoc thesaurus was considered for its potential to represent multidisciplinary concepts. It was missing important pieces of academia and sociology, however, and was quickly discarded.
HASSET/SOSIG implementation. HASSET thesaurus.Internet on-line. Available from
<http://sosig.ac.uk/roads/cgi/thesaurus.pl>.
•
The HASSET sociological thesaurus is cumbersome to use and was too broad to be of use for the IIS thesaurus.
Organization for Economic Co-operation and Development. OECD Macrothesaurus.
1991. Internet on-line. Available from <http://info.uibk.ac.at/info/oecd-macroth/>.
[8 February, 2004].
•
The OECD thesaurus considered because its international focus might provide coverage of a range of complex, large scale problems. It was also the only thesaurus with a category devoted to research and theory, which was the most appropriate home for Integration and Implementation Sciences as a discipline. But the OECD thesaurus was entirely to narrowly focused on economics to be of use to IIS.
UNESCO. UNESCO thesaurus. 1995. Internet on-line. Available from <http://www.ulcc.ac.uk/unesco/>.
[8 February, 2004].
•
The UNESCO thesaurus contained a great many terms applicable to IIS resources, but ultimately it was to general and did not provide enough terms related to academic study. A portion of the UNESCO thesaurus, however, was incorporated into the IIS system and referenced as the authority file for geographic place names.
United States Geological Survey. USGS Thesaurus: Gateway to the Earth Draft
9 28-Jun-2002. 2004 2002. Internet on-line. Available from <http://geo-nsdi.er.usgs.gov/term>.
, 8 February].
•
It was clear from the beginning that a geological thesaurus would not provide appropriate terms for indexing IIS resources. Instead it was reviewed as a model for its faceted structure and excellent presentation (both layout and functionality).
US Department of Education. ERIC Thesaurus (professional version).Internet on-line.
Available from <http://www.ericfacility.net/extra/pub/thessearch.cfm>.
•
ERIC contained the most terms relevant to IIS resources and provided a reasonably functional and flexible interface that was freely available to all users over the Web. As such it was selected as the primary thesaurus for the IIS thesaurus system.
USAID. USAID Thesaurus. 1998. Internet on-line. Available from <http://www.dec.org/pdf_docs/PNACD400.pdf
[8 February, 2004].
•
The USAID Thesaurus was available in a PDF form which was ungainly to use. It was considered for the same reasons that the UNESCO and ELDIS thesauri were considered - the broad multidisciplinary scope of international development seemed a likely area to contain the kinds of terms needed by IIS. But USAID did not provide enough detail in the six key IIS areas and lacked the necessary academic terms. It did provide, however, an excellent model for how to present an introduction to a thesaurus.
Top
|