Date Inferred Table 1. LCCN Dates

Similar documents
Bias (Economics for Mathematicians) Tool (Mathematical economics) Influence (Influence of mathematics on

THEORY AND PRACTICE OF CLASSIFICATION

THEORY AND PRACTICE OF CLASSIFICATION

Add note: A note instructing the classifier to append digits found elsewhere in the DDC to a given base number. See also Base number.

Educational Resource Management System (MPT1173) Library Classification: DDC. Mr. Abdul Razak Idris Dr. Norazrena Abu Samah

Crash Course in Dewey Decimal Classification. Instructor: Elisa Sze October 2018 Fall 2018 iskills Series

SCHEME OF EXAMINATION BACHELOR OF LIBRARY AND INFORMATION SCIENCE (B.Lib.I.Sc.) ONE YEAR PROGRAMME (ANNUAL) 2011

Cataloging Fundamentals AACR2 Basics: Part 1

DDC22. Dewey at ALA Midwinter. Dewey Decimal. Classification News

Illinois Statewide Cataloging Standards

LSC 606 Cataloging and Classification Summer 2007

ddc 19th edition 80692E5CC87923C6D9A704E8DF75CEE0 Ddc 19th Edition 1 / 6

The Ontological Character of Classes in the Dewey Decimal Classification. Rebecca Green Michael Panzer OCLC Online Computer Library Center, Inc.

LC GUIDELINES SUPPLEMENT TO THE MARC 21 FORMAT FOR AUTHORITY DATA

WELLS BRANCH COMMUNITY LIBRARY COLLECTION DEVELOPMENT PLAN JANUARY DECEMBER 2020

DDC22. Dewey at ALA Annual. Dewey Decimal Classification News

UCSB Library Collections Survey of Faculty and Graduate Students

A Role for Classification: The Organization of Resources on the Internet

CHAPTER 5 FINDINGS, SUGGESTIONS AND CONCLUSIONS

THE UNIVERSITY OF THE WEST INDIES

Interdepartmental Learning Outcomes

The Organization and Classification of Library Systems in China By Candise Branum LI804XO

Abridged Dewey Decimal Classification And Relative Index By Melvil Dewey, Joan S. Mitchell READ ONLINE

5/13/2014. In this presentation you will learn: What is an online library catalog? Online Library Catalogs

THEORY AND PRACTICE OF CLASSIFICATION

June 23, OCLC Dewey Update Breakfast. ALCTS Public Libraries Technical Services Interest Group Meeting #ALAAC18

Françoise Bourdon Bibliothèque nationale de France Paris, France. Patrice Landry Swiss National Library Bern, Switzerland

Ordinarily, when location elements vary, separate holdings records are used rather than multiple 852.

Glendale College Library Information Competency Workshops Introduction to the Library for New Students

Radically speaking : feminism reclaimed / edited by Diane Bell and Renate Duelli Klein

WEEDING THE COLLECTION

Get to know the Dewey Decimal Classification system

Opus: University of Bath Online Publication Store

Jerry Falwell Library RDA Copy Cataloging

Classification. Bibliography. The Functional Characteristics of the Common Subdivisions from the Dewey System

A composite number comprising of class number, book number and collection number which provides a unique and complete shelf address of the document.

Opportunities and difficulties Sweden goes Dewey

Calderdale College Learning Centre. Guide to the Dewey Decimal Classification system

USER SERVICES. Contents: QNLib. QatarNationalLibrary. Qatar National Library.

USER SERVICES. Contents: Becoming a Member Book Borrowing/Renewal/Return Finding a Book on the Shelves Document Delivery Service

Universal Decimal Classification adding value to the user experience. Penny Doulgeris, Metadata Librarian, IAEA Library.

Juvenile Literature Cataloging

You ve Been Warned: Amazon Reviews!

Faceted classification as the basis of all information retrieval. A view from the twenty-first century

6JSC/Chair/8/DNB response 4 October 2013 Page 1 of 6

SURING ELEM SCHOOL. Analysis Overview. Collection Information Date of Analysis: 08-Apr :44:23

Educational supplementary bibliographic relationships from FRBR point of view: A Canadian Case Study 1

InCites Indicators Handbook

South Carolina Standards for School Library Resource Collections

Creating a Shared Neuroscience Collection Development Policy

Get to know the Dewey Decimal Classification system

THEORY AND PRACTICE OF CLASSIFICATION

English English ENG 221. Literature/Culture/Ideas. ENG 222. Genre(s). ENG 235. Survey of English Literature: From Beowulf to the Eighteenth Century.

Model Answer. Prepared by. Sunil Kumar Gautam (Asst. Professor) Mob.No ,

Cited Publications 1 (ISI Indexed) (6 Apr 2012)

Writing Styles Simplified Version MLA STYLE

DOWNLOAD OR READ : THE DEWEY DECIMAL SYSTEM PDF EBOOK EPUB MOBI

Special notation for archaeology: Draft for comment by September 15, 2012

AESTHETICS. Students will appreciate the variety of human experiences as expressed through the arts.

AutoDewey. Julianne Beall, Assistant Editor, DDC Caroline Saccucci, Head, Dewey Section Library of Congress

UCSB LIBRARY COLLECTION SPACE PLANNING INITIATIVE: REPORT ON THE UCSB LIBRARY COLLECTIONS SURVEY OUTCOMES AND PLANNING STRATEGIES

Overview. Cataloging & Processing BOOKS & LIBRARY SERVICES

Changes to British Library services supplying records in UKMARC format

A Hybrid Theory of Metaphor

Indiana University, Bloomington, Department of Information and Library and Science (ILS) Z504: Cataloging Spring 2017

MUSIC COLLECTION GUIDELINES

Automated Cataloging of Rare Books: A Time for Implementation

Collection Development Policy, Film

Professor Suchy, Joliet Junior College Library

Use of the LCSH System: Realities

Copy Cataloging in ALMA ( )

British Journal of Humanities and Social Sciences 33 September 2011, Vol. 1 (2)

GEOSCIENCE INFORMATION: USER NEEDS AND LIBRARY INFORMATION. Alison M. Lewis Florida Bureau of Geology 903 W. Tennessee St., Tallahassee, FL 32304

Technical Processing in Private University Library of Assam

Indian LIS Literature in International Journals with Specific Reference to SSCI Database: A Bibliometric Study

THE EVALUATION OF GREY LITERATURE USING BIBLIOMETRIC INDICATORS A METHODOLOGICAL PROPOSAL

SUBJECT DISCOVERY IN LIBRARY CATALOGUES

(DBLS 01) B.L.I.Sc. DEGREE EXAMINATION, MAY 2013 Bachelor of Library Information Science. Time : 03 Hours Maximum Marks : 75

SOCIAL AND CULTURAL ANTHROPOLOGY

Documents Located at Docs Center

In Need of a Total Plan: From Wade-Giles to Pinyin

DEWEY DECIMAL CLASSIFICATION, DDC 23 (DEWEY DECIMAL CLASSIFICATION & RELATIVE INDEX) BY MELVIL DEWEY

1.1 What is CiteScore? Why don t you include articles-in-press in CiteScore? Why don t you include abstracts in CiteScore?

CLEAR LAKE ELEM SCHOOL

Predicting the Importance of Current Papers

SAURASHTRA UNIVERSITY RAJKOT

HERITAGE ELEM SCHOOL. Analysis Overview. Collection Information Date of Analysis: 21-May :34:53

BONDUEL ELEM SCHOOL. Analysis Overview. Collection Information Date of Analysis: 29-Mar :42:38

DISSERTATION AND THESIS FORMATING GUIDE Spring 2018 PREPARED BY THE OFFICE OF GRADUATE STUDIES

English 1010 Presentation Guide. Tennessee State University Home Page

Susan Battison Project Leader: SANB National Library of South Africa. 136 Bibliography No

SEBASTIAN MDL SCHOOL Fall 2013

Author(s): Title: Journal: Pages: ISSN: Year: Abstract: URLs: Hider, P.M.

foucault studies Richard A. Lynch, 2004 ISSN: pending Foucault Studies, No 1, pp , November 2004

UNIT 4 SPECIES OF SCHEMES OF LIBRARY CLASSIFICATION

CUA. National Catholic School of Social Service Washington, DC Fax

It's Not Just About Weeding: Using Collaborative Collection Analysis to Develop Consortial Collections

PELICAN ELEM SCHOOL Oct 2010

Azizia Freda Savana, Universitas Muhammaditah Yogyakarta, Indonesia Arda Putri Winata, Universitas Muhammadiyah Yogyakarta, Indonesia

PRAIRIE ELEM SCHOOL. Analysis Overview. Collection Information Date of Analysis: 10-May :02:04

Transcription:

Collocative Integrity and Our Many Varied Subjects: What the Metric of Alignment between Classification Scheme and Indexer Tells Us About Langridge s Theory of Indexing Joseph T. Tennis University of Washington Information School As the universe of knowledge and subjects change over time, indexing languages like classification schemes, accommodate that change by restructuring. Restructuring indexing languages affects indexer and cataloguer work. Subjects may split or lump together. They may disappear only to reappear later. And new subjects may emerge that were assumed to be already present, but not clearly articulated (Miksa, 1998). In this context we have the complex relationship between the indexing language, the text being described, and the already described collection (Tennis, 2007). It is possible to imagine indexers placing a document into an outdated class, because it is the one they have already used for their collection. However, doing this erases the semantics in the present indexing language. Given this range of choice in the context of indexing language change, the question arises, what does this look like in practice? How often does this occur? Further, what does this phenomenon tell us about subjects in indexing languages? Does the practice we observe in the reaction to indexing language change provide us evidence of conceptual models of subjects and subject creation? If it is incomplete, but gets us close, what evidence do we still require? To address these questions we documented how different subjects changed over time in the Dewey Decimal Classification (DDC). For example, we marked where one could class the topic EUGENICS. In 1911 it is a biological science. However, it can no longer be classed in 575.6, which is now the class for REPRODUCTIVE PARTS OF PLANTS. We collected this data from 1876-2003. Then, using the Z39.50 protocol we downloaded bibliographic records from 665 libraries using the DDC. The libraries were chosen from a list of those offering Z39.50 access to their catalogues and they also used the DDC. The libraries are in North America, South Africa, Taiwan, and Europe. We arranged the records according to Library of Congress Control Number. This number has a date built into it. See examples below with date highlighted in bold. Before the year 2000 LCCN dates were two digits; after 2000 they were four digit years. LCCN Date Inferred 68098003 1968 2004049123 2004 Table 1. LCCN Dates The date signifies when the bibliographic description was done at Library of Congress. This gives us an approximate time of cataloguing. We chose records that have our subject term in the MARC 650 field. So in our example above we arranged bibliographic records that had EUGENICS in the first MARC 650 field, with the assumption that we would see cataloguers class this where EUGENICS was available in the DDC. 196

From here we can ask how many cataloguers agreed with DDC and how many did not. We can also ask how many documents were classed in outdated numbers. Furthermore, we can observe trends in agreement and disagreement. These observations can be quantified, and that is what gives us the concept and metrics of collocative integrity. The measures of collocative integrity are given in the table and figure below. Here we show you where books on EUGENICS are classed. They are either in a class, outside of a possible class, or in an outdated class number. Over all only around 28% of the books on EUGENICS were classed where the DDC provided explicit semantic matches. Eugenics In Out Old 1899-2003 244 623 14 Percent ~28% ~71% ~1% Table 2. Counts and Percentages of Eugenics Books Classed In, Out, and in Old DDC Numbers 100% 80% 60% 40% 20% 0% Eugenics Old Out In Figure 1. Visualization of the Percentage of Eugenics Books Classed In, Out, and in Old DDC Numbers (NB: 1899 has no data so there is no bar) The measure of collocative integrity can be used in a number of ways. For example, we can see, at one point in time, where subject headings match well with the subjects in the scheme. We can see when, and how often, old classes are used. We might also be able to help indexers improve their practice. It might be that a subject with low collocative integrity can be flagged for potential reassignment. We might also be able to help designers of indexing languages plan revisions based on an optimized conceptualization of collocative integrity. Perhaps there is a benchmark they want to be above in the alignment of indexer and indexing language. We might postulate that in an ideal sense, as schemes change, the integrity of the collocation of documents on topics remains intact, that we do not jeopardize collocative integrity when we revise and restructure schemes. As for the potential research impact of measuring collocative integrity we may be able to explore the conceptions of types of subjects and types of subject emergence. Langridge, in analyzing the DDC talks about forms of knowledge, topics, specializations, forms of 197

writing, forms of thought, and forms of text (Langridge, 1989). The first three mentioned by Langridge are his conceptualization of how something is studied, the object of study, and the combination of the two. So PHILOSOPHY is a form of knowledge and can be applied to a range of topics. THE MIND or DESCRIPTORS IN THE MLS BIBLIOGRAPHY are topics that can be studied from various disciplinary points of view. Specializations are the intersections of these two, commonly called disciplines, but Langridge does not see the term as useful because it conflates specializations and forms of knowledge, where he sees them as distinct. An example of a specialization is The PHILOSOPHY OF MORALS commonly called ETHICS. We can see in our data what might be evidence of these distinctions. For example, when we chart the collocative integrity of ANATOMY we see a high level of integrity over time. This is because the description of ANATOMY is solidly a MEDICAL SCIENCE or an ART in the DDC. This is because we are commonly writing about the practice of anatomy or the drawing of anatomy. The indexers and the scheme agree based on the structure of this specialization. Table 3 and Figure 2 illustrate this. Anatomy In Out Old 1899-2003 1219 666 195 Percent ~59% ~32% ~9% Table 3. Counts and Percentages of Anatomy Books Classed In, Out, and in Old DDC Numbers 100% 90% 80% 70% 60% 50% 40% 30% 20% 10% 0% Anatomy Old Out In Figure 2. Visualization of the Percentage of Anatomy Books Classed In, Out, and in Old DDC Numbers We have close to 60% collocative integrity for ANATOMY compared with the 28% of EUGENICS. As a topic it is, and can be, studied from a number of forms of knowledge, so there is no consistent specialization over time. We can show this scatter visually by showing a timeline of classes possible in DDC and showing where cataloguers placed books in or out of those classes. See Figure 3 below. 198

Eugenics 1870-2010 1000 900 Range of Classes in the DDC 800 700 600 500 400 300 0 1870 1880 1890 1900 1910 1920 1930 1940 1950 1960 1970 1980 1990 2000 2010 DDC Assigned by Cataloguers Discontinued Classes in DDC Years Classes Possible in DDC See Alsos in DDC Figure 3. Eugenics classes and books arranged in chronological order and in DDC class number order Above we see squares that indicate the classes possible in DDC for EUGENICS. The large x s show where the editors of DDC explicitly removed a class from the schedules. The circles are see-also references, and the small diamonds are unique cataloguer decisions. There are 891 unique decisions presented here. Unique decisions for our purposes were records that had the same subject heading, different title, different publication year (if title was the same), and different class number. We can see a range of forms of knowledge represented both in the classes possible in DDC and in cataloguer decisions. The 100s are philosophy and psychology, 200s are religion, 300s are social sciences, 500s life sciences, 600s applied sciences and useful arts, 700s fine arts, 800s are literature, and 900s are history and geography. We also see, perhaps, a range of topics and specializations in, for example the 300s and 500s. 200 100 199

We can compare this graph to the one derived from decisions made about books on ANATOMY and GYPSIES. Anatomy 1900-2010 1000 900 Range of Classes in the DDC 800 700 600 500 400 300 200 0 1870 1880 1890 1900 1910 1920 1930 1940 1950 1960 1970 1980 1990 2000 2010 Years DDC class possible discontinued Figure 4. Anatomy classes and books arranged in chronological order and in DDC class number order The above figure reinforces, in visual form, the measure of collocative integrity present in the cataloguing practice of anatomy books. Cataloguers consistently agree with the DDC and place books on ANATOMY in either applied sciences (600s) or fine arts (700s). Even when DDC introduces natural sciences classes with few exceptions cataloguers agree with the editors of the classification scheme. The other notable difference here is with the scheme and its lack of see-also references for this particular subject. When we consider GYPSIES as a subject, or rather, as Langridge would describe it as a topic, we see a different set of considerations surface. It looks like both ANATOMY and EUGENICS. As a topic GYPSIES can be considered from different forms of knowledge. This makes it similar to EUGENICS. We see this over time as displayed in Figure 5. However, we also see a disagreement between cataloguers and the prescriptions of DDC. They do not agree as to where GYPSIES belong in the range of classes. This is in part due to the way people are handled in DDC as topics. From 1965 (17th Edition) onward we 100 200

get both classes in the schedules for people (and their languages, for instance), but then we also see the editors of DDC move area, ethnicity, race, and nationality, as well as language, to the tables for synthesis to forms of knowledge. There is a similarity between GYPSIES and ANATOMY in the stability of forms of knowledge over time. With the only change coming with the way DDC treats people in the 1960s onward. And we do see some collections privileging older conceptions of GYPSIES as a cultural group with distinct art forms, language, and social customs. We also see at least one echo class that resurfaces in 2003 in response to cataloguer work in the 900s. Gypsies 1870-2010 1000 900 800 Range of Classes in DDC 700 600 500 400 300 200 100 0 1870 1880 1890 1900 1910 1920 1930 1940 1950 1960 1970 1980 1990 2000 2010 Years Class Assigned Classes Possible Figure 5. Gypsies classes and books arranged in chronological order and in DDC class number order The collocative integrity measure for GYPSIES is problematic because from 1965 onward much of the semantics of GYPSIES is derived from number synthesis based on Tables. Our first attempt at calculating this measure can be seen in the table and figure below. Gypsies In Out Old 1899-2003 17 339 52 Percent ~4% ~83% ~13% Table 4. Counts and Percentages of Gypsies Books Classed In, Out, and in Old DDC Numbers 201

100% 80% 60% 40% 20% 0% Gypsies Old Out In Figure 6. Visualization of the Percentage of Gypsies Books Classed In, Out, and in Old DDC Numbers (NB: years with no data show no bar) We have to treat with skepticism the percentage of books Out of classes possible from 1965 onward because of the use of Tables from that point. However, this exploratory data analysis allows us to reflect on the nature of subject in the DDC and following Langridge s analysis a clearly understanding of the ramifications of forms of knowledge and topics as analyzed in subject analysis and representation. Langridge In his 1989 work Subject Analysis: Principles and Procedures Langridge outlines an analytical rubric for interpreting a text for its subject matter. He specifically addresses what he sees as the confusion between forms of knowledge, like PHILOSOPHY or NATURAL SCIENCE, and topics, such as HORSES. He outlines a robust set of interpretation guides for the cataloguer and indexer. His work is useful, but is based on anecdotal evidence. With the data we are collecting using the Z39.50 protocol we can begin to lay data next to his work. In this case we see topics EUGENICS ANATOMY and GYPSIES each with varying degrees of valence. That is, ANATOMY seems to demonstrate a strong valence with art and applied science. We do not get as strong a valence with the other two topics throughout their ontogeny. We see a kind of ambivalence with EUGENICS and GYPSIES. Further, these latter two topics are different in kind. One is a kind of research and practice. The other is a group of people. Perhaps we need to consider these topics differently in the context of a forms of knowledge classification scheme? Emerging research on the treatment of people in classification schemes has problematized the position of different groups, and here we see the ramifications of scheme change on the positioning of GYPSIES And as with EUGENICS we lose the historical context of the term in a long-lived collection based on updates to the semantics to reflect our contemporary vision of people and science. Given this, perhaps we can extend Langridge s atemporal conception of subject analysis to account for this time- 202

sensitive valence of topics to forms of knowledge, and begin to craft policy, practice, and technological innovations to classification work. References Langridge, D. W. (1989). Subject analysis: principles and procedures. Bowker. Miksa, F. (1998). The DDC, the universe of knowledge, and the post-modern library. Albany, NY: Forest Press. Tennis, J. T. (2007). "Scheme Versioning in the Semantic Web." In Cataloging and Classification Quarterly. 43(4/3): 85-104. 203