FACET ANALYSIS IN UDC Questions of structure, functionality and formality Aida Slavic UDC Consortium The Netherlands Sylvie Davies Robert Gordon University Aberdeen, UK
CONTENT Statement of the problem(s) Introduction to Otlet s classification system Fundamentals of synthesis in UDC Commitment to facet analysis Faceted classification rationale Types of faceted structure introduced in UDC Discussion of issues Conclusion
STATEMENT OF THE PROBLEM(S) Lack of connection throughout UDC development between: and requirements of an overarching facet analytical theory requirements of computerised systems in terms of: notational representation data structure
CONCEPT BASED APPROACH AS DESIGNED BY OTLET IN 1900s Drawing cca 1910 (Courtesy of Mundaneum)
NOTATION It should reflect class structure but it is also an indexing language comprising: vocabulary labels as index terms syntax with rules for building complex expressions It brings together semantic and syntax It is the main source of structural data and most relevant to classification data modelling, automation and management It is important to represent all components of synthesised notation It should not be treated as text string in database design
FACETED CLASSIFICATION Based on facet analysis which can be either: intuitive guided by common sense principles of formal logic facet analytical theory (FAT) Notation tends to be viewed separately from the classification structure (BC2) We do not have a data model for fully faceted classification
ANALYTICO-SYNTHETIC CLASSIFICATION It enables to compose and decompose notation in the indexing/retrieval process All discrete component concepts of synthesized notation need to be coded to ensure full control of data elements It is often linked to the notion of facets because its main purpose is to enable the combination of mutually exclusive properties that may need to be combined in describing a subject
FACET REPRESENTATION AND SYNTHESIS IN UDC Discipline 1 MAIN TABLES Persons COMMON AUXILIARIES Discipline 2 Language Discipline 3 81 Linguistics and languages 811 Languages 811.111 English language 811.112 West Germanic languages (other than English) 811.112.2 German language 811.112.5 Dutch 811.113 North Germanic languages 811.113.4 Danish language 811.113.5 Norwegian language 811.113.6 Swedish language 811.12 Italic languages MAIN FACET Time Form (1/9) Place (4/9) Countries and places of the modern world (4) Europe (410) United Kingdom (430) Germany (436) Austria (437.3) Czech Republic (437.5) Slovakia (438) Poland BROAD FACET CATEGORIES Special Auxiliary Subdivision 81-1 Schools and methods in linguistics 81-11 Schools and trends in linguistics 81-112 Diachronic linguistics 81-114 Synchronic linguistics 81-13 Methodology of linguistics 81-132 Method of string analysis 81-139 Other methods Special Auxiliary Subdivision 81`01/`08 Origins and periods of languages 81`01 Old period. Archaic period 81`02 Classical period 81`04 Middle period 81`06 Modern period 81`08 Revived language Special Auxiliary Subdivision 81`1/`4 81`1 81`2 81`3 81`4 Subject fields and facets of linguistics General linguistics Theory of signs. Theory of translation. Standardization Mathematical and applied linguistics. Phonetics. Graphemics. Grammar. Semantics. Stylistics Text linguistics. Discourse analysis. Typological linguistics 811.133.1 276.6:34 Language of French lawyers
COMMITMENT TO FACETING Decentralized development of UDC (international special subject committees) leads to multiplicity of approaches in class presentation and synthesis 1960s FID Central Classification Committee (CCC) endorsed facet analysis as a model for structuring of the scheme (without specific theory or procedure in place) CCC continues its search for appropriate theoretical framework and more formal model that would enable better management of UDC growth
RATIONALE Shorter and more rigorously structured schedules with logical and predictable principle of organization of facets and their notational presentation in all areas of knowledge Tables containing simple concepts that would support their unique identification, less repetition and compositionality Expressive notational system that fully supports automation (parsing) and seamless linking of notational elements and natural language
TWO APPROACHES TO FACETIZATION 1980s facets as relational tables (81 Linguistics and 82 Literature) Facet analysis logical but not based on any specific theoretical principle Facets with simple concepts, no enumeration of combinations (compound concepts appear in examples of combination only) All needed combinations appear in the process of indexing 1990s facets with enumerated compound and complex classes (2 Religion) Structure based on facet analytical theory developed for Bliss (BC2) Table of simple concepts followed by selection of enumerated compound subjects listed as a main hierarchy Schedules offer rich selection of useful combinations ready to be used
EXAMPLE OF A RELATIONAL TABLE APPROACH MAIN CLASS 82 Literature 821 Literatures of individual languages 821.1/.9 =1/=9 [Divide as language table!] 821.111 English literature Example of combination: 821.111(417)-1(082.2) English literature (Ireland) poetry anthology SPECIAL AUXILIARY FACETS 82.02/.09 Theory, study and technique of literature 82.02 Literary schools, trends and movements 82.09 Literary criticism. Literary studies... 82-1/-9 Literary forms, genres 82-1 Poetry. Poems. Verse 82-2 Drama. Plays 82-3 Fiction. Prose narrative 82-4 Essays 82-5 Oratory. Speeches... COMMON AUXILIARY FACETS PLACE (4) Places of the modern world (41) Countries of the British Isles (417) Republic of Ireland... FORM (08) Collected and polygraphic works (082) Collections of works by several authors (082.2) Anthologies. Selections. Excerpts...
UDC MASTER REFERENCE FILE DATABASE (CREATED IN 1992) UDC MRF assumed a relational table presentation of facets The main field in UDC MRF database record is UDC number this field should contain only simple UDC notation each UDC notation comes from a certain table (hence table code field ) the UDC notation in combination with table code would be sufficient to produce automatic filing and ordering of UDC classes the UDC notation is a unique identifier of a class Pre-combined (compound and complex) UDC numbers would appear only in the field example(s) of combination where elements of the combination can be managed to a certain extent
EXAMPLE OF FACETING ALLOWING ENUMERATION UDC MRF was designed for this kind of presentation: (1990s) MAIN TABLE 2 Religion. Theology 27 Christianity 271 Eastern Christianity 271.2 Orthodox Church 271.2-1 The Orthodox Tradition 271.2-284.7-247 The Gospel Book Example(s) of combinations: 271.2-282.7-247-536.36 Prostration before the Gospel Book 271.2-284 Doctrinal statements. Symbolical Books 271.2-472-022.43 The Longer Catechism 271.2-523.46 Side rooms, chambers... 27 Christianity 271 Eastern Christianity 271.2 Orthodox Church Example(s) of combinations: 271.2-1 The Orthodox Tradition 271.2-284.7-247 The Gospel Book 271.2-284.7-247-536.36 Prostration before the Gospel Book 271.2-284 Doctrinal statements. Symbolical Books 271.2-472-022.43 The Longer Catechism 271.2-523.46 Side rooms, chambers SPECIAL AUXILIARY FACETS 2-1 Theory and philosophy of religion. Nature of religion 2-2 Evidences of religion e.g. 2-23 Sacred books. Scriptures. Religious texts 2-25 Secondary literature. Pseudo-canonical works 2-27 Critical works 2-28 Other religious texts 2-3 Persons in religion 2-4 Religious activities. Religious practice 2-5 Worship broadly. Cult. Rites and ceremonies 2-6 Processes in religion 2-7 Religious organization and administration 2-8 Religions characterised by various properties 2-9 History of the faith, religion, denomination COMMON AUXILIARIES -02 Common auxiliaries of properties -021 Properties of existence -022 Properties of magnitude, degree, quantity, number, temporal values...
FURTHER ISSUES (1) Contracted captions 2-23 Sacred books. Scriptures [special auxiliary] 27 Christianity 27-23 Bible rather than: 27-23 Christianity -- Sacred books -- Bible Compound class used as basis for hierarchical subdivision 233 Hinduism 233-13 The Holy. Brahma Absolute being 233-14 God(s) and goddess(es) 233-158D Devi 233-158G Ganesh 233-158K Kali
FURTHER ISSUES (2) Differential facets (i.e. introducing further class specifications under particular subjects) not distinguished from the rest of hierarchy From: 2-265.3 Epics and sagas [special auxiliary] To: 233 Hinduism 233-265 [omitted combination] 233-265.3 Itihasa. Epics and sagas 233-265.32 Ramayana 233-265.33 Mahabharata 233-265.34 Bhagavadgita 233-265.35 Puranas
OVERALL... The relational model of facet analysis provides schedules easier to manage online - however, introduced without: an established theoretical framework and guidelines for facet analysis that should be applied for the UDC as a whole proper consideration of lack of searching access points in the scheme which provides only a few examples of pre-combined concepts The Bliss-based approach offers a much needed theoretical framework but it was introduced: ignoring the existing UDC MRF data model without guidelines about the principles and extent of enumerated combinations to be presented in faceted schedules of this type
FURTHER ISSUES lack of guidelines on facet analysis procedure and management and modelling of differential facets in the UDC MRF standardization of notational representation semantic analysis/factoring of compound subjects (e.g. when a compound should be represented as a simple class e.g. Eastern religions or Ancient religions and when as a combination) Also how the revised classes relate/combine with the rest of the UDC vocabulary: when to introduce a concept and when to use an existing concept from the UDC schedules? indicate how, when and why to use examples of combination indicate rules for verbal representation of pre-combined classes
CONCLUSION Facet analysis ought to be based on a theoretical framework valid for the system as a whole to impose predictability in the schedule composition and organization to impose rigorous principles for hierarchical subdivisions Notational system should be formalized in relation to faceted structure to formalize syntax rules to support synthesis to improve semantic linking to support syndetic structure We should provide: adequate data management system documentation on revision policy, guidelines and procedures
Thank you