TEXT ENCODING INITIATIVE

Similar documents
Ten Essays in the Development of Economic Thought. Ronald L. Meek Tyler Professor of Economics at the University of Leicester

THE PRIX VOLNEY ESSAY SERIES

Michel Foucault: Personal Autonomy and Education

LOGIC, LANGUAGE AND REASONING

Quality Assurance in Seafood Processing: A Practical Guide

SYNTHESE LIBRARY STUDIES IN EPISTEMOLOGY, LOGIC, METHODOLOGY, AND PHILOSOPHY OF SCIENCE. JAAKKO HINTIKKA, Boston University

Philosophy of Development

Polymer Technology Dictionary

EPISTEMOLOGY, METHODOLOGY, AND THE SOCIAL SCIENCES

Mathematical Principles of Fuzzy Logic

EXPANDING THE ECONOMIC CONCEPT OF EXCHANGE

THE THEORY OF BEAUTY IN THE CLASSICAL AESTHETICS OF JAPAN

SOCIOLOGICAL POETICS AND AESTHETIC THEORY

Studies in Natural Language and Linguistic Theory

METAMATHEMATICS OF FUZZY LOGIC

THE LITTLE BOOK. bees

A Glossary of Anesthesia and Related Terminology. Second Edition

Abbreviations and Acronyms in Medicine and Nursing

IS SCIENCE PROGRESSIVE?

Lattice-Ordered Groups. An Introduction

IMAGE AND TEXT COMPRESSION

Hardy and the Erotic

Damage Mechanics with Finite Elements

T h e P o s t c o l o n i a l a n d Imperial Experience in American Transcendentalism

The multicultural-scope of the services offered by the Miguel de Cervantes digital library project.

HANDBOOK OF RECORDING ENGINEERING FOURTH EDITION

Birkhäuser Advanced Texts

MULTILINGUAL DICTIONARY OF DISASTER MEDICINE AND INTERNATIONAL RELIEF

PHILOSOPHICAL WORKS OF PETER CHAADAEV

MULTIMEDIA SIGNALS AND SYSTEMS

Publications des Archives Henri-Poincaré Publications of the Henri Poincaré Archives

'if it was so, it might be; and if it were so, it would be: but as it isn't, it ain't. That's logic'

Appraising Research: Evaluation in Academic Writing

Existentialism and Romantic Love

HYDRAULIC AND ELECTRIC-HYDRAULIC CONTROL SYSTEMS

Animal Dispersal. Small mammals as a model. WILLIAM Z. LIDICKER, JR Museum of Vertebrate Zoology, University of California, Berkeley, USA

Letters between Forster and Isherwood on Homosexuality and Literature

HANDBOOK OF RECORDING ENGINEERING

Paul M. Gauthier. Lectures on Several Complex

Perceiving the Arts An Introduction to the Humanities

Mathematics, Computer Science and Logic - A Never Ending Story

Numerical Analysis. Ian Jacques and Colin Judd. London New York CHAPMAN AND HALL. Department of Mathematics Coventry Lanchester Polytechnic

This PDF is a truncated section of the. full text for preview purposes only. Where possible the preliminary material,

ANNOTATED BIBLIOGRAPHY OF CORPORATE FINANCE

PERFUMES ART, SCIENCE AND TECHNOLOGY

The. Craft of. Editing

BRITAIN, AMERICA AND ARMS CONTROL,

Cataloguing Digital Materials: Review of Literature and The Nigerian Experience

Public Sector Organizations and Cultural Change

Industrializing Antebellum America

OUT OF REACH THE POETRY OF PHILIP LARKIN

BOSTON STUDIES IN THE PHILOSOPHY OF SCIENCE. VOLUME LlI DIALECTICS OF THE CONCRETE

Ramanujan's Notebooks

Complicite, Theatre and Aesthetics

Rasch Models. Foundations, Recent Developments, and Applications

This page intentionally left blank

The Search for Selfhood in Modern Literature

Readability: Text and Context

Blackwell Reference Online

Human Rights Violation in Turkey

Seeing Film and Reading Feminist Theology

JAMES BALDWIN AND TONI MORRISON

All rights reserved. For information, write: Scholarly and Reference Division, St. Martin's Press, Inc., 175 Fifth Avenue, New York, N.Y.

Max Weber and Postmodern Theory

THEORY AND APPLICATIONS OF SPECIAL FUNCTIONS. A Volume Dedicated to Mizan Rahman

Corpus Approaches to Critical Metaphor Analysis

Imagining the Audience in Early Modern Drama,

The Hegel Marx Connection

This page intentionally left blank

RT0229_C00.qxd 4/6/04 12:22 PM Page i. Religion Online

The Letter in Flora Tristan s Politics,

CONDITIONS OF HAPPINESS

Educational Institutions in Horror Film

THE COUNTER-CREATIONISM HANDBOOK

A Hybrid Theory of Metaphor

Companion to European Heritage Revivals / edited by Linde Egberts and Koos Bosma

Reading and Seeing Ethnic Differences in the Enlightenment

Working Time, Knowledge Work and Post-Industrial Society

LOCALITY DOMAINS IN THE SPANISH DETERMINER PHRASE

Media Parasites in the Early Avant-Garde

Media Literacy and Semiotics

Quantum Theory and Local Causality

The Elegies of Ted Hughes

Marxism and Education. Series Editor Anthony Green Institute of Education University of London London, United Kingdom

How to Write Technical Reports

Eugenics and the Nature Nurture Debate in the Twentieth Century

Calculation of Demographic Parameters in Tropical Livestock Herds

NMR. Basic Principles and Progress Grundlagen und F ortschritte. Volume 7. Editors: P. Diehl E. Fluck R. Kosfeld. With 56 Figures

Prison Narratives from Boethius to Zana

MYRIAD-MINDED SHAKESPEARE

Six Lectures. on Modern Natural Philosophy. c. Truesdell. Springer-Verlag Berlin Heidelberg GmbH 1966

Transnational Activism and the Israeli-Palestinian Conflict

Listening to Popular Music. Or, How I Learned to Stop Worrying and Love Led Zeppelin

Benedetto Cotrugli The Book of the Art of Trade

Descartes Philosophical Revolution: A Reassessment

English Renaissance Literature and Contemporary Theory

Respiratory Physiology

Literature and Journalism

THE DEVELOPMENT OF ARABIC MATHEMATICS: BETWEEN ARITHMETIC AND ALGEBRA

Towards a Post-Modern Understanding of the Political

Transcription:

TEXT ENCODING INITIATIVE

Text Encoding Initiative Background and Context Edited by Nancy Ide Department o/computer Science, Vassar College, Poughkeepsie, NY, USA AND J ean Veronis Laboratoire Parole et Langage, CNRS & Universite de Provence, Aix-en-Provence, France Reprinted from Computers and the Humanities, Volume 29, Nos. 1,2 & 3 (1995), edited by Glyn Holmes (With the addition of an SGMIJTEI Bibliography) Springer Science+Business Media, B.V.

Library of Congress Cataloging-in-Publication Data Text encodlng InItIatIve: background and contexts / edlted by Nancy Ide and Jean Veronls. p. CII. "Reprlnted frolll COllputers and the hullanltles 29: 1-3, 1995." ISBN 978-0-7923-3704-1 ISBN 978-94-011-0325-1 (ebook) DOI 10.1007/978-94-011-0325-1 1. Text processlng (Collputer sclencel 2. Codlng theory. 1. Ide, Nancy M. II. Veronls,Jean. III. COllputers and the hullanltles. OA76.9.T48T47 1995 005.7'2--dc20 95-31289 ISBN 978-0-7923-3704-1 Printed on acid-free pa per AII Rights Reserved 1995 Springer Science+Business Media Dordrecht Originally published by Kluwer Academic Publishers in 1995 Softcover reprint ofthe hardcover Ist edition 1995 No part of the material protected by this copyright notice may be reproduced Of utilized in any fonn or by any means, electronic or mechanical, including photocopying, recording or by any infonnation storage and retrieval system, without written pennission from the copyright owner,

Table of Contents CHARLES GOLDFARB / Preface 1 NANCY IDE and JEAN veronis / Introduction 3 PART I: GENERAL TOPICS NANCY IDE and C.M. SPERBERG-McQUEEN / The Text Encoding Initiative: Its History, Goals, and Future Development 5 C.M. SPERBERG-McQUEEN and LOU BURNARD / The Design of the TEl Encoding Scheme 17 LOU BURNARD / What is SGML and How Does It Help? 41 PART ll: DOCUMENT-WIDE ENCODING ISSUES HARRY GAYLORD / Character Representation 51 RICHARD GIORDANO / The TEl Header and the Documentation of Electronic Texts 75 DOMINIC DUNLOP / Practical Considerations in the Use of TEl Headers in Large Corpora 85 PART Ill: ENCODING SPECIFIC TEXT TYPES DAVID CmSHOLD and DAVID ROBEY / Encoding Verse Texts 99 JOHN LA V AGNINO and ELL! MYLONAS / The Show Must Go On: Problems of Tagging Performance Texts 113 ROBIN COVER and PETER ROBINSON / Textual Criticism 123 DANIEL GREENSTEIN and LOU BURNARD / Speaking with One Voice: Encoding Standards and the Prospects for an Integrated Approach to Computing in History 137 STIG JOHANSSON / The Encoding of Spoken Texts 149 ALAN MELBY / E-TIF: An Electronic Terminology Interchange Format 159 NANCY IDE and JEAN veronis / Encoding Dictionaries 167

PART IV: SPECIAL ENCODING MECHANISMS STEVEN J. DeROSE and DAVID DURAND / The TEl Hypertext Guidelines 181 D. TERENCE LANGENDOEN and GARY F. SIMONS / Rationale for the TEl Recommendations for Feature-Structure Markup 191 DAVID BARNARD, LOU BURNARD, JEAN-PIERRE GASPART, LYNNE A. PRICE, C.M. SPERBURG-McQUEEN, GIOVANNI BATTISTA VARILE / Hierarchical Encoding of Text: Technical Problems and SGML Solutions 211 SGMLlTEI Bibliography by Robin C. Cover 233

Computers and the Humanities 29: I, 1995. Preface Charles F. Goldfarb Saratoga. California If asked for a sure recipe for chaos I would propose a project in which several thousand impassioned specialists in scores of disciplines from a dozen or more countries would be given five years to produce some 1300 pages of guidelines for representing the information models of their specialties in a rigorous, machineverifiable notation. Clearly, it would be sociologically and technologically impossible for such a group even to agree on the subject matter of such guidelines, let alone the coding details. But just as clearly as the bumblebee flies despite the laws of aerodynamics, the Text Encoding Initiative has actually succeeded in such an effort. The TEl Guidelines are extraordinary. Even if they were never adopted they would stand as a significant contribution to scholarship for their detailed analysis of the information sets of a huge range of complex text types. But in fact they have already been implemented, both by scholars for research and interchange and by commercial publishers for the publication of linguistic and humanistic works. I am delighted that my invention, the Standard Generalized Markup Language, was able to play a role in the TEl's magnificent accomplishment, particularly because almost all of the original applications of SGML were in the commercial and technological realms. It is reasonable, of course, that organizations with massive economic investments in new and changing information should want the benefits of information asset preservation and reuse that SGML offers. It is gratifying that the TEl, representing the guardians of humanity's oldest and most truly valuable information, chose SGML for those same benefits. The vaunted "information superhighway" would hardly be worth traveling if the landscape were dominated by industrial parks, office buildings, and shopping malls. Thanks to the Text Encoding Initiative, there will be museums, libraries, theaters, and universities as well.