Bulking Up: How Accepted Standards and Evolving Technology Advance Research in Chronicling America

Similar documents
Illinois Newspapers: Preparing Our Past for the Future. Anna FitzSimmons, Amy Sullivan, Tracy Nectoux, Nathan Yarasavage

Cataloging Electronic Resources: General

OCLC Update. Cynthia Whitacre. John Chapman. Sandi Jones. Manager, WorldCat Quality & Partner Content. Product Manager, Metadata Services

Today s WorldCat: New Uses, New Data

CODING TO WORK WITH ALMA AFTER VOYAGER

WORLD LIBRARY AND INFORMATION CONGRESS: 75TH IFLA GENERAL CONFERENCE AND COUNCIL

The digitized Newspaper Collection as National Patrimony of the Russian Federation

Resource Description and Access (RDA) The New Way to Say,

NDL s Digital Collection and Service for Information Access

Making sense of it all - combining digitized analogue collections with e-legal deposit and harvested web sites

Renovating Descriptive Practices: A Presentation for the ARL Fellows. Karen Calhoun OCLC Vice President WorldCat & Metadata Services November 1, 2007

The shelf-free generation

CALE (JOHN G.) PAPERS Mss Inventory

Effects of Civil War Pathfinder

This is a talk I did to Internet Archive Staff about the Open Library project. the amazing site that is

It's Not Just About Weeding: Using Collaborative Collection Analysis to Develop Consortial Collections

Information Standards Quarterly

Migratory Patterns in IRs: CONTENTdm, Digital Commons and Flying the Coop

Collection Development Duckworth Library

AACR2 s Updates for Electronic Resources Response of a Multinational Cataloguing Code A Case Study March 2002

IGeLU 2017 Content conversations

Migratory Patterns in IRs: CONTENTdm, Digital Commons and Flying the Coop

The Joint Transportation Research Program & Purdue Library Publishing Services

NYU Scholars for Department Coordinators:

History, Reputation Management, and Value: Discussing the Merits for

Newfound Press: The digital imprint of the University of Tennessee Libraries

A3 and A2 Book Scanners

Library Services Survey

The Frederick R. Karl Archive, Collection: Mss. 2000:1

(Presenter) Rome, Italy. locations. other. catalogue. strategy. Meeting: Manuscripts

News From OCLC Compiled by Susan Westberg SAA Annual, Boston, Massachusetts, August 2004

The Provincial Archives of Alberta. Price List

NLI Update Elhanan Adler, Marina Goldsmith

Dead Links? No Problem. We re In This Together

DDC22. Dewey at ALA Annual. Dewey Decimal Classification News

Astronomy Libraries - Your Gateway to Information. Uta Grothkopf ESO Library

Liz McKeen, Director, Resource Description Division, Published Heritage Branch, Library and Archives Canada

Welsh print online THE INSPIRATION THE THEATRE OF MEMORY:

Running head: HARRISON COLLGE 1

LEWIS GUION DIARY (Mss. 826) Inventory

The Estonian National Bibliography Challenges and Opportunities in the Digital Age

Digitised Content: How we Make It Relevant to Researchers, Teachers and Students

COLLECTION DEVELOPMENT

NYU Scholars for Individual & Proxy Users:

Literary Periods: Pseudonyms and Canadian Literature in English. Dewey Breakfast/Update ALA Annual Conference June 28, 2008

FDC020 FHSU Rare Book Collection Metadata Application Profile v1.1

Annual Report of the IFLA-PAC China Center

PUTTING IT ALL TOGETHER A guide to completing the 4-H Market or Breeding Livestock Record And BOOK

UNL Digital Commons -- An Introduction

Susan K. Reilly LIBER The Hague, Netherlands

Self-publishing services for book authors

Metadata Education and Research Information Clearinghouse (MERIC): Web Prototype

The Writer's Almanac with Garrison Keillor A - A poem each day, plus literary and historical notes from this day in history

Library Science Information Access Policy Clemson University Libraries

LIBER Road Map towards Digitisation

CRIS with in-text citations as interactive entities. Sergey Parinov CEMI RAS and RANEPA

SYRACUSE TELEVISION IMAGES OF AMERICA SYRACUSE TELEVISION IMAGES OF PDF SYRACUSE TELEVISION IMAGES OF AMERICA PDF ADMISSIONS - SYRACUSE UNIVERSITY

Introduction to

On-line literature searching. Outline

More than a feeling: I see my MARC life walking away. Eric Childress Consulting Project Manager OCLC Research

COLLECTION DEVELOPMENT POLICY

Inventory of the Firing Line (Television Program) Broadcast Records. No online items

from physical to digital worlds Tefko Saracevic, Ph.D.

The digital bookshelf. Vigdis Moe Skarstein, National Librarian, Norway

ARCHIVAL DESCRIPTION GOOD, BETTER, BEST

Success Providing Excellent Service in a Changing World of Digital Information Resources: Collection Services at McGill

Catalogs, MARC and Other Metadata

Collecting bits and pieces

Association for Library Collections and Technical Services (A Division of the American Library Association) Cataloging and Classification Section

Case study 1: Google Books at the Complutense University of Madrid CERL Annual Seminar 2012 October , British Library

Digital Signage Policy ADM 13.0

International Image Interoperability Framework (IIIF) Sharing high resolution images across institutional boundaries

PubMed Central. SPEC Kit 338: Library Management of Disciplinary Repositories 113

Library of Congress Portals to the World:

Aggregating Digital Resources for Musicology

2009 CDNLAO COUNTRY REPORT

B Index Term-Genre/Form (R)

SERENO TAYLOR PAPERS Mss. 617 Inventory. Compiled by Luana Henderson

UNESCO/Jikji Memory of the World Prize. Nomination form To be submitted by 31 December 2004

arxiv:cs/ v1 [cs.ir] 23 Sep 2005

A Finding Aid to the Alvord Eiseman research material concerning Charles Demuth, circa , in the Archives of American Art

Georgia Tech Library Catalog

Agenda. Conceptual models. Authority control. Cataloging principles. New cataloging codes

2019 ASCO Educational Book

Evaluating Microfilm: If You Think it Doesn't Matter, Think Again [2007]

C: The Complete Reference By J.K

The digital Beethoven house

Digital Collection Management through the Library Catalog

BOOKS AT JSTOR. books.jstor.org

Institutional Report. For my report, I chose to visit the Ralph Rinzler Folklife Archives located in Washington,

No online items

ANNIE JETER CARMOUCHE PAPERS (Mss. 2878) Inventory

Kapi`olani Community College s Kapi`o Student Newspaper Digitizing Project Report

Comparison of MARC Content Designation Utilization in OCLC WorldCat Records with National, Core, and Minimal Level Record Standards

ICDL FAQS FOR REVISED 3/18/05. What is the International Children s Digital Library (ICDL)? Who is the intended audience for the ICDL?

Capital Works process for Medium Works contracts

Mapping & Spatial History APRIL 26, 2017

Seeing Using Sound. By: Clayton Shepard Richard Hall Jared Flatow

Geoscience Librarianship 101 Geoscience Information Society (GSIS) Denver, CO September 24, 2016

A4 page of print publication, rush order 0.52

Transcription:

Bulking Up: How Accepted Standards and Evolving Technology Advance Research in Chronicling America 2014 IFLA International Newspapers Conference Salt Lake City, Utah, USA Nathan Yarasavage, Deborah Thomas & Georgia Higley Library of Congress

Chronicling America: Historic American Newspapers http://chroniclingamerica.loc.gov/ NDNP / Chronicling America p.2

NDNP / Chronicling America p.3

NDNP / Chronicling America p.4

NDNP / Chronicling America p.5

NDNP / Chronicling America p.6

NDNP / Chronicling America p.7

NDNP / Chronicling America p.8

NDNP / Chronicling America p.9

NDNP / Chronicling America p.10

NDNP / Chronicling America p.11

NDNP / Chronicling America p.12

NDNP / Chronicling America p.13

How did we get from this NDNP / Chronicling America p.14

to this?! NDNP / Chronicling America p.15

National Digital Newspaper Program Partners: 36 institutions 7 million pages now online 1836-1922 NDNP / Chronicling America p.16

National Digital Newspaper Program, 2005- GOALS: To enhance access to historic American newspapers from every state and territory To develop best practices for the digitization of historic newspapers (shared community) Free and open content, available to all NDNP / Chronicling America p.17

NDNP: Built for Sustainability Shared technical specifications Institutional cooperation Data integrity Data management Expect change NDNP / Chronicling America p.18

Shared technical specifications NDNP Technical Guidelines for Applicants (NDNP Tech Specs) http://www.loc.gov/ndnp/guidelines/ NDNP / Chronicling America p.19

Shared technical specifications Stable guidelines since 2005 Changes limited to: Clarifications to existing practice Version updates (ALTO 2.0) Expanding content scope Simplifying some metadata requirements Technical metadata from Microfilm collation now OPTIONAL Section / Edition Labels now OPTIONAL NDNP / Chronicling America p.20

Distributed Project / Shared Effort NDNP / Chronicling America p.21

NDNP Data For every page: For every issue and reel: Archival Image: TIFF Production Image: JPEG 2000 Printable Image: PDF OCR XML File: ALTO METS XML File Descriptive metadata Structural metadata Preservation metadata For newspaper title: MARC record Geographic metadata Time Period Subject metadata http://www.loc.gov/ndnp/guidelines/ NDNP / Chronicling America p.22

METS ALTO NDNP ALTO specification requires: Column level text block zoning and Coordinates to map or highlight text to image files. Chronicling America supports page level access with visual representation of search results. NDNP / Chronicling America p.23

Page Level Access Zoom, pan, and clip tools are available NDNP / Chronicling America p.24

Chronicling America - LC NDNP / Chronicling America p.25

Chronam: Open Source software https://github.com/libraryofcongress/chronam NDNP / Chronicling America p.26

chronam http://nyshistoricnewspapers.org/ http://oregonnews.uoregon.edu/ NDNP / Chronicling America p.27

Chronicling America - API http://chroniclingamerica.loc.gov/about/api/ NDNP / Chronicling America p.28

Chronicling America Bulk Data: OCR Downloads for External Services http://chroniclingamerica.loc.gov/ocr/ NDNP / Chronicling America p.29

Advancing Research with Chronam NDNP / Chronicling America p.30

The Growth of US Newspapers, 1690-2011 NDNP / Chronicling America p.31

Mapping Texts Mapping Texts is a collaboration between scholars, staff and students at Stanford University and the University of North Texas. It is supported by the National Endowment for the Humanities. http://mappingtexts.org/ NDNP / Chronicling America p.32

An Epidemiology of Information: Data Mining the 1918 Influenza Pandemic The goal is to develop methods for combining algorithmic techniques with the interpretive strength of traditional historical and rhetorical analysis in order to help researchers better understand reporting on the 1918 flu pandemic in American and Canadian newspapers. Text mining Topic Modeling Tone Analysis Professor E. Thomas Ewing, Principal Investigator and Project Director, Department of History, Virginia Tech http://www.flu1918.lib.vt.edu/ NDNP / Chronicling America p.33

Viral Networks in 19 th -Century Newspapers Infectious Texts is sponsored by Northeastern University's NULab for Texts, Maps, and Networks and generously funded by the National Endowment for the Humanities' Office of Digital Humanities. The project team includes Professors Ryan Cordell, Elizabeth Maddock Dillon, and David Smith, as well as Ph.D. students Abby Mullen and NATIONAL Matthew Williamson. ENDOWMENT FOR THE HUMANITIES http://www.viraltexts.org/ NDNP / Chronicling America p.34

Chronicling America NDNP / Chronicling America p.35

Weathering the Storm http://chroniclingamerica.loc.gov/lccn/sn83045433/1912-11-30/ed-1/seq-1 / NDNP / Chronicling America p.36

Beyond the Chronicling America Web Site Regular update/highlights information by RSS* or email subscription http://www.loc.gov/rss/ndnp/ndnp.xml; and Recent Additions RSS feed (newspaper titles added) Keep up with what s in Chronicling America! Open-source code for chronam application available on GitHub https://github.com/libraryofcongress/chronam chronam is the Django application that the Library of Congress uses to make its Chronicling America website (a core set of functionality for loading, modeling and indexing NDNP data) Check it out and take it for a spin! Linked data and API** access in Chronicling America RDF/XML views available to Open Archives Initiative Object Reuse and Exchange (OAI-ORE) OpenSearch and Atom APIs for search queries, Bookmarkable URIs for all pages http://chroniclingamerica.loc.gov/about/api/ NDNP Extras! from the NDNP program site http://www.loc.gov/ndnp/extras/ Visualizations, tutorials, podcasts, teaching resources, state project blogs, how Chronicling America is used in other projects and more! NDNP / Chronicling America p.37

Thank you! NDNP Public Web http://www.loc.gov/ndnp/ NDNP Web Service Chronicling America: Historic American Newspapers http://chroniclingamerica.loc.gov Contact us at ndnptech@loc.gov NDNP / Chronicling America p.38