Today s WorldCat: New Uses, New Data

Similar documents
OCLC Update. Cynthia Whitacre. John Chapman. Sandi Jones. Manager, WorldCat Quality & Partner Content. Product Manager, Metadata Services

Success Providing Excellent Service in a Changing World of Digital Information Resources: Collection Services at McGill

The Power of Shared Data and WorldCat & Open Access Ted Fons OCLC

Voyager and WorldCat Local - A Cataloger's Perspective

Presenter: JoEllen Ostendorf, Troup-Harris-Coweta Regional Library

Barbara Glackin Boise State University. A Cataloger s Perspective

Help! I m cataloging a monographic e-resource! What do I need to know from I-Share?

Visualize and model your collection with Sustainable Collection Services

Renovating Descriptive Practices: A Presentation for the ARL Fellows. Karen Calhoun OCLC Vice President WorldCat & Metadata Services November 1, 2007

BOOKS AT JSTOR. books.jstor.org

It's Not Just About Weeding: Using Collaborative Collection Analysis to Develop Consortial Collections

Cataloguing for the world: motivation, method and madness

Leveraging your investment in EAST: A series of perspectives

Why, How, Who, and other Questions

HELIN Cataloging Policies and Procedures Manual

Case study: Pepperdine University Libraries migration to OCLC s WorldShare

Managing content in the electronic world Anne Knight Acting Head of Information Systems / Resources & Facilities Manager

News From OCLC Compiled by Susan Westberg SAA Annual, Boston, Massachusetts, August 2004

SCS/GreenGlass: Decision Support for Print Book Collections

Libraries and MARC Holdings: From Works to Items

Emily Asch Head of Technical Services St. Catherine University

Illinois Statewide Cataloging Standards

California Community Colleges Library/Learning Resources Data Survey

Sha Li Zhang, Planning an Authority Control Project at a Medium-Sized University Library, College & Research Libraries 62, no.

Library Science Information Access Policy Clemson University Libraries

Happily ever after or not: E-book collection usage analysis and assessment at USC Library

Collection Development Duckworth Library

Automation of Processes in the National Library of China: Historical Review and Future Perspective

Overview. Cataloging & Processing BOOKS & LIBRARY SERVICES

Expert Selection & Monographs Use: A Brief History

The Emergence of the Collective Collection: Analyzing Aggregate Print Library Holdings By Lorcan Dempsey

THE AUTOMATING OF A LARGE RESEARCH LIBRARY. Susan Miller and Jean Yamauchi INTRODUCTION

STORYTELLING TOOLKIT. Research Tips

University of Wisconsin Libraries Last Copy Retention Guidelines

New ILS Data Delivery Guidelines

The shelf-free generation

Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

The Danish WorldCat Project

More than a feeling: I see my MARC life walking away. Eric Childress Consulting Project Manager OCLC Research

SpringerLink Inforum, Prague 26 May Frans Lettenström SpringerLink Licensing Executive South & East Europe SPRINGER

White Paper ABC. The Costs of Print Book Collections: Making the case for large scale ebook acquisitions. springer.com. Read Now

Migration to Alma/Primo: A Case Study of Central Washington University

Building Collections Cooperatively: Analysis of Collection Use in the OhioLINK Library Consortium

COLLECTION DEVELOPMENT GUIDELINES

An Environmental Scan. of OCLC Alternatives. A Final Report Prepared for. April Prepared by. JRM Consulting Inc. Carlsbad, CA.

Do we still need bibliographic standards in computer systems?

Questionnaire for Library of Congress Reclassification

Maximizing the Collective Collection (monographs) in Illinois I-Share: assessing our buy to share potential

An Introduction to Springer ebooks: Business Models, Product, and Lessons Learned

What are we getting ourselves into? KU Libraries investigates e-book vendors and publishers

The Joint Transportation Research Program & Purdue Library Publishing Services

Authority Control in the Online Environment

Follow this and additional works at: Part of the Library and Information Science Commons

OUR LIBRARY. Used by scientists, lecturers, experts, students and citizens. The special multidiscipline library of the Bulgarian Academy of Sciences.

UCSB LIBRARY COLLECTION SPACE PLANNING INITIATIVE: REPORT ON THE UCSB LIBRARY COLLECTIONS SURVEY OUTCOMES AND PLANNING STRATEGIES

Collection Rightsizing

LC GUIDELINES SUPPLEMENT TO THE MARC 21 FORMAT FOR AUTHORITY DATA

Maurits van der Graaf Pleiade Management & Consultancy

OCLC Print Archives Disclosure Pilot Final Report April Table of Contents

NLI Update Elhanan Adler, Marina Goldsmith

OCLC's CORC Service: A User's Perspective

The CYCU Chang Ching Yu Memorial Library Resource Development Policy

Monographic Collections Analysis Webinar

E-journals: the OhioLINK experience

Unit 2 Assignment - Selecting a Vendor. ILS 519 Collection Development. Dr. Arlene Bielefield. Prepared by: Lucinda D. Mazza

Introduction to the EUI Library for Historians Wednesday 6 September a.m a.m. Seminar Room 2, Badia Fiesolana

Missouri Evergreen Cataloging Policy. Adopted July 3, Cataloging Policy Purpose. Updating the Missouri Evergreen Cataloging Policy

AN ELECTRONIC JOURNAL IMPACT STUDY: THE FACTORS THAT CHANGE WHEN AN ACADEMIC LIBRARY MIGRATES FROM PRINT 1

Charting Strange Lands: The Acquisitions of e- Books

WorldCat Discovery User Guide 2018

William Shakalis 32 Fellen Road, Storrs, CT Tele. (860)

Georgia Tech Library Catalog

ROBERT E. WEBBER INSTITUTE FOR WORSHIP STUDIES

Consortial Cataloging Guidelines for Electronic Resources: I-Share Survey and Recommendations

UNIVERSITY OF CALIFORNIA LIBRARY STATISTICS July 2011

DART Advanced Library Research

Web of Science The First Stop to Research Discovery

What is Source Pulling? FINDING EVERY SOURCE IN A SUBMITTED ARTICLE

Credo Usage Boosters

THE IMPACT OF COLLECTION WEEDING ON THE ACCURACY OF WORLDCAT HOLDINGS. July, 2002

Jerry Falwell Library RDA Copy Cataloging

INTERLIBRARY LOAN FOR THE REST OF THE STAFF

UNIVERSITY OF CALIFORNIA LIBRARY STATISTICS July 2010

Primo. Michael Cotta-Schønberg. To cite this version: HAL Id: hprints

Search TSU Online Catalog for Print and Electronic

Amazon: competition or complement to OPACs Maja Žumer University of Ljubljana, Slovenia

Self-publishing services for book authors

DRAFT UC VENDOR/SHARED CATALOGING STANDARDS FOR AUDIO RECORDINGS JUNE 4, 2013 EDIT

Running head: HARRISON COLLGE 1

COLLECTION DEVELOPMENT

ACRL STATISTICS QUESTIONNAIRE, INSTRUCTIONS FOR COMPLETING THE QUESTIONNAIRE

Topics in Managing Serials in WMS. Daniel Jolley, Mary Thompson, Frank Newton

Do Off-Campus Students Use E-Books?

Digital Collection Management through the Library Catalog

From Aggregation to Access: Building Digital Collections collectively

Getting the Most from Alma. Patron Driven Acquisitions (PDA)

Journal Title Display and Citation Practices

NDL s Digital Collection and Service for Information Access

Glendale College Library Information Competency Workshops Introduction to the Library for New Students

Influence of Discovery Search Tools on Science and Engineering e-books Usage

Transcription:

OCLC Member Services October 21, 2011 Today s WorldCat: New Uses, New Data Ted Fons Executive Director, Data Services & WorldCat Quality Good Practices for Great Outcomes: Cataloging Efficiencies that Make a Difference Columbus, Ohio

WorldCat: Happy 40 th! 26 August 1971

WorldCat: Many Uses, Many Users WorldCat has changed a lot since it was born in 1971 It is growing g to suit many different uses Let s talk about those uses and what we are doing preserve the quality and reliability of WorldCat GLIMIR Deduplication WMS Full Text Content t More Links WorldCat Local Global Cooperative Cataloging

FIRST WHAT S IN WORLDCAT?

Representing the collective collection 21 2.1 billion items and growing! Physical holdings in WorldCat Licensed digital content/articles in library collections Local library content being digitized 243 million bib records 1.75+ billion holdings 526.6 million records 30 million items (Google, HathiTrust, OAIster) As of 30 September 2011

WorldCat Books 189,421,960 Serials 9,363,065 Visual materials 7,220,470 Maps 3,725,484 Mixed materials 4072021 4,072,021 as of July 1, 2011 Sound recordings 8,437,557 Scores 5736058 5,736,058 Computer files 7,946,335 Total 235,822,950

Multilingual WorldCat Total Records English German French Spanish Japanese Chinese Italian Dutch Russian Latin 30 June 2011 Percentage of records 235.5 m for non-english materials 97.7m 33.0m 21.3m 9.7 m 7.5 m 58.5% 5% 5.9 m 3.9 m 35 3.5 m 30 June 2011 3.2 m 3.2 m

Who uses WorldCat? Libraries: 389.3 million items cataloged 57.8 million records added to WorldCat 10.2 million interlibrary loans arranged 68.4 million cataloging records exported Public, college/university, State and national = 40% Source: OCLC annual report 2009/2010. http://www.oclc.org/news/publications/annualreports/2010/2010.pdf

Who uses WorldCat.org? End Users: Students, Teacher/professor, Business professional 69% Source: Online Catalogs study, PDF p. 16 http://www.oclc.org/us/en/reports/onlinecatalogs/default.htm

Where does the content come from?

WorldCat growth since 1998 250 Millions of records 235 200 197 150 What happened here? 139 100 108 86 50 39 41 44 47 50 52 55 61 67 0 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011

What Happened? WorldCat got very big from a huge variety of sources, formats & cataloging schemes. We started to look at WorldCat quality from the user s perspective.

Online Catalogs: What Users and Librarians Want End-Users expect online catalogs: to look/behave like popular Web sites to have summaries, abstracts, tables of contents to link directly to needed information Librarians expect online catalogs: to help staff carry out work responsibilities to have accurate, structured data to exhibit library principles of organization April 2009 http://www.oclc.org/us/en/reports/onlinecatalogs/default.htm

Librarian/Staff Results: Highlighted Differences End-User Results: Recommended Enhancements 9 Recommended enhancements to WorldCat Total end-user responses 1 4 Source: Online Catalogs study, PDF p. 51

Composite view of what end users and librarians want and what we are doing about it Basis of 2009-2010 WorldCat Quality Program Source: Online Catalogs study, PDF p. 52

SO, WHAT DID WE DO FOR END USERS?

WorldCat growth in electronic and digital content 45,000,000 40,000,000 35,000,000 30,000,000 25,000,000 Digital Items 20,000,000 ebooks 15,000,000 10,000,000 5,000,000 0 Jun 2006 Jun 2007 Jun 2008 Jun 2009 Jun 2010 Jan 2011

Links to content: Load metadata for ebooks from mass digitization, aggregator g and publisher partners into WorldCat 7.1 million ebooks records added d Some additions Project Gutenburg Sage BioDiversity Heritage Library MyiLibrary Google Book Search Google Book Search HathiTrust

Central index of citations leading to full text 500,000,000 Article Citations 450,000,000 400,000,000 350,000,000 300,000,000 250,000,000 200,000,000 Article Citations 150,000,000 100,000,000 50,000,000 0 Jun-2006 Jun-2007 Jun-2008 Jun-2009 Jun-2010 Jan-2011

Links to content: Adding article level l metadata t to WorldCat Local As of June 30, 2011 WorldCat Local Central Index Article Records WorldCat Local Central Index Journal Titles Indexed 530,886,124 85,834

Better links: brought to you by the WorldCat knowledge base One place to manage a library s electronic holdings--both ejournals and ebooks--at the network level (i.e., in the cloud) Collection or package level management of holdings Controls display of journal/book/article level links in WorldCat Local Enables resource sharing of e-articles (as licenses allow) WorldCat knowledge base

WHAT DID WE DO FOR LIBRARIANS?

Recommendations from librarian survey Merge duplicate bibliographic records Enrichment TOCs, summaries, cover art work with content suppliers, use APIs, etc. Make it easier to make corrections to records (fix typos; do upgrades); social cataloging experiment Wikipedia More emphasis on accuracy/currency of library holdings

MERGE DUPLICATE BIBLIOGRAPHIC RECORDS

Duplicates in WorldCat Real Duplicates: Delete them actually, we merge them so we can keep unique data Apparent duplicates Cluster them

Removing Real Duplicates http://www.oclc.org/worldcat/catalog/quality/ddr

Duplicate Detection and Resolution (DDR) of WorldCat bibliographic records Reimplementation and expansion of previous software - Now handles all types of material (not just books) Fully operational in early 2010 in 2 separate processes Walking the database (Complete September 2010) Selected records from each day s daily journal files (Ongoing) The result is continuous cleaning of WorldCat

MANAGING APPARENT DUPLICATES & IMPROVING WORKS

Managing Apparent Duplicates GLIMIR = Global LIbrary Manifestation IdentifieR. Clusters manifestations and assigns unique identifier to each manifestation Clusters records for parallel records (differing languages of cataloging for the same manifestation) ti and for reproductions Re-clusters FRBR work sets

Apparent duplicates in WorldCat.org

Future Enhancements for Discovery Non-English browser setting: Show the end user the record in their preferred language. Improved FRBR worksets will provide a better Work view in WorldCat.org (coming in 2012)

ENRICHING RECORDS

Capturing more data through batchloading 160,000,000000 000 July 2009-July 2011 140,000,000 63% 120,000,000 100,000,000 80,000,000 2009 July 60,000,000 40,000,000 34% 145% 430% 41% 87% 2011 July 20,000,000 0 LC-type call number Dewey-type call number Other call number Contents/Summaries Subject terms URLs

Enriched content from partners 43 million data elements under contract Over 9.5 million book jacket covers, summaries, 1 st chapters Over 14.4 million ToCs Over 360,000 music album covers (added Aug. 2010) Over 1.5 million non-us book covers (added Nov. 2010)

Mining WorldCat: Sharing data elements across a FRBR Work Set Work The novel Expression Original Text Translation Critical Edition Manifestation Summary Classification Subject Terms Records

MAKE IT EASIER TO CORRECT RECORDS: SOCIAL CATALOGING

Making it easier to correct the records: WorldCat community maintenance Activity it by Member Libraries i during FY2010 & FY2011 TOTAL FY10 TOTAL FY11 Expert Community 271,626 304,759 Database Enrichment 198,084 235,533 Minimal-Level Upgrade 176,618 194,634 Enhance Regular 176,491 155,713 Enhance National 45,451451 47,876 CONSER Authentication 15,705 21,208 CONSER Maintenance 61,949 57,917 TOTAL 945,924 1,017,640

WORLDCAT STEWARDSHIP BY OCLC STAFF

OCLC Staff Maintenance Activity in FY 2010 TOTAL FY10 TOTAL FY11 Bibliographic Records Replaced 12,511,044 23,305,162 Records Merged 150,992 301,667 Authority Records Created 1,977 119 Authority Records Replaced 94,744 797 CIP Records Upgraded 16,145 5,742

Other OCLC enhancements to WorldCat a couple of recent examples Updated subject headings Recent example: Cookery changed to Cooking Over 314,000 records affected ~75 new subject headings proposed to Library of Congress Adding Linking ISSNs (ISSN-L) Added to about 800,000 records thus far Provider-neutral records About 9.5 million records touched thus far

Additional OCLC staff enhancements Adding non-latin cross-references to authority records Almost 500,000 records affected Non-Latin forms derived from WorldCat records Authority records for geographic names Indirect subdivision forms added (about 90,000 records) Geographic coordinates added to field 034 (more than 78,000 records)

Quality enhancements looking ahead Looking ahead More automated enrichment of bibliographic records from mining FRBR work set data (summer/fall 2011)

More Work on Quality Another quality survey to begin. WorldCat Quality Whitepaper WorldCat quality whitepaper Formal definitions of data quality & consistency. September, 2011 http://www.oclc.org/us/en/reports/worldcatquality

NEW USES FOR WORLDCAT

New Uses for WorldCat Progressing from a cataloging and ILL database to Discovery Management System: Circulation Acquisitions Syndication to Google

Discovery

WorldCat Local more than just books Maps: 3.6 million Scores: 5.6 million Visual materials: 7.7 million Sound recordings: 8.5 million Serials: 9.5 million ebooks: 11.9 million Conference proceedings: 12.1 million Institutional repository records: 14.3 mil Archival materials: 16.1 million Theses and dissertations: 16.7 million Web/Internet resources: 28.2 million Articles 614 million Books 198 million As of Sept 1, 2011

Inbound OpenURL link resolution Print holdings display based on WorldCat knowledge base holdings

Evaluative content Goal: Offer a robust collection to enhance discovery How: Aggregate evaluative content t from multiple l sources Status: 44+ million data elements included 6.5 million book jacket covers 14.44 million ToCs 3.1 million summaries 294 thousand author biographies 291 thousand first chapters 398 thousand million album covers 1.7 million user reviews 360 thousand album reviews 12 million ratings

Unified request options for Delivery

Mobile access included

Quantitative results: Increased usage of resources ILL books 270 % sage creased u In ILL 101 % resolver hits consortial borrowing consortial resolver borrowing hits 74 % 80 % 59 % ILL 58 % articles circ 21 % 17 % University of Washington Willamette University Portland Community College

Management Systems

What s ahead: OCLC Web Scale Management Services The first cooperative management service for libraries. Lower total cost of ownership Simplify back-office operations Reduce support costs for disparate systems Customize and extend services through an open, extensive platform Free staff time for high-priority services Circulation Acquisitions License Management

New Data for WMS Circulation Patron/Identity Item Circ History Fines Holds Loan Rules Acquisitions Vendors Funds Invoices Payments License Management License terms Knowledge base

What is new? Circulation Transaction data at the network level Acquisitions Cooperatively managed vendor data License Management Cooperatively managed license terms Cooperatively manged knowledge base

Data Required for Discovery & Management Systems Local Data Fields: Required to record the full bibliographic and copy-level details Many libraries require this to provide a complete discovery and management system environment

Data Required for Discovery & Management Systems http://www.oclc.org/us/en/support/documentation/batchprocessing/using/storelocalbibdataforwclandwms.pdf

Discussion