Migratory Patterns in IRs: CONTENTdm, Digital Commons and Flying the Coop

Similar documents
Migratory Patterns in IRs: CONTENTdm, Digital Commons and Flying the Coop

Happily ever after or not: E-book collection usage analysis and assessment at USC Library

The Joint Transportation Research Program & Purdue Library Publishing Services

Digital Initiatives & Scholar Commons

POST-SUBMISSION INFORMATION PACKET

Metadata Education and Research Information Clearinghouse (MERIC): Web Prototype

Preserving Music Recitals before they fade away

FORMAT & SUBMISSION GUIDELINES FOR DISSERTATIONS UNIVERSITY OF HOUSTON CLEAR LAKE

E-Book Readers: Exploration and Experiences

Creating Digital Access to the OCAD University Zine Library through Artstor Shared Shelf MARTA CHUDOLINSKA LEARNING ZONE LIBRARIAN

Osgoode Digital Commons: Digital Repository Success Stories

Inventory Software Trials

Transitioning Your Institutional Repository into a Digital Archive

Leveraging your investment in EAST: A series of perspectives

Introduction to

Nicola Visits the Library. For my library visit, I traveled to beautiful Point Breeze in Pittsburgh to speak with

Usage metrics: tools for evaluating science collections

The Organization and description of the UNLV archives

Case study: Pepperdine University Libraries migration to OCLC s WorldShare

LMS301: Reference Management Software (Mendeley)

OCLC's CORC Service: A User's Perspective

BARC Tips for Tiny Libraries

Kapi`olani Community College s Kapi`o Student Newspaper Digitizing Project Report

Today s WorldCat: New Uses, New Data

Guide to the Dennis Lee Askew Papers

EndNote Menus Reference Guide. EndNote Training

FDC020 FHSU Rare Book Collection Metadata Application Profile v1.1

Faculty Governance Minutes A Compilation for online version

The Power of Shared Data and WorldCat & Open Access Ted Fons OCLC

Ad Specs. Digital Banner Ad Specs DESKTOP BROWSER CREATIVE SPECIFICATIONS CREATIVE DEADLINES. 300X250 pixels pixels.

Florida State University Libraries

A More-Product-Less-Process Approach to Cataloging Recordings

UNL Digital Commons -- An Introduction

Library Liaison Advisory Group Fall Quarter Meeting Minutes Tuesday, October 14, 2008 Tuesday, November 11, 2008 Thursday, November 20, 2008

Credo Usage Boosters

Mainstreaming University Publications: Designing Collaboration Across Library Units for Discovery and Access

Objective: Students will learn about the differences between a library and an archive and the different sources that might be available in each.

Use and Usability in Digital Library Development

LibGuides & LibAnswers & LibAnalytics

administration access control A security feature that determines who can edit the configuration settings for a given Transmitter.

EndNote on Windows: Class Notes. EndNote Training

Cisco StadiumVision Defining Channels and Channel Guides in SV Director

An Introduction to Springer ebooks: Business Models, Product, and Lessons Learned

LIBER Road Map towards Digitisation

What is Source Pulling? FINDING EVERY SOURCE IN A SUBMITTED ARTICLE

Measuring Your Research Impact: Citation and Altmetrics Tools

Using the Book Expert in Scholastic Achievement Manager

This guide covers the set-up of Oboe 3.5 and selling POS products on your website.

Exposing Hidden Collections Using Interdepartmental Collaboration: The Serials Collection in the Southern Folklife Collection

(Slide1) POD and The Long Tail

OCLC Update. Cynthia Whitacre. John Chapman. Sandi Jones. Manager, WorldCat Quality & Partner Content. Product Manager, Metadata Services

Development of Reference Management System in Cloud Computing Environment

USER DOCUMENTATION. How to Set Up Serial Issue Prediction

Guide to the Arthur B. and Sally Bruce Kinsolving papers (bulk )

Effects of Civil War Pathfinder

The APA Style Converter: A Web-based interface for converting articles to APA style for publication

ADAPTING IN A PERIOD OF IMMENSE CHANGE: Strategies you might consider

Un-hiding Strategies for outreach in times of austerity

BLUEBOOK RULE 18: CITATION TO INTERNET AND ELECTRONIC RESOURCES

STORYTELLING TOOLKIT. Research Tips

AGENDA. Mendeley Content. What are the advantages of Mendeley? How to use Mendeley? Mendeley Institutional Edition

Scopus. Advanced research tips and tricks. Massimiliano Bearzot Customer Consultant Elsevier

Comparing Scholars Portal & ebrary e-book platforms

What's new in EndNote Version 6?

Data Citation Analysis Framework for Open Science Data

INTRODUCTION TO ENDNOTE

EndNote X6 Basics. Health Sciences Library System University of Pittsburgh

Metadata Plus: How Libraries Assure Discovery of Locally Created Content

Maths-Whizz Investigations Paper-Back Book

Georgia Tech Library Catalog

Continuities. The Serialization of (Just About) Everything. By Steve Kelley

RELIABILITY REASON FOR A COMMERCIAL INADVERTENT-INTERCHANGE SETTLEMENT STANDARD.

Open Access Essentials

A Beginner s Experience With Design Analytics

Library. Summary Report

How to Begin. Digital Photo. Iwork in the local history/genealogy department at a medium-sized public library. Collection.

Delaware Division of Libraries Update A presentation at the joint Delaware Library Association/ Maryland Library Association Annual Conference 2013

RATE CARD & MEDIA KIT 2013

California Community Colleges Library/Learning Resources Data Survey

LIBRARY ASSIGNMENT. Level Three. Austin Community College Produced by Library Services and Developmental Reading. Name. Date due

The Lilly Library of rare books, manuscripts, and special collections at Indiana

Employment Cost Index Original Data Value Series Id: Seasonally adjusted Series Title: Ownership: Component: Occupation: Industry: Subcategory:

Finding aid for the Grand Rapids Public Library underground newspapers collection Collection 256

Swinburne University of Technology

Influence of Discovery Search Tools on Science and Engineering e-books Usage

Introduction to EndNote X7

Using EndNote Online to Manage your References. Workbook

EndNote Class Outline Building Your EndNote Library

A journey of many steps. Alma Integration with Libraries Australia

Digitised Content: How we Make It Relevant to Researchers, Teachers and Students

Introduction to Bell Library Resources

Reading & Writing Workshop Suggested Sequence for Units of Study K-5 Sequence of Units. Kindergarten

Springer Book Archives. Cem Üzüm Licensing Manager Central Europe, May 2014, Prague

Conditional Formatting in Microsoft Excel 2007 Midterm Debriefing Lynda Cannedy 6323 Multimedia/Hypermedia

3. Green OA (self-archiving) needs to be mandated

OpenOne Outage Management System

Calibrating the timecode signal input

Housing Inventory Setup Guide

How to find out information about the report

Using LibGuides & LibAnswers

Transcription:

University of San Diego Digital USD Digital Initiatives Symposium 2018 Digital Initiatives Symposium Apr 24th, 1:50 PM - 2:35 PM Migratory Patterns in IRs: CONTENTdm, Digital Commons and Flying the Coop Michele Gibney University of the Pacific, mgibney@pacific.edu Kristin Laughtin-Dunker Chapman University, laughtin@chapman.edu Elizabeth Chance Fort Hays State University, mechance2@fhsu.edu Follow this and additional works at: https://digital.sandiego.edu/symposium Part of the Archival Science Commons, Cataloging and Metadata Commons, Higher Education Commons, Scholarly Communication Commons, and the Scholarly Publishing Commons Gibney, Michele; Laughtin-Dunker, Kristin; and Chance, Elizabeth, "Migratory Patterns in IRs: CONTENTdm, Digital Commons and Flying the Coop" (2018). Digital Initiatives Symposium. 34. https://digital.sandiego.edu/symposium/2018/2018/34 This 45-minute concurrent session is brought to you for free and open access by Digital USD. It has been accepted for inclusion in Digital Initiatives Symposium by an authorized administrator of Digital USD. For more information, please contact digital@sandiego.edu.

Migratory Patterns in IRs: CONTENTdm, Digital Commons and Flying the Coop Presenter 1 Title Digital Repository Coordinator Presenter 2 Title Coordinator of Scholarly Communications & Electronic Resources Presenter 3 Title Digital Curation Librarian Session Type 45-minute concurrent session Abstract What is the importance of institutional history and special collections in a digital environment? Should these pieces of history have their own digital platform or be merged with the institutional repository? What role do repositories play in the institutional environment? What impact do digital historical collections have on the stakeholder contingent as well as the global community? The speakers will discuss the rationale behind migrating collections from CONTENTdm to institutional repositories (all using bepress s Digital Commons platform). Reasons range from subscription costs to file format concerns to increased search optimization. The migratory act will be covered in terms of method used (Excel spreadsheets, bepress add-on service, and manual) along with suggestions and lessons learned. Comparisons in usage statistics between CONTENTdm and Digital Commons demonstrating increased access to collections will also be discussed. Ultimately, the speakers hope to engage in a dialogue on the initial questions of this abstract. Location Room EF Keywords Digital Commons, CONTENTdm, migration, archival collections, institutional history, special collections, usage statistics, platform analysis, archives, digital materials Creative Commons License This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 4.0 License. This 45-minute concurrent session is available at Digital USD: https://digital.sandiego.edu/symposium/2018/2018/34

Migratory Patterns in IRs: CONTENTdm, Digital Commons and Flying the Coop Elizabeth Chance, Fort Hays State University Michele Gibney, University of the Pacific Kristin Laughtin-Dunker, Chapman University

Rationales Migration Workflows & Usage Stats Outline Fort Hays Chapman Pacific Discussion

CHAPMAN & PACIFIC Launched an institutional repository using Digital Commons (DC) platform Rising dissatisfaction with CONTENTdm, where digital Special Collections and Archives were housed Rationales DC supports text & image files - the bulk of Special Collections therefore, combining was cost-effective FORT HAYS Acknowledged that CONTENTdm and Digital Commons do different things CONTENTdm cost is shared with other departments so decision was made to maintain both platforms but audit collections to determine which platform was the best choice for the content.

Workflows & Usage Differences Fort Hays - Targeted collections on our own Chapman - on our own Pacific - bepress paid service

Fort Hays State University Elizabeth Chance, MLIS Digital Curation Librarian

Fort Hays Targeted collections Rationale CONTENTdm Does well with image galleries Allows for in-depth description of complex items like scrapbooks Has native streaming abilities for audio and video Can easily present a variety of file formats in a single collection Has a hard time with large.pdf files New responsive website doesn t have a page-flip view for.pdf Difficult to embed third-party content DIGITAL COMMONS Great for text-based items Requires less metadata to get the discovery job done Better integration for OCR and in-text searching File formats other than.pdf require extra steps Can easily embed thirdparty content like book readers and YouTube videos No mechanism for handling compound objects like front and back of a photograph or items in a scrapbook

Identified qualities that warranted moving a collection Fort Hays Targeted collections Initial Steps Text-based collections.pdf collections Anything that would benefit from better integration of OCR Anything that would be better presented in a book reader format Scholarly works that would benefit from inclusion in the Digital Commons network

Journals Academic Leadership: The Online Journal Journal of Business & Leadership Fort Hays Targeted collections Collections Moved Archival Collections Reveille Yearbooks Athletics Programs Graduate Student Works Master of Liberal Studies Research Papers Masters Theses Collection (In Progress)

Used Bepress batch upload process pdf s were reduced in size to get them under 100mb This allowed us to use the first page as the cover image Re-ran OCR on all the files Fort Hays Targeted collections General Workflow Uploaded files to our library upload server using Filezilla Created new metadata where necessary and cleaned up old metadata where possible When required objects were uploaded to the Internet Archive so we could embed the book reader Took the opportunity to inventory all of the master scans so they could be accessioned into our formal preservation system Standardized file names

Fort Hays Targeted collections The Reveille The Reveille Official Yearbook of Fort Hays State University Published from 1914-2003 Initially Digitized in 2009: Reveille 1.0 Presented as its own collection in CONTENTdm Volumes were photographed and.pdf files were created Re-mastered in 2014 to mixed results Library leadership expressed a desire to get more out of this collection Wanted to see increased usage Access and discovery were difficult due to metadata issues and loading times Wanted to see improved user experience

Fort Hays Targeted collections The Reveille 3.0 Book Gallery highlights cover art Uses the collections tool to create decade sorted sub-galleries Embedded book reader

Fort Hays Targeted collections Usage Differences

Chapman University Kristin Laughtin-Dunker, MLIS Coordinator of Scholarly Communications & Electronic Resources

Step 1: Export metadata spreadsheets from CONTENTdm Chapman FrankenURLs Step 2: Recreate collection structures in DC Step 3: Convert CONTENTdm spreadsheets to DC Batch Upload Excel sheets Step 4: Missing URLs to the objects in CONTENTdm Step 5: Piece together FrankenURLs

The Problems of CONTENTdm URLs The metadata exports from CONTENTdm contained the URL for the record for each item. Chapman FrankenURLs However, in order to import an item, Digital Commons needs a direct URL to it, not the record. Right-clicking the download button in CONTENTdm and copying the link address also did not work. Thus, we had to come up with a system to generate direct URLs to each item ourselves.

Chapman FrankenURLs For each FrankenURL, the following pieces of information from ContentDM are needed: Instance Identifier: the unique numeric identifier for your university/instance Collection Identifier: the unique numeric identifier for the collection Item Identifier: the unique numeric identifier for each individual item Width and Height (for images, optional) Filename (for PDFs or other non-image documents)

The general formula for PDFs is: Chapman FrankenURLs http://cdm[instanceidentifier].contentdm.ocl c.org/utils/getfile/collection/p[instanceidenti fier]coll[collection identifier]/id/[item identifier]/filename/[filename].pdf Chapman s Instance identifier = 15046. For the item 116.pdf, which was item 115 in collection 20, the FrankenURL would be: http://cdm15046.contentdm.oclc.org/utils/ getfile/collection/p15046coll20/id/115/file name/116.pdf

If you can get all of the separate pieces of information into an Excel sheet, it is easy to use the Concatenate function to create the FrankenURLs. Column A: the URL root (http://cdm[instance identifier].contentdm.oclc.org/utils/getfile/collection/p[i nstance identifier]coll) Chapman FrankenURLs Column B: the collection identifier Column C: /id/ Column D: the item identifier Column E: /filename/ Column F: the file name and file extension You could then concatenate columns A-F together to create the FrankenURLs in column G, then Paste Values to add the text of the URLs to your batch upload sheets.

FrankenURL format for image files: Chapman FrankenURLs http://cdm[instanceidentifier].contentdm.ocl c.org/utils/ajaxhelper/?cisoroot=p[instanc eidentifier]coll[collectionidentifier]&cisopt R=[itemidentifier]&action=2&DMSCALE=100 &DMWIDTH=[width]&DMHEIGHT=[height] FrankenURL for item 20 in collection 1: http://cdm15046.contentdm.oclc.org/utils/aj axhelper/?cisoroot=p15046coll1&cisopt R=20&action=2&DMSCALE=100&DMWIDTH= 9999&DMHEIGHT=9999

FrankenURL format for image files: Tip! Put 9999 for the width and height, rather than trying to look up the width and height of each image. Digital Commons will import them at their correct size. However, if you put in a number that s too small, Digital Commons will only import part of that image and cut the rest off. Chapman FrankenURLs You can also use the Concatenate function to build the FrankenURLs in a spreadsheet, then Paste Values into your batch upload sheet: Column A: the URL root: http://cdm[instance identifier].contentdm.oclc.org/utils/ajaxhelper/?cisoroot =p[instance identifier]coll Column B: the collection identifier Column C: &CISOPTR= Column D: the item identifier Column E: &action=2&dmscale=100&dmwidth=9999&dmheight=9 999

Exceptions These tips will work for most instances of ContentDM, especially if the collection identifiers are numeric. Chapman FrankenURLs However, if your instance uses alphabetical or Quick Start collection identifiers, you may have to tweak the FrankenURLs a bit. In the base URL, use the following section as the collection identifier in your FrankenURLs: https://[instanceidentifier].contentdm.oclc.org/cdm/ref/c ollection/[collectionidentifier]/id/[itemidentifier] For alphabetical collection identifiers, it will be a string of letters. For Quick Start collections, it may include the instance identifier, and will end with qs.

Exceptions Once you have found your collection identifier, plug it into the FrankenURL after the /collection/ portion for PDFs or the CISOROOT portion for images: PDFs Chapman FrankenURLs http://cdm[instanceidentifier].contentdm.oclc.org/ut ils/getfile/collection/[collectionidentifier]/id/[itemid Images entifier]/filename/[filename].[fileextension] http://cdm[instanceidentifier].contentdm.oclc.org/ut ils/ajaxhelper/?cisoroot=[collectionidentifier]&ci SOPTR=[itemidentifier]&action=2&DMSCALE=100& DMWIDTH=9999&DMHEIGHT=9999 Note that you do not need to put p[instanceidentifier] before the collection identifiers here, as in earlier examples.

Exceptions Chapman FrankenURLs For images: we did experience a few instances where after clicking on or pasting the FrankenURL, we could not view the image on our computers. However, they did import successfully into Digital Commons. This workaround was only discovered in consultation with two other institutions in January and February 2018, so it has not been tested on ContentDM instances with purely numeric collection identifiers (like Chapman s were).

On ContentDM, usage had peaked at about 5,000 utilizations (page hits) per month in 2012, and had declined to about 2,300 utilizations per month in 2014. Digital Commons usage: First year (September 2014-August 2015): About 5,100 objects 6,381 page hits (531/month) Chapman Usage 4,874 downloads (406/month) Second year (September 2015-August 2016): About 5,700 objects 16,623 page hits (1,385/month) 63,892 downloads (5,324/month) Third year (September 2016-August 2017): About 7,800 objects 21,276 page hits (1,773/month) 70,764 downloads (5,894/month) First five months of fourth year (September 2017-February 2018) About 8,000 objects 11,927 page hits (2,385/month) 41,825 downloads (8,365/month)

Chapman Usage

Chapman Usage Special Collections objects account for about 40% of our content and 40% of our downloads. 4 or 5 of our Top 10 Downloads are consistently Special Collections objects that had been in ContentDM. 2 of these are photos of Miss Universe 1970, who is still a television personality in Puerto Rico. When people Google her, they are finding our photos, often as the top result. We have gotten many reproduction requests for our Japanese propaganda poster and United Press International photo collections.

University of the Pacific Michele Gibney Digital Repository Coordinator

Some one else Pacific bepress paid service does the work? SIGN ME UP.

Pacific bepress paid service 35,000+ items 216 structures

bepress requires: Structure collection (type of structure, URLs, titles, Grouping, etc.) Metadata fields for each structure (double check all spellings) Administrators Pacific bepress paid service OAI-PMH URLs from CONTENTdm with UNIQUE Dublin Core mapping for each field for each structure Migration mapping chart of fields from CONTENTdm to DC FOR EACH STRUCTURE (I ll say it again: double check all spellings) And then you sit back and wait And wait And wait And wait.

Pacific OAI-PMH Dublin Core mapping

Problems COMPOUND OBJECTS (DC doesn t handle these well) Empty top level rows Copy/paste nightmare Massive # of items = search and revision difficulties Pacific bepress paid service Certain fields Circa dates/original dates Author standardization JPEG2000 image files Download/Access Naming structure Embedded audio/video Needs of harvesters

Pacific Compound Objects

CONTENTdm hosted Pacific Embedded A/V Digital Commons does not host

Multi image records VS PDFs (preservation VS ease of use) Multi-format type collections Pacific Partial Manually Led to Manual migrations Metadata export Capturing full text URLs in CDM Conversion to single PDFs Batch uploading

0 0 18 71 0 502 296 499 1663 1883 3091 14987 USAGE COMPARISON CONTENTdm Digital Commons Pacific Usage OCT-DEC '16 JAN-MAR '17 APR-JUN '17 JUL-SEP '17 OCT-DEC '17 JAN-MAR '18 Content added to Digital Commons in May 2017 CONTENTdm subscription ended in October 2017

Pacific Usage Download Map for all collections in Digital Commons

Discussion

Discuss factors affecting migration File preservation Historical metadata preservation Preserving historical usage data Discussion Determine the pro/cons of migrations Migrate targeted collections Consolidate platforms Consider labor involved: DIY vs. paid service options Do your research Look at collections from similar institutions Talk to librarians who have done migrations Document your process to act as a resource in the future

Elizabeth Chance Digital Curation Librarian Fort Hays State University mechance2@fhsu.edu THANK YOU Questions? Michele Gibney Digital Repository Coordinator University of the Pacific mgibney@pacific.edu Kristin Laughtin-Dunker Coordinator of Scholarly Communications & Electronic Resources Chapman University laughtin@chapman.edu