Distributed Eprints Archives and Scientometrics. Resolving an Anomaly

Similar documents
3. Green OA (self-archiving) needs to be mandated

How and Why To Free All Refereed Research From Access- and Impact-Barriers Online, Now

Harnad, S. (2008) The Postgutenberg Open Access Journal. To appear in: Cope, B. & Phillips, A (Eds.) The Future of the Academic Journal. Chandos.

Quality Assurance in the Age of Author Self-Archiving

The Free Online Scholarship Movement: An Interview with Peter Suber

STI 2018 Conference Proceedings

Open Access Publishing and arxiv. Tommy Ohlsson KTH Royal Institute of Technology

Copyright Transfer Agreements in an Interdisciplinary Repository

Introduction to

Astronomy Libraries - Your Gateway to Information. Uta Grothkopf ESO Library

PubMed, PubMed Central, Open Access, and Public Access Sept 9, 2009

The Joint Transportation Research Program & Purdue Library Publishing Services

RoMEO Studies 8: Self-archiving when Yellow and Blue make Green: the logic behind the colour-coding used in the Copyright Knowledge Bank

The Business of E-Resources Publishing

Write to be read. Dr B. Pochet. BSA Gembloux Agro-Bio Tech - ULiège. Write to be read B. Pochet

Student and Early Career Researcher Workshop:

Research Impact Measures The Times They Are A Changin'

The Publishing Landscape for Humanities and Social Sciences: Navigation tips for early

Digital Initiatives & Scholar Commons

Archiving Your Research: the UNM Institutional Repository

Scientific Quality Assurance by Interactive Peer Review & Public Discussion

Quality Control in Scholarly Publishing. What are the Alternatives to Peer Review? William Y. Arms Cornell University

Author Deposit Mandates for Scholarly Journals: A View of the Economics

An Introduction to Bibliometrics Ciarán Quinn

Workshop on repositories and journals

A Scientometric Study of Digital Literacy in Online Library Information Science and Technology Abstracts (LISTA)

BiUM manual on how to deposit FBM/CHUV full text articles in Serval. BiUM Bibliothèque Universitaire de Médecine

Quality Control Experiences from a Large-Scale Film Digitisation Project

Scopus. Advanced research tips and tricks. Massimiliano Bearzot Customer Consultant Elsevier

Author Frequently Asked Questions

Suggested Publication Categories for a Research Publications Database. Introduction

Frequently Asked Questions about Rice University Open-Access Mandate

New directions in scholarly publishing: journal articles beyond the present

AU-6407 B.Lib.Inf.Sc. (First Semester) Examination 2014 Knowledge Organization Paper : Second. Prepared by Dr. Bhaskar Mukherjee

The potential of preprints to accelerate scholarly communication

Media and Data Converging Media and Content

Do we use standards? The presence of ISO/TC-46 standards in the scientific literature ( )

Professor Birger Hjørland and associate professor Jeppe Nicolaisen hereby endorse the proposal by

The citation advantage of open access articles

Managing content in the electronic world Anne Knight Acting Head of Information Systems / Resources & Facilities Manager

Web of Science User Training. #1: Getting Started. Setting up. 1) Search. Page1

WEB OF SCIENCE THE NEXT GENERATAION. Emma Dennis Account Manager Nordics

Bibliometric Study on LIS Journals Archived in DOAJ

Publishing your research in a peer reviewed journal: Tips for success. Los Angeles London New Delhi Singapore Washington DC

How to Publish Your Research Workshop

PUBLICATION OF RESEARCH RESULTS

What are Bibliometrics?

Overview of Open Access Books in Library and Information Science in DOAB

Scientometrics & Altmetrics

What is Web of Science Core Collection? Thomson Reuters Journal Selection Process for Web of Science

Elsevier Databases Training

Periodical Usage in an Education-Psychology Library

Scientific Publishing at Karger

ASTRONOMY LIBRARIES YOUR GATEWAY TO INFORMATION

Avoiding plagiarism - information, communication and referencing

1.1 What is CiteScore? Why don t you include articles-in-press in CiteScore? Why don t you include abstracts in CiteScore?

Scientometric Analysis of Astrophysics Research Output in India 26 years

The Liège ORBi model: Mandatory policy without rights retention but linked to assessment processes

Can editorial peer review survive in a digital environment?

WORKING NOTES AS AN. Michael Buckland, School of Information, UC Berkeley Andrew Hyslop, California State Archives. April 13, 2013

CHAPTER 8 CONCLUSION AND FUTURE SCOPE

AGENDA. Mendeley Content. What are the advantages of Mendeley? How to use Mendeley? Mendeley Institutional Edition

Bibliometric analysis of the field of folksonomy research

Web of Science Unlock the full potential of research discovery

Ethical Policy for the Journals of the London Mathematical Society

ARCHIVAL DESCRIPTION GOOD, BETTER, BEST

THE CARD CATALOGUE. THE WEALTH OF THE LIBRARY Christ, Henry I. Modern English in Action D.C. Heath and Company. Subject Card

HEBS: Histogram Equalization for Backlight Scaling

Urania, a Linked, Distributed Resource for Astronomy

Excerpt of the new core provisions. Article 1. Amendment of the Act on Copyright and Related Rights

What Happens to My Paper?

Electronic Journals and Electronic Publishing at CERN: A Case Study

Bibliometrics & Research Impact Measures

F5 Network Security for IoT

Institutional Report. For my report, I chose to visit the Ralph Rinzler Folklife Archives located in Washington,

Doctor of Nursing Practice Formatting Guidelines

administration access control A security feature that determines who can edit the configuration settings for a given Transmitter.

INTUITIVE, REAL-TIME LAUNDROMAT DATA THAT S CUSTOM-MADE FOR THE WAY YOU OPERATE. LAUNDROMAT - LOCATION 1 - HUEBSCH.COM/COMMAND

Getting Your Paper Published: An Editor's Perspective. Shawnna Buttery, PhD Scientific Editor BBA-Molecular Cell Research Elsevier

Where to present your results. V4 Seminars for Young Scientists on Publishing Techniques in the Field of Engineering Science

EndNote Menus Reference Guide. EndNote Training

Device Management Requirements

SEARCH about SCIENCE: databases, personal ID and evaluation

How to Publish a Great Journal Article. Parker J. Wigington, Jr., Ph.D. JAWRA Editor-in-Chief

Part III: How to Present in the Health Sciences

PRNANO Editorial Policy Version

Alcatel-Lucent 5620 Service Aware Manager. Unified management of IP/MPLS and Carrier Ethernet networks and the services they deliver

The cost of reading research. A study of Computer Science publication venues

Corso di Informatica Medica

Publishing Your Research

Journal Citation Reports Your gateway to find the most relevant and impactful journals. Subhasree A. Nag, PhD Solution consultant

Positional Effects on Citation and Readership in arxiv

Geoscience Librarianship 101 Geoscience Information Society (GSIS) Denver, CO September 24, 2016

Bibliographic Software and Online Resources for Research

Searching GeoRef for Archaeology

Research Project Preparation Course Writing Literature Reviews (part 1)

VISIBILITY OF AFRICAN SCHOLARS IN THE LITERATURE OF BIBLIOMETRICS

INTRODUCTION TO SCIENTOMETRICS. Farzaneh Aminpour, PhD. Ministry of Health and Medical Education

Volume 6, Issue 2, August Open Access to Journal Content as a Case Study in Unlocking IP^

Scientometric Profile of Presbyopia in Medline Database

Transcription:

Distributed Eprints Archives and Scientometrics

H. G. Wells, World Brain: The Idea of a Permanent World Encyclopaedia Encyclopédie Française, August, 1937 Encyclopaedias of the past sufficed for the needs of a cultivated minority universal education was unthought of gigantic increase in recorded knowledge Discontent with the role of universities and libraries in the intellectual life of mankind Universities multiply but do not enlarge their scope thought & knowledge organization of the world No obstacle to the creation of an efficient index to all human knowledge, ideas and achievements

The Optimal and Inevitable for Researchers All of this will come to pass. The only question is How Soon? The entire full-text refereed corpus online On every researcher s desktop, everywhere 24 hours a day All papers citation-interlinked Fully searchable, navigable, retrievable For free, for all, forever

Globalizing Research Harvard Impact Access Harvard financial firewalls The Rest The Rest

The Subversive Proposal: Sufficient to free entire refereed corpus forever, immediately: 1. Universities install off-the-shelf, OAI-compliant Eprint software 2. Authors self-archive (preprints & postprints) 3. Institutions subsidize first start-up wave of self-archiving 4. The Give-Away corpus is freed Hypothetical Sequel: 5. Users prefer free version? 6. Publisher S/L/P revenues shrink, Library S/L/P savings grow? 7. Publishers downsize to QC/C service-providers + optional add-ons? 8. QC/C service costs funded by author-institution out of reader-institution S/L/P savings?

Five Essential PostGutenberg Distinctions: (if you don t make them, none of this will make sense) 1. Distinguish the non-give-away vs. give-away literature Litmus test: Does the author seek a royalty/fee? : books (yes) vs. refereed journal papers (no) 2. Distinguish income (from paper sale) vs. impact (from paper use) (and distinguish give-away-author imprint-income [0] vs. impact-income [??]) 3. Distinguish give-away author copyright protection from: theft-of-authorship (wanted) vs. theft-of-text (unwanted) 4. Distinguish self-publishing (vanity press) vs. self-archiving (of published, refereed research) 5. Distinguish unrefereed preprints vs. refereed postprints eprints = preprints + postprints

Zeno s Prima-FaQs I worry about self-archiving because : 1. Preservation 2. Authentication 3. Corruption 4. Navigation (info-glut) 5. Certification 6. Evaluation 7. Peer review 8. Paying the piper 9. Downsizing 10. Copyright 11. Plagiarism 12. Priority 13. Censorship 14. Capitalism 15. Readability 16. Graphics 17. Publishers future 18. Libraries future 19. Learned Societies future 20. University conspiracy 21. Serendipity 22. Tenure/Promotion 23. (your prima-faq here ) Answers available at < http://cogsci.soton.ac.uk/~harnad/tp/resolution.htm >

Eprints < > is dedicated to freeing the research literature, preand post-refereeing, through author/institution self-archiving in interoperable Open Archives < www.openarchives.org > To help the self-archiving initiative quickly gain momentum, archive-creating software, compliant with the OAi protocol, hence fully interoperable with all other Open Archives, has been developed at the University of Southampton. Eprints is designed to be as flexible and adaptable as possible, so that all universities world-wide can immediately adopt and configure it with minimal effort for all their disciplines selfarchiving needs. The Eprints software, has been available (for free, of course) from eprints.org since December 2000.

From Linear Growth to Exponential Deposit Rates Disciplines arxiv submission rates - linear growth only 30% of citations to papers deposited in arxiv Time Exponential growth in archiving to catch up with paper-based research 100% of papers archived, in all disciplines

Well s Global Research Database?

New OAI Services Multiple Updates by LANL Subfield (based on LANL meta-data) solv-int patt-sol nucl-ex nlin math-ph cs comp-gas chao-dyn adap-org physics hep-ex quant-ph hep-lat nucl-th math gr-qc hep-th cond-mat astro-ph 0 5000 10000 15000 20000 25000 No. of Papers with Updates No Updates 1 Update 2 Updates 3 Updates 4 Updates hep-ph Citation Linking & Scientometric Analysis

Citation-Ranked Searches

Citation-based Visualisation

Decreasing Citation Latencies Frequency of Citation Latencies: 1992-1999 5000 4500 4000 3500 Citations 3000 2500 2000 1500 1000 500 0 0 12 24 36 48 60 72 84 96 Time Difference/Months 99 98 97 96 95 94 93 92 The raw data show that the latency of the citation peak has been reducing over the period of the archive

The New Paper Rush Age of paper against number of downloads Number of Downloads 50000 45000 40000 35000 30000 25000 20000 15000 10000 5000 0 0 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 Age of Paper (days) Users subscribe to an email alerting service that informs them of new papers.

Article Embryology hep-th 200 175 150 125 Papers 100 75 50 25 0 199107 199201 199207 199301 199307 199401 199407 199501 199507 199601 199607 199701 199707 199801 199807 199901 199907 200001 With J-R With J-R/Report Report Unknow n Papers with a journal reference [J-R] cross papers without a J-R at an age of 13 months, suggesting a time difference of 13 months between pre-print and post-print

Effect of Paper Impact The papers were split into three sets based on the number of citations to them. There are an equal number of citations to the papers in the low, medium and high sets.

Author Impact Quartiles Quartile Total % Total Citations Papers Citations/Aut hor/paper Deposits Mean Updates/ Author High 25% 798 2.09% 240,092 2,732 0.11 6,720 0.48 Med 50% 9,262 24.20% 733,272 37,318 0.00212 93,671 0.37 Low 25% 28,211 73.71% 251,925 67,951 0.000131 165,971 0.27 High impact authors update more than medium or low High and medium impact authors deposit more papers than low

Citation Quality Do Papers Cite Papers of Like Impact 140000 120000 100000 High 80000 60000 40000 No of Citations Medium 20000 Dest. Impact Low Low Medium High Source Impact 0 Papers generally cite papers of like impact (χ 2 underway).

Citation Spread Histogram of Citations per Paper (author impact) 30,000 papers were by authors with no citation 40000 35000 30807 30000 25000 Papers 20000 15000 10000 5000 0 13668 11527 6784 3105 6534 4441 138 6072 5863 4781 121 170 257 249 No citations 1 Citation 2/3 Citations 4/5/6 Citations 7/8/9/10 Citations 2060 9627 1797 11 or more Citations High (2.53%) Medium (34.55%) Low (62.92%) A small number of papers receive a very large number of citations

Effect of Paper Impact on Usage All Papers 0.0025 0.002 0.0015 0.001 0.0005 0 0 109 218 Frequency Density 327 436 545 654 763 872 981 1090 1199 1308 1417 1526 1635 1744 1853 1962 2071 2180 2289 2398 Age of paper (days) High (2.0%) Medium (7.7%) Low (46.5%) Unknown (39.6%) Higher impact papers have a longer download life expectancy.

Correlating citations and downloads Download type r n All Papers 0.11155 63671 High Impact Papers (2.0%) 0.27293 1981 Medium Impact Papers (7.7%) 0.01288 5937 Low Impact Papers (46.5%) -0.01412 30163 There is a significant positive correlation between citations and downloads for high impact papers.

Implementation Issues Creating new metadata vs Creating new services