Data Citation Analysis Framework for Open Science Data

Similar documents
Scopus. Advanced research tips and tricks. Massimiliano Bearzot Customer Consultant Elsevier

Embedding Librarians into the STEM Publication Process. Scientists and librarians both recognize the importance of peer-reviewed scholarly

DATA CITATION. what you need to know

What is bibliometrics?

Quality Control in Scholarly Publishing. What are the Alternatives to Peer Review? William Y. Arms Cornell University

Impact Factors: Scientific Assessment by Numbers

Publishing Your Research

The digital revolution and the future of scientific publishing or Why ERSA's journal REGION is open access

Workshop on repositories and journals

Journal Citation Reports Your gateway to find the most relevant and impactful journals. Subhasree A. Nag, PhD Solution consultant

Research Playing the impact game how to improve your visibility. Helmien van den Berg Economic and Management Sciences Library 7 th May 2013

Research Evaluation Metrics. Gali Halevi, MLS, PhD Chief Director Mount Sinai Health System Libraries Assistant Professor Department of Medicine

Scientometrics & Altmetrics

MEASURING EMERGING SCIENTIFIC IMPACT AND CURRENT RESEARCH TRENDS: A COMPARISON OF ALTMETRIC AND HOT PAPERS INDICATORS

Information Networks

Journal Citation Reports on the Web. Don Sechler Customer Education Science and Scholarly Research

What is Web of Science Core Collection? Thomson Reuters Journal Selection Process for Web of Science

WEB OF SCIENCE THE NEXT GENERATAION. Emma Dennis Account Manager Nordics

USING THE UNISA LIBRARY S RESOURCES FOR E- visibility and NRF RATING. Mr. A. Tshikotshi Unisa Library

New directions in scholarly publishing: journal articles beyond the present

INTRODUCTION TO SCIENTOMETRICS. Farzaneh Aminpour, PhD. Ministry of Health and Medical Education

Scientific and technical foundation for altmetrics in the US

University of Liverpool Library. Introduction to Journal Bibliometrics and Research Impact. Contents

UA Libraries; UW-Madison Libraries; IMLS: Advisory Committee; Program Manager; Support Staff

Publishing your research in a peer reviewed journal: Tips for success. Los Angeles London New Delhi Singapore Washington DC

Syddansk Universitet. The data sharing advantage in astrophysics Dorch, Bertil F.; Drachen, Thea Marie; Ellegaard, Ole

How to Choose the Right Journal? Navigating today s Scientific Publishing Environment

Data Citation Principles CODATA TG on Data Citation

2013 Environmental Monitoring, Evaluation, and Protection (EMEP) Citation Analysis

Migratory Patterns in IRs: CONTENTdm, Digital Commons and Flying the Coop

Bibliometric analysis of the field of folksonomy research

Calver, M.C. (2016) Reflections on two years of change at Pacific Conservation Biology. Pacific Conservation Biology, 22 (4). p

Introduction. Status quo AUTHOR IDENTIFIER OVERVIEW. by Martin Fenner

Academic Identity: an Overview. Mr. P. Kannan, Scientist C (LS)

Author Instructions for Environmental Control in Biology

PubMed, PubMed Central, Open Access, and Public Access Sept 9, 2009

Easy access to medical literature: Are user habits changing? Is this a threat to the quality of Science?

Presentation by Martie van Deventer to eresearch Africa 2013 Conference 08 October

EVALUATING THE IMPACT FACTOR: A CITATION STUDY FOR INFORMATION TECHNOLOGY JOURNALS

Open Access Essentials

Demystifying Citation Metrics. Michael Ladisch Pacific Libraries

Measuring Your Research Impact: Citation and Altmetrics Tools

Citation analysis: Web of science, scopus. Masoud Mohammadi Golestan University of Medical Sciences Information Management and Research Network

Instructions to Authors

The largest abstract and citation database

Finding a Home for Your Publication. Michael Ladisch Pacific Libraries

Promoting your journal for maximum impact

Open access. Open Access at Aarhus University. Make your publications visible and accessible on the web

AGENDA. Mendeley Content. What are the advantages of Mendeley? How to use Mendeley? Mendeley Institutional Edition

Migratory Patterns in IRs: CONTENTdm, Digital Commons and Flying the Coop

A Scientometric Study of Digital Literacy in Online Library Information Science and Technology Abstracts (LISTA)

Influence of Discovery Search Tools on Science and Engineering e-books Usage

Digitization : Basic Concepts

Full Page Ads. Against the Grain. Volume 28 Issue 3 Article 2

INTRODUCTION TO SCIENTOMETRICS. Farzaneh Aminpour, PhD. Ministry of Health and Medical Education

Use and Usability in Digital Library Development

SEARCH about SCIENCE: databases, personal ID and evaluation

Research metrics. Anne Costigan University of Bradford

Development of Reference Management System in Cloud Computing Environment

Citation Metrics. From the SelectedWorks of Anne Rauh. Anne E. Rauh, Syracuse University Linda M. Galloway, Syracuse University.

Citations count: the provision of bibliometrics training by university libraries

Representing Social Sciences

Introduction. Citing data sources in the social sciences: do authors do it?

Corso di dottorato in Scienze Farmacologiche Information Literacy in Pharmacological Sciences 2018 WEB OF SCIENCE SCOPUS AUTHOR INDENTIFIERS

DOWNLOAD PDF BOWKER ANNUAL LIBRARY AND TRADE ALMANAC 2005

Web of Science Unlock the full potential of research discovery

AN INTRODUCTION TO BIBLIOMETRICS

Measuring the Impact of Electronic Publishing on Citation Indicators of Education Journals

Citation-Based Indices of Scholarly Impact: Databases and Norms

BIBLIOGRAPHIC DATA: A DIFFERENT ANALYSIS PERSPECTIVE. Francesca De Battisti *, Silvia Salini

Web of Science Core Collection

Enriching scientific citations to facilitate knowledge discovery

Telescope Bibliometrics 101. Uta Grothkopf & Jill Lagerstrom

Figures in Scientific Open Access Publications

SCOPUS : BEST PRACTICES. Presented by Ozge Sertdemir

International Journal of Library and Information Studies ISSN: Vol.3 (3) Jul-Sep, 2013

The Publishing Landscape for Humanities and Social Sciences: Navigation tips for early

ICDL FAQS FOR REVISED 3/18/05. What is the International Children s Digital Library (ICDL)? Who is the intended audience for the ICDL?

ICI JOURNALS MASTER LIST Detailed Report for 2017

Introduction to

DISCOVERING JOURNALS Journal Selection & Evaluation

Where to present your results. V4 Seminars for Young Scientists on Publishing Techniques in the Field of Engineering Science

Strategies for Enhancing Research Visibility and Improving Citations

Journal of American Computing Machinery: A Citation Study

Code Number: 174-E 142 Health and Biosciences Libraries

ISSN: ISO 9001:2008 Certified International Journal of Engineering Science and Innovative Technology (IJESIT) Volume 3, Issue 2, March 2014

Copyright Transfer Agreements in an Interdisciplinary Repository

Measuring Academic Impact

A Citation Analysis of Articles Published in the Top-Ranking Tourism Journals ( )

How to Get Published Elsevier Author Webinar. Jonathan Simpson, Publishing Director Elsevier Science & Technology Books

Horizon 2020 Policy Support Facility

Experiences with a bibliometric indicator for performance-based funding of research institutions in Norway

GfK Audience Measurements & Insights FREQUENTLY ASKED QUESTIONS TV AUDIENCE MEASUREMENT IN THE KINGDOM OF SAUDI ARABIA

Metadata Education and Research Information Clearinghouse (MERIC): Web Prototype

Purdue Libraries Publishing Services: The Domino Effect of Repository-Based Publishing, Outreach, and Promotion

Open Access Determinants and the Effect on Article Performance

Citation Analysis. Presented by: Rama R Ramakrishnan Librarian (Instructional Services) Engineering Librarian (Aerospace & Mechanical)

THE USE OF THOMSON REUTERS RESEARCH ANALYTIC RESOURCES IN ACADEMIC PERFORMANCE EVALUATION DR. EVANGELIA A.E.C. LIPITAKIS SEPTEMBER 2014

PUBLICATION OF RESEARCH RESULTS

Professor Birger Hjørland and associate professor Jeppe Nicolaisen hereby endorse the proposal by

Transcription:

Citation Analysis Framework for Open Science Koji Zettsu zettsu@nict.go.jp National Institute of Information and Communications Technology SCOSTEP-WDS Workshop on Global Activities for the Study of Solar-Terrestrial Variability September 28-30, 2014 Tokyo, Japan

Citation as a Driver of Scientific Open Publication of paper and data Reward Recognition Research community General society Publishing Original paper Citation publishing Reward Recognition Scientific findings Scientists Scholars Metadata producers contributors Roche DG, Lanfear R, Binning SA, Haff TM, Schwanz LE, et al. (2014) Reference: Society of Geomagnetism, Earth, Planetary and Space Sciences, http:// sgepss.org/sgepss/shorai/sgepss_syorai_jan2013.pdf [accessed on January 2013]. (C) NICT 2

Citation Practices Publisher Description Citation Example PANGAEA: The Publishing Network for Geo-scientific and Environmental http://www.pangaea.de/ ICPSR: The Inter-university Consortium for Political and Social Research https://www.icpsr.umich.edu/ Dryad http://http://datadryad.org// Open access library and data publisher for earth and environmental science International consortium of about 700 academic institutions and research organizations that maintains and provides assess to social science data International data repository of peer reviewed scholarly literature specialized in bioscience data Gershanovich, DE; Zinkovskiy, AB (1987): Distribution of particulate matter and particulate organic carbon in waters of the Caspian Sea.doi:10.1594/PANGAEA.756520 Escarce, Jose J., Nicole Lurie, and Adria Jewell. RAND Center for Population Health and Health Disparities (CPHHD) Core Series: Pollution, 1988-2004 [United States]. ICPSR27864-v1. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], 2011-10-21. http://doi.org/10.3886/icpsr27864.v1 López-Rodríguez MJ, Tierno de Figueroa JM (2012) from: Life in the dark: on the biology of the cavernicolous stonefly Protonemura gevi (Insecta, Plecoptera). The American Naturalist http://dx.doi.org/10.5061/dryad.8m8r1 and more Source: CODATA-ICSTI Task Group on Citation Standards and Practices: Out of Cite, Out of Mind: The Current State of Practice, Policy, and Technology for the Citation of, Science Journal, Vol. 12, pp. CIDCR1-CIDCR75 (2013) (C) NICT 3

Citation Practices (Cont d) Citation Index (DCI) [Thomson Reuters] Harvests citations to research data from papers indexed in the Web of Science A. DCI overall statistics C. Distribution of citations by subject B. DCI statistics by area of data studies Source: Robinson-Garcia, N. et. al: Analyzing data citation practices using the Citation Index, Journal of the Association for Information Science and Technology, DOI: 10.1002/asi.23529 (June, 2015) (C) NICT 4

Citation Mechanism Online Journal forward Publisher Landing page lookup DOI server register Metadata Garrison, VH et al. (2014): Particulate matter (PM2.5 and PM10) in the air in Bamako, Mali (2012-2013). doi:10.1594/pangaea.834195 Cite CrossRef JaLC, etc. DOI 10.1594/PANGAEA.834195 citation PANGAEA, ICPSR, Dryad, etc. (C) NICT 5

Web of Citation Document Inter-university Consortium for Political and Social Research (ICPSR) 115,154 citations (C) NICT 6

Citation Analysis Macro analysis: Analyze structure of data citation network Discover communities of data citation, and characterize data by citations in a community Micro analysis: Analyze associations between document and data Discover typical associations (i.e. association rules) between documented knowledge and evidential data (C) NICT 7

Citation Analysis Framework Publisher OAI-PMH HTML OAI-PMH Citation Extraction Citation Archive Citation Structure Analysis Citation Association Rule Discovery Citation Graph Visualization (C) NICT 8

Citation Extraction Citation Metadata Referencing documents Landing pages (C) NICT 9

Citations Archive Citation Archive citation metadata > 1,285,000 citations Site Domain # of DC 322,477 Pangaea Earth & Environment 384,815 114,815 ICPSR Social Science 115,154 Cite (any) 773,173 DRYAD Bioscience 1,556 ADA Social Science 16,062 ESDS Economic & Social Science 59,471 (C) NICT 10

Citation Structure Analysis (1) Collection Community collection Physical properties of Hole # Document Pangaea Reports of the Deep Sea Drilling Project Catalogue document (C) NICT 11

Citation Structure Analysis (2) Sharing Community -sharing document clusters Inequality Australian Archive (ADA) Document Working Attitudes National Social Science Survey Shared data (C) NICT 12

Topic-Specific Community Discovery HITS algorithm [Kleinberg 99] A good hub links to many good authorities Hub score: HH xx = yy xx AA(yy) A good authority is referenced by many good hubs Authority score: AA xx = yy xx HH(yy) Discovery from result set Hub Auth ority (C) NICT 13

Hubs and Authorities Hub document Australian Broadcasting Cooperation (Audience Research) «Brisbane radio, 1980» «Sydney radio survey, 1981» Authority data «Melbourne radio survey, 1978» Document (C) NICT 14

Community Discovery Demo (C) NICT 15

Referential Context usage is often different from the data content Health insurance Population data Income data Both population data and income data are referenced by health insurance document Document (C) NICT 16

Citation Association Rule Discovery Discover typical combinations of document and data attributes frequently co-occurring in data citations (referential contexts) Citation Archive Referential Context: Document Attribute Title Authors Topics, etc. Attribute Subject Observation location, time Creators, etc. Association rule discovery tool (C) NICT 17

Typical Supporting Documented Knowledge subject Document topic Labor market (88) 39:39 31:31 40:112 7:7 1:1 Educational background (9) Career goals (3) Occupational mobility (35) Health (4) Energy assistance (1) 1:1 citations (#source: #target) Dating (social) (1) (C) NICT 18

Trend of Knowledge-Supporting Observation location Year Document topic 3:35 Equatorial Pacific (35) 1992 Total carbon dioxide (10) 3:279 Arabian Sea (279) 1995 4:247 Southern Ocean (247) 1996-1998 Observation data for carbon dioxide research goes south over years. (C) NICT 19

Supporting Wide Knowledge Document collected from 1963 ~ 1981 all over the world Diverse topics title Sedimentation rates calculated on surface sediment samples from different site of the Atlantic and Pacific Oceans, 1991. Created by Wallace Smith Broecker (1931 - ) (C) NICT 20

Use Cases Discover data-intensive community Collaborate with research communities having same or similar data [researcher] Survey the data common to a research community [data repository] Evaluate reputation of data Reward data (creators) based on its popularity and/or authority [funding organization] Manage quality of data [data creator] Provide superior discoverability for better reuse of data Search data by both content and context keywords [data repository] Discover related data for interdisciplinary research [data curator] Find research publications actionable for reuse of data [researcher, publisher] (C) NICT 21

Open Issues Unstable metadata of data citation Semantic compatibility among heterogeneous data citation metadata E.g.) relate to, supplement to (PANGAEA), related publications (ICPSR), is referenced by (Dryad), related materials (ADA),. Up-to-date? More citations for better analysis Unified and/or centralized access to distributed information of data citation Citation-creating applications with harnessing citation analysis E.g.) search with citation-based ranking (more citations, more exposure) data citation optimization Analyzing dynamic citations Behavior analysis on user (client)-to-data citations from DOI access log Intention analysis on keyword-to-data citations from search query log 2014/11/5 (C) NICT 22

Summary citation = link from documented knowledge to evidential data Instead of knowledge-to-knowledge link by document citation Analysis of Web of data citation citation structure analysis (macro analysis) citation association rule discovery (micro analysis) For better reuse & reward of data Discover data-intensive community Evaluate reputation of data Provide superior discoverability (C) NICT 23

THANK YOU isp-contact@ml.nict.go.jp Poster & demo are now on show! (C) NICT 24