Cirtec project (former CyrCitEc/CitEcCyr)

Similar documents
CRIS with in-text citations as interactive entities. Sergey Parinov CEMI RAS and RANEPA

The Joint Transportation Research Program & Purdue Library Publishing Services

ICI JOURNALS MASTER LIST Detailed Report for 2017

(Presenter) Rome, Italy. locations. other. catalogue. strategy. Meeting: Manuscripts

Academic Identity: an Overview. Mr. P. Kannan, Scientist C (LS)

MEASURING EMERGING SCIENTIFIC IMPACT AND CURRENT RESEARCH TRENDS: A COMPARISON OF ALTMETRIC AND HOT PAPERS INDICATORS

Introduction to Mendeley

Purdue Libraries Publishing Services: The Domino Effect of Repository-Based Publishing, Outreach, and Promotion

All about Mendeley. University of Southampton 18 May mendeley.com. Michaela Kurschildgen, Customer Consultant Elsevier

SCOPUS : BEST PRACTICES. Presented by Ozge Sertdemir

Author Name Co-Mention Analysis: Testing a Poor Man's Author Co-Citation Analysis Method

ENCYCLOPEDIA DATABASE

Life Sciences sales and marketing

e-infrastructure for Scientific Communities

AGENDA. Mendeley Content. What are the advantages of Mendeley? How to use Mendeley? Mendeley Institutional Edition

WORLD LIBRARY AND INFORMATION CONGRESS: 75TH IFLA GENERAL CONFERENCE AND COUNCIL

WEB OF SCIENCE THE NEXT GENERATAION. Emma Dennis Account Manager Nordics

What is bibliometrics?

Representing Social Sciences

Astronomy Libraries - Your Gateway to Information. Uta Grothkopf ESO Library

Oral history for library history

Web of Science Unlock the full potential of research discovery

Susan K. Reilly LIBER The Hague, Netherlands

Sentiment Aggregation using ConceptNet Ontology

CLARIN - NL. Language Resources and Technology Infrastructure for the Humanities in the Netherlands. Jan Odijk NO-CLARIN Meeting Oslo 18 June 2010

Measuring Academic Impact

The Social Impact of History Books: Citations, Reader Ratings, and the Use of Goodreads as an Altmetric tool

PubMed, PubMed Central, Open Access, and Public Access Sept 9, 2009

DATA CITATION. what you need to know

New directions in scholarly publishing: journal articles beyond the present

LMS301: Reference Management Software (Mendeley)

Development of Reference Management System in Cloud Computing Environment

Mendeley. By: Mina Ebrahimi-Rad (Ph.D.) Biochemistry Department Head of Library & Information Center Pasteur Institute of Iran

British National Corpus

Scopus Introduction, Enhancement, Management, Evaluation and Promotion

Archiving Your Research: the UNM Institutional Repository

I. GENERAL OVERVIEW OF RECENT MAJOR DEVELOPMENTS AND RELATIONSHIP TO GOVERNMENT

THE USE OF THOMSON REUTERS RESEARCH ANALYTIC RESOURCES IN ACADEMIC PERFORMANCE EVALUATION DR. EVANGELIA A.E.C. LIPITAKIS SEPTEMBER 2014

Russian Index of Science Citation: Overview and Review

Telescope Bibliometrics 101. Uta Grothkopf & Jill Lagerstrom

Renovating Descriptive Practices: A Presentation for the ARL Fellows. Karen Calhoun OCLC Vice President WorldCat & Metadata Services November 1, 2007

The ACL Anthology Network Corpus. University of Michigan

A GUIDE TO USING ENDNOTE

How comprehensive is the PubMed Central Open Access full-text database?

Digital Initiatives & Scholar Commons

GUIDELINES TO AUTHORS

Experiences with a bibliometric indicator for performance-based funding of research institutions in Norway

Are Mutualisms Maintained by Host Sanctions or Partner Fidelity Feedback?

Digital Text, Meaning and the World

Bibliography Of Publications: Africa Region, (World Bank Technical Paper) READ ONLINE

Szymanowska Scholarship: Ideas for Access and Discovery through Collaborative Efforts 1

The Societal Impact of History Books: Citations, Reader Ratings, and the 'Altmetric' Value of Goodreads

ICOMOS Ename Charter for the Interpretation of Cultural Heritage Sites

Getting started with Mendeley

ICOMOS ENAME CHARTER

Figures in Scientific Open Access Publications

Today s WorldCat: New Uses, New Data

POLI 203 Library Workshop MICHELLE LAKE POLITICAL SCIENCE, SCPA, FPST AND GOVERNMENT PUBLICATIONS LIBRARIAN

Web of Science, Scopus, & Altmetrics:

BLM is the Council Contributor Member of Council of Science Editors (CSE) and following the CSE slogan Education, Ethics, and Evidence for Editors.

Presentation from the EISZ Conference The use and generation of scientific content. Roles for Libraries in Budapest, Hungary Sep 12 th, 2016

Self-publishing services for book authors

OCLC Update. Cynthia Whitacre. John Chapman. Sandi Jones. Manager, WorldCat Quality & Partner Content. Product Manager, Metadata Services

Full-Text based Context-Rich Heterogeneous Network Mining Approach for Citation Recommendation

Springer Archives ABC. Unlock Yesterday s Minds Today. springer.com. Springer Book Archives and Springer Journal Archives. springer.

Enriching scientific citations to facilitate knowledge discovery

Introduction. Status quo AUTHOR IDENTIFIER OVERVIEW. by Martin Fenner

USING THE UNISA LIBRARY S RESOURCES FOR E- visibility and NRF RATING. Mr. A. Tshikotshi Unisa Library

Welsh print online THE INSPIRATION THE THEATRE OF MEMORY:

Data Citation Analysis Framework for Open Science Data

Constructing Bibliographic Relationships through DOI for Asian Studies. Estelle Cheng

COLLECTION DEVELOPMENT POLICY OF THE NATIONAL LIBRARY OF FINLAND

Using InCites for strategic planning and research monitoring in St.Petersburg State University

VIRTUAL NETWORKING AND CITATION ANALYSIS

WEB OF SCIENCE JOURNAL SELECTION PROCESS THE PATHWAY TO EXCELLENCE IN SCHOLARLY COMMUNICATION

How to Choose the Right Journal? Navigating today s Scientific Publishing Environment

(web semantic) rdt describers, bibliometric lists can be constructed that distinguish, for example, between positive and negative citations.

Research Impact Measures The Times They Are A Changin'

Frequently Asked Questions: Cable TV and Next Generation CAP EAS

A Pragma-Semantic Analysis of the Emotion/Sentiment Relation in Debates

Journal Citation Reports Your gateway to find the most relevant and impactful journals. Subhasree A. Nag, PhD Solution consultant

Internet of Things ( IoT) Luigi Battezzati PhD.

Extended Engagement: Real Time, Real Place in Cyberspace

SpringerLink Inforum, Prague 26 May Frans Lettenström SpringerLink Licensing Executive South & East Europe SPRINGER

Bulletin for the Study of Religion Guidelines for Contributors, January 2010

Publishing Scientific Research. Jacco Flipsen Editorial Director

Research Paper Recommendation Using Citation Proximity Analysis in Bibliographic Coupling

Information Literacy for German Language and Literature at the Graduate Level: New Approaches and Models

EndNote X8 Workbook. Getting started with EndNote for desktop. More information available at :

Presented by. The Metadata [R]evolution: Transformative Opportunities September 18, 2013

Stepwise process of publishing English language journal

AN OVERVIEW ON CITATION ANALYSIS TOOLS. Shivanand F. Mulimani Research Scholar, Visvesvaraya Technological University, Belagavi, Karnataka, India.

User Deposit Checklists

ICOMOS Charter for the Interpretation and Presentation of Cultural Heritage Sites

Measuring Your Research Impact: Citation and Altmetrics Tools

Sentiment Analysis on YouTube Movie Trailer comments to determine the impact on Box-Office Earning Rishanki Jain, Oklahoma State University

Title. Author(s) 北海道大学北キャンパス図書室. Issue Date Doc URL. Rights(URL) Type. Note

OLA TENGSTAM MALMÖ UNIVERSITY SWEDEN

Tamar Sovran Scientific work 1. The study of meaning My work focuses on the study of meaning and meaning relations. I am interested in the duality of

The largest abstract and citation database

Transcription:

Open citation content data Cirtec project (former CyrCitEc/CitEcCyr) Sergey Parinov, CEMI RAS and RANEPA Cirtec project is funded by Russian Presidential Academy of National Economy and Public Administration (RANEPA)

Cirtec main principles Open infrastructure. Two initial nodes: CitEc (http://citec.repec.org/) and Cirtec systems with a specialization on processing papers in specific languages. Other nodes, e.g. specialized on processing citation data in languages, like Chinese, Japanese, Arabic, etc., could be added by the same way. There is also an intention to integrate data about references into the OpenCitations Corpus (http://opencitations.net/). Transparency. Cirtec allows publishers, authors and readers of papers to see how the citation data of their papers were extracted by the system. They can trace why some papers' references / in-text citations are not processed or not counted. Enrichment. Integration with research information system (RIS). Providing tools for authors of papers to enter additional data to correct errors of processing citations found in their papers and to enrich their citation relationships. Public control. Readers of papers can publicly or private react to authors misbehavior in order to increase their number of citations by using the enrichment facilities.

Cirtec Technology: - Takes papers from RePEc and Socionet - Returns citation data to RePEc/Socionet - Integrated by data with CitEc/RePEc - Uses PDF.js to convert PDF to JSON - Stores citation data as XML files - Provides open access to produced data Cirtec Outputs (2 of 4): 1. Open source software to parse papers metadata and full text PDFs available at https://github.com/citeccyr 2. Open service to process papers PDFs for extracting citation data including citation contexts

3. Open dataset at http://cirtec.ranepa.ru/data/

4. Statistics and a monitoring tool on the citation data extraction process To monitor everyday changes, missed/damaged papers, processed/unprocessed citation data, etc. A fragment of the main page - http://cirtec.ranepa.ru/stats.html

Statistics on dataset of citation data Statistics on 2018.09.01 Totals processedcollections of papers 317 metadata records available 144,250 records with links to paper s fulltext 132,035 PDF files in Web ARChive 108,823 JSONfiles with found reference sections 74,268 total references 1,272,126 total citation contexts 1,203,358 total mentioned references 1,091,996 total citation relationships (including DOI) 166,976 total non-mentioned references 180,130 50% 15% We accumulate and store all Cittec statistics from 2018-07-05 Source: http://cirtec.ranepa.ru/stats.html

Current Cirtec activities: citation contexts analysis Index of references that provides for each reference: number/id of papers where the reference occurs number/id of in-text citations for the reference (by papers) citation contexts for the reference (by papers) Co-occurrence of references in papers frequencies and list of references with common citation contexts common citation contexts as characteristics of similarities between references Polarity of citation contexts (sentiment analysis) Word2vec and Doc2vec analysis of citation contexts (similarity analysis)

Future Cirtec: ambitious aims Transformation of the in-text citations into interactive elements: to make channels for scholarly communication and research cooperation Using these channels: the cited authors know who used what of their outputs the cited authors can inform the citing authors about upgrades with cited outputs the citing authors can send requests to cited authors on needed development of cited outputs As a result, the research community has wider, than now, scholarly cooperation scholars have better individual research performance

Research Information System (RIS) If we integrate citation data into RIS with a rich semantic layer, we can enrich the data by many additional attributes, like citing/cited authors contact data, etc. Citing paper s full text PDF Cited author s affiliation, Organization s profile Other authors Their papers Citing paper s metadata In-text citation data Reference data Profile of citing author Citing author s contacts Cited author s contacts Profile of cited author Other author s papers Other author s papers Cited paper s full text PDF Cited paper s metadata Citing author s affiliation, Organization s profile Other authors Their papers

Interactive in-text citations: first experiments PDF.js module to convert PDF to JSON Hypothes.is annotation tool within Socionet formatting citation data by the Web Annotation Data Model Computer-generated annotations for the in-text citations A fragment of paper s PDF with annotated in-text citations source: https://goo.gl/bzjwzz

Taxonomy of cited author s reactions VALUES FOR CITING FOR CITED FOR AUTHORS AUTHORS READERS agree with this citation, comment disagree with this citation, comment ready to improve my paper ready to help with taking better effect from using my paper propose making a joint paper propose a joint development of my results misunderstanding of my paper protest against style of this citation

Contacts Web: http://cirtec.ranepa.ru/ Oxana Medvedeva, Cirtec project head, oxana.medvedeva.1984@gmail.com Sergey Parinov, Cirtec development group leader, sparinov@gmail.com