Designing an Affiliation Extractor for Turkish Universities through Finite State Graphs

Similar documents
First Stage of an Automated Content-Based Citation Analysis Study: Detection of Citation Sentences 1

INTRODUCTION TO SCIENTOMETRICS. Farzaneh Aminpour, PhD. Ministry of Health and Medical Education

Analyzing the Intellectual Structure of World Information Literacy Literature through Citations and Co-Citations

THE USE OF THOMSON REUTERS RESEARCH ANALYTIC RESOURCES IN ACADEMIC PERFORMANCE EVALUATION DR. EVANGELIA A.E.C. LIPITAKIS SEPTEMBER 2014

A bibliometric analysis of the Journal of Academic Librarianship for the period of

CITATION INDEX AND ANALYSIS DATABASES

What is Web of Science Core Collection? Thomson Reuters Journal Selection Process for Web of Science

researchtrends IN THIS ISSUE: Did you know? Scientometrics from past to present Focus on Turkey: the influence of policy on research output

INTRODUCTION TO SCIENTOMETRICS. Farzaneh Aminpour, PhD. Ministry of Health and Medical Education

Bibliometric glossary

Citation analysis: Web of science, scopus. Masoud Mohammadi Golestan University of Medical Sciences Information Management and Research Network

ICI JOURNALS MASTER LIST Detailed Report for 2017

2nd International Conference on Advances in Social Science, Humanities, and Management (ASSHM 2014)

Scientometric Measures in Scientometric, Technometric, Bibliometrics, Informetric, Webometric Research Publications

CONTRIBUTION OF INDIAN AUTHORS IN WEB OF SCIENCE: BIBLIOMETRIC ANALYSIS OF ARTS & HUMANITIES CITATION INDEX (A&HCI)

Web of Knowledge Workflow solution for the research community

Contribution of Academics towards University Rankings: South Eastern University of Sri Lanka

MEASURING EMERGING SCIENTIFIC IMPACT AND CURRENT RESEARCH TRENDS: A COMPARISON OF ALTMETRIC AND HOT PAPERS INDICATORS

Web of Science Unlock the full potential of research discovery

Citation Analysis in Research Evaluation

Journal Citation Reports Your gateway to find the most relevant and impactful journals. Subhasree A. Nag, PhD Solution consultant

Information Networks

BIBLIOMETRIC REPORT. Bibliometric analysis of Mälardalen University. Final Report - updated. April 28 th, 2014

Edited Volumes, Monographs, and Book Chapters in the Book Citation Index. (BCI) and Science Citation Index (SCI, SoSCI, A&HCI)

ISSN: ISO 9001:2008 Certified International Journal of Engineering Science and Innovative Technology (IJESIT) Volume 3, Issue 2, March 2014

What is bibliometrics?

The use of bibliometrics in the Italian Research Evaluation exercises

International Journal of Library and Information Studies ISSN: Vol.3 (3) Jul-Sep, 2013

BIBLIOMETRIC STUDY OF INDIAN JOURNAL OF MICROBIOLOGY:

International Journal of Library Science and Information Management (IJLSIM)

Bibliometric Analysis of the Indian Journal of Chemistry

Bibliometric Analysis of Electronic Journal of Knowledge Management

Trends in Research Librarianship Literature: A Social Network Analysis of Articles

Trends in Research Librarianship Literature: A Social Network Analysis of Articles

Valeria Aman Does the Scopus author ID suffice to track scientific international mobility? A case study based on Leibniz laureates (abstract IS10)

Horizon 2020 Policy Support Facility

Mapping and Bibliometric Analysis of American Historical Review Citations and Its Contribution to the Field of History

A Bibliometric Study of Chinese Librarianship: An International Electronic Journal,

Citation Analysis. Presented by: Rama R Ramakrishnan Librarian (Instructional Services) Engineering Librarian (Aerospace & Mechanical)

On the causes of subject-specific citation rates in Web of Science.

Complementary bibliometric analysis of the Educational Science (UV) research specialisation

JOURNAL IMPACT FACTOR. 3-year calculation window (2015, 2016, and 2017)

Journal of Documentation : a Bibliometric Study

RESEARCH TRENDS IN INFORMATION LITERACY: A BIBLIOMETRIC STUDY

The Google Scholar Revolution: a big data bibliometric tool

Scientometrics Study on Web: Tools and Techniques

Bibliometric analysis of the field of folksonomy research

Vol. 48, No.1, February

SEARCH about SCIENCE: databases, personal ID and evaluation

InCites Indicators Handbook

Complementary bibliometric analysis of the Health and Welfare (HV) research specialisation

Battle of the giants: a comparison of Web of Science, Scopus & Google Scholar

Scopus. Advanced research tips and tricks. Massimiliano Bearzot Customer Consultant Elsevier

Mapping the Research Productivity of Three Medical Sciences Journals Published in Saudi Arabia: A Comparative Bibliometric Study

Academic Identity: an Overview. Mr. P. Kannan, Scientist C (LS)

Bibliometric Analysis of Parasitological Research in Iran and Turkey: A Comparative Study

What are Bibliometrics?

FROM IMPACT FACTOR TO EIGENFACTOR An introduction to journal impact measures

Experiences with a bibliometric indicator for performance-based funding of research institutions in Norway

Professor Birger Hjørland and associate professor Jeppe Nicolaisen hereby endorse the proposal by

Bibliometrics and scientometrics in India: an overview of studies during

The 2016 Altmetrics Workshop (Bucharest, 27 September, 2016) Moving beyond counts: integrating context

Bibliometric practices and activities at the University of Vienna

Publication Output and Citation Impact

International Journal of Library and Information Studies

Application of Lotka s Law in the field of. Human Biology Journal 2007

Representing Social Sciences

Using InCites for strategic planning and research monitoring in St.Petersburg State University

A bibliometric analysis of publications by staff from Mid Yorkshire Hospitals NHS Trust,

Working Paper Series of the German Data Forum (RatSWD)

Applicability of Lotka s Law and Authorship pattern in the field of Mathematical Science Research: A Scientometric Study

CHAPTER I INTRODUCTION

VISIBILITY OF AFRICAN SCHOLARS IN THE LITERATURE OF BIBLIOMETRICS

Telescope Bibliometrics 101. Uta Grothkopf & Jill Lagerstrom

F. W. Lancaster: A Bibliometric Analysis

New analysis features of the CRExplorer for identifying influential publications

Indian LIS Literature in International Journals with Specific Reference to SSCI Database: A Bibliometric Study

Citation & Journal Impact Analysis

CITATION METRICS WORKSHOP (WEB of SCIENCE)

Missing author address information in Web of Science An explorative study Weishu Liu1, Guangyuan Hu2, Li Tang* Accepted by Journal of Informetrics

Assessing researchers performance in developing countries: is Google Scholar an alternative?

Using Bibliometric Analyses for Evaluating Leading Journals and Top Researchers in SoTL

Scientometrics and Evaluation of Humanities and Social Sciences

On the relationship between interdisciplinarity and scientific impact

Predicting the Importance of Current Papers

PBL Netherlands Environmental Assessment Agency (PBL): Research performance analysis ( )

Should author self- citations be excluded from citation- based research evaluation? Perspective from in- text citation functions

How to write the report

Managing the EUI digital library: Digital Library Services, Databases and CD-ROMs for historians

BIBLIOMETRIC REPORT. Netherlands Bureau for Economic Policy Analysis (CPB) research performance analysis ( ) October 6 th, 2015

Corso di dottorato in Scienze Farmacologiche Information Literacy in Pharmacological Sciences 2018 WEB OF SCIENCE SCOPUS AUTHOR INDENTIFIERS

The rate of growth in scientific publication and the decline in coverage provided by Science Citation Index

1.INTRODUCTION. compilations of science indicators heavily rely on publication and citation

Citation Indexes and Bibliometrics. Giovanni Colavizza

Focus on bibliometrics and altmetrics

BIBLIOMETRIC ANAYSIS OF ANNALS OF LIBRARY AND INFORMATION STUDIES ( )

Año 8, No.27, Ene Mar What does Hirsch index evolution explain us? A case study: Turkish Journal of Chemistry

Web of Science. Search and Navigation in the Web of Knowledge

Google Scholar and ISI WoS Author metrics within Earth Sciences subjects. Susanne Mikki Bergen University Library

Do we use standards? The presence of ISO/TC-46 standards in the scientific literature ( )

Transcription:

Designing an Affiliation Extractor for Turkish Universities through Finite State Graphs Zehra Taşkın & Umut Al {ztaskin, umutal}@hacettepe.edu.tr - 1

Plan Information retrieval and its relation to bibliometrics Web of Science and citation indexes Data inconsistency in citation indexes Methodology and the aim of the study Affiliation extractor model for Turkish Universities - 2

Information Retrieval and its Relation to Bibliometrics Information retrieval problem (high volume natural language texts) Bibliometrics is the the application of mathematical and statistical methods to books and other media of communication (Pritchard, 1969, p. 348) Research evaluation Fund distributions Academic appointments and incentives Impact of scientific outputs Science policy making - 3

WoS and Citation Indexes A platform and indexes Science Citation Index (SCI), Social Sciences Citation Index (SSCI) and Arts and Humanities Citation Index (A&HCI) One of the main sources for research evaluation Problem: Natural language indexing - 4

Data Inconsistency in Citation WYSIWYG Institution names Author names Journal names Indexes Character or spelling errors Translation errors Indexing errors Standardization errors - 5

Examples Harvard Univ => Harward Univ Hacettepe Univ => Hacetteppe Univ Univ Trakya => Univ Trakia Dumlupinar Univ => Durnlupinar Univ Standardization errors; Hacettepe Hosp >> Hacettepe Univ Hacettepe Fac Med >> Hacettepe Univ - 6

Methodology Data source: Web of Science 197,687 Turkey-addressed publications Published between 1928-2009 Deep data cleaning and unification process The addresses of 50 universities that have more than 1,000 publications were analyzed Nooj for finite state graphs - 7

Aim of the Study Designing an extractor for the identification of Turkish Universities affiliations by using finite state graphs Testing the possibility of employing machine learning for the task of affiliation identification and extraction by using finite state graphs - 8

Background (Taşkın & Al, 2014) - 9

Background - 10

Background - 11

Background - 12

Findings A total of 433 rules for 50 universities were found - 13

The FSG Model - 14

Concordance of Founded Affiliations - 15

Limitations & Future Studies The rule list for Turkish universities created manually due to not to lose any variations of affiliations This study can provide a basis for future studies focusing on automatic learning algorithms for affiliations to measure the success of machine learning - 16

Conclusion This model could be extracted 99.05% of the rules The affiliation extraction based on the general identification of main affiliation patterns for Turkish universities, can help the future studies Rule list creation is time consuming and impractical However, it is more useful for the future studies that used machine learning algorithms, since it provides opportunity for comparison - 17

References Pritchard, A. (1969). Statistical bibliography or bibliometrics? Journal of Documentation, 25(4), 348-349. Taşkın, Z. & Al, U. (2014). Standardization problem of author affiliations in citation indexes. Scientometrics, 98(1), 347-368. - 18

Designing an Affiliation Extractor for Turkish Universities through Finite State Graphs Zehra Taşkın & Umut Al {ztaskin, umutal}@hacettepe.edu.tr - 19