Constructing bibliometric networks: A comparison between full and fractional counting

Size: px
Start display at page:

Download "Constructing bibliometric networks: A comparison between full and fractional counting"

Transcription

1 Constructing bibliometric networks: A comparison between full and fractional counting Antonio Perianes-Rodriguez 1, Ludo Waltman 2, and Nees Jan van Eck 2 1 SCImago Research Group, Departamento de Biblioteconomia y Documentacion, Universidad Carlos III, Getafe, Madrid, Spain aperiane@bib.uc3m.es 2 Centre for Science and Technology Studies, Leiden University, Leiden, The Netherlands {waltmanlr, ecknjpvan}@cwts.leidenuniv.nl The analysis of bibliometric networks, such as co-authorship, bibliographic coupling, and co-citation networks, has received a considerable amount of attention. Much less attention has been paid to the construction of these networks. We point out that different approaches can be taken to construct a bibliometric network. Normally the full counting approach is used, but we propose an alternative fractional counting approach. The basic idea of the fractional counting approach is that each action, such as co-authoring or citing a publication, should have equal weight, regardless of for instance the number of authors, citations, or references of a publication. We present two empirical analyses in which the full and fractional counting approaches yield very different results. These analyses deal with co-authorship networks of universities and bibliographic coupling networks of journals. Based on theoretical considerations and on the empirical analyses, we conclude that for many purposes the fractional counting approach is preferable over the full counting one. 1. Introduction The study of bibliometric networks, such as co-authorship, bibliographic coupling, and co-citation networks, has a long history in the field of bibliometrics, with early work dating back to the 1960s and 1970s (e.g., De Solla Price, 1965; Kessler, 1963; Small, 1973). Many different methods for analyzing and visualizing bibliometric networks have been studied by bibliometricians (e.g., Börner, Chen, & Boyack, 2003; Milojević, 2014; Van Eck & Waltman, 2014; Zhao & Strotmann, 2015). However, 1

2 before bibliometric networks can be analyzed and visualized, they first need to be constructed. The construction of bibliometric networks has received remarkably little attention in the literature (for important exceptions, see Batagelj & Cerinšek, 2013; Park, Yoon, & Leydesdorff, 2016). It seems that the construction of bibliometric networks is typically seen as a more or less trivial step that does not need any special consideration. In this paper, we argue that this step is far from trivial. We point out that different approaches can be taken to construct bibliometric networks. Our aim is to draw attention to the existence of different approaches for constructing bibliometric networks, to clarify the conceptual differences between these approaches, and to show that these approaches may yield very different results. A well-known problem in the field of bibliometrics is the issue of assigning coauthored publications to individual authors. For instance, when a publication is coauthored by three researchers, how should the publication be counted for each individual researcher? In the context of the calculation of bibliometric indicators, many different approaches have been proposed to this problem (for overviews, see Gauffriau, Larsen, Maye, Roulin-Perriard, & Von Ins, 2007; Waltman, 2016, Section 7). The most popular approaches are the full counting method (also known as the whole counting method) and the fractional counting method (e.g., Aksnes, Schneider, & Gunnarsson, 2012; Waltman & Van Eck, 2015). In the case of the full counting method, a publication co-authored by three researchers is assigned to each researcher with a full weight of one. On the other hand, in the case of the fractional counting method, the publication is assigned to each researcher with a fractional weight of 1 / 3. In this paper, we show how the distinction between full and fractional counting, which has been studied extensively in the context of the calculation of bibliometric indicators, can be translated to the context of the construction of bibliometric networks. Consider for instance the construction of a co-authorship network. Suppose researcher X has co-authored a publication with five other researchers. In the conventional approach to the construction of bibliometric networks, this yields five co-authorship links with a weight of one for researcher X. We refer to this approach as the full counting method. An alternative approach is to assign a weight of 1 / 5 to each of the five co-authorship links. In this approach, which we refer to as the fractional counting method, the total weight of the co-authorship links that a 2

3 researcher obtains because of co-authoring a publication equals one. This total weight of one is distributed equally over the individual co-authorship links. To construct bibliometric networks, researchers have traditionally used the full counting method. To the best of our knowledge, the fractional counting method has hardly been used in the literature (for the only exception that we are aware of, see Newman, 2001c), although some related ideas have been proposed (Batagelj & Cerinšek, 2013; Cerinšek & Batagelj, 2015; Park et al., 2016; Persson, 1994, 2010). 1 In this paper, we carefully define the full and fractional counting methods. Our focus is on three popular types of bibliometric networks, namely co-authorship, bibliographic coupling, and co-citation networks, but our ideas extend to other types of bibliometric networks as well. We also provide two examples of situations in which the choice between the full and fractional counting methods makes a big difference. One example is about co-authorship networks of universities. The other example deals with bibliographic coupling networks of journals. In both examples, we argue that the fractional counting method is preferable over the full counting method. We note that the full and fractional counting methods are both available in the VOSviewer software ( Van Eck & Waltman, 2010, 2014) for constructing and visualizing bibliometric networks. The VOSviewer software can be used to construct bibliometric networks based on data downloaded from bibliographic databases such as Web of Science and Scopus. The software requests the user to choose between the use of the full and the fractional counting method. The information provided in this paper should help VOSviewer users in choosing the most appropriate counting method for their analyses. This paper is organized as follows. Formal definitions of the full and fractional counting methods in the context of the construction of bibliometric networks are provided in Section 2. An empirical comparison between the two counting methods is reported in Section 3. We present our conclusions in Section 4. 1 Small and Sweeney (1985) also use a fractional counting approach in the context of the construction of a bibliometric network. However, they do not use fractional counting in the actual construction of the network, but instead they use fractional counting to select the publications to be included in the network. 3

4 2. Constructing bibliometric networks In this section, we provide a detailed discussion of the full and fractional counting methods for constructing bibliometric networks. We first discuss in general terms the difference between full and fractional counting. We then focus specifically on coauthorship networks, followed by bibliographic coupling and co-citation networks. We focus on these three types of bibliometric networks because they seem to be the types of bibliometric networks that receive most attention in the literature. However, we emphasize that our ideas apply to other types of bibliometric networks as well. For an overview of the literature on different types of bibliometric networks, we refer to Van Eck and Waltman (2014, Subsection 2.1) Full counting vs. fractional counting In the context of the calculation of bibliometric indicators, the concepts of a publication and a co-author play a key role in the distinction between full and fractional counting. Full counting means that a co-authored publication is counted with a full weight of one for each co-author, which implies that the overall weight of a publication is equal to the number of authors of the publication. Fractional counting means that a co-authored publication is assigned fractionally to each of the coauthors, with the overall weight of the publication being equal to one. Hence, in the case of fractional counting, each publication has the same overall weight. In the context of the construction of bibliometric networks, a similar distinction between full and fractional counting can be made. However, in order to do so, the concepts of a publication and a co-author need to be replaced by appropriate networkrelated concepts. We replace the concept of a publication by the concept of an action. The concept of a co-author is replaced by the concept of a link. For specific types of bibliometric networks, the concepts of an action and a link can be given a more concrete interpretation. For instance, in the case of a co-authorship network, coauthoring a publication with other researchers is an action and this action results in co-authorship links. In the case of a bibliographic coupling or co-citation network, giving a citation is an action and this action results in bibliographic coupling or cocitation links. When full counting is used to construct a bibliometric network, each link resulting from an action has a full weight of one, which means that the overall weight of an action is equal to the number of links resulting from the action. On the other hand, 4

5 when fractional counting is used, each link has a fractional weight such that the overall weight of an action equals one. For instance, in the case of fractional counting, the decision of a researcher to co-author a publication with five other researchers should have the same weight as the decision of a researcher to co-author a publication with 500 other researchers. In the first situation, five new co-authorship links are introduced. Each of these links is assigned a fractional counting weight of 1 / 5, so that the total weight equals 5 (1 / 5) = 1. The second situation results in 500 new coauthorship links, each with a fractional counting weight of 1 / 500, which again yields a total weight of 500 (1 / 500) = 1. In the case of full counting, each co-authorship link has a weight of one in both situations, resulting in a total weight of 5 in the first situation and 500 in the second situation. Hence, based on full counting, the decision made in the second situation has 100 times as much weight as the decision made in the first situation. Table 1. Summary of the key differences between full and fractional counting, both in the context of the calculation of bibliometric indicators (where N denotes the number of co-authors of a publication) and in the context of the construction of bibliometric networks (where N denotes the number of links resulting from an action). Full counting Fractional counting Indicators Each co-author has a weight of 1. Each co-author has a weight of 1 / N. Each publication has a total weight of N. Each publication has a total weight of 1. Networks Each link has a weight of 1. Each link has a weight of 1 / N. Each action has a total weight of N. Each action has a total weight of 1. A completely analogous example can be given for the construction of a bibliographic coupling network, where links are created when two publications both cite the same third publication (Kessler, 1963). In the case of fractional counting, giving a citation to a publication that has already been cited by five other publications has the same weight as giving a citation to a publication that has already been cited by 500 other publications. In the first situation, five new bibliographic coupling links are introduced, each with a fractional counting weight of 1 / 5, which gives a total weight of 5 (1 / 5) = 1. The second situation results in 500 new bibliographic coupling links, each with a fractional counting weight of 1 / 500, and again a total weight of 500 (1 / 500) = 1 is obtained. In the case of full counting, all bibliographic coupling 5

6 links have a weight of one in both situations, and therefore the total weight equals 5 in the first situation and 500 in the second situation. The key differences between full and fractional counting are summarized in Table 1. The table also shows how full and fractional counting in the context of the construction of bibliometric networks relate to full and fractional counting in the context of the calculation of bibliometric indicators Arguments in favor of fractional counting In the context of the construction of bibliometric networks, why would fractional counting be preferable over full counting, at least for certain purposes? In other words, why would it be reasonable to require each action to have the same weight? Let us provide an argument in the context of bibliographic coupling analysis. Suppose we have a publication and suppose we want to use bibliographic coupling analysis to identify other related publications. Bibliographic coupling analysis starts from the idea that the references cited in a publication reflect what the publication is about and, consequently, that publications citing the same references are related to each other. In the case of full counting, references that are cited not only by our focal publication but also by many other publications have a larger overall influence on the bibliographic coupling analysis than references that are cited by just a few other publications. In a certain sense, this means that in the full counting case highly cited references are seen as more representative of what a publication is about than lowly cited references. This may not be desirable. Suppose for instance that our focal publication cites both a lowly cited research article dealing with a closely related topic and a highly cited review article that offers a broad overview of the literature, including many topics that are only weakly related to the topic of our focal publication. In this situation, the lowly cited research article is more representative of what our focal publication is about than the highly cited review article. However, in the full counting case, the reference to the highly cited review article has a much larger influence on the bibliographic coupling analysis than the reference to the lowly cited research article. One could therefore say that the reference to the highly cited review article is treated as being more representative of the topic of our focal publication than the reference to the lowly cited research article, while it actually should have been the other way around. 6

7 In the case of fractional counting, each reference cited in a publication has the same influence in a bibliographic coupling analysis, which essentially means that each reference is considered to be equally representative of what the publication is about. We believe this to be a very reasonable idea, more reasonable than the idea of highly cited references being more representative than lowly cited references. In practice, some references cited in a publication are of course more representative of what the publication is about than others. However, we see no reason to expect highly cited references to be systematically more representative than lowly cited references. Without any further information, the most reasonable idea seems to be to treat each reference cited in a publication as being equally representative, and this is what is done by fractional counting. The above argument in favor of fractional counting applies to bibliographic coupling analysis, but similar arguments can be given for other types of analysis as well. For instance, when co-authorship analysis is used to identify strong collaborative ties between researchers, it can be argued that the most reasonable approach is to consider each publication of a researcher to be equally important in the researcher s oeuvre. This may then result in fractional counting being preferable over full counting Co-authorship networks We now discuss in more detail the construction of co-authorship networks using full and fractional counting. We first provide a technical discussion, we then present a simple example, and finally we briefly refer to some related work in the literature. Constructing co-authorship networks Co-authorship networks can be constructed for different units of analysis, such as researchers, research institutions, and countries. In the discussion below, we use researchers as the unit of analysis (e.g., Newman, 2001a, 2001b, 2001c). However, we emphasize that the discussion also applies to other units of analysis. We use N and M to denote, respectively, the number of researchers and the number of publications included in the analysis, and we use A = [a ik ] to denote an N M authorship matrix. Element a ik of this matrix equals 1 if researcher i is an author of publication k and 0 otherwise. We further use n k to denote the number of authors of publication k, that is, 7

8 N n k = a ik. (1) i=1 Publications that have only one author do not provide any co-authorship links. For simplicity, we therefore assume that each publication included in the analysis has at least two authors. This means that n k > 1 for each publication k. We first consider the case of full counting. We use U = [u ij ] to denote the full counting co-authorship matrix. This is a symmetrical N N matrix. Element u ij of this matrix equals the number of full counting co-authorship links between researchers i and j and is given by M u ij = a ik a jk. (2) k=1 In matrix notation, the co-authorship matrix U is given by U = AA T. (3) Hence, the co-authorship matrix U is obtained by post-multiplying the authorship matrix A by its transpose. Self-links in a co-authorship network are usually of no interest, and therefore the main diagonal elements of the co-authorship matrix U are set to 0. We now consider the case of fractional counting, where we denote the fractional counting co-authorship matrix by U * = [u * ij]. The number of fractional counting coauthorship links between researchers i and j, denoted by u * ij, is given by u ij M = a ika jk. (4) n k 1 k=1 Equivalently, the co-authorship matrix U * is obtained by U = A diag(a T 1 1) 1 A T, (5) 8

9 where diag(v) denotes a diagonal matrix with the elements of the vector v on the main diagonal and where 1 denotes a column vector of length N with all elements equal to 1. The main diagonal elements of the co-authorship matrix U * are set to 0. Example To illustrate the use of full and fractional counting for constructing co-authorship networks, we consider a simple example in which we have four researchers and three publications. Table 2 presents the authorship matrix and Figure 1 displays the corresponding authorship network. Table 2. Authorship matrix. P1 P2 P3 Total R R R R Total Figure 1. Authorship network. The full and fractional counting co-authorship matrices and the corresponding coauthorship networks are presented in Table 3 and Figure 2, respectively. We note that for each researcher the total weight of the fractional counting co-authorship links is equal to the number of publications the researcher has authored. This is a general property of fractional counting co-authorship analyses. 9

10 Table 3. Full and fractional counting co-authorship matrices. Full counting Fractional counting R1 R2 R3 R4 Total R1 R2 R3 R4 Total R R R R R R R R Total Total Figure 2. Full and fractional counting co-authorship networks. To illustrate how the weights of the fractional counting co-authorship links have been obtained, we take the link between researchers 1 and 3 as an example. Researcher 1 has co-authored publication 1 with two other researchers. This yields two co-authorship links for researcher 1, and one of these links is with researcher 3. It follows from Eq. (4) that the two co-authorship links each have a weight of 1 / (3 1) = 0.5. Researcher 1 has co-authored publication 2 only with researcher 3, and this results in a co-authorship link with a weight of 1 / (2 1) = 1. In total, we obtain a weight of = 1.5 for the co-authorship link between researchers 1 and 3. As explained in Subsection 2.1, in the case of fractional counting, each action should have the same weight. For instance, the decision of researcher 2 to co-author publication 1 with researchers 1 and 3 should have the same weight as researcher 2 s decision to co-author publication 3 with researcher 4. The co-authorship links of researcher 2 with researchers 1 and 3 each have a weight of 1 / (3 1) = 0.5, which means that the weight of researcher 2 s decision to co-author publication 1 with researchers 1 and 3 equals = 1. The weight of researcher 2 s decision to coauthor publication 3 with researcher 4 equals 1 / (2 1) = 1. Hence, in the case of fractional counting, the two actions of researcher 2 indeed have the same weight. 10

11 We note that it is essential to have a denominator of n k 1 rather than n k in Eq. (4). We need to subtract 1 from n k in the denominator because we do not consider self-links in a co-authorship network. Without subtracting 1 from n k, the weight of researcher 2 s decision to co-author publication 1 with researchers 1 and 3 would have been 2 1 / 3 = 0.67, while the weight of researcher 2 s decision to co-author publication 3 with researcher 4 would have been 1 / 2 = 0.5. Hence, without subtracting 1 from n k, the weight of the two actions of researcher 2 would not have been the same. Related work Our fractional counting method for constructing co-authorship networks is equivalent to the approach for constructing weighted co-authorship networks proposed by Newman (2001c). Our fractional counting method is also related to the approaches for constructing co-authorship networks introduced by Batagelj and Cerinšek (2013) and Park et al. (2016). In the appendix, we discuss in more detail how our fractional counting method relates to these approaches for constructing coauthorship networks Bibliographic coupling networks In Subsection 2.3, the construction of co-authorship networks using full and fractional counting was discussed. We now turn to the construction of bibliographic coupling networks. The discussion below closely resembles the discussion in Subsection 2.3, but there are also some small differences. Constructing bibliographic coupling networks Bibliographic coupling networks can be constructed for different units of analysis, such as publications, journals, and researchers. Our focus will be on researchers as the unit of analysis (Zhao & Strotmann, 2008a), but we emphasize that the discussion below also applies to other units of analysis. In a bibliographic coupling analysis of researchers, the relatedness of researchers is determined based on the degree to which they cite the same publications. The more often two researchers cite the same publications, the stronger their relatedness. We use N and M to denote, respectively, the number of researchers and the number of publications included in the analysis, and we use C = [c ik ] to denote an N M citation matrix. Element c ik of this matrix equals the number of citations received 11

12 by publication k from researcher i. We further use n k to denote the total number of citations received by publication k from all researchers included in the analysis, that is, N n k = c ik. (6) i=1 Publications that have been cited fewer than two times do not provide any bibliographic coupling links. We therefore assume that each publication included in the analysis has received at least two citations, which means that n k > 1 for each publication k. We use V = [v ij ] to denote the N N full counting bibliographic coupling matrix. Element v ij of this matrix equals the number of full counting bibliographic coupling links between researchers i and j and is given by M v ij = c ik c jk. (7) k=1 Hence, the bibliographic coupling matrix V is given by V = CC T. (8) Turning now to the fractional counting case, we use V * = [v * ij] to denote the fractional counting bibliographic coupling matrix. The number of fractional counting bibliographic coupling links between researchers i and j, denoted by v * ij, is given by v ij M = c ikc jk. (9) n k 1 k=1 Equivalently, the bibliographic coupling matrix V * is obtained by V = C diag(c T 1 1) 1 C T. (10) 12

13 Self-links in a bibliographic coupling network are usually of no interest, and therefore the main diagonal elements of the bibliographic coupling matrices V and V * are set to 0. Example We consider an example with five researchers and four publications. The citation matrix and the corresponding citation network are presented in Table 4 and Figure 3, respectively. We note that a researcher can give multiple citations to the same publication. For instance, researcher 1 has cited publication 1 three times. This means that researcher 1 has authored three publications in which publication 1 is cited. Table 4. Citation matrix. P1 P2 P3 P4 Total R R R R R Total Figure 3. Citation network. The full and fractional counting bibliographic coupling matrices and the corresponding bibliographic coupling networks can be found in Table 5 and Figure 4, respectively. 13

14 Table 5. Full and fractional counting bibliographic coupling matrices. Full counting Fractional counting R1 R2 R3 R4 R5 Total R1 R2 R3 R4 R5 Total R R R R R R R R R R Total Total Figure 4. Full and fractional counting bibliographic coupling networks. This example can be used to illustrate how fractional counting implements the idea that each action should have the same weight. Researcher 5 cites publication 4, which results in a bibliographic coupling link with researcher 4 with a weight of 1 / (2 1) = 1. Likewise, researcher 5 cites publication 2, resulting in bibliographic coupling links with researchers 1 and 3 that have weights of, respectively, 1 / (4 1) = 0.33 and 2 / (4 1) = 0.67, which corresponds with a total weight of = 1. This shows that the two actions of researcher 5 both have the same weight of one. Let us now consider researcher 3. This researcher cites publication 1, which results in bibliographic coupling links with researchers 1 and 2 that have weights of, respectively, 3 / (6 1) = 0.6 and 2 / (6 1) = 0.4, yielding a total weight of = 1. Researcher 3 also gives two citations to publication 2. These citations require a more detailed discussion. In total, publication 2 is cited four times. Each citation of publication 2 therefore corresponds with three bibliographic coupling links, each with a weight of 1 / 3 = 0.33, which gives a total weight of one. However, because researcher 3 gives two citations to publication 2, one of the bibliographic coupling links that we have is a link between the two citing publications of researcher 3. Since we are not interested in researcher self-links, this link is ignored. As a consequence, for each of researcher 3 s citations to publication 2, the total weight of the 14

15 corresponding bibliographic coupling links is less than one. More specifically, each citation corresponds with a bibliographic coupling link with researcher 1 and a bibliographic coupling link with researcher 5, and these links each have a weight of 1 / 3 = 0.33, yielding a total weight of = Hence, if researcher self-links had been taken into consideration, a total weight of one would have been obtained, but by ignoring researcher self-links we obtain a total weight below one. 2 This also explains why for some researchers (i.e., researchers 1, 2, and 3) the total weight of their fractional counting bibliographic coupling links is less than the number of citations they have made. Related work We are not aware of earlier work discussing approaches for constructing bibliographic coupling networks similar to our fractional counting method. The most closely related work seems to be the approach proposed by Batagelj and Cerinšek (2013) for constructing normalized bibliographic coupling networks. Like our fractional counting method, the approach of Batagelj and Cerinšek (2013) is based on the idea of fractionalization. However, there is a fundamental difference. While we fractionalize based on the number of citations received by a cited publication from other publications, Batagelj and Cerinšek (2013) fractionalize based on the number of citations given by a citing publication to other publications Co-citation networks After discussing the construction of co-authorship and bibliographic coupling networks using full and fractional counting, we now consider the construction of cocitation networks. Since the construction of co-citation networks is very similar to the construction of co-authorship and bibliographic coupling networks, only a brief discussion will be provided. 2 If this is considered undesirable, it can be fixed by adapting the denominator in Eq. (9). If in the denominator we subtract c ik rather than 1 from n k, we always obtain a total weight of one. However, the bibliographic coupling matrix V * may no longer be symmetrical when this approach is taken. 3 A somewhat similar approach is taken by Sen and Gan (1983) and Glänzel and Czerwon (1996). These authors also perform a normalization based on the number of citations given by a citing publication to other publications. 15

16 Constructing co-citation networks Our focus will be on researchers as the unit of analysis (McCain, 1990; White & Griffith, 1981), but we emphasize that the discussion below also applies to other units of analysis, such as publications and journals. In a co-citation analysis of researchers, the relatedness of researchers is determined based on the degree to which they are cited in the same publications. The more often two researchers are cited in the same publications, the stronger their relatedness. Like in Subsection 2.4, we use N and M to denote, respectively, the number of researchers and the number of publications included in the analysis, and we use C = [c ik ] to denote an N M citation matrix. Importantly, however, the citation matrix is defined in a different way than in Subsection 2.4. Element c ik of the matrix equals the number of citations given by publication k to researcher i (rather than the number of citations received by publication k from researcher i). We further use n k to denote the total number of citations given by publication k to all researchers included in the analysis, that is, N n k = c ik. (11) i=1 We assume that n k > 1 for each publication k. Apart from the difference in the definition of the citation matrix C, co-citation analysis is mathematically identical to bibliographic coupling analysis. We use W = [w ij ] to denote the N N full counting co-citation matrix. Element w ij of this matrix equals the number of full counting co-citation links between researchers i and j and is given by M w ij = c ik c jk. (12) k=1 The co-citation matrix W is given by W = CC T. (13) 16

17 In the fractional counting case, we use W * = [w * ij] to denote the fractional counting co-citation matrix. The number of fractional counting co-citation links between researchers i and j, denoted by w * ij, is given by w ij M = c ikc jk. (14) n k 1 k=1 The co-citation matrix W * is obtained by W = C diag(c T 1 1) 1 C T. (15) Self-links in a co-citation network are usually of no interest, and therefore the main diagonal elements of the co-citation matrices W and W * are set to 0. Related work Our fractional counting method for constructing co-citation networks is somewhat similar to a method for constructing co-citation networks discussed by Persson (1994). The latter method is used to construct normalized co-citation networks. One element in the normalization is a fractionalization similar to the one proposed in Eq. (14). The difference is that a denominator of n k is used instead of the denominator of n k 1 used in Eq. (14). This is analogous to the difference between our fractional counting method for constructing co-authorship networks and one of the approaches for constructing co-authorship networks discussed by Batagelj and Cerinšek (2013) (see the appendix for more details on this difference). We further note that there has been some discussion in the literature on how to handle publications with multiple authors when constructing co-citation networks of researchers. These discussions are about the distinction between taking into account all authors of a publication or only the first or the last one (Persson, 2001; Zhao, 2006; Zhao & Strotmann, 2008b, 2011) and about the distinction between co-citation links and co-authorship links (Rousseau & Zuccala, 2004). We do not discuss these issues in more detail in this paper. 17

18 3. Empirical analysis We now present an empirical comparison of the full and fractional counting methods for constructing bibliometric networks. We will compare the results obtained using the two counting methods, but in addition we will also show why the two counting methods yield different results. Two analyses are presented. The first analysis focuses on co-authorship networks of universities. The second analysis is about bibliographic coupling networks of journals. We have selected these two analyses because full and fractional counting yield very different results in these analyses. The analyses therefore offer important insights into the differences between the two counting methods Co-authorship networks of universities We collected all 1.28 million publications indexed in the Web of Science database that were published in 2014 and that are authored by one or more of the 750 universities included in the 2015 edition of the CWTS Leiden Ranking ( Waltman et al., 2012). Based on these publications, we constructed a full counting and a fractional counting co-authorship network of the 750 universities. Other institutions that have co-authored with the 750 universities were ignored in the analysis. The co-authorship networks were constructed following the calculations discussed in Subsection 2.3. The VOSviewer software (Van Eck & Waltman, 2010, 2014) was used to create visualizations of the full and fractional counting co-authorship networks. Figures 5 and 6 present visualizations of the university co-authorship networks constructed using full and fractional counting, respectively. Each circle represents a university. To prevent the names of universities from overlapping each other, names are shown only for a subset of the universities. The size of a circle reflects the number of publications of the corresponding university. The distance between two circles approximately indicates the strength of the co-authorship link between the corresponding universities. In general, the closer two circles are located to each other, the stronger the co-authorship link between the universities. Colors represents clusters 18

19 of universities with strong co-authorship links. Lines are used to indicate the 1,500 strongest co-authorship links between universities. 4 It is evident that there are large differences between the visualizations presented in Figures 5 and 6. In Figure 5, it is hard to identify a clear pattern in the visualization. Almost all universities are located together in one big group, with the exception of universities from a number of Asian countries located in the bottom area of the visualization. No clear grouping of universities by country is visible, neither in the positioning of the universities in the visualization nor in the clustering of the universities. For instance, while many US universities are located in the left area of the visualization, where they belong to the cyan, yellow, and green clusters, US universities can also be found in the bottom-right area of the visualization, where they mostly belong to the purple cluster. In Figure 6, on the other hand, the visualization shows a very clear pattern, both in the positioning and in the clustering of the universities. A number of distinct groups of universities are visible, and to a large extent universities turn out to be grouped by country. US universities are located in the bottom area of the visualization. In the left area, groups of Chinese, Taiwanese, Japanese, and South Korean universities can be found. In the center of the visualization, we observe an Australian and a Canadian group of universities. European universities and universities from South American countries are located in the right area of the visualization, where again a reasonably strong separation by country can be observed. The visualizations presented in Figures 5 and 6 are based on the same underlying data, but nevertheless they give a very different impression of worldwide scientific collaboration. The visualization in Figure 6, based on fractional counting, suggests that scientific collaboration takes place mostly within national borders. On the other hand, the visualization in Figure 5, based on full counting, gives the impression that national borders play only a minor role in determining scientific collaboration. How can these large differences between the two visualizations be explained? 4 To produce the visualizations using the VOSviewer software, the layout attraction and layout repulsion parameters were set to 1 and 0, respectively. The clustering resolution and minimum cluster size parameters were set to 1.25 and 5, respectively. 19

20 Figure 5. Visualization of the university co-authorship network constructed using full counting. An interactive visualization is available at Figure 6. Visualization of the university co-authorship network constructed using fractional counting. An interactive visualization is available at It turns out that the differences can be explained largely by the fact that in the case of full counting a small number of publications that have been co-authored by a large 20

21 number of universities have a very strong effect on the co-authorship network. To demonstrate this, we constructed a full counting co-authorship network in the same way as above, except that in the construction of the network we did not take into account publications co-authored by more than 20 universities. There are 702 publications that have been co-authored by more than 20 universities (i.e., 0.05% of the total number of 1.28 million publications), and these publications were not used in the construction of the co-authorship network. A visualization of the co-authorship network that was obtained in this way is presented in Figure 7. Figure 7. Visualization of the university co-authorship network constructed using full counting by including only publications co-authored by at most 20 universities. An interactive visualization is available at Importantly, the visualization in Figure 7 based on full counting is very different from the full counting visualization in Figure 5, and in fact it is quite similar to the fractional counting visualization in Figure 6. Like in the visualization in Figure 6, distinct groups of universities can be easily distinguished, and these groups largely coincide with the countries in which universities are located. Hence, it can be concluded that to a large extent the differences between full and fractional counting co-authorship networks of universities are caused by a small number of publications that have been co-authored by a large number of universities. 21

22 Table 6 provides some statistics that indicate the effect of a small number of publications with many co-authors on university co-authorship networks constructed using full counting. When in our analysis we take into account all publications regardless of their number of co-authors, we have 1.28 million publications, which yield 2.90 million co-authorship links. 5 The statistics reported in Table 6 show what happens when publications for which the number of co-authoring universities exceeds a certain threshold are not considered in the construction of a co-authorship network. In the case of the construction of the co-authorship network visualized in Figure 7, publications with more than 20 co-authoring universities were not considered. This causes a decrease of 0.05% in the number of publications. However, as can be seen in Table 6, this negligible decrease in the number of publications is responsible for a decrease of 62% in the number of co-authorship links. Even more extreme results are obtained when we take into account all publications except for those with more than 100 co-authoring universities. In that case, we lose just 0.01% of all publications, but this leads to a reduction in the number of co-authorship links by almost 50%. Based on these statistics, it is clear that in the case of full counting a very small number of publications may have a huge effect on a co-authorship network. Table 6. Number of publications considered in the construction of a co-authorship network and number of co-authorship links included in the network when publications for which the number of co-authoring universities exceeds a certain threshold are not taken into account. Threshold on no. of co-authoring universities No. of publications % of publications No. of co-authorship links % of co-authorship links 5 1,266, % 722,935 25% 10 1,276, % 939,667 32% 20 1,278, % 1,102,564 38% 50 1,278, % 1,372,300 47% 100 1,278, % 1,532,105 53% No threshold 1,278, % 2,898, % 5 If two universities have co-authored 100 publications, this can be counted either as 100 unweighted co-authorship links or as one weighted co-authorship link, where the weight equals 100. We here count co-authorship links using the former approach. 22

23 Figure 8 offers more detailed insight into the effect of publications co-authored by a large number of universities. We again explore the situation where publications for which the number of co-authoring universities exceeds a certain threshold are not considered in the construction of a co-authorship network. The figure shows how the percentage of the publications that are taken into account in the construction of a coauthorship network increases as we increase the threshold. Moreover, the figure also shows the effect of increasing the threshold on the percentage of all co-authorship links that are included in the network. Figure 8. Percentage of publications considered in the construction of a co-authorship network and percentage of co-authorship links included in the network when publications for which the number of co-authoring universities exceeds a certain threshold are not taken into account. Figure 8 shows that most co-authorship links are due to publications that either have a limited number of co-authoring universities or a very large number of coauthoring universities. Publications co-authored by at most ten universities are responsible for somewhat more than 30% of all co-authorship links. Individually, each of these publications contributes only a very small number of co-authorship links. However, because there are so many publications co-authored by at most ten universities (i.e., 99.8% of all publications), these publications are still responsible for almost one-third of all co-authorship links. We note that most publications (i.e., 23

24 almost 70% of all publications) have been authored by just one university. These publications do not result in any co-authorship links at all. Publications co-authored by more than 100 universities are responsible for almost 50% of all co-authorship links. There are just 158 publications that have been coauthored by more than 100 universities, but each of these hyperauthorship publications (Cronin, 2001) is responsible for a very large number of co-authorship links. For instance, the publication co-authored by most universities is a publication that has 151 co-authoring universities 6, and this single publication results in / 2 = 11,325 co-authorship links, which is 0.4% of all co-authorship links. The 158 publications co-authored by more than 100 universities have all appeared in the field of physics, and they all or almost all seem to result from research related to the Large Hadron Collider at CERN. We have now seen how in the case of full counting a very small number of publications with many co-authors may have a huge effect on a co-authorship network. In the case of fractional counting, the effect of publications with many coauthors is much more limited. Fractional counting is based on the idea that each action should have the same weight. Hence, each decision of a university to co-author a publication has the same weight of one, regardless of the total number of universities by which a publication is co-authored. This means that the total weight of the co-authorship links related to a publication is equal to the number of co-authoring universities. In other words, in the fractional counting case, the effect of a publication on a co-authorship network increases linearly with the number of co-authors. In the full counting case, on the other hand, the effect of a publication increases quadratically with the number of co-authors. We have for instance seen that in the full counting case 0.05% of all publications are responsible for 62% of all co-authorship links. In the fractional counting case, the same publications turn out to be responsible for just 4.0% of all co-authorship links Bibliographic coupling networks of journals We now turn to the analysis of bibliographic coupling networks of journals. Our aim is to use bibliographic coupling to identify the journals that are most strongly 6 This is the following publication: Aad et al. (2014). Search for long-lived neutral particles decaying into lepton jets in proton-proton collisions at s = 8 Tev with the ATLAS detector. Journal of High Energy Physics, 11,

25 related to one specific focal journal. We use Scientometrics as the focal journal, since this is a journal that we expect many readers of this paper to be familiar with. We again performed our analysis using the Web of Science database. Following the calculations discussed in Subsection 2.4, two bibliographic coupling networks of journals were constructed, one based on full counting and one based on fractional counting. The networks were constructed based on citing publications in the period In Scientometrics, 1,350 publications appeared in this period. These 1,350 citing publications refer to 12,799 publications indexed in the Web of Science database, resulting in bibliographic coupling links of Scientometrics with 11,526 other journals. Table 7. The 20 journals most strongly related to Scientometrics in the full counting bibliographic coupling network. Rank No. of bib. Journal coupling links Full Frac. Full Frac. 1 1 Journal of Informetrics 94,561 1, PLOS ONE 76, J. of the Am. Soc. for Information Science and Technology 61,478 1, Physical Review E 43, Physica A 42, Research Policy 42, ,674 Acta Crystallographica Section E 22, Scientific Reports 17, Technological Forecasting and Social Change 15, Strategic Management Journal 14, J. of the Ass. for Information Science and Technology 13, Research Evaluation 13, Technovation 12, Organization Science 12, Journal of Technology Transfer 12, Europhysics Letters 12, Expert Systems with Applications 10, European Physical Journal B 10, Technology Analysis & Strategic Management 10, Physical Review B 10,

26 Table 7 lists the 20 journals that are most strongly related to Scientometrics in the full counting bibliographic coupling network. For each journal, both the number of full counting and the number of fractional counting bibliographic coupling links with Scientometrics is reported. Table 8 is similar to Table 7, but it shows the 20 journals that are most strongly related to Scientometrics in the fractional counting rather than the full counting bibliographic coupling network. Table 8. The 20 journals most strongly related to Scientometrics in the fractional counting bibliographic coupling network. Rank No. of bib. Journal coupling links Full Frac. Full Frac. 1 1 Journal of Informetrics 94,561 1, J. of the Am. Soc. for Information Science and Technology 61,478 1, Research Policy 42, PLOS ONE 76, Research Evaluation 13, Technological Forecasting and Social Change 15, J. of the Ass. for Information Science and Technology 13, Revista Espanola de Documentacion Cientifica 7, Journal of Technology Transfer 12, Malaysian Journal of Library & Information Science 7, Technology Analysis & Strategic Management 10, Technovation 12, Online Information Review 7, Expert Systems with Applications 10, Journal of Information Science 6, Current Science 7, Science and Public Policy 6, Information Processing & Management 7, Higher Education 4, Journal of Documentation 3, As can be seen in Table 8, journals that are highly ranked based on fractional counting also tend to be quite highly ranked based on full counting. Importantly, however, Table 7 shows that this does not apply in the reverse situation. Some journals are highly ranked based on full counting, while they are ranked much lower 26

A systematic empirical comparison of different approaches for normalizing citation impact indicators

A systematic empirical comparison of different approaches for normalizing citation impact indicators A systematic empirical comparison of different approaches for normalizing citation impact indicators Ludo Waltman and Nees Jan van Eck Paper number CWTS Working Paper Series CWTS-WP-2013-001 Publication

More information

CitNetExplorer: A new software tool for analyzing and visualizing citation networks

CitNetExplorer: A new software tool for analyzing and visualizing citation networks CitNetExplorer: A new software tool for analyzing and visualizing citation networks Nees Jan van Eck and Ludo Waltman Centre for Science and Technology Studies, Leiden University, The Netherlands {ecknjpvan,

More information

Source normalized indicators of citation impact: An overview of different approaches and an empirical comparison

Source normalized indicators of citation impact: An overview of different approaches and an empirical comparison Source normalized indicators of citation impact: An overview of different approaches and an empirical comparison Ludo Waltman and Nees Jan van Eck Centre for Science and Technology Studies, Leiden University,

More information

Alphabetical co-authorship in the social sciences and humanities: evidence from a comprehensive local database 1

Alphabetical co-authorship in the social sciences and humanities: evidence from a comprehensive local database 1 València, 14 16 September 2016 Proceedings of the 21 st International Conference on Science and Technology Indicators València (Spain) September 14-16, 2016 DOI: http://dx.doi.org/10.4995/sti2016.2016.xxxx

More information

Citation analysis: State of the art, good practices, and future developments

Citation analysis: State of the art, good practices, and future developments Citation analysis: State of the art, good practices, and future developments Ludo Waltman Centre for Science and Technology Studies, Leiden University Bibliometrics & Research Assessment: A Symposium for

More information

A Taxonomy of Bibliometric Performance Indicators Based on the Property of Consistency

A Taxonomy of Bibliometric Performance Indicators Based on the Property of Consistency A Taxonomy of Bibliometric Performance Indicators Based on the Property of Consistency Ludo Waltman and Nees Jan van Eck ERIM REPORT SERIES RESEARCH IN MANAGEMENT ERIM Report Series reference number ERS-2009-014-LIS

More information

Getting started with CitNetExplorer version 1.0.0

Getting started with CitNetExplorer version 1.0.0 Getting started with CitNetExplorer version 1.0.0 Nees Jan van Eck and Ludo Waltman Centre for Science and Technology Studies (CWTS), Leiden University March 10, 2014 CitNetExplorer is a software tool

More information

Scientometrics & Altmetrics

Scientometrics & Altmetrics www.know- center.at Scientometrics & Altmetrics Dr. Peter Kraker VU Science 2.0, 20.11.2014 funded within the Austrian Competence Center Programme Why Metrics? 2 One of the diseases of this age is the

More information

BIBLIOMETRIC REPORT. Bibliometric analysis of Mälardalen University. Final Report - updated. April 28 th, 2014

BIBLIOMETRIC REPORT. Bibliometric analysis of Mälardalen University. Final Report - updated. April 28 th, 2014 BIBLIOMETRIC REPORT Bibliometric analysis of Mälardalen University Final Report - updated April 28 th, 2014 Bibliometric analysis of Mälardalen University Report for Mälardalen University Per Nyström PhD,

More information

This is the preliminary version of the accepted JASIST paper

This is the preliminary version of the accepted JASIST paper This is the preliminary version of the accepted JASIST paper Scholarly network similarities: How bibliographic coupling networks, citation networks, co-citation networks, topical networks, coauthorship

More information

A tutorial for vosviewer. Clément Levallois. Version 1.6.5,

A tutorial for vosviewer. Clément Levallois. Version 1.6.5, A tutorial for vosviewer Clément Levallois Version 1.6.5, 2017-03-29 Table of Contents Presentation of this tutorial.................................................................. 1 Importing a dataset.........................................................................

More information

Mapping Interdisciplinarity at the Interfaces between the Science Citation Index and the Social Science Citation Index

Mapping Interdisciplinarity at the Interfaces between the Science Citation Index and the Social Science Citation Index Mapping Interdisciplinarity at the Interfaces between the Science Citation Index and the Social Science Citation Index Loet Leydesdorff University of Amsterdam, Amsterdam School of Communications Research

More information

PBL Netherlands Environmental Assessment Agency (PBL): Research performance analysis ( )

PBL Netherlands Environmental Assessment Agency (PBL): Research performance analysis ( ) PBL Netherlands Environmental Assessment Agency (PBL): Research performance analysis (2011-2016) Center for Science and Technology Studies (CWTS) Leiden University PO Box 9555, 2300 RB Leiden The Netherlands

More information

Citation analysis may severely underestimate the impact of clinical research as compared to basic research

Citation analysis may severely underestimate the impact of clinical research as compared to basic research Citation analysis may severely underestimate the impact of clinical research as compared to basic research Nees Jan van Eck 1, Ludo Waltman 1, Anthony F.J. van Raan 1, Robert J.M. Klautz 2, and Wilco C.

More information

Bibliometric Rankings of Journals Based on the Thomson Reuters Citations Database

Bibliometric Rankings of Journals Based on the Thomson Reuters Citations Database Instituto Complutense de Análisis Económico Bibliometric Rankings of Journals Based on the Thomson Reuters Citations Database Chia-Lin Chang Department of Applied Economics Department of Finance National

More information

F1000 recommendations as a new data source for research evaluation: A comparison with citations

F1000 recommendations as a new data source for research evaluation: A comparison with citations F1000 recommendations as a new data source for research evaluation: A comparison with citations Ludo Waltman and Rodrigo Costas Paper number CWTS Working Paper Series CWTS-WP-2013-003 Publication date

More information

VISIBILITY OF AFRICAN SCHOLARS IN THE LITERATURE OF BIBLIOMETRICS

VISIBILITY OF AFRICAN SCHOLARS IN THE LITERATURE OF BIBLIOMETRICS VISIBILITY OF AFRICAN SCHOLARS IN THE LITERATURE OF BIBLIOMETRICS Yahya Ibrahim Harande Department of Library and Information Sciences Bayero University Nigeria ABSTRACT This paper discusses the visibility

More information

Discussing some basic critique on Journal Impact Factors: revision of earlier comments

Discussing some basic critique on Journal Impact Factors: revision of earlier comments Scientometrics (2012) 92:443 455 DOI 107/s11192-012-0677-x Discussing some basic critique on Journal Impact Factors: revision of earlier comments Thed van Leeuwen Received: 1 February 2012 / Published

More information

CITATION CLASSES 1 : A NOVEL INDICATOR BASE TO CLASSIFY SCIENTIFIC OUTPUT

CITATION CLASSES 1 : A NOVEL INDICATOR BASE TO CLASSIFY SCIENTIFIC OUTPUT CITATION CLASSES 1 : A NOVEL INDICATOR BASE TO CLASSIFY SCIENTIFIC OUTPUT Wolfgang Glänzel *, Koenraad Debackere **, Bart Thijs **** * Wolfgang.Glänzel@kuleuven.be Centre for R&D Monitoring (ECOOM) and

More information

Global Journal of Engineering Science and Research Management

Global Journal of Engineering Science and Research Management BIBLIOMETRICS ANALYSIS TOOL A REVIEW Himansu Mohan Padhy*, Pranati Mishra, Subhashree Behera * Sophitorium Institute of Lifeskills & Technology, Khurda, Odisha DOI: 10.5281/zenodo.2536852 KEYWORDS: Bibliometrics,

More information

Publication boost in Web of Science journals and its effect on citation distributions

Publication boost in Web of Science journals and its effect on citation distributions Publication boost in Web of Science journals and its effect on citation distributions Lovro Šubelj a, * Dalibor Fiala b a University of Ljubljana, Faculty of Computer and Information Science Večna pot

More information

Visualizing the context of citations. referencing papers published by Eugene Garfield: A new type of keyword co-occurrence analysis

Visualizing the context of citations. referencing papers published by Eugene Garfield: A new type of keyword co-occurrence analysis Visualizing the context of citations referencing papers published by Eugene Garfield: A new type of keyword co-occurrence analysis Lutz Bornmann*, Robin Haunschild**, and Sven E. Hug*** *Corresponding

More information

Self-citations at the meso and individual levels: effects of different calculation methods

Self-citations at the meso and individual levels: effects of different calculation methods Scientometrics () 82:17 37 DOI.7/s11192--187-7 Self-citations at the meso and individual levels: effects of different calculation methods Rodrigo Costas Thed N. van Leeuwen María Bordons Received: 11 May

More information

STI 2018 Conference Proceedings

STI 2018 Conference Proceedings STI 2018 Conference Proceedings Proceedings of the 23rd International Conference on Science and Technology Indicators All papers published in this conference proceedings have been peer reviewed through

More information

Contribution of Chinese publications in computer science: A case study on LNCS

Contribution of Chinese publications in computer science: A case study on LNCS Jointly published by Akadémiai Kiadó, Budapest Scientometrics, Vol. 75, No. 3 (2008) 519 534 and Springer, Dordrecht DOI: 10.1007/s11192-007-1781-1 Contribution of Chinese publications in computer science:

More information

In basic science the percentage of authoritative references decreases as bibliographies become shorter

In basic science the percentage of authoritative references decreases as bibliographies become shorter Jointly published by Akademiai Kiado, Budapest and Kluwer Academic Publishers, Dordrecht Scientometrics, Vol. 60, No. 3 (2004) 295-303 In basic science the percentage of authoritative references decreases

More information

Scientometric Measures in Scientometric, Technometric, Bibliometrics, Informetric, Webometric Research Publications

Scientometric Measures in Scientometric, Technometric, Bibliometrics, Informetric, Webometric Research Publications International Journal of Librarianship and Administration ISSN 2231-1300 Volume 3, Number 2 (2012), pp. 87-94 Research India Publications http://www.ripublication.com/ijla.htm Scientometric Measures in

More information

A Correlation Analysis of Normalized Indicators of Citation

A Correlation Analysis of Normalized Indicators of Citation 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 Article A Correlation Analysis of Normalized Indicators of Citation Dmitry

More information

The mf-index: A Citation-Based Multiple Factor Index to Evaluate and Compare the Output of Scientists

The mf-index: A Citation-Based Multiple Factor Index to Evaluate and Compare the Output of Scientists c 2017 by the authors; licensee RonPub, Lübeck, Germany. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/4.0/).

More information

Mapping and Bibliometric Analysis of American Historical Review Citations and Its Contribution to the Field of History

Mapping and Bibliometric Analysis of American Historical Review Citations and Its Contribution to the Field of History Journal of Information & Knowledge Management Vol. 15, No. 4 (2016) 1650039 (12 pages) #.c World Scienti c Publishing Co. DOI: 10.1142/S0219649216500398 Mapping and Bibliometric Analysis of American Historical

More information

Author Name Co-Mention Analysis: Testing a Poor Man's Author Co-Citation Analysis Method

Author Name Co-Mention Analysis: Testing a Poor Man's Author Co-Citation Analysis Method Author Name Co-Mention Analysis: Testing a Poor Man's Author Co-Citation Analysis Method Andreas Strotmann 1 and Arnim Bleier 2 1 andreas.strotmann@gesis.org 2 arnim.bleier@gesis.org GESIS Leibniz Institute

More information

Complementary bibliometric analysis of the Health and Welfare (HV) research specialisation

Complementary bibliometric analysis of the Health and Welfare (HV) research specialisation April 28th, 2014 Complementary bibliometric analysis of the Health and Welfare (HV) research specialisation Per Nyström, librarian Mälardalen University Library per.nystrom@mdh.se +46 (0)21 101 637 Viktor

More information

Peter Ingwersen and Howard D. White win the 2005 Derek John de Solla Price Medal

Peter Ingwersen and Howard D. White win the 2005 Derek John de Solla Price Medal Jointly published by Akadémiai Kiadó, Budapest Scientometrics, and Springer, Dordrecht Vol. 65, No. 3 (2005) 265 266 Peter Ingwersen and Howard D. White win the 2005 Derek John de Solla Price Medal The

More information

Which percentile-based approach should be preferred. for calculating normalized citation impact values? An empirical comparison of five approaches

Which percentile-based approach should be preferred. for calculating normalized citation impact values? An empirical comparison of five approaches Accepted for publication in the Journal of Informetrics Which percentile-based approach should be preferred for calculating normalized citation impact values? An empirical comparison of five approaches

More information

The 2016 Altmetrics Workshop (Bucharest, 27 September, 2016) Moving beyond counts: integrating context

The 2016 Altmetrics Workshop (Bucharest, 27 September, 2016) Moving beyond counts: integrating context The 2016 Altmetrics Workshop (Bucharest, 27 September, 2016) Moving beyond counts: integrating context On the relationships between bibliometric and altmetric indicators: the effect of discipline and density

More information

Universiteit Leiden. Date: 25/08/2014

Universiteit Leiden. Date: 25/08/2014 Universiteit Leiden ICT in Business Identification of Essential References Based on the Full Text of Scientific Papers and Its Application in Scientometrics Name: Xi Cui Student-no: s1242156 Date: 25/08/2014

More information

The problems of field-normalization of bibliometric data and comparison among research institutions: Recent Developments

The problems of field-normalization of bibliometric data and comparison among research institutions: Recent Developments The problems of field-normalization of bibliometric data and comparison among research institutions: Recent Developments Domenico MAISANO Evaluating research output 1. scientific publications (e.g. journal

More information

Direct Citations between Citing Publications

Direct Citations between Citing Publications Direct Citations between Citing Publications Yong Huang Information Retrieval and Knowledge Mining Laboratory, School of Information Management, Wuhan University, Wuhan, Hubei, China School of Informatics,

More information

Bibliometric glossary

Bibliometric glossary Bibliometric glossary Bibliometric glossary Benchmarking The process of comparing an institution s, organization s or country s performance to best practices from others in its field, always taking into

More information

hprints , version 1-1 Oct 2008

hprints , version 1-1 Oct 2008 Author manuscript, published in "Scientometrics 74, 3 (2008) 439-451" 1 On the ratio of citable versus non-citable items in economics journals Tove Faber Frandsen 1 tff@db.dk Royal School of Library and

More information

Bibliometric analysis of the field of folksonomy research

Bibliometric analysis of the field of folksonomy research This is a preprint version of a published paper. For citing purposes please use: Ivanjko, Tomislav; Špiranec, Sonja. Bibliometric Analysis of the Field of Folksonomy Research // Proceedings of the 14th

More information

Should author self- citations be excluded from citation- based research evaluation? Perspective from in- text citation functions

Should author self- citations be excluded from citation- based research evaluation? Perspective from in- text citation functions 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 Should author self- citations be excluded from citation- based research evaluation? Perspective

More information

2nd International Conference on Advances in Social Science, Humanities, and Management (ASSHM 2014)

2nd International Conference on Advances in Social Science, Humanities, and Management (ASSHM 2014) 2nd International Conference on Advances in Social Science, Humanities, and Management (ASSHM 2014) A bibliometric analysis of science and technology publication output of University of Electronic and

More information

The Statistical Analysis of the Influence of Chinese Mathematical Journals Cited by Journal Citation Reports

The Statistical Analysis of the Influence of Chinese Mathematical Journals Cited by Journal Citation Reports Cross-Cultural Communication Vol. 11, No. 9, 2015, pp. 24-28 DOI:10.3968/7523 ISSN 1712-8358[Print] ISSN 1923-6700[Online] www.cscanada.net www.cscanada.org The Statistical Analysis of the Influence of

More information

THE USE OF THOMSON REUTERS RESEARCH ANALYTIC RESOURCES IN ACADEMIC PERFORMANCE EVALUATION DR. EVANGELIA A.E.C. LIPITAKIS SEPTEMBER 2014

THE USE OF THOMSON REUTERS RESEARCH ANALYTIC RESOURCES IN ACADEMIC PERFORMANCE EVALUATION DR. EVANGELIA A.E.C. LIPITAKIS SEPTEMBER 2014 THE USE OF THOMSON REUTERS RESEARCH ANALYTIC RESOURCES IN ACADEMIC PERFORMANCE EVALUATION DR. EVANGELIA A.E.C. LIPITAKIS SEPTEMBER 2014 Agenda Academic Research Performance Evaluation & Bibliometric Analysis

More information

Identifying Related Documents For Research Paper Recommender By CPA and COA

Identifying Related Documents For Research Paper Recommender By CPA and COA Preprint of: Bela Gipp and Jöran Beel. Identifying Related uments For Research Paper Recommender By CPA And COA. In S. I. Ao, C. Douglas, W. S. Grundfest, and J. Burgstone, editors, International Conference

More information

INTRODUCTION TO SCIENTOMETRICS. Farzaneh Aminpour, PhD. Ministry of Health and Medical Education

INTRODUCTION TO SCIENTOMETRICS. Farzaneh Aminpour, PhD. Ministry of Health and Medical Education INTRODUCTION TO SCIENTOMETRICS Farzaneh Aminpour, PhD. aminpour@behdasht.gov.ir Ministry of Health and Medical Education Workshop Objectives Scientometrics: Basics Citation Databases Scientometrics Indices

More information

Bibliometric Analysis of the Indian Journal of Chemistry

Bibliometric Analysis of the Indian Journal of Chemistry http://unllib.unl.edu/lpp/ Library Philosophy and Practice 2011 ISSN 1522-0222 Bibliometric Analysis of the Indian Journal of Chemistry S. Thanuskodi Library & Information Science Wing, Directorate of

More information

Accpeted for publication in the Journal of Korean Medical Science (JKMS)

Accpeted for publication in the Journal of Korean Medical Science (JKMS) The Journal Impact Factor Should Not Be Discarded Running title: JIF Should Not Be Discarded Lutz Bornmann, 1 Alexander I. Pudovkin 2 1 Division for Science and Innovation Studies, Administrative Headquarters

More information

arxiv: v2 [cs.dl] 6 Feb 2017

arxiv: v2 [cs.dl] 6 Feb 2017 Bibliometric author evaluation through linear regression on the coauthor network Rasmus A. X. Persson Department of Chemistry & Molecular Biology, University of Gothenburg, SE-412 96 Gothenburg, Sweden

More information

Bibliometric analysis of publications from North Korea indexed in the Web of Science Core Collection from 1988 to 2016

Bibliometric analysis of publications from North Korea indexed in the Web of Science Core Collection from 1988 to 2016 pissn 2288-8063 eissn 2288-7474 Sci Ed 2017;4(1):24-29 https://doi.org/10.6087/kcse.85 Original Article Bibliometric analysis of publications from North Korea indexed in the Web of Science Core Collection

More information

HIGHLY CITED PAPERS IN SLOVENIA

HIGHLY CITED PAPERS IN SLOVENIA * HIGHLY CITED PAPERS IN SLOVENIA 972 Abstract. Despite some criticism and the search for alternative methods of citation analysis it's an important bibliometric method, which measures the impact of published

More information

Citation Impact on Authorship Pattern

Citation Impact on Authorship Pattern Citation Impact on Authorship Pattern Dr. V. Viswanathan Librarian Misrimal Navajee Munoth Jain Engineering College Thoraipakkam, Chennai viswanathan.vaidhyanathan@gmail.com Dr. M. Tamizhchelvan Deputy

More information

BIBLIOGRAPHIC DATA: A DIFFERENT ANALYSIS PERSPECTIVE. Francesca De Battisti *, Silvia Salini

BIBLIOGRAPHIC DATA: A DIFFERENT ANALYSIS PERSPECTIVE. Francesca De Battisti *, Silvia Salini Electronic Journal of Applied Statistical Analysis EJASA (2012), Electron. J. App. Stat. Anal., Vol. 5, Issue 3, 353 359 e-issn 2070-5948, DOI 10.1285/i20705948v5n3p353 2012 Università del Salento http://siba-ese.unile.it/index.php/ejasa/index

More information

Citation Proximity Analysis (CPA) A new approach for identifying related work based on Co-Citation Analysis

Citation Proximity Analysis (CPA) A new approach for identifying related work based on Co-Citation Analysis Bela Gipp and Joeran Beel. Citation Proximity Analysis (CPA) - A new approach for identifying related work based on Co-Citation Analysis. In Birger Larsen and Jacqueline Leta, editors, Proceedings of the

More information

Inequality of Publishing Performance and International Collaboration in Physics

Inequality of Publishing Performance and International Collaboration in Physics Inequality of Publishing Performance and International Collaboration in Physics Mu-Hsuan Huang and Muh-Chyun Tang Department of Library and Information Science, National Taiwan University, Taipei, Taiwan.

More information

Predicting the Importance of Current Papers

Predicting the Importance of Current Papers Predicting the Importance of Current Papers Kevin W. Boyack * and Richard Klavans ** kboyack@sandia.gov * Sandia National Laboratories, P.O. Box 5800, MS-0310, Albuquerque, NM 87185, USA rklavans@mapofscience.com

More information

More Precise Methods for National Research Citation Impact Comparisons 1

More Precise Methods for National Research Citation Impact Comparisons 1 1 More Precise Methods for National Research Citation Impact Comparisons 1 Ruth Fairclough, Mike Thelwall Statistical Cybermetrics Research Group, School of Mathematics and Computer Science, University

More information

Journal of Informetrics

Journal of Informetrics Journal of Informetrics 4 (2010) 581 590 Contents lists available at ScienceDirect Journal of Informetrics journal homepage: www. elsevier. com/ locate/ joi A research impact indicator for institutions

More information

CONTRIBUTION OF INDIAN AUTHORS IN WEB OF SCIENCE: BIBLIOMETRIC ANALYSIS OF ARTS & HUMANITIES CITATION INDEX (A&HCI)

CONTRIBUTION OF INDIAN AUTHORS IN WEB OF SCIENCE: BIBLIOMETRIC ANALYSIS OF ARTS & HUMANITIES CITATION INDEX (A&HCI) International Journal of Library & Information Science (IJLIS) Volume 6, Issue 5, September October 2017, pp. 10 16, Article ID: IJLIS_06_05_002 Available online at http://www.iaeme.com/ijlis/issues.asp?jtype=ijlis&vtype=6&itype=5

More information

Año 8, No.27, Ene Mar What does Hirsch index evolution explain us? A case study: Turkish Journal of Chemistry

Año 8, No.27, Ene Mar What does Hirsch index evolution explain us? A case study: Turkish Journal of Chemistry essay What does Hirsch index evolution explain us? A case study: Turkish Journal of Chemistry Metin Orbay, Orhan Karamustafaoğlu and Feda Öner Amasya University (Turkey) morbay@omu.edu.tr, orseka@yahoo.com,

More information

Growth of Literature and Collaboration of Authors in MEMS: A Bibliometric Study on BRIC and G8 countries

Growth of Literature and Collaboration of Authors in MEMS: A Bibliometric Study on BRIC and G8 countries Growth of Literature and Collaboration of Authors in MEMS: A Bibliometric Study on BRIC and G8 countries Dr. M. Tamizhchelvan Deputy Librarian Gandhigram Rural Institute-Deemed University Gandhigram, Dindigul,

More information

International Journal of Library and Information Studies ISSN: Vol.3 (3) Jul-Sep, 2013

International Journal of Library and Information Studies ISSN: Vol.3 (3) Jul-Sep, 2013 SCIENTOMETRIC ANALYSIS: ANNALS OF LIBRARY AND INFORMATION STUDIES PUBLICATIONS OUTPUT DURING 2007-2012 C. Velmurugan Librarian Department of Central Library Siva Institute of Frontier Technology Vengal,

More information

University of Liverpool Library. Introduction to Journal Bibliometrics and Research Impact. Contents

University of Liverpool Library. Introduction to Journal Bibliometrics and Research Impact. Contents University of Liverpool Library Introduction to Journal Bibliometrics and Research Impact Contents Journal Citation Reports How to access JCR (Web of Knowledge) 2 Comparing the metrics for a group of journals

More information

Bibliometric Analysis of Literature Published in Emerald Journals on Cloud Computing

Bibliometric Analysis of Literature Published in Emerald Journals on Cloud Computing International Journal of Computational Engineering & Management, Vol. 18 Issue 1, January 2015 www..org 21 Bibliometric Analysis of Literature Published in Emerald Journals on Cloud Computing Jayaprakash

More information

Results of the bibliometric study on the Faculty of Veterinary Medicine of the Utrecht University

Results of the bibliometric study on the Faculty of Veterinary Medicine of the Utrecht University Results of the bibliometric study on the Faculty of Veterinary Medicine of the Utrecht University 2001 2010 Ed Noyons and Clara Calero Medina Center for Science and Technology Studies (CWTS) Leiden University

More information

Research Ideas for the Journal of Informatics and Data Mining: Opinion*

Research Ideas for the Journal of Informatics and Data Mining: Opinion* Research Ideas for the Journal of Informatics and Data Mining: Opinion* Editor-in-Chief Michael McAleer Department of Quantitative Finance National Tsing Hua University Taiwan and Econometric Institute

More information

Evaluating Research and Patenting Performance Using Elites: A Preliminary Classification Scheme

Evaluating Research and Patenting Performance Using Elites: A Preliminary Classification Scheme Evaluating Research and Patenting Performance Using Elites: A Preliminary Classification Scheme Chung-Huei Kuan, Ta-Chan Chiang Graduate Institute of Patent Research, National Taiwan University of Science

More information

Comparing Bibliometric Statistics Obtained from the Web of Science and Scopus

Comparing Bibliometric Statistics Obtained from the Web of Science and Scopus Comparing Bibliometric Statistics Obtained from the Web of Science and Scopus Éric Archambault Science-Metrix, 1335A avenue du Mont-Royal E., Montréal, Québec, H2J 1Y6, Canada and Observatoire des sciences

More information

Complementary bibliometric analysis of the Educational Science (UV) research specialisation

Complementary bibliometric analysis of the Educational Science (UV) research specialisation April 28th, 2014 Complementary bibliometric analysis of the Educational Science (UV) research specialisation Per Nyström, librarian Mälardalen University Library per.nystrom@mdh.se +46 (0)21 101 637 Viktor

More information

The journal relative impact: an indicator for journal assessment

The journal relative impact: an indicator for journal assessment Scientometrics (2011) 89:631 651 DOI 10.1007/s11192-011-0469-8 The journal relative impact: an indicator for journal assessment Elizabeth S. Vieira José A. N. F. Gomes Received: 30 March 2011 / Published

More information

Scientometric and Webometric Methods

Scientometric and Webometric Methods Scientometric and Webometric Methods By Peter Ingwersen Royal School of Library and Information Science Birketinget 6, DK 2300 Copenhagen S. Denmark pi@db.dk; www.db.dk/pi Abstract The paper presents two

More information

1.1 What is CiteScore? Why don t you include articles-in-press in CiteScore? Why don t you include abstracts in CiteScore?

1.1 What is CiteScore? Why don t you include articles-in-press in CiteScore? Why don t you include abstracts in CiteScore? June 2018 FAQs Contents 1. About CiteScore and its derivative metrics 4 1.1 What is CiteScore? 5 1.2 Why don t you include articles-in-press in CiteScore? 5 1.3 Why don t you include abstracts in CiteScore?

More information

Citation Analysis with Microsoft Academic

Citation Analysis with Microsoft Academic Hug, S. E., Ochsner M., and Brändle, M. P. (2017): Citation analysis with Microsoft Academic. Scientometrics. DOI 10.1007/s11192-017-2247-8 Submitted to Scientometrics on Sept 16, 2016; accepted Nov 7,

More information

Publication Output and Citation Impact

Publication Output and Citation Impact 1 Publication Output and Citation Impact A bibliometric analysis of the MPI-C in the publication period 2003 2013 contributed by Robin Haunschild 1, Hermann Schier 1, and Lutz Bornmann 2 1 Max Planck Society,

More information

Author Productivity Indexing via Topic Sensitive Weighted Citations

Author Productivity Indexing via Topic Sensitive Weighted Citations Author Productivity Indexing via Topic Sensitive Weighted Citations Tehmina Amjad 1, Shabnum Bibi 2, Ali Daud 3 Islamabad Islamic University Islamabad, Pakistan tehminaamjad@iiu.edu.pk ali.daud@iiu.edu.pk

More information

A New Format For The Ph.D. Dissertation and Masters Thesis. A Proposal by the Department of Physical Performance and Development

A New Format For The Ph.D. Dissertation and Masters Thesis. A Proposal by the Department of Physical Performance and Development A New Format For The Ph.D. Dissertation and Masters Thesis A Proposal by the Department of Physical Performance and Development March, 2003 DISSERTATION AND THESIS FORMAT Overview The chapter structure

More information

researchtrends IN THIS ISSUE: Did you know? Scientometrics from past to present Focus on Turkey: the influence of policy on research output

researchtrends IN THIS ISSUE: Did you know? Scientometrics from past to present Focus on Turkey: the influence of policy on research output ISSUE 1 SEPTEMBER 2007 researchtrends IN THIS ISSUE: PAGE 2 The value of bibliometric measures Scientometrics from past to present The origins of scientometric research can be traced back to the beginning

More information

Citation Analysis in Research Evaluation

Citation Analysis in Research Evaluation Citation Analysis in Research Evaluation (Published by Springer, July 2005) Henk F. Moed CWTS, Leiden University Part No 1 2.1 2.2 2.3 2.4 2.5 2.6 2.7 2.8 Part Title General introduction and conclusions

More information

Mendeley readership as a filtering tool to identify highly cited publications 1

Mendeley readership as a filtering tool to identify highly cited publications 1 Mendeley readership as a filtering tool to identify highly cited publications 1 Zohreh Zahedi, Rodrigo Costas and Paul Wouters z.zahedi.2@cwts.leidenuniv.nl; rcostas@cwts.leidenuniv.nl; p.f.wouters@cwts.leidenuniv.nl

More information

Celebrating Scholarly Communication Studies

Celebrating Scholarly Communication Studies Celebrating Scholarly Communication Studies A Festschrift for Olle Persson at his 60 th Birthday special volume of the e-newsletter of the international society for scientometrics and informetrics vol.

More information

Weighted citation: An indicator of an article s prestige

Weighted citation: An indicator of an article s prestige Weighted citation: An indicator of an article s prestige Erjia Yan 1, Ying Ding School of Library and Information Science, Indiana University, Bloomington, USA Abstract We propose using the technique of

More information

A BIBLIOMETRIC ANALYSIS OF ASIAN AUTHORSHIP PATTERN IN JASIST,

A BIBLIOMETRIC ANALYSIS OF ASIAN AUTHORSHIP PATTERN IN JASIST, A BIBLIOMETRIC ANALYSIS OF ASIAN AUTHORSHIP PATTERN IN JASIST, 1981-2005 HAN-WEN CHANG Department and Graduate Institute of Library and Information Science, National Taiwan University No. 1, Sec. 4, Roosevelt

More information

An Introduction to Bibliometrics Ciarán Quinn

An Introduction to Bibliometrics Ciarán Quinn An Introduction to Bibliometrics Ciarán Quinn What are Bibliometrics? What are Altmetrics? Why are they important? How can you measure? What are the metrics? What resources are available to you? Subscribed

More information

The use of citation speed to understand the effects of a multi-institutional science center

The use of citation speed to understand the effects of a multi-institutional science center Georgia Institute of Technology From the SelectedWorks of Jan Youtie 2014 The use of citation speed to understand the effects of a multi-institutional science center Jan Youtie, Georgia Institute of Technology

More information

Kent Academic Repository

Kent Academic Repository Kent Academic Repository Full text document (pdf) Citation for published version Mingers, John and Lipitakis, Evangelia A. E. C. G. (2013) Evaluating a Department s Research: Testing the Leiden Methodology

More information

SCOPUS : BEST PRACTICES. Presented by Ozge Sertdemir

SCOPUS : BEST PRACTICES. Presented by Ozge Sertdemir SCOPUS : BEST PRACTICES Presented by Ozge Sertdemir o.sertdemir@elsevier.com AGENDA o Scopus content o Why Use Scopus? o Who uses Scopus? 3 Facts and Figures - The largest abstract and citation database

More information

Edited Volumes, Monographs, and Book Chapters in the Book Citation Index. (BCI) and Science Citation Index (SCI, SoSCI, A&HCI)

Edited Volumes, Monographs, and Book Chapters in the Book Citation Index. (BCI) and Science Citation Index (SCI, SoSCI, A&HCI) Edited Volumes, Monographs, and Book Chapters in the Book Citation Index (BCI) and Science Citation Index (SCI, SoSCI, A&HCI) Loet Leydesdorff i & Ulrike Felt ii Abstract In 2011, Thomson-Reuters introduced

More information

PUBLICATION RESEARCH TRENDS ON TECHNICAL REVIEW JOURNAL: A SCIENTOMETRIC STUDY

PUBLICATION RESEARCH TRENDS ON TECHNICAL REVIEW JOURNAL: A SCIENTOMETRIC STUDY PUBLICATION RESEARCH TRENDS ON TECHNICAL REVIEW JOURNAL: A SCIENTOMETRIC STUDY Velmurugan, C Research Scholar Department of Library and Information Science, Periyar University, Salem-636 011, Tamilnadu,

More information

Coverage of highly-cited documents in Google Scholar, Web of Science, and Scopus: a multidisciplinary comparison

Coverage of highly-cited documents in Google Scholar, Web of Science, and Scopus: a multidisciplinary comparison Coverage of highly-cited documents in Google Scholar, Web of Science, and Scopus: a multidisciplinary comparison Alberto Martín-Martín 1, Enrique Orduna-Malea 2, Emilio Delgado López-Cózar 1 Version 0.5

More information

Citations and Self Citations of Indian Authors in Library and Information Science: A Study Based on Indian Citation Index

Citations and Self Citations of Indian Authors in Library and Information Science: A Study Based on Indian Citation Index Research Journal of Library Sciences ISSN 2320 8929 Citations and Self Citations of Indian Authors in Library and Information Science: A Study Based on Indian Citation Index Abstract S. Dhanavandan and

More information

Bibliometric Analysis of Electronic Journal of Knowledge Management

Bibliometric Analysis of Electronic Journal of Knowledge Management Cloud Publications International Journal of Advanced Library and Information Science 2013, Volume 1, Issue 1, pp. 23-32, Article ID Sci-101 Research Article Open Access Bibliometric Analysis of Electronic

More information

Journal of American Computing Machinery: A Citation Study

Journal of American Computing Machinery: A Citation Study B.Vimala 1 and J.Dominic 2 1 Library, PSGR Krishnammal College for Women, Coimbatore - 641004, Tamil Nadu, India 2 University Library, Karunya University, Coimbatore - 641 114, Tamil Nadu, India E-mail:

More information

Publication Boost in Web of Science Journals and Its Effect on Citation Distributions

Publication Boost in Web of Science Journals and Its Effect on Citation Distributions Publication Boost in Web of Science Journals and Its Effect on Citation Distributions Lovro Subelj Faculty of Computer and Information Science, University of Ljubljana, Večna pot 113, 1000 Ljubljana, Slovenia.

More information

A Visualization of Relationships Among Papers Using Citation and Co-citation Information

A Visualization of Relationships Among Papers Using Citation and Co-citation Information A Visualization of Relationships Among Papers Using Citation and Co-citation Information Yu Nakano, Toshiyuki Shimizu, and Masatoshi Yoshikawa Graduate School of Informatics, Kyoto University, Kyoto 606-8501,

More information

SEKITAR PERPUSTAKAAN : A BIBLIOMETRIC STUDY USING CITATION ANALYSIS. Nasimah Badaruddin Institut Latihan Islam Malaysia.

SEKITAR PERPUSTAKAAN : A BIBLIOMETRIC STUDY USING CITATION ANALYSIS. Nasimah Badaruddin Institut Latihan Islam Malaysia. '~"JJ~ SEKITAR PERPUSTAKAAN 2004-2005: A BIBLIOMETRIC STUDY USING CITATION ANALYSIS ~_I~I_Jf_fJ_JJ_ll_fl_fJJJ_Jll_'_' '_JJ fjj_'~""_':"_"_ff I By Nasimah Badaruddin Institut Latihan Islam Malaysia ~~ ~

More information

Abstract. Introduction

Abstract. Introduction Are multi-authored articles cited more than single-authored ones? Are collaborations with authors from other countries more cited than collaborations within the country? A case study. Ronald Rousseau UIA,

More information

Scientometric Profile of Presbyopia in Medline Database

Scientometric Profile of Presbyopia in Medline Database Scientometric Profile of Presbyopia in Medline Database Pooja PrakashKharat M.Phil. Student Department of Library & Information Science Dr. Babasaheb Ambedkar Marathwada University. e-mail:kharatpooja90@gmail.com

More information

Københavns Universitet

Københavns Universitet university of copenhagen Københavns Universitet ACUMEN DELIVERABLE 5.4c Cluster analysis of bibliometric indicators of individual scientific performance Wildgaard, Lorna Elizabeth; Larsen, Birger; Schneider,

More information

What is bibliometrics?

What is bibliometrics? Bibliometrics as a tool for research evaluation Olessia Kirtchik, senior researcher Research Laboratory for Science and Technology Studies, HSE ISSEK What is bibliometrics? statistical analysis of scientific

More information