The Slavery of the h-index—Measuring the Unmeasurable

Kreiner, Grzegorz

doi:10.3389/fnhum.2016.00556

OPINION article

Front. Hum. Neurosci., 02 November 2016

Sec. Cognitive Neuroscience

Volume 10 - 2016 | https://doi.org/10.3389/fnhum.2016.00556

The Slavery of the h-index—Measuring the Unmeasurable

Grzegorz Kreiner^*

Department of Brain Biochemistry, Institute of Pharmacology, Polish Academy of Sciences, Krakow, Poland

Introduction

Last year we “celebrated” the 10th anniversary of the invention of the h-index (also known as the Hirsch factor; Hirsch, 2005), an indicator created by Jorge E. Hirsch, that attempts to measure the achievements of a research scientist. However, it not only appears that h-index has taken on a life of its own but also that the popularity of this formula currently surpasses the initial idea for its use envisioned by the inventor. Originally introduced as a simple characterization of the scientific output of a researcher (Hirsch, 2005), the h-index has come to be uncritically regarded as a “magic tool” that is applied to measure what is unmeasurable—the quality of science. As a result, it has become a “must have” indicator when applying for funds or a new position. Surprisingly, many decision-makers apparently are not fully aware of what it represents. According to its inventor, the h-index is the number of papers coauthored by the investigator with at least h citations each (Hirsch, 2005). That is, to be the proud owner of a high h-index, it is not sufficient to have authored many articles or for some of them to have been extensively cited. Rather, such an achievement must satisfy both issues: a considerable number of articles must be highly cited, which is reflected by ranking them according to the number of citations they have collected, and finding the one where the position on the list equals or is less than the number of the citations it garnered.

The Pros and Cons of the h-index

The initial idea of Hirsch was to discriminate the investigators who are persistently productive from those who experienced an isolated auspicious moment in their scientific life, and who currently only cut coupons from their popularity roll. Nevertheless, it assumes that researcher A, who published a breakthrough story that was extensively cited, should deserve less respect than researcher B who publishes often and regularly; however, the outcome of the latter's work has not contributed yet to any remarkable discovery. A good example is the inventor of the RNA isolation method, Piotr Chomczynski. He has in total over 65 000 citations to which he contributed almost exclusively (92.9% of all citations) with one single paper regarding the method he introduced in 1987 (Chomczynski and Sacchi, 1987). His current h-index is 23, relatively low for such a prodigious number of citations. Yet, could we imagine working with RNA over the past number of years without possessing this simple technique that is now the principle of virtually every commercial protocol related to RNA extraction? Notwithstanding, it might be argued—depreciating that discovery—that because this method is so simple, someone else would have been discovered it sooner or later. In response, let me cite my former mentor, professor Günther Schütz, who would disprove similar arguments with a simple statement: If you say this is so trivial, why was it not you who discovered PCR? Breakthrough discoveries are not solely dependent on sophisticated science.

Another problem with the h-index is the impossibility of comparing the investigators during different stages of their careers (even assuming comparisons among those representing the same field, which is another ambiguous factor). There is a certain correlation between the age of an investigator and h-index. Clearly and in any case, some of articles will accumulate citations and this number will increase over the time since they were first published. However, even the comparison between investigators at a similar career stage may often be misleading, particularly among young post-docs whose careers have just begun. We must be honest and acknowledge the fact that it is a rare occurrence that right after completing a PhD, such an investigator is able to independently decide about own career development. Mostly, the scientific achievements at this stage are primarily a derivative of the power of the PhD mentor and the reputation of the hosting institution.

Another issue contributing to h-index limitations is that many research groups have different regulations regarding authorship. It is assumed that a researcher's name will be added to the authorship list only after considerable contribution has been made to the published work. However, what occurs fairly often is that being a “middle man” on the listing does not necessary reflect the significant contribution and, worth to be emphasized, the h-index does not differentiate between article authors who hold the most valuable first and last authorship position and those wherein the author's name appears as one among perhaps even 100 authors, as occurs with articles containing vast meta-analyses of clinical data.

Although, some of these concerns were raised in the discussion part of the initial paper published by Hirsch (2005), they were overshadowed by the enormous popularity of this tool and its indiscriminate application. Despite of many critical yet unofficial discussions about the h-index, its limitations, and perhaps dangerous influence on science, the topic—if only tackled by publications—is usually narrowed to the pursuit of more and more sophisticated bibliometrics and proposals of new indicators (Bharathi, 2013; Biswal, 2013; Díaz et al., 2016; Würtz and Schmidt, 2016). The critical voices seem to be less represented, nevertheless there are of course existing articles pointing out that past achievements of the scientists may not necessarily be correlated with future success and that all such rankings need context which means that the best method to gain an impression of the quality is still simply reading the papers (Wendl, 2007; von Bohlen Und Halbach, 2011). Perhaps the combination of looking into the context of the particular paper and the journal reputation gives the best assumption in this matter, however even this approach can be biased toward personal preferences and requires considerable amount of time.

Last but not least, when examining the reasons why the h-index is not trustworthy, we should not neglect issues regarding fraud. Because the h-index does not discriminate self-citations, it is not difficult to predict that even the investigators who are poorly cited by others but who publish prodigiously, citing mostly themselves, will easily increase their h-index in the long run. Moreover, some evidence of misuse has been reported involving artificially pumped h-indices based on i.e., regular cross-citations between good friends (Kotov, 2010).

The h-indices of Nobel Prize Winners

Great discoveries require time for experiments. Furthermore, quality investigational pursuits require time for planning, critical thought, discussion, repetition. This process cannot be conducted in haste and under pressure to simply publish all that has been collected. Isaak Newton was said to think twice and to examine all of his discoveries time after time until he finally decided to publish them. Currently, with such an approach, he would most likely have been released nowadays from the university due to the lack of progress in his research. Seemingly, this example perhaps would be difficult to prove. However, consider instead scenarios that are closer to reality—the Nobel Prize laureates in Physiology & Medicine, an unbiased selection of preeminent scientists in life sciences. Do they actually possess h-indices above 100 or thereabouts? Indeed and not surprisingly, many of them do. However, there are also laureates who would have inevitably encountered roadblocks had their h-index been the decisive factor in their nomination. If we consider the last 25 laureates from the period of the recent 10 years, the outcomes are somewhat surprising. The h-index values vary among these esteemed professors within the range of 24–139 (Figure 1A), although no one would rightfully propose that one of these laureates should be regarded as a 5-fold better scientist than another. Clearly, this may again have reflected the time when the Nobel-worthy discovery was made and simply the age of the laureate, another problematic issue raised in one of recent commentaries in Nature journal (Fortunato, 2014). Nonetheless, even when comparing the laureates by their h-indices at the same age of 35 years, the average age of a young post-doc (Figure 1B), or the year before the laureates' famous discoveries were first announced in top-flight journals (Figure 1C), no correlation exists and moreover, examples may be found of investigators who had virtually no bibliometric track records. This is also true among relatively young Nobel laureates (those born >1960) whose careers were certainly influenced by the IT era from the very beginning (Figures 1A–C, green bars). It could be surmised that in many cases the moderate h-indices at this stage are simply correlated to the persistence in pursuing the hypothesis that ultimately was proven, thereby yielding a scientific breakthrough.

FIGURE 1

Figure 1. The h-indices of Nobel Prize laureates granted during the period 2005–2015; (A) current standings; (B) h-indices at the age of 35 years; (C) h-indices for the year prior to the breakthrough discovery. Green bars, Nobel Prize laureates born after 1960. Following papers were selected as the firstly announced breakthrough discovery of evaluated laureates: John O'Keefe, JOK (Morris et al., 1982); May-Britt Moser, MBM (Fyhn et al., 2004); Edvard I. Moser, EIM (Fyhn et al., 2004); James E. Rothman, JER (Söllner et al., 1993); Randy W. Schekman, RWS (Novick et al., 1980); Thomas C. Südhof, TCS (Südhof et al., 1987); John B. Gurdon, JBG (Gurdon et al., 1971); Shinya Yamanaka, SY (Takahashi and Yamanaka, 2006); Bruce A. Beutler, BAB (Poltorak et al., 1998); Jules A. Hoffmann, JAH (Lemaitre et al., 1996); Ralph M. Steinman, RMS (Steinman and Cohn, 1973); Robert G. Edwards, RGE (Edwards, 1965); Elizabeth H. Blackburn, EHB (Greider and Blackburn, 1985); Carol W. Greider, CWG (Harley et al., 1990); Jack W. Szostak, JWS (Lundblad and Szostak, 1989); Harald zur Hausen, HzH (Devilliers et al., 1987); Françoise Barré-Sinoussi, FBS (Barré-Sinoussi et al., 1983); Luc Montagnier, LM (Barré-Sinoussi et al., 1983); Mario R. Capecchi, MRC (Thomas and Capecchi, 1987); Sir Martin J. Evans, MJE (Evans and Kaufman, 1981); Oliver Smithies, OS (Blattner et al., 1977); Andrew Fire, AF (Fire et al., 1998); Craig C. Mello, CCM (Mello et al., 1991); Barry J. Marshall, BJM (Marshall and Warren, 1984); J. Robin Warren, JRW (Marshall and Warren, 1984). The h-indices of the examples provided were calculated according to the Web of Science Core Collection database (Thomson Reuters).

A Lesson from Poland

We have had a unique experience in Poland, arising from the Communist period of our history, when all work efforts were required to be quantified by numbers, that were the most important, even though the outcomes of that work were far from expected. That was a time during which, e.g., a car factory would proudly announce that it had produced the 1-millionth vehicle, although it completely disregarded that it was barely possible to start the engine in most of the cars or to exit a parking spot because of a variety of mechanical failures.

It appears that looking through the lens of the h-index, we are perilously reverting back to those times. What is conveyed in the article, the quality of the work and the actual impact on the topic in focus does not appear to be important. What counts is quantity, not quality. With more articles and more citations, the magic “h” letter will increase. As a result, this activity will considerably increase the approval chances of grant application or will lend favor to tenure position evaluation. Yet, what will be the feedback regarding these articles? Can anything be learned from them? Apparently, this is out of the question—it does not affect the h-index.

Poland is now enjoying a good period in its history. As a result of maintaining the gross domestic product in the black, despite worldwide economic crises, the monies spent on research are continuously increasing. Moreover, several attempts and programs are underway that aim to reorganize science in Poland to be more competitive and efficient. However, all these efforts may backfire if we remain stuck to the easy and old-fashioned way of evaluating our work.

Unquestionably, we scientists must somehow be evaluated as our job is based on spending public monies and such work should be accomplished in an efficient manner. However, the critical evaluation of science requires a great deal of effort and cannot be done using a simple comparison of numbers. As stated by the Hirsch himself, the single number can give only the rough approximation to an individual's researcher profile with many other factors contributing as well (Hirsch, 2005). The overall assessment is also affected by an average number of citations per paper, the quality of journals where the work was published, and goes far beyond the publication track records including other scientific achievements i.e., invited lectures, international experience (post-doctoral fellowships, sabbaticals, collaborations), project managements.

Another issue is the system of evaluation built upon assumption that future success of the project depends on the previous achievements of the applicant. Thus, it remains tempting for some of the grant reviewers to assess the applicants solely through the prism of their respective h-indices, and such a relentless pursuit of bibliometric factors may bring global science to its knees, leaving no room for independent research that may lead to the elucidation of fundamental biological quandaries.

Author Note

The author is a research associate at the Department of Brain Biochemistry, Institute of Pharmacology, Polish Academy of Sciences (IF PAS), Krakow, Poland. A former post-doc (2005–2008) at the German Cancer Research Center (DKFZ), Heidelberg, Germany. The PI on 2 research grants funded by the Polish National Science Center (NCN).

Author Contributions

GK designed, calculated data, and wrote the paper. No other person was involved.

Conflict of Interest Statement

The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

The content of this article is solely the responsibility of the author and does not necessarily represent the official view of the affiliated organization.

References

Barré-Sinoussi, F., Chermann, J. C., Rey, F., Nugeyre, M. T., Chamaret, S., Gruest, J., et al. (1983). Isolation of a T-Lymphotropic retrovirus from a patient at risk for Acquired Immune Deficiency Syndrome (AIDS). Science 220, 868–871. doi: 10.1126/science.6189183

PubMed Abstract | CrossRef Full Text | Google Scholar

Bharathi, D. G. (2013). Evaluation and ranking of researchers–Bh Index. PLoS ONE 8:e82050. doi: 10.1371/journal.pone.0082050

PubMed Abstract | CrossRef Full Text | Google Scholar

Biswal, A. K. (2013). An absolute index (Ab-index) to measure a researcher's useful contributions and productivity. PLoS ONE 8:e84334. doi: 10.1371/journal.pone.0084334

PubMed Abstract | CrossRef Full Text | Google Scholar

Blattner, F. R., Williams, B. G., Blechl, A. E., Denniston-Thompson, K., Faber, H. E., Furlong, L., et al. (1977). Charon phages: safer derivatives of bacteriophage lambda for DNA cloning. Science 196, 161–169. doi: 10.1126/science.847462

PubMed Abstract | CrossRef Full Text | Google Scholar

Chomczynski, P., and Sacchi, N. (1987). Single-step method of RNA isolation by acid guanidinium thiocyanate-phenol-chloroform extraction. Anal. Biochem. 162, 156–159. doi: 10.1016/0003-2697(87)90021-2

PubMed Abstract | CrossRef Full Text | Google Scholar

De villiers, E. M., Schneider, A., Miklaw, H., Papendick, U., Wagner, D., Wesch, H., et al. (1987). Human papillomavirus infections in women with and without abnormal cervical cytology. Lancet 2, 703–706. doi: 10.1016/S0140-6736(87)91072-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Díaz, I., Cortey, M., Olvera, Á., and Segalés, J. (2016). Use of h-index and other bibliometric indicators to evaluate research productivity outcome on Swine diseases. PLoS ONE 11:e0149690. doi: 10.1371/journal.pone.0149690

PubMed Abstract | CrossRef Full Text | Google Scholar

Edwards, R. G. (1965). Maturation in vitro of human ovarian oocytes. Lancet 2, 926–929. doi: 10.1016/S0140-6736(65)92903-X

PubMed Abstract | CrossRef Full Text

Evans, M. J., and Kaufman, M. H. (1981). Establishment in culture of pluripotential cells from mouse embryos. Nature 292, 154–156. doi: 10.1038/292154a0

PubMed Abstract | CrossRef Full Text | Google Scholar

Fire, A., Xu, S., Montgomery, M. K., Kostas, S. A., Driver, S. E., and Mello, C. C. (1998). Potent and specific genetic interference by double-stranded RNA in Caenorhabditis elegans. Nature 391, 806–811. doi: 10.1038/35888

PubMed Abstract | CrossRef Full Text | Google Scholar

Fortunato, S. (2014). Prizes: growing time lag threatens Nobels. Nature 508:186. doi: 10.1038/508186a

PubMed Abstract | CrossRef Full Text | Google Scholar

Fyhn, M., Molden, S., Witter, M. P., Moser, E. I., and Moser, M. B. (2004). Spatial representation in the entorhinal cortex. Science 305, 1258–1264. doi: 10.1126/science.1099901

PubMed Abstract | CrossRef Full Text | Google Scholar

Greider, C. W., and Blackburn, E. H. (1985). Identification of a specific telomere terminal transferase activity in Tetrahymena extracts. Cell 43, 405–413. doi: 10.1016/0092-8674(85)90170-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Gurdon, J. B., Lane, C. D., Woodland, H. R., and Marbaix, G. (1971). Use of frog eggs and oocytes for study of messenger RNA and its translation in living cells. Nature 233, 177–182. doi: 10.1038/233177a0

PubMed Abstract | CrossRef Full Text

Harley, C. B., Futcher, A. B., and Greider, C. W. (1990). Telomeres Shorten during Aging of Human Fibroblasts. Nature 345, 458–460. doi: 10.1038/345458a0

PubMed Abstract | CrossRef Full Text | Google Scholar

Hirsch, J. E. (2005). An index to quantify an individual's scientific research output. Proc. Natl. Acad. Sci. U.S.A. 102, 16569–16572. doi: 10.1073/pnas.0507655102

PubMed Abstract | CrossRef Full Text | Google Scholar

Kotov, N. A. (2010). Fraud, the h-index, and Pasternak. ACS Nano 4, 585–586. doi: 10.1021/nn100182y

PubMed Abstract | CrossRef Full Text | Google Scholar

Lemaitre, B., Nicolas, E., Michaut, L., Reichhart, J. M., and Hoffmann, J. A. (1996). The dorsoventral regulatory gene cassette spatzle/Toll/cactus controls the potent antifungal response in Drosophila adults. Cell 86, 973–983. doi: 10.1016/S0092-8674(00)80172-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Lundblad, V., and Szostak, J. W. (1989). A mutant with a defect in telomere elongation leads to senescence in yeast. Cell 57, 633–643. doi: 10.1016/0092-8674(89)90132-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Marshall, B. J., and Warren, J. R. (1984). Unidentified curved bacilli in the stomach of patients with gastritis and peptic ulceration. Lancet 1, 1311–1315. doi: 10.1016/S0140-6736(84)91816-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Mello, C. C., Kramer, J. M., Stinchcomb, D., and Ambros, V. (1991). Efficient gene transfer in C. elegans: extrachromosomal maintenance and integration of transforming sequences. Embo J. 10, 3959–3970.

PubMed Abstract | Google Scholar

Morris, R. G., Garrud, P., Rawlins, J. N., and O'keefe, J. (1982). Place navigation impaired in rats with hippocampal lesions. Nature 297, 681–683. doi: 10.1038/297681a0

PubMed Abstract | CrossRef Full Text | Google Scholar

Novick, P., Field, C., and Schekman, R. (1980). Identification of 23 complementation groups required for post-translational events in the yeast secretory pathway. Cell 21, 205–215. doi: 10.1016/0092-8674(80)90128-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Poltorak, A., He, X., Smirnova, I., Liu, M. Y., Van Huffel, C., Du, X., et al. (1998). Defective LPS signaling in C3H/HeJ and C57BL/10ScCr mice: mutations in Tlr4 gene. Science 282, 2085–2088. doi: 10.1126/science.282.5396.2085

PubMed Abstract | CrossRef Full Text | Google Scholar

Söllner, T., Whiteheart, S. W., Brunner, M., Erdjument-Bromage, H., Geromanos, S., Tempst, P., et al. (1993). SNAP receptors implicated in vesicle targeting and fusion. Nature 362, 318–324. doi: 10.1038/362318a0

PubMed Abstract | CrossRef Full Text | Google Scholar

Steinman, R. M., and Cohn, Z. A. (1973). Identification of a novel cell type in peripheral lymphoid organs of mice. I. Morphology, quantitation, tissue distribution. J. Exp. Med. 137, 1142–1162. doi: 10.1084/jem.137.5.1142

PubMed Abstract | CrossRef Full Text | Google Scholar

Südhof, T. C., Lottspeich, F., Greengard, P., Mehl, E., and Jahn, R. (1987). A synaptic vesicle protein with a novel cytoplasmic domain and four transmembrane regions. Science 238, 1142–1144. doi: 10.1126/science.3120313

PubMed Abstract | CrossRef Full Text | Google Scholar

Takahashi, K., and Yamanaka, S. (2006). Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors. Cell 126, 663–676. doi: 10.1016/j.cell.2006.07.024

PubMed Abstract | CrossRef Full Text | Google Scholar

Thomas, K. R., and Capecchi, M. R. (1987). Site-directed mutagenesis by gene targeting in mouse embryo-derived stem cells. Cell 51, 503–512. doi: 10.1016/0092-8674(87)90646-5

PubMed Abstract | CrossRef Full Text | Google Scholar

von Bohlen Und Halbach, O. (2011). How to judge a book by its cover? How useful are bibliometric indices for the evaluation of “scientific quality” or “scientific productivity”? Ann. Anat. 193, 191–196. doi: 10.1016/j.aanat.2011.03.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Wendl, M. C. (2007). H-index: however ranked, citations need context. Nature 449:403. doi: 10.1038/449403b

PubMed Abstract | CrossRef Full Text | Google Scholar

Würtz, M., and Schmidt, M. (2016). The stratified H-index. Ann. Epidemiol. 26, 299–300. doi: 10.1016/j.annepidem.2016.01.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: h-index, bibliometrics, citations, nobel prize, research evaluation

Citation: Kreiner G (2016) The Slavery of the h-index—Measuring the Unmeasurable. Front. Hum. Neurosci. 10:556. doi: 10.3389/fnhum.2016.00556

Received: 30 September 2016; Accepted: 18 October 2016;
Published: 02 November 2016.

Edited by:

Mikhail Lebedev, Duke University, USA

Reviewed by:

Ludvic Zrinzo, UCL Institute of Neurology, UK
David Engblom, Linköping University, Sweden
Matjaž Perc, University of Maribor, Slovenia

Copyright © 2016 Kreiner. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Grzegorz Kreiner, a3JlaW5lckBpZi1wYW4ua3Jha293LnBs

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.