Assessing the impact of biomedical research in academic institutions of disparate sizes

Background The evaluation of academic research performance is nowadays a priority issue. Bibliometric indicators such as the number of publications, total citation counts and h-index are an indispensable tool in this task but their inherent association with the size of the research output may result in rewarding high production when evaluating institutions of disparate sizes. The aim of this study is to propose an indicator that may facilitate the comparison of institutions of disparate sizes. Methods The Modified Impact Index (MII) was defined as the ratio of the observed h-index (h) of an institution over the h-index anticipated for that institution on average, given the number of publications (N) it produces i.e. (α and β denote the intercept and the slope, respectively, of the line describing the dependence of the h-index on the number of publications in log10 scale). MII values higher than 1 indicate that an institution performs better than the average, in terms of its h-index. Data on scientific papers published during 2002–2006 and within 36 medical fields for 219 Academic Medical Institutions from 16 European countries were used to estimate α and β and to calculate the MII of their total and field-specific production. Results From our biomedical research data, the slope β governing the dependence of h-index on the number of publications in biomedical research was found to be similar to that estimated in other disciplines (≈0.4). The MII was positively associated with the average number of citations/publication (r = 0.653, p < 0.001), the h-index (r = 0.213, p = 0.002), the number of publications with ≥ 100 citations (r = 0.211, p = 0.004) but not with the number of publications (r = -0.020, p = 0.765). It was the most highly associated indicator with the share of country-specific government budget appropriations or outlays for research and development as % of GDP in 2004 (r = 0.229) followed by the average number of citations/publication (r = 0.153) whereas the corresponding correlation coefficient for the h-index was close to 0 (r = 0.029). MII was calculated for first 10 top-ranked European universities in life sciences and biomedicine, as provided by Times Higher Education ranking system, and their total and field-specific performance was compared. Conclusion The MII should complement the use of h-index when comparing the research output of institutions of disparate sizes. It has a conceptual interpretation and, with the data provided here, can be computed for the total research output as well as for field-specific publication sets of institutions in biomedicine.


Background
Bibliometric indices are an indispensable tool in evaluating the research output of individuals and institutions. Recently, novel indicators have been proposed with the aim to overcome deficiencies of the "traditional" bibliometric indices (e.g. number of publications, total citation count, average number of citations per publication) and to combine more efficiently information on both the quantity and the quality of the research output [1][2][3][4]. Hindex is the most known example of such an indicator [1] and is now routinely provided by Thomson Scientific Web of Science and other bibliometric databases. This indicator is defined as the number h of papers of an individual or an institution with number of citations higher or equal to h. As a result, it combines information on both the number of papers and the number of citations. However, due to its inherent association with the size of the research output it may result in rewarding institutions with high production [2]. Thus, when comparing institutions, a proper calibration of the h-index for the size of the output may provide additional information.
Recenlty, it has been shown that when evaluating sets of publications ranging from several hundreds to 10 5 papers, the dependence of the h-index on the size of the set is characterised by a "universal" growth rate [2]. This was shown for interdisciplinary, mechanics and materials science data [2] as well as for nonbiomedical research data [5]. Thus, the h-index can be decomposed into the product of a factor depending on the population size and of an impact index. This impact index can be used to compare the research output of institutions of disparate number of publications. However, as most bibliometric indicators, the impact index of an institution is not informative on its own, unless it is compared to the corresponding indices of other institutions. Furthermore, Molinari and Molinari [2] have provided parameter estimates to calculate this index only for a large number of papers and therefore, it cannot be extended to assess the impact in e.g. specific fields where the sets of publications range on a much lower scale.
In the present study we aim to extend the interpretation of the h-index by proposing a size-corrected, h-index based indicator (Modified Impact Index -MII). The concept of this index is to assess whether the h-index of an institution deviates from the average h-index, as estimated for a particular number of publications. MII shares all the merits of the impact index. Additionally, we will show that it has a more informative numerical interpretation and, with the data that we will provide in the following sections, it may be used also in the case of smaller publication sets. We will illustrate the use of this index in biomedical research and explore its application within specific biomedical disciplines.

Methods
The Academic Medical Institutions located in 16 European countries (Austria, Belgium, Denmark, Finland, France, Germany, Greece, Ireland, Italy, Netherlands, Norway, Portugal, Spain, Sweden, Switzerland, United Kingdom) were identified from the database of medical schools provided by the Institute for International Medical Education [6]. Once the final list of 219 institutions was compiled, all publications affiliated to the corresponding universities (excluding meeting abstracts) and classified into any of the 36 pre-specified medical subjects (Table 1)

Modified Impact Index (MII) in biomedical research
When the h-index of each institution was plotted against the corresponding number of papers from 36 medical fields on a log-log plot, the resulting points were fitted by a regression line ( Figure 1  biomedical research was found to be similar to that estimated in other disciplines (≈0.4). The number of publications ranged from 10 2 to 10 4 papers, with the exception of one institution with very low number of publications. The exclusion of this institution did not alter the estimated slope. Our estimate for β in biomedical sciences was consistent among different countries ( Figure 2).
The fitted regression line of equation (1) provides the average h-index for a particular number of publications.
Thus, points above the regression line correspond to institutions with h-index higher than the average. Similarly, points below the regression line correspond to institutions with h-index lower than the average. The difference log 10 h i -(α + β log 10 N i ) between the observed log 10 h i (denoted as circles in the Figure 1 and 2) and the corresponding fitted value α + β log 10 N i (superimposed regression line) expresses the deviation ε i of the observed hindex of the i th institution from the average estimate for the number of publications it produces. In the original scale, this difference is transformed into the ratio .
This ratio expresses how many times the observed h-index is higher than that estimated by the regression model based on the number of publications. Thus, a value higher than 1 indicates that the particular institution performs better in terms of h-index than it would be expected for the number of publications it produces. Similarly, a value lower than 1 indicates that the particular institution performs worse in terms of h-index than it would be expected for the number of publications it produces. The ratio was found to be equivalent to the impact index proposed by Molinari and Molinari [2]     whether the country-specific modified impact indices (calculated as the median of the MIIs of the institutions for each country) correlated with the share of government budget appropriations or outlays for research and development (GBAORD) as % of GDP in 2004 (GBAORD are a way of measuring government support to R&D activities) [8]. The MII was the most highly associated indicator (r = 0.229) followed by the average number of citations/ publication (r = 0.153) whereas the corresponding correlation coefficient for the h-index was close to 0 (r = 0.029). We used the database with the number of publications and corresponding h-indices per subfield. Plots similar to Figure 1 were constructed for each one of the 36 medical fields and the parameters α and β were estimated (Table   1). These parameters can be used to estimate the field-specific MII of an institution or a department. The field-specific slopes had a mean (SD) of 0.571 (0.045) and ranged from 0.488 (subfield: Otorhinolaryncology) to 0.668 (subfield: Allergy). There was a slight negative association between the slopes and the number of publications per field (i.e. higher slopes in sub fields with few publications), which was not statistically significant (r = -0.126, p = 0.465).

MII for selected top-ranked universities
To illustrate our findings, we compared the first 10 topranked European universities in life sciences and biomedicine, as provided by Times Higher Education [7]. In Table  2, the number of publications, h-index, impact index proposed by Molinari [2] and MII are presented for all 36 medical fields (publication years: 2002-2006). All universities had a MII higher than 1 (range: 1.027-1.403) i.e. their performance based on the h-index was higher than or around that expected based on the number of papers they produced. In terms of h-index, the two most productive institutions (Imperial College and UCL) occupied the A higher heterogeneity was observed in the estimated MIIs for selected subfields such as e.g. in "Cardiac and Cardiovascular Systems" where MII was found to range within 0.842-1.720 (Table 3). Uppsala, Cambridge and Edinburgh ranked first according to MII in the subfields "Medicine, General and Internal", "Cardiac and Cardiovascular Systems" and "Infectious Diseases", respectively.

Discussion
The h-index is a valuable bibliometric indicator that combines information on both the quantity and the quality of the research output. Moreover, the findings of a recent paper indicate that it is better in predicting researchers' future scientific achievement than other indicators (total citation count, average number of citations per paper, total paper count) [9]. However, the h-index has various shortcomings, in particular when comparing individual scientists, discussed in detail by others [10][11][12][13]; it cannot differentiate between active and inactive scientists, it depends on the scientific age, it is affected by different discipline-dependent citation patterns etc. Numerous variants have been proposed that aim to overcome some of these disadvantages. For example, the m quotient allows to compare different lengths of scientific career [1], the g and h(2) indices give more weight to highly cited papers [14,15], the impact index h m provides an evaluation of the impact of the production [2] and the contemporary hindex [13] gives more weight to newer articles.
The proposed index deals with the fact that the inherent association of the h-index with the size of the research output may result in rewarding high production when evaluating institutions of disparate sizes. By definition, the h-index cannot exceed the number of publications. Thus, as noted by Glanzel [12] "it puts small but highlycited paper sets at a disadvantage ('small is not beautiful')". An institution with a moderate-size production will not reach the h-index of a very large institution even if the quality of its publications are of similar or even better quality simply because its total production may be even less than h.
An application of the proposed modified impact index was presented using biomedical data. In biomedical research, the parameter β that characterises the dependence of h-index on the number of publications was approximately 0.4 and similar to that estimated in other disciplines (interdisciplinary, mechanics and materials science data [2], nonbiomedical research data [5] and chemical research data [16]). These estimates were based on publications ranging from a few hundreds to several thousands. When the number of publications ranges from a few papers up to approximately 500, as e.g. when evaluating the research output within specific subfields, the parameter β was higher than the overall estimate of 0.445. This was also noted by Molinari & Molinari [2] who have shown that the slope of the line describing the dependence of the h-index on the number of publications is higher when the number of evaluated papers is small. For example, in the field "Medicine, General & Internal" Uppsala had 223 papers with an h-index of 40, so using the appropriate field-specific values for the intercept α who have shown that the slope of the line describing the dependence of the h-index on the number of publications is higher when the number of evaluated papers is small. In our biomedical data, the field-specific slopes ranged from 0.488 to 0.668. For example, in the field "Medicine, General & Internal" Uppsala had 223 papers with an hindex of 40, so using the appropriate field-specific values for the intercept a and slope β the corresponding MII was calculated to be .
The proposed index correlated with the share of government budget appropriations or outlays for research and development as % of GDP in 2004 (r = 0.229) whereas the corresponding correlation coefficient for the h-index was close to 0. Additionally, it was positively associated with the average number of citations/publication, the h-index and the number of highly cited papers. Furthermore, for a given β the MII provides the same ranking as the impact index proposed by Molinari and Molinari [2]. Actually, the estimates of β provided here can be used to calculate the impact index of institutions in biomedical research and within specific biomedical disciplines. Both indices have the advantage that they can be well estimated by using a representative subset of the publications rather than the total set of publications produced by an institution [2]. The advantage of MII over the impact index is its conceptual interpretation.
The estimates of α s and β s were based on data from European Medical Institutions. In order to assess whether these estimates can be used to calculate the MII for non-European institutions too, we performed a preliminary analysis to check whether the slope based on data from topranked US universities is similar to that obtained from the top-ranked European ones. We observed that these slopes were similar unless universities with number of publications outside the evaluated range were included (e.g. Harvard and Johns Hopkins). Thus, we advocate that the estimates provided here can be used to calculate the MII  Bibliometric methods have been criticised due to technical and methodological problems generally encountered when they are employed to assess the research output of a university (17,18). Furthermore, the bibliometric indices currently used appear to be related to the size of research output and thus they probably tend to favour large institutions. The proposed index presents some clear advantages compared to existing bibliometric indices: it is not associated with the size of the publication output and thus can be used to compare institutions of disparate size, it has a conceptual interpretation (performance below or above the average) and can be computed by using a representative subset of the publications rather than the total set of publications produced by an institution. However, its computation requires estimates for the α s and β s and thus is not as straightforward as in the case of usual bibliometric indices. As mentioned before, the parameter β has a "universal" estimate of 0.4 independent of the discipline but dependent on the size of the publication set. As a result, the estimates for the α as a "universal" estimate of 0.4 independent of the discipline but dependent on the size of the publication set. As a result, the estimates for the a s and β s, as e.g. those provided here for biomedicine, can be applied to compute the MII of an institution as long as the number of its publications falls within the evaluated range (e.g. 10 2 -10 4 papers in our case). Thus, it would not be safe to use them for outliers, i.e. for institutions with productivity outside the evaluated range.

Conclusion
In conclusion, there is a growing demand for transparent and valid evaluation of universities but any ranking is bound to give rise to controversy. The assessment of medical research performance, in particular, is a challenging task. Peer-review, the currently thought gold standard of research evaluation is usually not feasible for large-scale evaluations. For large-scale evaluative purposes, we advo-cate the use of a combination of bibliometric indices that will include an indicator not associated with the size of the research output. The proposed modified impact index is such an indicator that has a conceptual interpretation and with the data provided here can be computed for large as well as for small field-specific publication sets in biomedicine.