Skip to main content

Validation of diabetes mellitus and hypertension diagnosis in computerized medical records in primary health care



Computerized Clinical Records, which are incorporated in primary health care practice, have great potential for research. In order to use this information, data quality and reliability must be assessed to prevent compromising the validity of the results.

The aim of this study is to validate the diagnosis of hypertension and diabetes mellitus in the computerized clinical records of primary health care, taking the diagnosis criteria established in the most prominently used clinical guidelines as the gold standard against which what measure the sensitivity, specificity, and determine the predictive values.

The gold standard for diabetes mellitus was the diagnostic criteria established in 2003 American Diabetes Association Consensus Statement for diabetic subjects. The gold standard for hypertension was the diagnostic criteria established in the Joint National Committee published in 2003.


A cross-sectional multicentre validation study of diabetes mellitus and hypertension diagnoses in computerized clinical records of primary health care was carried out. Diagnostic criteria from the most prominently clinical practice guidelines were considered for standard reference.

Sensitivity, specificity, positive and negative predictive values, and global agreement (with kappa index), were calculated. Results were shown overall and stratified by sex and age groups.


The agreement for diabetes mellitus with the reference standard as determined by the guideline was almost perfect (κ = 0.990), with a sensitivity of 99.53%, a specificity of 99.49%, a positive predictive value of 91.23% and a negative predictive value of 99.98%.

Hypertension diagnosis showed substantial agreement with the reference standard as determined by the guideline (κ = 0.778), the sensitivity was 85.22%, the specificity 96.95%, the positive predictive value 85.24%, and the negative predictive value was 96.95%. Sensitivity results were worse in patients who also had diabetes and in those aged 70 years or over.


Our results substantiate the validity of using diagnoses of diabetes and hypertension found within the computerized clinical records for epidemiologic studies.

Peer Review reports


In recent decades, Computerized Clinical Records (CCR) are being used in the routine medical practice of primary health care (PHC) in the Spanish National Health System (NHS). Indeed, by 2007, 98.8% of general practices were computerized and 88% of the population had a primary care electronic health records [1]. There are many software systems to manage the CCR. In Spain the most frequently implemented CCR is the OMI-AP® program.

Other countries like Canada, the United States of America, and some European Countries have a significant experience using computerized health databases obtained with PHC records [24]. The most widely used is the GPRD (General Practice Research Database), which contains information introduced prospectively since 1987 by more than 1,500 general practitioners (GPs) in the PHC of the United Kingdom and includes 7% of the population [5]. The GPRD has been widely used for research studies, with over 700 associated papers published up to date in peer-reviewed journals [5].

Electronic Health Records provide great potential for research, because of their ability to provide data for large populations. Even though the CCR can be used for research, it is important to note that the data are collected primarily for routine clinical rather than for researching purposes. Data quality and reliability must be assessed by researchers who use the resources found in the CCR in order to prevent compromising the results.

There are several approaches found in the literature but there is not an agreed upon standard approach to evaluate the quality and accuracy.

Related validation studies have attempted to show whether the cases with diagnostic codes indeed have that condition. Two recent systematic reviews of validation studies within the GPRD have shown that most (90%) of the coded diagnoses, from many diseases, are 'validated'[6, 7]. In order to perform these validations, 83% of them used sources of information external to the GPRD, mostly questionnaires from GPs, hospital reports, copies of the Medical history or comparisons with disease rates obtained from other registries.

The validation of the diagnosis information included in BIFAP (Base de Datos para la Investigación Farmacoepidemiológica en Atención Primaria), population-based database in Spain containing information of more than 2.5 million patients, was carried out by sending questionnaires to collaborating doctors requiring hospital reports and other information in paper format of the clinical history and doctor's activities reports [8]. External validations were also carried out comparing the results with other sources of information, such as the national health survey and the death registry.

In order to guarantee the quality of the studies performed with the data from the CCR and because few studies have evaluated the quality of those registries, it is necessary to check their validity in our setting. The abovementioned methodology, that used the medical records of the second health care level as the standard reference, does not seem appropriate for chronic diseases such as hypertension (HTN) and diabetes mellitus (DM), which mainly were diagnosed and followed up in the PHC.

The aim of this study is to validate the diagnosis of HTN and DM coded in the CCR of PHC, taking the diagnosis criteria established in the most prominently used clinical guidelines as the gold standard.



Cross-sectional validation study of the diagnoses of DM and HTN in the CCR of PHC.


The study was carried out in the 21 health centers of the health area 4, in the northeast urban zone of the Community of Madrid, which serves a population of 777,426 people. All health centers have computerized patient records since at least for 10 years.

Sources of information

The CCR administered by the software OMI-AP® were structured around a list of episodes (problems in the bio-psycho-social sphere, reasons for consultation, etc). The episodes are listed with an alphanumeric code, which corresponds to the International Classification of Primary Care (ICPC), and a description or clinical label. The same code can be described by using many different clinical labels.

It is possible to link these episodes with diagnostic tests, prescriptions, protocols, therapeutic interventions, referrals, temporary incapacity to work reports, and free-text annotations. The laboratory results are automatically recorded in the CCR.

In PHC, text and codes are entered by the GPs during clinical care, as part of their routine clinical practice. The CCR incorporates a user-friendly instrument to encode episodes in order to make it acceptable and useful to the GPs, who are not professional coders. This instrument is a search system based on clinical labels that assigns the code automatically. The program allows the modification of the descriptions, but not the code which could be substituted or erased if deemed necessary.

Data extraction was conducted using the ICPC code from the CCR of patients.

Inclusion and exclusion criteria

The study population comprised those patients who met the following inclusion criteria: had at least one record within the CCR in the health centers of health area 4 as of January 1st 2010; over 18 years of age; had an ICPC code in their CCR corresponding to DM (32,377 patients with code T90) or to HTN (91,065 patients with codes K86 or K87), respectively.

Patients were not included if they met any of the following exclusion criteria: had not at least one plasma glucose measurement (7.3%) or two Blood pressure (BP) measurements (22.9%) in their CCR for the validation of DM and HTN, respectively.


Given the absence of reference information over the proportion of incorrectly classified cases (false negatives or false positives), the maximum indetermination was assumed (p = (1-p) = 0.5).

With this assumption, and in order to obtain a confidence of 95% and a precision of 5%, the required sample size was 384 patients for each variable. We increased it up to 423 to adjust for a foreseeable loss of 10% between the sampling and validation of the diagnosis (change of address, death or other reasons).

Four different samples of patients were obtained: with DM code (sample 1), without DM code (sample 2), with HTN code (sample 3), and without HTN code (sample 4) in the CCR. The first two were used to validate the episodes of DM and the latter two were used to validate the episodes of HTN.

Samples 1 and 3 were obtained by simple random way whilst samples samples 2 and 4 were obtained by individual matching by age and sex with samples 1 and 3, respectively.


Diagnostic test aims to correctly classify patients and healthy for a disease or clinical condition. The validation of a diagnostic test is performed by comparing their results, both positives and negatives, with those obtained by the best instrument for measuring the phenomenon under study (gold standard).

In this study, documented diagnosis of DM and HTN were considered as the diagnostic tests. In order to perform the validation, they were compared against the gold standards.

The gold standard for DM were the diagnostic criteria established in 2003 American Diabetes Association Consensus Statement for diabetic subjects, that that were still in effect in 2010 [9]. The reference diagnostic criteria for DM were shown in Table 1.

Table 1 Diabetes Mellitus diagnostic criteria.

The gold standard for HTN were the diagnostic criteria established in the Joint National Committee (JNC 7) of the United States published in 2003, which were shown in Table 2[10].

Table 2 Hypertension diagnostic criteria.

These criteria agree with recommendations given by other major scientific societies, such as the World Health Organization [11], the European Society for the Study of Diabetes (EASD) [12], the European Society of Hypertension (ESH) [13, 14], the European Society of Cardiology [14], the Canadian Diabetes Association [15] and with the recommended guidelines in Spain [1619].

Subjects of samples 1 and 2 were considered with DM as long as they fulfilled at least one of the criteria described in Table 1. Subjects from samples 3 and 4 that fulfilled any criteria from Table 2 were considered patients with HTN.

We consulted the computerized medical records of patients to verify concordance with the above criteria. The validation algorithm is shown in the Figure 1.

Figure 1

Validation algorithm. DM: diabetes mellitus; HTN: hypertension.

The evaluation team consisted of three general practitioners, with experience using the OMI-AP program. We conducted a peer evaluation with two reviewers and a third evaluator who resolved discrepancies.

Statistical Analysis

A descriptive analysis of the study population and samples was carried out. The age was expressed by means of average percentiles 25 and 75, and the qualitative variables were summarized with their relative frequency.

Sensitivity (Sn), specificity (Sp), positive predictive values (PPV) and negative predictive values (NPV) were calculated, with their confidence intervals at 95% (CI), overall and stratified by gender, age group and DM.

In addition, a sensitivity analysis was performed determining the relative impact on predictive values of varying assumptions regarding the prevalence of DM and HTN.

The Sn is the proportion of cases with DM or HTN codes in the CCR among those that fulfilled the diagnosis criteria of Tables 1, respectively. The Sp is the proportion of cases without DM or HTN code in the CCR among those that did not fulfill the diagnosis criteria. Positive predictive value is the probability that people with DM or HTN code in the CCR meet the diagnostic criteria, while negative predictive value is the probability that people without DM or HTN code do not meet the criteria.

We checked whether the Sn and Sp were different by gender, age group and DM Through the homogeneity test based on a χ2 statistic. In case that the application conditions of the test were not met (any expected cell count under 5), then the Fisher's exact bilateral test was used.

When diagnostic tests are applied to the population, the proportion of those testing positive (apparent prevalence) can not be used as an estimation of the prevalence of a disease in that population, because the Sn and Sp of these tests are usually less than 100%. Thus, the proportion of individuals with a positive result includes false positive cases and excludes cases that are false negatives, so in order to estimate the true prevalence of a disease from diagnostic tests; it is required to adjust for the misclassification resulting from the Sn and Sp of the used test. In this study we have used the formula proposed by Rogan and Gladen for this adjustment [20].

True prevalence  = apparent prevalence  +  Sp  -  1 Sn  +  Sp  - 1

The degree of overall agreement between the registered diagnosis and the reference standard, as well as the inter-observer agreement, was determined by the kappa index with their CI. According to this value, the agreement was considered slight (≤ 0.20), fair (0.21-0.40), moderate, (0.41-0.60), substantial (0.61-0.80) or almost perfect agreement (≥ 0.81) [21].

The statistic analysis of the information was performed with SPSS software (version 15.0, SPSS Inc., Chicago, Illinois), the CI of the kappa index and the predictive values were calculated with macros for SPSS of the Applied Statistics Laboratory in Universidad Autónoma de Barcelona: !KAPPA and !DT, respectively [22, 23].

Ethical Aspects

In order to ensure the confidentiality, the study was developed as stipulated in the Spanish Personal Data Protection Law. The protocol of the study was approved by the ethics committee of the Hospital Carlos III in Madrid and all of the evaluators signed a confidentiality clause.


The main demographic characteristics of the population of patients over 18 years old attended in the health area 4 with episodes of DM (ICPC T90) and HTN (ICPC K86 or K87) registered in the CCR, as well as the selected samples are described in Table 3.

Table 3 Main demographic characteristics of the population of patients over 18 years old attended in Health Area 4 and of samples.

The 7.3% of patients from sample 2 (without DM code) had to be excluded because there was not at least one fasting plasma glucose. They were 64.5% males with a mean age of 73.6 (SD 14.4) years. There were significant differences in mean age between patients excluded and not excluded.

The 22.9% of patients from sample 4 (without HTN code) had to be excluded because there were not at least two BP measurements. They were 36.1% males with a mean age of 69.1 (SD 11.6) years. There were no significant differences in mean age and female proportion between patients excluded and not excluded from the sample.

The prevalence of patients diagnosed with DM was 5.02%, slightly higher in males and in patients aged 65 years or older. The prevalence of patients with diagnosis of HTN was 14.11%, higher in women, in people aged 70 years or older and in patients with diagnosis of DM.

In our study, the diagnosis of DM was confirmed in 99.5% of the cases (sensitivity) and in 99.49% of those without diagnosis of DM did not meet the diagnosis criteria (specificity). There were no significant differences when stratifying by age groups or sex.

The sensitivity of the diagnosis of HTN was 85.22%, decreasing significantly in people over 69 years old (81.76%) and in patients with diagnosis of DM (79.85). The specificity of the diagnosis of HTN was 96.95%, with no significant differences when stratifying by sex or age groups (Table 4).

Table 4 Sensivity, specificity and agreement for diabetes mellitus and hypertension.

The degree of overall agreement between the diagnosis in the CCR and the standard of reference, measured as the kappa index, was almost perfect for DM (κ = 0.990), and substantial agreement for the HTN (κ = 0.778), as shown in Table 4. The worst result was obtained for HTN in the subgroup of patients with diagnosis of diabetes.

The degree of global inter-observers agreement, measured with the kappa index was very high, both for DM (κ = 0.988) and HTN (κ = 0.941), and for the different categories of sex, age groups and diagnosed diabetes. In all cases, the kappa index was higher than 0.880.

Table 5 shows the apparent prevalences (diagnosed in the CCR), true prevalences (fulfillment of the diagnosis criteria) as well as the positive and negative predictive values. The true prevalences were, for both diseases, higher than the apparent prevalences. The between them was 0.89% for the DM diagnosis and 21.45% for HTN, which increased up to 32.36% in patients diagnosed with diabetes. The probability that the diagnostic criteria could be confirmed in patients with diagnosed DM was 91.23% and 99.98% in HTN.

Table 5 Apparent prevalence (diagnosis in the CCR), true prevalence (fulfillment of the diagnosis criteria) and predictive values of diabetes and hypertension.

Given that the PPV is directly proportional to the prevalence of the disease and the NPV is inversely proportional to the prevalence, we estimated the PPV and NPV for different true prevalences of HTN and DM (Table 6).

Table 6 Predictive values for hypertension and diabetes for different prevalences (sensitivity analysis).


The results of the study show a very high agreement of the diagnosis of DM in the CCR with the gold standard, and also a high sensitivity and specificity of the diagnosis of DM. The information obtained from the CCR provides a good estimation of the true prevalence of the illness, overall and in each category of sex and age groups.

For HTN, there was also a good overall agreement of the diagnosis in the CCR with the gold standard, high Sp and Sn, but lower than for diabetes. In patients with DM subgroup, the agreement was strikingly lower especially at the expense of Sn, because the under-diagnosis of HTN is much higher and the NPV, very influenced by the prevalence, is ostensibly lower. Similar results, of less magnitude, are found in the subgroup of patients aged 70 or over.

In our study the DM diagnosis was confirmed in 99.53% of the cases and HTN in 98.11%.

The systematic review of GPRD validation studies by Herrett and cols [6] shows that the different diagnoses studied were confirmed in 89% of the cases, although DM and HTN were not validated. Only a small proportion of the studies provided quantitative estimates of validity such as sensitivity and specificity.

We have not found published studies that used similar methodology to ours for the validation of HTN and DM diagnoses. For this reason, our results can only be compared with most similar validation studies, which used self-reported diagnosis by patients compared to biometric measures as reference standards.

The Sn obtained in our study for the diagnosis of DM (99.53%) is much higher than 69.7% they found in the DINO study [24], which validates the self-reported diagnosis of diabetes, HTN and hyperlipidemia of a population of 20 years and older in southern Spain. Published studies in other countries, also performed with self-reported diagnosis, present great heterogeneity, with Sn of DM between 58.9% in a Dutch study [25] and 85.2% in Taiwan [26].

Regarding the diagnosis of HTN, our Sn was a little bit lower (85.22%). In Spain, the DINO study estimated the Sn in 49.4% [24], in a subsample of the SUN study the Sn was 90.3% [27], and the EPIC Murcia cohort study obtained a sensitivity of 63.5% [28], taking medical records as the reference standard. In other countries, we found values that oscillate between 34.5% in the Dutch study aforementioned [25], and 82% found in a North-American study [29].

The high specificity obtained for the diagnosis of DM (99.49%) is consistent with the findings of other published studies, where the Sp is situated between 95.2% [30] and 99.6% [24]. The Sp obtained for the HTN (96.95%) is slightly above what was found by other authors, which oscillated between 80% [31] up to 96.8% [24].

The agreement found between the diagnosis of DM registered in the CCR and the fulfillment of the diagnosis criteria were very good (k = 0.990), above the one obtained in the DINO study (K = 0.78) [24].

For HTN, the agreement (k = 0.778) was lower than for diabetes, but higher than those found in the DINO study (k = 0.51) [24], EPIC-Murcia (k = 0.58) [28] and in the SUN (k = 0.66) [27].

The validation indexes obtained in our study are higher than those found by other authors, possibly because we have checked the diagnosis done by physicians and not those self-reported by patients.

Our study confirms the hypothesis found in other publications in which DM diagnosis has higher validity than HTN diagnosis [25, 30, 32]. This could be related to the perceived higher seriousness of physicians for DM than for HTN, and due to the DM diagnostic criteria that were changed less frequently and more uniform than those for HTN. In general, the parameters of validity found enable us to realize a precise estimation of the prevalence of diabetes [33] but a sub-estimation of the prevalence of HTN.

The under-diagnosis of HTN is a well-known phenomenon whose magnitude varies greatly in published studies. In a systematic review of 44 studies from different countries published in 2009 [34], the proportion of undiagnosed HTN on a worldwide level was estimated to be 46.2% for men and 58.5% for women. Different Spanish studies offer results from 14.9% in Navarra [27], to 49.4% in Galicia [35] and 31.74% obtained in the PREDIMERC study in Madrid [36]. In our study, we found 21.45% in global and 32.36% in the subgroup of patients with diabetes.

Very little is known about the prevalence of undiagnosed HTN in diabetic patients. In Spain, the DIAPA study found that 56.8% of patients with type 2 DM had BP > 130/85 mmHg, even though they were not previously diagnosed with HTN [37]. The lack of HTN diagnosis in patients with DM could be related to the cutoff values of diagnosis, which are lower for these patients (BP ≥ 130/80 mmHg) [10], and it is possible that some GPs had been using the diagnostic criteria of the general population (BP ≥ 140/90 mmHg).

When stratifying by sex and age groups, there were significant differences in the Sn of HTN with worse results in patients with DM and in those over 69 years, despite the fact that these patients are subject to a large number of revisions and so there were more chances to detect HTN. These results are different than other studies [24, 25, 27, 28, 30, 32], where better results were obtained in older and diabetic patients. One possible explanation could be that other studies were done through questionnaires with volunteer participants. The methods used may have resulted in a selection bias, since the patients who are more worried about their health and those who have worse perception of their health may be more predisposed to participate.

The true prevalence of DM in our study was 5.06%, which is very similar to the obtained with the BIFAP database (5.8%) [8] and in the Spanish National Health Survey (4.79%) [38] but lower than 8.1% in the PREDIMERC study in Madrid [36].

The true prevalence of HTN in our study (17.14%) is similar to those obtained with the BIFAP database (16.1%) [8] and in the Spanish National Health Survey (18.89%) [38] but also lower than the findings in other studies in Spain. These studies found about 35% in the adult population [39] and 29.3% in the PREDIMERC study [36].

This differences observed in the magnitude of the prevalences could be due to the age of the patients included in PREDIMERC [36]. The patient range age was between 30 and 74 whereas our study includes all those aged 18 and over. If we had selected people aged 30 or over in our database, the true prevalence would have been 6.87% for DM and 23.72% for HTN.

Moreover, PREDIMERC was undertaken with volunteers, with an overall response rate of 56.4%, which may have produced a selection bias as was mentioned before. Furthermore, we cannot assure that false positives were due to misdiagnosis, simply that the verification of diagnostic criteria could not be met. This may have led to an underestimation of the prevalence in our study.

The prevalence of HTN in diabetic patients obtained in our study (80.22%) is close to the highest found in the studies published, which oscillate between 50% and 84% [37, 40].

The study presents some limitations. These are that the information included may not have been completely exhaustive. Because of this a potential selection bias may exist. This is particularly true in view of the proportion of adults in nursing homes, chronic disease hospitals, or those treated in private practice. In addition, as the health area 4 in Madrid covers only urban population, patients living in rural areas were not represented in this study.

Alternatively, more than 95% of citizens have public health coverage with the Spanish National Health System [41], so we suppose that the proportion of assigned persons in the health area 4 who are not included in our study is low.

The high proportion of patients without available information to perform the validation (7.3% had not at least one plasma glucose level and 22.9% had not two or more BP measurements) could be due to the fact that these patients do not usually go to health centers. At least one or two years of active data are basically required in order to include clinical patient records in a study.

In both the GPRD and the BIFAP, GPs who voluntarily participate in the projects are the ones that refer the information, possibly introducing a selection bias (e.g. the pattern of patients care can differ among volunteer physicians and those who are not volunteers). Furthermore, the response rate of physicians to questionnaires can be low, as occurred in BIFAP with the validation of Upper Gastrointestinal Bleeding with a rate of the 58.4% [8]. Our database is exhaustive, containing information from the entire population attended in the health area, and registered by all the professionals, avoiding the possibility of the mentioned biases.

Published studies related to validation are aimed to show whether cases with diagnostic codes indeed have that condition. The use of the PPV as the only measurement presents the inconvenience that it depends on the prevalence of the disease, as shown in Table 6. Another weakness of these validation studies is that, with a few exceptions, they do not address the question of false negatives, which are cases of the disease who have not received a diagnostic code. There were missed cases in which the GPs did not make the diagnosis or when made diagnosis but encoded it wrong [42]. We argue that in any validation study, PPV, NPV, Sn, and Sp should be identified, as far as possible.

The validation of the diagnosis of CCR of PHC has facilitated the detection of areas of improvement in the clinical practice, such as under-diagnosis of HTN with differential classification bias for patients who also suffers diabetes.

These findings can be used to alert clinicians of subgroups for which the interventions could be more beneficial.

The use of secondary sources of information stored in computerized databases enables access data from large populations. This can facilitate the rapid identification of patients for observational studies or inclusion in interventions and may reduce the time and resources needed to obtain results. This greater efficiency constitutes one of the main advantages for using the databases as epidemiological research tools.


The results obtained in this validation enable the usage of both DM and HTN diagnoses codes of the computerized clinical records of PHC as a valid tool, which can be used with confidence to perform epidemiological studies.

However, the HTN diagnosis in the CCR has lower sensitivity than DM diagnosis, especially in diabetic patients. Therefore, in this group of patients, the code of HTN diagnosis in the CCR is not enough in order to detect people without HTN since there would be selected a high amount of false negative results.


  1. 1.

    Ministry of Health and Social Policy: ICT in the Health System. The Healthcare Online program. Progress update January 2010. []

  2. 2.

    Jick H, Madsen S, Nudelman PM, Perera DR, Stergachis A: Postmarketing follow-up at Group Health Cooperative of Puget Sound. Pharmacotherapy. 1984, 4 (2): 99-100.

    CAS  Article  PubMed  Google Scholar 

  3. 3.

    Malcolm E, Downey W, Strand L, McNutt M, West R: Saskatchewan Health's linkable data bases and pharmacoepidemiology. PMS. 1993, 6: 175-264.

    Google Scholar 

  4. 4.

    Ray WA, Griffin MR: Use of Medicaid data pharmacoepidemiology. Am J Epidemiol. 1989, 129 (4): 837-49.

    CAS  PubMed  Google Scholar 

  5. 5.

    General Practice Research Database [Internet]. []

  6. 6.

    Herrett E, Thomas SL, Schoonen WM, Smeeth L, Hall AJ: Validation and validity of diagnoses in the General Practice Research Database: a systematic review. Br J Clin Pharmacol. 2010, 69 (1): 4-14. 10.1111/j.1365-2125.2009.03537.x.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  7. 7.

    Khan NF, Harrison SE, Rose PW: Validity of diagnostic coding within the General Practice Research Database: a systematic review. Br J Gen Pract. 2010, 60 (572): 128-136. 10.3399/bjgp10X483562.

    Article  Google Scholar 

  8. 8.

    Base de datos para la Investigación Farmacoepidemiológica en Atención Primaria BIFAP. []

  9. 9.

    American Diabetes Association: Treatment of hypertension in adults with diabetes (Position Statement). Diabetes Care. 2003, 26 (Suppl 1): S80-S82.

    Google Scholar 

  10. 10.

    Chobanian AV, Bakris GL, Black HR, Cushman WC, Green LA, Izzo JL, et al: The Seventh Report of the Joint National Committee on Prevention, Detection, Evaluation, and Treatment of High Blood Pressure. JAMA. 2003, 289: 2560-71. 10.1001/jama.289.19.2560.

    CAS  Article  PubMed  Google Scholar 

  11. 11.

    World Health Organization: Definition, Diagnosis and classification of diabetes mellitus and its complications: Report of a WHO Consultation. Part 1: Diagnosis and clasification of diabetes mellitus. 1999, Geneve: World Heath Organization, 539-53.

    Google Scholar 

  12. 12.

    European Diabetes Policy Group 1999: A desktop guide to type 2 diabetes Mellitus. Diabet Med. 1999, 16 (9): 716-30. 10.1046/j.1464-5491.1999.00166.x.

    Article  Google Scholar 

  13. 13.

    The Task Force for the Management of Arterial Hypertension of the European Society of Hypertension (ESH) and of the European Society of Cardiology (ESC): 2007 Guidelines for the Management of Arterial Hypertension. J Hypert. 2007, 25: 1005-87.

    Google Scholar 

  14. 14.

    Mancia G, Laurent S, Agabiti-Rosei E, Ambrosioni E, Burnier M, Caulfield MJ, et al: Reappraisal of European guidelines on hypertension management: a European Society of Hypertension Task Force document. J Hypertens. 2009, 27: 2121-58. 10.1097/HJH.0b013e328333146d.

    CAS  Article  PubMed  Google Scholar 

  15. 15.

    Canadian Diabetes Association: Canadian Diabetes Association 2008 clinical practice guidelines for the prevention and management of diabetes in Canada. Can J Diabetes. 2008, 32: S1-201.

    Google Scholar 

  16. 16.

    Grupo de trabajo de la Guía de Práctica Clínica sobre Diabetes tipo 2: Guía de Práctica Clínica sobre Diabetes tipo 2. Madrid: Plan Nacional para el SNS del MSC. 2008, Agencia de Evaluación de Tecnologías Sanitarias de El País Vasco, Guías de Práctica Clínica en el SNS: OSTEBA N° 2006/08

    Google Scholar 

  17. 17.

    Marín R, de la Sierra A, Armario P, Campo C, Banegas JR, Gorostidi M: Guía sobre el diagnóstico y tratamiento de la hipertensión arterial en España 2005. Med Clin(Barc). 2005, 125: 24-34. 10.1157/13076402.

    Article  Google Scholar 

  18. 18.

    Coca A, Aranda P, Bertomeu V, Bonet A, Esmatjes E, Guillén F, et al: Strategies for effective control of arterial hypertension in Spain. Consensus document. Rev ClinEsp. 2006, 206: 510-4.

    CAS  Google Scholar 

  19. 19.

    De la Sierra A, Gorostidi M, Marín R, Redón J, Banegas JR, Armario P, et al: Evaluación y tratamiento de la hipertensión arterial en España. Documento de consenso. Med Clin (Barc). 2008, 131 (3): 104-16. 10.1157/13124015.

    Article  Google Scholar 

  20. 20.

    Rogan WJ, Gladen B: Estimating prevalence from the results of a screening test. Am J Epidemiol. 1978, 107: 71-6.

    CAS  PubMed  Google Scholar 

  21. 21.

    Landis JR, Koch GG: The measurement of observer agreement for categorical data. Biometrics. 1977, 33: 159-174. 10.2307/2529310.

    CAS  Article  PubMed  Google Scholar 

  22. 22.

    Domenech JM, Granero R: Macro !KAPPA for SPSS Statistics. Weighted Kappa [computer program]. V2009.07.31. 2009, Bellaterra: Universitat Autonoma de Barcelona, []

    Google Scholar 

  23. 23.

    Domenech JM: SPSS Macro !DT. Diagnostic Tests [computer program]. V2008.05.27. 2008, Bellaterra: Universitat Autònoma de Barcelona, []

    Google Scholar 

  24. 24.

    Huerta JM, Tormo MJ, Egea-Caparrós JM, Ortolá-Devesa JB, Navarro C: Validez del diagnóstico referido de diabetes, hipertensión e hiperlipemia en población adulta española. Resultados del estudio DINO. Rev Esp Cardiol. 2009, 62: 143-52.

    Article  Google Scholar 

  25. 25.

    Molenaar EA, Van Ameijden EJ, Grobbee DE, Numans ME: Comparison of routine care self-reported and biometrical data on hypertension and diabetes: results of the Utrecht Health Project. Eur J Public Health. 2007, 17: 199-205. 10.1093/eurpub/ckl113.

    Article  PubMed  Google Scholar 

  26. 26.

    Goldman N, Lin IF, Weinstein M, Lin YH: Evaluating the quality of self-reports of hypertension and diabetes. J Clin Epidemiol. 2003, 56 (2): 148-54. 10.1016/S0895-4356(02)00580-2.

    Article  PubMed  Google Scholar 

  27. 27.

    Alonso A, Beunza JJ, Delgado-Rodríguez M, Martínez-González MA: Validation of self reported diagnosis of hypertension in a cohort of university graduates in Spain. BMC Public Health. 2005, 5: 94-10.1186/1471-2458-5-94.

    Article  PubMed  PubMed Central  Google Scholar 

  28. 28.

    Tormo MJ, Navarro C, Chirlaque MD, Barber X: Validation of self diagnosis of high blood pressure in a sample of the Spanish EPIC cohort: overall agreement and predictive values. EPIC Group of Spain. J Epidemiol Community Health. 2000, 54: 221-6. 10.1136/jech.54.3.221.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  29. 29.

    Giles WH, Croft JB, Keenan NL, Lane MJ, Wheeler FC: The validity of self-reported hypertension and correlates of hypertension awareness among blacks and whites within the stroke belt. J Prev Med. 1995, 11 (3): 163-9.

    CAS  Google Scholar 

  30. 30.

    Wu SC, Li CY, Ke DS: The agreement between self-reporting and clinical diagnosis for selected medical conditions among the elderly in Taiwan. Public Health. 2000, 114 (2): 137-42. 10.1016/S0033-3506(00)00323-1.

    CAS  Article  PubMed  Google Scholar 

  31. 31.

    Johansson J, Hellenius ML, Elofsson S, Krakau I: Self-report as a selection instrument in screening for cardiovascular disease risk. Am J Prev Med. 1999, 16 (4): 322-4. 10.1016/S0749-3797(99)00019-7.

    CAS  Article  PubMed  Google Scholar 

  32. 32.

    Bowlin SJ, Morrill BD, Nafziger AN, Lewis C, Pearson TA: Reliability and changes in validity of self-reported cardiovascular disease risk factors using dual response: the behavioral risk factor survey. J Clin Epidemiol. 1996, 49: 511-17. 10.1016/0895-4356(96)00010-8.

    CAS  Article  PubMed  Google Scholar 

  33. 33.

    Bays HE, Bazata DD, Clark NG, Gavin JR, Green AJ, Lewis SJ, Reed ML, Stewart W, Chapman RH, Fox KM, Grandy S: Prevalence of self-reported diagnosis of diabetes mellitus and associated risk factors in a national survey in the US population: SHIELD (Study to Help Improve Early evaluation and management of risk factors Leading to Diabetes). BMC Public Health. 2007, 3;7: 277-

    Article  Google Scholar 

  34. 34.

    Pereira M, Lunet N, Azevedo A, Barros H: Differences in prevalence, awareness, treatment and control of hypertension between developing and developed countries. J Hypertens. 2009, 27 (5): 963-75. 10.1097/HJH.0b013e3283282f65.

    CAS  Article  PubMed  Google Scholar 

  35. 35.

    Perez-Fernandez R, Mariño AF, Cadarso-Suarez C, Botana MA, Tome MA, Solache I, et al: Prevalence, awareness, treatment and control of hypertension in Galicia (Spain) and association with related diseases. J Hum Hypertens. 2007, 21 (5): 366-73.

    CAS  PubMed  Google Scholar 

  36. 36.

    Zorrilla B, De la Calle H, Martínez M, Gil E, Sánchez JL, Nogales P, et al: Prevalencia de diabetes mellitus en la Comunidad de Madrid: estudio PREDIMERC. Av Diabetol. 2008, 24 (Suppl 1): 45-112.

    Google Scholar 

  37. 37.

    García Vallejo O, Vicente Lozano J, Vegazo O, Jiménez Jiménez FJ, Llisterri Caro JL, Redón J: Control de la presión arterial de los pacientes diabéticos en el ámbito de atención primaria. Estudio DIAPA. Med Clin (Barc). 2003, 120 (14): 529-34. 10.1157/13045871.

    Article  Google Scholar 

  38. 38.

    Ministerio de Sanidad y Política social: Spanish National Health Survey 2006. [Internet database]. []

  39. 39.

    Banegas JR: Epidemiología de la hipertensión arterial en España. Situación actual y perspectivas. Hipertensión. 2005, 22: 353-62.

    Article  Google Scholar 

  40. 40.

    Kannel WB, Wilson PW, Zhang TJ: The epidemiology of impaired glucose tolerance and hypertension. Am Heart J. 1991, 121: 1268-73. 10.1016/0002-8703(91)90432-H.

    CAS  Article  PubMed  Google Scholar 

  41. 41.

    Fernández Cuenca R: Análisis de los servicios sanitarios. Sociedad Española de Salud Pública y Administración Sanitaria. Informe SESPAS 1998: La salud pública y el futuro del estado de bienestar. 1998, Granada: Escuela Andaluza de Salud Pública, 249-98.

    Google Scholar 

  42. 42.

    Nicholson A, Tate AR, Koeling R, Cassell JA: What does validation of cases in electronic record databases mean? The potential contribution of free text. Pharmacoepidemiol Drug Saf. 2011, 20 (3): 321-4. 10.1002/pds.2086.

    Article  PubMed  PubMed Central  Google Scholar 

Pre-publication history

  1. The pre-publication history for this paper can be accessed here:

Download references


We thank Javier Díez Castro and Enrique Carrillo de Santa Pau for their useful contributions to the manuscript.

Author information



Corresponding author

Correspondence to Carmen de Burgos-Lunar.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

CBL conceived the study, participated in its design, performed the statistical analysis and drafted the manuscript. MASF participated in the design and drafted the manuscript. JCV participated in the design and coordinated the research group. JCAH, ICG, SSD drafted the manuscript and made substantial contributions to the analysis and interpretation. CYFR helped in the statistical analysis and drafted the manuscript. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

de Burgos-Lunar, C., Salinero-Fort, M.A., Cárdenas-Valladolid, J. et al. Validation of diabetes mellitus and hypertension diagnosis in computerized medical records in primary health care. BMC Med Res Methodol 11, 146 (2011).

Download citation


  • Positive Predictive Value
  • Primary Health Care
  • Negative Predictive Value
  • Diagnosis Criterion
  • Kappa Index