This article has Open Peer Review reports available.
Investigating linkage rates among probabilistically linked birth and hospitalization records
© Bentley et al.; licensee BioMed Central Ltd. 2012
Received: 29 February 2012
Accepted: 28 August 2012
Published: 25 September 2012
With the increasing use of probabilistically linked administrative data in health research, it is important to understand whether systematic differences occur between the populations with linked and unlinked records. While probabilistic linkage involves combining records for individuals, population perinatal health research requires a combination of information from both the mother and her infant(s). The aims of this study were to (i) describe probabilistic linkage for perinatal records in New South Wales (NSW) Australia, (ii) determine linkage proportions for these perinatal records, and (iii) assess records with linked mother and infant hospital-birth record, and unlinked records for systematic differences.
This is a population-based study of probabilistically linked statutory birth and hospital records from New South Wales, Australia, 2001-2008. Linkage groups were created where the birth record had complete linkage with hospital admission records for both the mother and infant(s), partial linkage (the mother only or the infant(s) only) or neither. Unlinked hospital records for mothers and infants were also examined. Rates of linkage as a percentage of birth records and descriptive statistics for maternal and infant characteristics by linkage groups were determined.
Complete linkage (mother hospital record – birth record – infant hospital record) was available for 95.9% of birth records, partial linkage for 3.6%, and 0.5% with no linked hospital records (unlinked). Among live born singletons (complete linkage = 96.5%) the mothers without linked infant records (1.6%) had slightly higher proportions of young, non-Australian born, socially disadvantaged women with adverse pregnancy outcomes. The unlinked birth records (0.4%) had slightly higher proportions of nulliparous, older, Australian born women giving birth in private hospitals by caesarean section. Stillbirths had the highest rate of unlinked records (3-4%).
This study shows that probabilistic linkage of perinatal records can achieve high, representative levels of complete linkage. Records for mother’s that did not link to infant records and unlinked records had slightly different characteristics to fully linked records. However, these groups were small and unlikely to bias results and conclusions in a substantive way. Stillbirths present additional challenges to the linkage process due to lower rates of linkage for lower gestational ages, where most stillbirths occur.
The ability to conduct linkage of perinatal records, obtained as part of routinely collected administrative health data, has increased the scope for population based studies of mother and infant health . When a unique identifier is available, deterministic linkage is used to identify records for the same person [2, 3], however, when no unique identifiers are available, increasingly large databases are being linked using probabilistic-based linkage methods. While probabilistic linkage usually involves combining records for individuals, perinatal research typically requires a combination of information from both the mother and her infant(s).
Advantages of linkage of administrative health records include; describing the total disease burden in a population, assessment of risk factors  and investigating rare outcomes , which are all relevant to addressing key issues in health and health policy [6, 7]. Other advantages include; improved coverage, ascertainment , completeness and validity , and large samples with standardized reporting to produce generalisable results . Longitudinal record linkage allows the study of recurrence risk [10–12], mortality, major morbidities  and co-morbidities and impacts on childhood development . Probabilistic linkage of administrative health records is undertaken routinely in Scotland , Wales [16, 17], Canada [18–20], the United States , and Australia [22, 23].
Mismatches are possible with probabilistic linkage. Two different individuals could be linked resulting in incorrectly reported outcomes or risk factors (false positive links), or two records from the same individual may not be linked (false negative links), resulting in missing information. The success of linkage, often described in terms of minimizing mismatches, can depend upon a number of factors, including the quality of the information used in the linkage process and how uniquely identifying reported information is. Recent studies have shown that, unlike deterministic methods, the flexibility of probabilistic record linkage allows for minimization of mismatches under variations in data quality . With the potential for mismatches it is important to consider the possibility of systematic biases that may arise between linked and unlinked populations of records. Researchers are becoming increasingly aware of the potential bias created by excluding unlinked records, and more recently this has prompted a publication of guidelines for reporting studies using linked data .
The aims of this study were to (i) describe probabilistic linkage for perinatal records in New South Wales (NSW) Australia, (ii) determine linkage proportions for these perinatal records, and (iii) assess records with complete linkage of mother and infant hospital-birth record and unlinked records for systematic differences.
This study used linked records of the NSW Perinatal Data Collection (PDC), and the NSW Admitted Patient Data Collection (APDC). The PDC (referred to as ‘birth records’) is a population-based statutory surveillance system that includes all live births and stillbirths of at least 20 weeks gestation or if gestational age is not known of at least 400 grams birth weight, and includes information on maternal characteristics, pregnancy, labor and delivery factors and infant outcomes. ‘Hospital records’ (for mothers and infants) that relate to the birth (birth admission records) were obtained from the APDC, which includes demographic and hospitalization related data for every inpatient admitted to any public or private hospital in NSW. Diagnoses and procedures for each hospital admission are coded according to the 10th revision of the International Classification of Disease, Australian Modification (ICD10-AM) and the Australian Classification of Health Interventions (ACHI).
The study population included all mothers who gave birth, and their infants, in NSW, Australia, from 1 January 2001-31 December 2008. NSW is the largest state in Australia with around 7,287,600 million people representing 32% of the Australian population . Homebirths (0.2%) as identified in the birth records were excluded as these would not have a linked hospital birth admission.
Probabilistic record linkage
Birth, and maternal and infant hospital records for 2001 to 2008 were probabilistically linked  by the Centre for Health Record Linkage (CHeReL)  using a best practice approach in privacy preserving record linkage  and the open source probabilistic record linkage software Choice Maker . Best practice involves ensuring separation of personal identifiers and health information. The CHeReL receives personal identifiers only (i.e. no health information) from the data custodians to generate a linkage key, and a linkage key is returned to the data custodians. Finally, researchers receive only health information and a linkage key from the data custodians.
The link between the mother and infant is provided by the common birth record. Probabilistic linkage is used to link records for the same individual, and in this context the outline that follows is in reference to linking the mothers’ birth and hospital records, and the infants’ birth and hospital records.
The CHeReL used a variety of fields that are common to both datasets for matching records in the linkage process. These include first name, last name, address, sex, date of birth, and country of birth. Additional information used, where available, includes hospital code and medical record number (MRN), admission date, discharge date, hospital discharged from, hospital discharged to, alias names, plurality and birth order for multiple pregnancies (twins, triplets and higher order multiple pregnancies).
The CHeReL undertakes quality assurance for any data linkage and assesses the linkage quality by manually reviewing personal identifiers for a sample of the records obtained for linkage. For this project, the CHeReL reported the linkage quality as < 1/1,000 missed links and < 2/1,000 false positive links.
For this study we defined six different groups of records based on the linkage configuration. The ‘linked mothers and infants’ group includes birth records with a linked hospital admission for both the mother and the infant(s), representing the ‘complete’ set of perinatal records. The ‘mothers only’ group includes birth records with a linked hospital birth admission record for the mother but without one for the infant, while the ‘infants only’ group includes birth records with a linked hospital birth admission record for the infant but without one for the mother. These two groups represent the ‘partial linkage’ groups. Finally, there are three different groups of unlinked records. The first is ‘unlinked birth records’ which includes birth records without a linked birth admission record for either the mother or the infant. The second is the ‘unlinked maternal hospital records’ which includes hospital birth admission records identified for a pregnancy that did not link to the birth records. The third is the ‘unlinked infant hospital records’ which includes hospital birth admission records identified for infants that did not link to the birth record.
Stillbirths and plurality
Stillbirths are reported on the mother’s hospital birth admission record and do not usually generate an infant hospital admission record for the infant. Therefore most will not have complete linked mother and infants records. Further, there may be misclassification of stillbirths and miscarriages and it has been indicated previously that linkage for stillbirths is problematic .
Linking is conducted separately for singleton and multiple pregnancies as multiple pregnancies generate infant records with identical information such as mothers name, date of birth, hospital of birth and even sex, so extra care is required [31, 32].
Identification of hospital birth admission records
ICD10-AM  diagnosis and ACHI procedure codes, and administrative information, were used to identify hospital birth admission records for mothers and infants independently of the birth record.
Variables used to identify birth admission hospital record for infants
Liveborn infants according to place of birth
Conditions originating in the perinatal period
An age of 0 or 1 days old
A birth weight is recorded in the hospital record
Source of referral
Born in hospital
The hospital record is for birth in hospital
The hospital record is the first for an infant
Variables used to identify delivery admission hospital records for mothers
Outcome of delivery
16571, 16573, 90479-90481, 90485
Other procedures associated with delivery
Analgesia and anaesthesia during labour and delivery procedure
Induction and augmentation of labour
Identification of variables for unlinked hospital records
Apgar1 < 4
O24, E10, E11, E13, E14
O10, O11, O13-O16
O82, Procedures: 16520
Duration of pregnancy < 25 weeks
O90.1, O90.2, O90.3
Z37.0-Z37.1, Z38.0-Z38.2, O80-O83
Z37.2-Z37.7, Z38.3-Z38.8, O84
Z37.1, Z37.3, Z37.4, Z37.6, Z37.7
Reported for all births are (i) rates of linkage for the birth-hospital record linkage groups by plurality and live born/stillborn as a percentage of all birth records and (ii) rates of identification for deliveries and births as ascertained from the hospital birth admissions as a percentage of the number of deliveries/births reported in the birth records. Note that delivery is used to refer to a mother giving birth, and birth to refer to a baby being born. Thereafter, we limited the analysis to live born singleton deliveries/births. Descriptive statistics of both maternal and infant characteristics by linkage groups were reported using either information from the birth or hospital birth record. For those variables reported on both, information from the birth record was used unless the hospital birth admission record was indicated as being more reliable according to validation studies of birth and hospital data [35–37]. Descriptive analysis was performed in SAS 9.2 . Ethical approval was obtained from the NSW Population and Health Services Research Ethics Committee.
Linkage rates for all births
Linkage rates for all births, NSW 2001-2008
N = 691 197
N = 21 907
N = 4 018
N = 442
N = 717 982
Birth-hospital record linked groups
Mother and infants
667 315 (96.5)
21 024 (96.0)
688 802 (95.9)
11 312 (1.6)
3 787 (94.3)
16 029 (2.2)
9 553 (1.4)
9 884 (1.4)
3 017 (0.4)
3 267 (0.5)
Infants hospital records
13 469 (-)
14 504 (-)
Maternal hospital records†
8 145 (-)
8 984 (-)
From the hospital records, 713,190 infant birth records were identified, almost the same number of live born birth admissions as reported in the birth records (N = 713,522), > 99.9%. From the hospital records, 704,009 delivery records (mothers) were identified, representing 99.6% of those reported in the birth records (N = 706,906).
For the largest group of birth records, live born singletons, 96.5% of records had complete linkage to both a mother and an infant birth admission record compared to 96.0% of live born multiple births. For stillbirths, the largest linkage group was the ‘mothers only’ at around 94% for both singletons and multiple births. Unlinked birth records were more common for stillbirths (3-4%) than live births (0.3-0.4%).
Given the incomplete linkage of stillbirths (recorded as a maternal outcome) and the difficulty of presenting results for multiple births (requiring duplication of maternal information), comparisons of maternal and infant linkage groups are presented for singleton live births. Coding of stillbirth/live birth and plurality could not be identified for 1,505 of the 704,009 deliveries identified in the hospital records (0.2%) and pregnancies with duration <26 weeks were over-represented in this group (3.2%). Similarly, 415 infant birth admissions (<0.1%) could not be classified and preterm birth was over-represented in this group (6.4%).
Singleton live births
Among singleton live births the rate of complete linkage dropped from around 96% at 25 weeks gestation to only 72% at 20 weeks gestation (Figure 2). For birth weight, complete linkage was around 96% for weights above 1000 grams, but below this dropped to around 80% by 400 grams (Figure 3).
Maternal demographic and birth-related characteristics by linkage group for liveborn singleton pregnancies, NSW 2001-2008
Birth-hospital record linked groups
Mothers and infants
N = 667315
N = 11312
N = 9553
N = 3017
N = 8145
122 417 (18.3)
2 837 (25.1)
1 870 (19.6)
1 691 (20.7)
407 468 (61.1)
6 418 (56.7)
5 779 (60.5)
1 916 (63.5)
4 753 (58.3)
137 304 (20.6)
2 055 (18.2)
1 851 (19.4)
1 716 (21.0)
Marital status (Married)
546 152 (81.8)
8 167 (72.2)
6 343 (77.7)
277 713 (41.6)
5 077 (44.9)
3 994 (41.8)
1 378 (45.7)
224 843 (33.7)
3 561 (31.5)
2 879 (30.1)
102 521 (15.4)
1 533 (13.6)
1 410 (14.7)
61 127 (9.2)
1 123 (9.9)
1 248 (13.1)
Australian born mother
478 317 (71.7)
7 458 (65.9)
6 847 (71.7)
2 233 (74.0)
5 740 (70.3)
140 069 (21.0)
2 711 (24.0)
1 835 (19.2)
2 044 (25.1)
367 930 (55.1)
5 596 (49.5)
5 143 (53.8)
1 463 (48.5)
4 084 (50.1)
158 193 (23.7)
2 918 (25.8)
2 483 (26.0)
2 030 (24.9)
Smoked during pregnancy
95 866 (14.4)
2 225 (19.7)
1 904 (19.9)
Antenatal care ≥ 15 weeks
164 940 (24.7)
3 416 (30.2)
2 598 (27.2)
34 760 (5.2)
50 582 (7.6)
2 555 (0.4)
3 595 (0.5)
Induction of labour
166 647 (25.0)
2 633 (23.3)
2 146 (22.5)
1 465 (18.0)
Delivery by caesarean
179 528 (26.9)
3 189 (28.2)
2 028 (21.2)
2 136 (26.2)
Duration of pregnancy < 26 weeks
1 166 (0.2)
Birth in private hospital
168 036 (25.2)
3 486 (30.8)
1 939 (20.3)
1 731 (57.4)
2 047 (25.1)
The ‘mothers only’ group (no associated infant hospital record), had higher levels of social disadvantage (quintile 5), women aged less than 25, non-Australian born mothers, births by unmarried women, smoking during pregnancy, commencement of antenatal care after 14 weeks gestation, caesarean section, placental abruption, and duration of pregnancy less than 26 weeks.
Infant demographic and birth-related characteristics by linkage group for liveborn singleton births, NSW 2001-2008
Birth-hospital record linked groups
Mothers and infants
N = 667315
N = 11312
N = 9553
N = 3017
N = 13469
343 655 (51.5)
5 782 (51.1)
4 851 (50.8)
1 484 (49.2)
6 867 (51.0)
323 261 (48.4)
5 517 (48.8)
4 698 (49.2)
1 496 (49.6)
6 599 (49.0)
Birthweight < 1000 grams
1 948 (0.3)
35 776 (5.4)
Agpar1 < 4
12 642 (1.9)
Admission to SCN/NICU
100 498 (15.1)
1 908 (16.9)
1 534 (16.1)
Death in hospital
1 714 (0.3)
To our knowledge, this is the first study that has assessed the linkage of mother and infant birth and hospital records rather than mothers and infants separately. As maternal and pregnancy factors are important predictors of infant outcomes, assessment of the complete linkage is important. In this study the level of complete linkage (95.9%) was high for all births and highest for live singleton births (96.5%). Partially linked mother records (no infant hospital record) had slightly higher rates of adverse events and common risk factors while the partially linked infant records (no mother hospital record) were very similar to those with complete linkage.
This study has shown that stratifying linkage by plurality to overcome the recognized difficulty of linking multiple births [31, 32] has generated comparable linkage rates for singleton and multiple live births. Stillbirths represent a very different group in terms of linkage. As infant hospital admission records are not generated, stillbirths should not be present in the complete linkage group. While this explains the majority of stillbirth records being in the ‘mothers only’ group, the proportion of unlinked birth records for stillbirths was also much greater than that for live births (4% vs. 0.4%), reflecting that stillbirths remain a problem for linkage. The lower rate of linkage for stillbirths and the issue of lower rates of complete linkage for live born singletons ≤24 weeks gestation are probably related. Infants born close to the border of viability (misclassification of stillbirths and live births, and births and miscarriages) have been previously identified as a problematic domain for perinatal record linkage . For these reasons, unless infants ≤24 weeks are of particular interest, studies using probabilistically linked records may benefit from restriction to the population of at least 24 weeks gestation. For stillbirth studies, specialist linkages may be needed to improve linkage rates to the levels needed for robust research.
Among singleton live births, the proportions of birth records with partial (1.4-1.6%) or no linkage (0.4%) to hospital records was small. However, there was some evidence of systematic differences for the partially linked records that had no infant hospitalization record (‘mothers only’). This group has slightly higher rates of adverse infant outcomes and associated risk factors, consistent with observations in other studies [10, 39–41]. Reduced matching of infant records may be related to the association between missing information, social disadvantage and adverse outcomes, or that severely ill infants with prolonged hospitalization may not necessarily be coded as a birth admission. Restriction to later gestational ages would further reduce the already small size of this group of records. It is important to quantify the number and characteristics of unlinked or partially linked records to assess the potential for bias in estimation of the burden of disease and association between risk factors and outcomes. In our study inclusion of additional records would not change, for example, the estimated preterm birth rate nor is it likely to change risk estimates. However, in other settings with higher proportions of unlinked or partially linked records, exclusion of such records could introduce bias.
Our finding that the unlinked birth records represent a relatively low risk group of mothers and babies is likely to be a local phenomenon. The over-representation of births in private hospitals in the unlinked birth records is likely a result of missing name information. It is at the discretion of private hospitals as to whether name information is collected, and so generally have a large amount of missing name information for both mothers and infants, thus affecting linkage rates for both mothers and infants. Changes to the data provided from private hospitals for linkage could potentially reduce the size of the unlinked birth records.
The results highlight the importance of comparing the characteristics of probabilistic record linkage for perinatal research for mothers and infants, given the potential bias introduced into analysis by incomplete record linkage. It is recommended that for the chosen study population, linked and unlinked records should be requested for analysis and a comparison of linked and unlinked records be undertaken as part of any research using probabilistically linked data. This is of even greater importance when newly-established datasets and linkages are used, which is in contrast to the well-established datasets and linkage protocols used by the CHeReL which generated the linked data for this study. Further, in order to properly discuss the potential impacts, it is necessary for researchers to have a reasonable understanding of how the probabilistic linkage process works and the matching processes involved.
The hospital birth admission records for mothers and infants that did not link to a birth record were small in number and of comparable size to the number of unlinked birth records, and inevitably include some missed links. However, particularly for mothers, there is difficulty in establishing birth admission records as more than one hospitalization may be identified as a birth admission. Although used in the past [42, 43], we found that selecting maternal hospital records on a single outcome of delivery code (ICD10: Z37, ICD9: V27) to be inadequate and a much more comprehensive list was required (Table 2). This agrees with a US study that showed that identifying maternal hospital records using outcome of delivery missed complicated pregnancies . Furthermore, due to the nature of ICD coding there was difficulty in classifying the plurality and whether the birth(s) were live born or stillborn. In general a good understanding of coding practices can help to improve identification of these records.
Probabilistic methods can achieve high, representative levels of complete linkage for mothers and infants. Although some systematic differences occur for the mothers records that do not link to a corresponding infant record, and to a lesser degree for unlinked birth records with respect to private hospitals, these groups are very small and unlikely to bias estimates of effect or conclusions in a substantive way, particularly if the study population is live born singletons.
We thank the NSW Ministry of Health for access to the population health data and the Centre for Health Record Linkage (CHeReL) for linking the data sets.
This work was supported by a NSW Ministry of Health and Australian National Health and Medical Research Council (NHMRC) Partnership Building Grant (#571451) and the Stillbirth Foundation Australia. Christine Roberts is supported by a NHMRC Senior Research Fellowship (#457078).
Choice Maker Technologies Inc. developed the Choice Maker software and contributed it to the open source community.
- Donati S, Senatore S, Ronconi A: Maternal mortality in Italy: a record-linkage study. BJOG. 2011, 118 (7): 872-879. 10.1111/j.1471-0528.2011.02916.x.View ArticlePubMedGoogle Scholar
- Artama M, Gissler M, Malm H, Ritvanen A: Nationwide register-based surveillance system on drugs and pregnancy in Finland 1996-2006. Pharmacoepidemiol Drug Saf. 2011, 20 (7): 729-738. 10.1002/pds.2159.View ArticlePubMedGoogle Scholar
- Bonamy AK, Parikh NI, Cnattingius S, Ludvigsson JF, Ingelsson E: Birth characteristics and subsequent risks of maternal cardiovascular disease: effects of gestational age and fetal growth. Circulation. 2011, 124 (25): 2839-2846. 10.1161/CIRCULATIONAHA.111.034884.View ArticlePubMedGoogle Scholar
- Stanley FJ, Croft ML, Gibbins J, Read AW: A population database for maternal and child health research in Western Australia using record linkage. Paediatr Perinat Epidemiol. 1994, 8 (4): 433-447. 10.1111/j.1365-3016.1994.tb00482.x.View ArticlePubMedGoogle Scholar
- Bright RA, Avorn J, Everitt DE: Medicaid data as a resource for epidemiologic studies: strengths and limitations. J Clin Epidemiol. 1989, 42 (10): 937-945. 10.1016/0895-4356(89)90158-3.View ArticlePubMedGoogle Scholar
- Roos LL, Nicol JP: A research registry: uses, development, and accuracy. J Clin Epidemiol. 1999, 52 (1): 39-47. 10.1016/S0895-4356(98)00126-7.View ArticlePubMedGoogle Scholar
- Schwartz RM, Gagnon DE, Muri JH, Zhao QR, Kellogg R: Administrative data for quality improvement. Pediatrics. 1999, 103 (1 Suppl E): 291-301.PubMedGoogle Scholar
- Roberts CL, Algert CS, Ford JB: Methods for dealing with discrepant records in linked population health datasets: a cross-sectional study. BMC Health Serv Res. 2007, 7: 12-10.1186/1472-6963-7-12.View ArticlePubMedPubMed CentralGoogle Scholar
- Ananth CV, Getahun D, Peltier MR, Salihu HM, Vintzileos AM: Recurrence of spontaneous versus medically indicated preterm birth. Am J Obstet Gynecol. 2006, 195 (3): 643-650. 10.1016/j.ajog.2006.05.022.View ArticlePubMedGoogle Scholar
- Adams MM, Wilson HG, Casto DL, Berg CJ, McDermott JM, Gaudino JA, McCarthy BJ: Constructing reproductive histories by linking vital records. Am J Epidemiol. 1997, 145 (4): 339-348. 10.1093/oxfordjournals.aje.a009111.View ArticlePubMedGoogle Scholar
- Savitz DA, Stein CR, Ye F, Kellerman L, Silverman M: The epidemiology of hospitalized postpartum depression in New York State, 1995-2004. Ann Epidemiol. 2011, 21 (6): 399-406. 10.1016/j.annepidem.2011.03.003.View ArticlePubMedPubMed CentralGoogle Scholar
- von Katterfeld B, Li J, McNamara B, Langridge AT: Obstetric profiles of foreign-born women in Western Australia using data linkage, 1998-2006. Aust N Z J Obstet Gynaecol. 2011, 51 (3): 225-232. 10.1111/j.1479-828X.2010.01282.x.View ArticlePubMedGoogle Scholar
- Parrish KM, Holt VL, Connell FA, Williams B, LoGerfo JP: Variations in the accuracy of obstetric procedures and diagnoses on birth records in Washington State, 1989. Am J Epidemiol. 1993, 138 (2): 119-127.PubMedGoogle Scholar
- Liang W, Chikritzhs T: Obstetric conditions and risk of first ever mental health contact during infancy, childhood and adolescence. Midwifery. 2012, 28 (4): 379-384.View ArticlePubMedGoogle Scholar
- Kendrick S, Clarke J: The Scottish Record Linkage System. Health Bull (Edinb). 1993, 51 (2): 72-79.Google Scholar
- Centre for Health Information, Research and Evaluation: http://www.healthinformaticsresearchlabs.swansea.ac.uk/sailproject.html,
- Lyons RA, Jones KH, John G, Brooks CJ, Verplancke JP, Ford DV, Brown G, Leake K: The SAIL databank: linking multiple health and social care datasets. BMC Med Inform Decis Mak. 2009, 9: 3-10.1186/1472-6947-9-3.View ArticlePubMedPubMed CentralGoogle Scholar
- Record linkage at Statistics Canada: http://www.statcan.gc.ca/record-enregistrement/index-eng.htm,
- Roos NP, Black CD, Frohlich N, Decoster C, Cohen MM, Tataryn DJ, Mustard CA, Toll F, Carriere KC, Burchill CA, et al: A population-based health information system. Med Care. 1995, 33 (12 Suppl): DS13-20.PubMedGoogle Scholar
- Chamberlayne R, Green B, Barer ML, Hertzman C, Lawrence WJ, Sheps SB: Creating a population-based linked health database: a new resource for health services research. Can J Public Health. 1998, 89 (4): 270-273.PubMedGoogle Scholar
- Buehler JW, Prager K, Hogue CJ: The role of linked birth and infant death certificates in maternal and child health epidemiology in the United States. Am J Prev Med. 2000, 19 (1 Suppl): 3-11.View ArticlePubMedGoogle Scholar
- Holman CD, Bass AJ, Rouse IL, Hobbs MS: Population-based linkage of health records in Western Australia: development of a health services research linked database. Aust N Z J Public Health. 1999, 23 (5): 453-459. 10.1111/j.1467-842X.1999.tb01297.x.View ArticlePubMedGoogle Scholar
- Centre for Health Record Linkage: http://www.cherel.org.au,
- Tromp M, Ravelli AC, Bonsel GJ, Hasman A, Reitsma JB: Results from simulated data sets: probabilistic record linkage outperforms deterministic record linkage. J Clin Epidemiol. 2011, 64 (5): 565-572. 10.1016/j.jclinepi.2010.05.008.View ArticlePubMedGoogle Scholar
- Bohensky MA, Jolley D, Sundararajan V, Evans S, Ibrahim J, Brand C: Development and validation of reporting guidelines for studies involving data linkage. Aust N Z J Public Health. 2011, 35 (5): 486-489. 10.1111/j.1753-6405.2011.00741.x.View ArticlePubMedGoogle Scholar
- Australian Bureau of Statistics. Australian Demographic Statistics: 2011, Catalogue 3101.0 http://www.abs.gov.au/AUSSTATS/abs@.nsf/Lookup/3101.0Main+Features1Mar%202011?OpenDocument,
- Jaro MA: Probabilistic linkage of large public health data files. Stat Med. 1995, 14 (5–7): 491-498.View ArticlePubMedGoogle Scholar
- Kelman CW, Bass AJ, Holman CD: Research use of linked health data–a best practice protocol. Aust N Z J Public Health. 2002, 26 (3): 251-255. 10.1111/j.1467-842X.2002.tb00682.x.View ArticlePubMedGoogle Scholar
- Open Source ChoiceMaker Technology: http://oscmt.sourceforge.net,
- Ford JB, Roberts CL, Taylor LK: Characteristics of unmatched maternal and baby records in linked birth records and hospital discharge data. Paediatr Perinat Epidemiol. 2006, 20 (4): 329-337. 10.1111/j.1365-3016.2006.00715.x.View ArticlePubMedGoogle Scholar
- Meray N, Reitsma JB, Ravelli AC, Bonsel GJ: Probabilistic record linkage is a valid and transparent tool to combine databases without a patient identification number. J Clin Epidemiol. 2007, 60 (9): 883-891.View ArticlePubMedGoogle Scholar
- Tromp M, Reitsma JB, Ravelli AC, Meray N, Bonsel GJ: Record linkage: making the most out of errors in linking variables. AMIA Annu Symp Proc. 2006, 2006: 779-783.PubMed CentralGoogle Scholar
- The International Statistical Classification of Diseases and Related Health Problems, Australian Modification – Tabular List of Diseases and Alphabetic Index of Diseases: http://nccc.uow.edu.au/icd10am/icd10am/index.html,
- Australian Bureau of Statistics. Socio-economic Indexes for Areas (SEIFA), Data only: 2006, Catalogue 2033.0.55.001 http://www.abs.gov.au/ausstats/abs@.nsf/mf/2033.0.55.001/,
- Taylor LK, Travis S, Pym M, Olive E, Henderson-Smart DJ: How useful are hospital morbidity data for monitoring conditions occurring in the perinatal period?. Aust N Z J Obstet Gynaecol. 2005, 45 (1): 36-41. 10.1111/j.1479-828X.2005.00339.x.View ArticlePubMedGoogle Scholar
- Validation Study: NSW Midwives Data Collection 1998. NSW Mothers and Babies 1998. In: State Publication No (EPI) 000029. 2000, 9 (S-2): 97-99. Sydney: NSW Public Health Bulletin. NSW Department of HealthGoogle Scholar
- Pym M, Taylor L: Validation study of the NSW Midwives Data Collection 1990. State Publication No (EHSEB) 93-167 Sydney. NSW Public Health Bulletin. 1993, 4 (S-8): 1-6.Google Scholar
- SAS Institute Inc: SAS 9.2 [computer program]. 2008, Cary: SAS Institute IncGoogle Scholar
- Adams MM, Kirby RS: Measuring the accuracy and completeness of linking certificates for deliveries to the same woman. Paediatr Perinat Epidemiol. 2007, 21 (Suppl 1): 58-62.View ArticlePubMedGoogle Scholar
- Adams MM, Berg CJ, McDermott JM, Gaudino JA, Casto DL, Wilson HG, McCarthy BJ: Evaluation of reproductive histories constructed by linking vital records. Paediatr Perinat Epidemiol. 1997, 11 (1): 78-92. 10.1111/j.1365-3016.1997.tb00799.x.View ArticlePubMedGoogle Scholar
- Gyllstrom ME, Jensen JL, Vaughan JN, Castellano SE, Oswald JW: Linking birth certificates with Medicaid data to enhance population health assessment: methodological issues addressed. J Public Health Manag Pract. 2002, 8 (4): 38-44.View ArticlePubMedGoogle Scholar
- Ananth CV, Oyelese Y, Yeo L, Pradhan A, Vintzileos AM: Placental abruption in the United States, 1979 through 2001: temporal trends and potential determinants. Am J Obstet Gynecol. 2005, 192 (1): 191-198. 10.1016/j.ajog.2004.05.087.View ArticlePubMedGoogle Scholar
- Danel I, Berg C, Johnson CH, Atrash H: Magnitude of maternal morbidity during labor and delivery: United States, 1993-1997. Am J Public Health. 2003, 93 (4): 631-634. 10.2105/AJPH.93.4.631.View ArticlePubMedPubMed CentralGoogle Scholar
- Kuklina EV, Whiteman MK, Hillis SD, Jamieson DJ, Meikle SF, Posner SF, Marchbanks PA: An enhanced method for identifying obstetric deliveries: implications for estimating maternal morbidity. Matern Child Health J. 2008, 12 (4): 469-477. 10.1007/s10995-007-0256-6.View ArticlePubMedGoogle Scholar
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2288/12/149/prepub
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.