Data preparation techniques for a perinatal psychiatric study based on linked data
© Xu et al.; licensee BioMed Central Ltd. 2012
Received: 7 March 2012
Accepted: 28 May 2012
Published: 8 June 2012
In recent years there has been an increase in the use of population-based linked data. However, there is little literature that describes the method of linked data preparation. This paper describes the method for merging data, calculating the statistical variable (SV), recoding psychiatric diagnoses and summarizing hospital admissions for a perinatal psychiatric study.
The data preparation techniques described in this paper are based on linked birth data from the New South Wales (NSW) Midwives Data Collection (MDC), the Register of Congenital Conditions (RCC), the Admitted Patient Data Collection (APDC) and the Pharmaceutical Drugs of Addiction System (PHDAS).
The master dataset is the meaningfully linked data which include all or major study data collections. The master dataset can be used to improve the data quality, calculate the SV and can be tailored for different analyses. To identify hospital admissions in the periods before pregnancy, during pregnancy and after birth, a statistical variable of time interval (SVTI) needs to be calculated. The methods and SPSS syntax for building a master dataset, calculating the SVTI, recoding the principal diagnoses of mental illness and summarizing hospital admissions are described.
Linked data preparation, including building the master dataset and calculating the SV, can improve data quality and enhance data function.
KeywordsData preparation Method Psychiatric study Australia
Existing population and hospital data are valuable sources of surveillance and research. Data linkage provides a tangible route to make the data sources more powerful for research [1, 2]. In recent years, there has been a marked increase in the use of registry data for births, deaths and diseases such as psychiatric disorders [3, 4]. Hospital admission data has become one of the key data collections for medical research especially for population-based cohort studies. In New South Wales (NSW), the Midwives Data Collection (MDC), the Registry of Births, Deaths and Marriages (RBDM) registration data, the Admitted Patient Data Collection (APDC) and the Cancer Registry (CR) are commonly used by medical researchers for their studies [6–9]. However, there is little literature that describes the method of linked data preparation. Research articles using linked data did not describe data preparation details in method section because of the word limit. Method papers about linked data mainly focused on strategies rather than on techniques [2, 5, 10]. However, the data preparation techniques including syntax are often inquired by researchers.
This paper aims to answer four frequently asked questions in data preparation using linked birth and hospital data. The first question is how are data collections merged? The second is how are hospital admissions distributed into the periods before pregnancy, during pregnancy and after birth? The third is how are psychiatric diagnoses recoded and grouped? The fourth is how are hospital admissions summarised and ordered in sequence?
Data sources used in this paper as examples are: The NSW Midwives Data Collection (MDC) which includes mothers’ and babies’ records, the NSW Register of Congenital Conditions (RCC), the NSW Admitted Patient Data Collection (APDC) and the NSW Pharmaceutical Drugs of Addiction System (PHDAS). The MDC is a population-based data collection of all births in NSW. It covers all births of at least 20 weeks gestation or at least 400 grams birthweight in public and private hospitals and homebirths. It includes information on maternal characteristics, pregnancy, labour, delivery and neonatal outcomes. The NSW Department of Health has managed the MDC since 1987. Personal identifiers were included in the data collection after 1992 and there was a revision to the MDC form in 1993. It is recommended that record linkage studies be carried out on data collected since 1994. The RCC is a population-based surveillance system and monitors birth defects detected during pregnancy, at birth or up to one year after birth. The data is available for a rolling five-year period. It covers structural birth defects but not functional problems. The RCC was established in 1990. As the RCC was voluntary up to 1997, the birth defects records were incomplete especially for terminations of pregnancy. Since 1998, the reporting of terminations of pregnancy has improved and doctors, hospitals and laboratories have been required to notify the register of all birth defects. The APDC is a routinely collected census of all hospital separations. It includes all patient hospitalisations from NSW public, private and repatriation hospitals, private day procedures centres and public nursing homes. The data include patient demographics, diagnoses and clinical procedures. The APDC is collected on a financial year basis. A limitation of APDC data is that there were no names in the APDC prior to 1 July 2000. Since 1 July 2000, the APDC has included names for patients admitted to public hospitals but does not have names for patients admitted to private hospitals. Other information such as sex, date of birth, medical record number, hospital code and address can be used to assist in linkage. The PHDAS is an administrative database used by the Pharmaceutical Services Unit of the NSW Ministry of Health to facilitate the authorisation of medical practitioners to prescribe drugs of addiction. It consists of the Methadone Subsystem implemented in 1985, the Non-Methadone Subsystem implemented in 1985 and the Stimulant Notification Subsystem implemented in 1999. This study used the NSW Opioid Treatment Program (OTP) in the Methadone Subsystem and included treatment information such as program type, start and end date, reason for ending program and other drugs of concern.
Data linkage was performed by The Centre for Health Record Linkage (CHeReL) of the NSW Department of Health using probabilistic record linkage methods and choicemaker software (http://www.cherel.org.au). Each record was assigned a record identification number, the Project Person Number, which allows the records for the same individual to be identified, extracted and linked. The CHeReL Checked a random sample of 1,000 persons, the false positive rate of the linkage was 0.3% and false negative <0.5%.
The study was approved by the NSW Population & Health Services Research Ethics Committee and the Human Research Ethics Committee of the University of New South Wales.
The aims of this study are: to investigate the pattern and rate of hospital admission for maternal psychiatric disorders and substance use before and after birth; explore the factors associated with these problems; and compare the perinatal outcomes of babies whose mothers were admitted or not admitted to hospital for mental illness before birth.
This study was based on linked data which include the MDC, RCC, APDC and PHDAS. The study population included all mothers aged from 18 to 44 years who gave birth between 1 January 2003 and 31 December 2004 in NSW, and their babies. Birth records from 2003 to 2004 in the MDC were linked with the RCC in the same period and with APDC and PHDAS records between 1 January 2001 and 31 December 2006. This enabled hospital admissions for the mothers to be traced back to at least two years before birth and followed up for at least two years after birth. Each mother was followed up from the year before pregnancy to the end of the 24th month after birth.
The key variables included in the study included Project Person Number for mother and baby; mother’s age in days at birth; mother’s age in days at hospital admission and discharge; diagnoses for psychiatric disorders, substance use and birth defects; maternal age; mother’s country of birth; maternal diabetes mellitus and hypertension; pregnancy complications (pre-eclampsia, gestational diabetes); smoking status during pregnancy; remoteness of living area and a socioeconomic indicator (i.e. the Index of Relative Socio-economic Disadvantage Quintile); delivery method; infant gender; birthweight; gestational age; admission to a neonatal intensive care unit (NICU) or special care nursery (SCN); and fetal/neonatal death type (stillbirth, neonatal death, post-neonatal death). The reason for using mother’s age in days at birth, at hospital admission and discharge rather than date of birth, hospital admission and discharge is to maintain privacy and decrease the risk of inadvertent identification of individuals from the data.
The main measurements for mothers were hospital admission rate of maternal psychiatric disorders and substance use before pregnancy, during pregnancy and after birth; and factors associated with maternal psychiatric disorders and substance use. The main pregnancy outcomes included birthweight, preterm birth, birth defects and admission to an NICU or SCN. The outcomes were compared between babies whose mothers were admitted and not admitted to hospital with the diagnoses of mental illness.
The diagnoses in hospital data were classified using ICD-10-AM (International Statistical Classification of Diseases and Related Health Problems Tenth Revision, Australian Modification). The diagnoses included principal, stay and all diagnosis. The principal diagnosis, coded ‘icd10d1’, referred to a medical condition that was chiefly responsible for the hospital admission . An additional diagnosis, coded from ‘icd10d2’ to ‘icd10d55’, was a condition or a complaint either coexisting with the principal diagnosis or arising during the hospitalization. Stay diagnosis was an additional diagnosis coded ‘icd10d2’ and referred to the diagnosis that most influenced the length of stay in hospital. The stay diagnosis was frequently the same as the principal diagnosis; however, it may be different .
The steps and SPSS syntax to calculate and recode the statistical variable of Admmonth
Variable and label
Admdays: number of days between hospital admission and birth.
COMPUTE Admdays = AGEAdmMum – AgeBirthMum.VARIABLE LABELS Admdays 'Hospital admission days before and after birth'.EXECUTE.
If the value of Admdays is less than 0, it means the hospital admission is before birth.A positive value means the admission is after birth.
Admmonth: admission month before pregnancyGage : gestational age in days
IF (Admdays < 0) Admmonth = Admdays + Gage.EXECUTE.RECODE Admmonth (lowest thru −361 = 0)(−360 thru −331 = 1)(−330 thru −301 = 2)……(−60 thru −31 = 11) (−30 thru −1 = 12).EXECUTE.
Value 0 refers to the admission before the 12th month before pregnancy.1: the admission in the 12th month before pregnancy……11: the admission in the second month before pregnancy.12: the admission in the first month before pregnancy.
Admmonth: admission month during pregnancy
RECODE Admmonth (0 thru 29 = 13) (30 thru 59 = 14)…… (300 thru 329 = 23).EXECUTE.
13: the admission in the first month of pregnancy.14: the admission in the second month of pregnancy……23: the admission in the 11th month of pregnancy.
Admmonth: admission month after birth
DO IF (Admdays > = 0).RECODE Admdays (0 thru 29 = 24) (30 thru 59 = 25)…… (330 thru 359 = 35) (360 thru Highest = 36) INTO Admmonth.END IF.EXECUTE.VARIABLE LABELS Admmonth 'Hospital admission month before and after birth'.VALUE LABELS Admmonth 0 'before' 1 ' the 12th month before pregnancy' ……11 ' the 2nd month before pregnancy' 12 ' the 1st month before pregnancy’ 13 ' the first month of pregnancy ' 14' the second month of pregnancy '……23' the 11th month of pregnancy' 24 ' the first month after birth ' 25' the second month after birth '……35' the 12th month after birth '36' after'.
24: refers to the admission in the first month after birth.25: refers to the admission in the second month after birth……35: refers to the admission in the 12th month after birth.36: refers to the admission after the 12th month after birth.
Prepare the variable which distributes hospital admissions over the periods before and after birth
In order to describe the trend of hospitalisations before and after birth (see Figure1), an SVTI needs to be created. The SVTI can identify hospital admissions over different time intervals such as a week, month or year, before and after birth. It also shows the time of admission before and during pregnancy. The steps and SPSS syntax for calculating and recoding an SVTI variable, Admmonth, which identifies hospital admissions during the months before and after birth, are shown in Table1. The Admmonth is calculated by three variables: maternal age in days at birth in the MDC (AgeBirthMum); mother’s age in days at admission in the APDC (AGEAdmMum); and gestational age in days (Gage). Gestational age was provided from the MDC as the number of completed weeks since the start of the last menstrual period. As a result, it needs to be converted into days to keep the unit consistent with the previous two variables. Because the Gage is reported in completed weeks, the accuracy of hospital admission time before birth is also in weeks rather than in days. For interval recoding, each month is defined as 30 days and each year as 360 days.
The hospital admissions in other time intervals such as year (Admyear) can be calculated the same way. The only difference is the length of the interval recoded. For example, each value of Admyear covers 360 days while Admmonth covers 30 days.
Classify diagnoses of mental illness
Classification of principal diagnoses for mental illness
Prin1: Schizophrenia and schizophrenia-like disorder
COMPUTE Prin1 = 0.if ( icd10d1 GE 'F20' AND icd10d1 LE 'F22.9' ) Prin1 = 1.if ( icd10d1 GE 'F24' AND icd10d1 LE 'F29' ) Prin1 = 1.EXECUTE.VARIABLE LABELS Prin1' principal diagnoses of Schizophrenia and schizophrenia-like disorder F20-22, F24-29'.
Persistent delusional disorders
Induced delusional disorder
Other nonorganic psychotic disorders
Unspecified nonorganic psychosis
Prin2: Unipolar depressions
Depressive episode except F32.3 see below (psychotic).
COMPUTE Prin2 = 0.if ( icd10d1 GE 'F32' AND icd10d1 LE 'F32.21' ) Prin2 = 1.if ( icd10d1 GE 'F32.8' AND icd10d1 LE 'F32.91' ) Prin2 = 1.if ( icd10d1 EQ 'F53.0' ) Prin2 = 1.if ( icd10d1 EQ 'F34.1' ) Prin2 = 1.if ( icd10d1 EQ 'F38.1' ) Prin2 = 1.if ( icd10d1 EQ 'F38.8' ) Prin2 = 1.EXECUTE.VARIABLE LABELS Prin2 ' principal diagnoses of unipolar depressions F32 (exclude 32.3), F53.0, F34.1, F38 (exclude 38.0)'.
Mild disorders associated with the puerperium, not elsewhere classified
Persistent mood [affective] disorders: Dysthymia
Other mood [affective] disorders except F38.0 see below (bipolar affective episode)
Prin3: Acute Psychotic episodes : reactive, brief, affective
Acute and transient psychotic disorders
COMPUTE Prin3 = 0.if ( icd10d1 GE 'F23' AND icd10d1 LE 'F23.91' ) Prin3 = 1.if ( icd10d1 GE 'F32.3' AND icd10d1 LE 'F32.31' ) Prin3 = 1.if ( icd10d1 EQ 'F30.2' ) Prin3 = 1.if ( icd10d1 EQ 'F33.3' ) Prin3 = 1.if ( icd10d1 EQ 'F39' ) Prin3 = 1.if ( icd10d1 EQ 'F53.1' ) Prin3 = 1.EXECUTE.VARIABLE LABELS Prin3' principal diagnoses of acute psychotic episodes: reactive, brief, affective F23, 30.2,32.3, 33.3, 39, 53.1'.
Mania with psychotic features
Severe depressive episode with psychotic symptoms
Recurrent depressive disorder, current severe with psychotic symptoms
Unspecified mood [affective] disorder
Severe MH disorders associated with the puerperium, not elsewhere classified
Prin4: Bipolar affective disorders
COMPUTE Prin4 = 0.if ( icd10d1 GE 'F30.8' AND icd10d1 LE 'F30.9' )Prin4 = 1.if ( icd10d1 GE 'F31' AND icd10d1 LE 'F31.6' ) Prin4 = 1.if ( icd10d1 GE 'F31.8' AND icd10d1 LE 'F31.9' ) Prin4 = 1.if ( icd10d1 EQ 'F30.0' ) Prin4 = 1.if ( icd10d1 EQ 'F34.0' ) Prin4 = 1.if ( icd10d1 EQ 'F38.0' ) Prin4 = 1.EXECUTE.VARIABLE LABELS Prin4' principal diagnoses of bipolar affective disorders F30 (exclude 30.2), 31(exclude 31.7),34.0,38.0 '.
Other manic episodes
Manic episode, unspecified
Bipolar affective disorder F31.0-31.9 excluding F31.7 (remission).
Persistent mood [affective] disorders: Cyclothymia
Other single mood [affective] disorders
Prin5: Adjustment disorders
Reaction to severe stress and adjustment disorders
COMPUTE Prin5 = 0.if ( icd10d1 GE 'F43' AND icd10d1 LE 'F43.9' ) Prin5 = 1.EXECUTE.VARIABLE LABELS Prin5' principal diagnoses of adjustment disorders F43 '.
Prin6: Anxiety disorders
Phobic anxiety disorders
COMPUTE Prin6 = 0.if ( icd10d1 GE 'F40' AND icd10d1 LE 'F42.9' ) Prin6 = 1.EXECUTE.VARIABLE LABELS Prin6' principal diagnosis of anxiety disorders F40-42'.
Other anxiety disorders
Prin7: Personality disorders
COMPUTE Prin7 = 0.if ( icd10d1 GE 'F60' AND icd10d1 LE 'F69' ) Prin7 = 1.EXECUTE.VARIABLE LABELS Prin7' principal diagnoses of personality disorders F60-69'.
Prin8: Mental illness due to substance use
Mental illness due to substance use
COMPUTE Prin8 = 0.if ( icd10d1 GE 'F10' AND icd10d1 LE 'F19.8' ) Prin8 = 1.EXECUTE.VARIABLE LABELS Prin8 ' principal diagnoses of mental illness due to substance use F10-19. '.
Prin9: Remaining diagnoses
F00-99 Exclude the diagnoses above
COMPUTE Prin9 = 0.if ( icd10d1 GE 'F00' AND icd10d1 LE 'F09' ) Prin9 = 1.if ( icd10d1 EQ 'F30.1' ) Prin9 = 1.if ( icd10d1 EQ 'F31.7' ) Prin9 = 1.if ( icd10d1 GE 'F33.1' AND icd10d1 LE 'F33.2' ) Prin9 = 1.if ( icd10d1 GE 'F33.8' AND icd10d1 LE 'F33.9' ) Prin9 = 1.if ( icd10d1 EQ 'F34.9' ) Prin9 = 1.if ( icd10d1 GE 'F44.5' AND icd10d1 LE 'F52.9' ) Prin9 = 1.if ( icd10d1 GE 'F53.8' AND icd10d1 LE 'F59' ) Prin9 = 1.if ( icd10d1 GE 'F70.1' AND icd10d1 LE 'F99' ) Prin9 = 1.EXECUTE.VARIABLE LABELS Prin9 ' remaining principal diagnoses F00-99 exclude diagnosis above'.
Prin10: Overall diagnoses of mental illness
Overall diagnoses of mental illness
COMPUTE Prin10 = 0.if ( icd10d1 GE 'F00' AND icd10d1 LE 'F99' ) Prin10 = 1EXECUTE.VARIABLE LABELS Prin10 ' principal diagnoses of mental illness F00-99.’.
VALUE LABELS Prin1 Prin2 Prin3 Prin4 Prin5 Prin6 Prin7 Prin8 Prin9 Prin10 0'no' 1'yes'.
The principal diagnosis in Table2 refers to the diagnosis which is chiefly responsible for the hospital admission . ICD-10-AM refers to International Statistical Classification of Diseases and Related Health Problems, Tenth Revision, Australian Modification . In NSW APDC data, the variable ‘icd10d1’ refers to principal diagnosis; ‘icd10d2’ refers to stay diagnosis and ‘icd10d3’ and following refer to other diagnoses. If stay and other diagnoses are recoded into one diagnosis, the command of ‘DO REPEAT’ and ‘END REPEAT’ should be added to the syntax. For example, the variable which includes one-stay diagnosis and 50 other diagnoses for schizophrenia and schizophrenia-like disorder (Other1) is recoded by the syntax: COMPUTE Other1 = 0. DO REPEAT icd10d = icd10d2 TO icd10d52. If (icd10d GE ‘F20’ AND icd10d LE ‘F22.9’) Other1 = 1. If (icd10d GE ‘F24’ AND icd10d LE ‘F29’) Other1 = 1. END REPEAT. EXECUTE. VARIABLE LABELS Other1' stay and other diagnoses of schizophrenia and schizophrenia-like disorder F20-22, F24-29'.
Aggregate hospital admissions
The steps and SPSS syntax to aggregate hospital admissions for mental illness during pregnancy
Variable and explanation
Summarize hospital admissions of all principal diagnoses of mental illness during pregnancy
Select the period of pregnancy and sort record according to hospital admission date.PPN_mum: mother’s Project Person Number AGEAdmMum: mother’s age in days at hospital admission
USE ALL.COMPUTE filter_$ = ( Admmonth > = 13 Admmonth < = 23).VARIABLE LABELS filter_$ ' Admmonth >= 13 Admmonth < = 23 (FILTER)'.VALUE LABELS filter_$ 0 'Not Selected' 1 'Selected'.FORMATS filter_$ (f1.0).FILTER BY filter_$.EXECUTE.SORT CASES BY PPN_mum (A) AGEAdmMum(A).
Prin10_sum: the total hospital admissions for principal diagnosis of mental illness during pregnancy
AGGREGATE/OUTFILE = * MODE = ADDVARIABLES/BREAK = PPN_mum /Prin10_sum = SUM(Prin10).
Order the principal diagnoses of mental illness during pregnancy in sequence
Select and sort the records during pregnancy
FILTER OFF.USE ALL.SELECT IF (Admmonth > = 13 & Admmonth < = 23).EXECUTE.SORT CASES BY PPN_mum(A) AGEAdmMum(A).
Morder: The sequent order of mother’s hospital admission
compute Morder = 1.if ( PPN_mum = lag(PPN_mum )) Morder = lag(Morder ) + 1.format Morder (F2).VARIABLE LABELS Morder 'Mother’s hospital admission order (during pregnancy)'.EXECUTE.
Order the principal diagnosis of mental illness. Prin10order: the order of mother’s hospital admissions for principal diagnosis of mental illness (F00-99) during pregnancy
IF ( Morder EQ 1) #counter = 0.IF (Prin10 EQ 1 AND #counter GE 1) #counter = #counter + 1.IF (Prin10 EQ 1 AND #counter GE 1) Prin10order = #counter.IF (Prin10 EQ 1 AND #counter EQ 0) Prin10order = 1.IF (Prin10 EQ 1 AND #counter EQ 0) #counter = 1.FORMATS Prin10order (F2).VARIABLE LABELS Prin10order 'The order of mother’s hospital admissions for principal diagnosis of mental illness (F00-99) during pregnancy'.EXECUTE.
For the analysis of total length of hospital stay during pregnancy, the duration of each hospital stay during the period is calculated firstly by using maternal age in days at discharge minus maternal age in days at admission. Then the durations of hospital stay during pregnancy are added together by using the function of aggregate data in SPSS (see Table3).
Register-based and routinely collected data are important sources of disease surveillance and epidemiological studies [21, 22]. The data sources have been widely utilized in the Nordic countries, Scotland, United Kingdom, United States, Canada and Australia [5, 21]. For rare conditions such as birth defects, spinal surgery and arthroplasty, the data provide an effective means for monitoring the rates [23, 24]. By compiling more similar data sources from different countries or regions, the study population for the rare diseases increased significantly and the study results became more reliable [24, 25]. On the other hand, data linkage provides a way to extend the study field to broader areas and improve the quality of the linked data [2, 10, 26]. By linking different data sources according to study objectives, study variables can be increased and the completeness of the variable values can be improved significantly [10, 26].
A common limitation of registered or routinely collected data is loss of registration or under-reporting. Favourable health was generally more frequent among the registered than the non-registered, and non-registration may lead to bias in analyses of health inequalities [10, 27]. In NSW, Aboriginal mothers were less likely to register their births . The magnitude of under-estimation can be estimated by the capture-recapture method . The master dataset provides a platform for creating an SV which can improve the under-estimation . The SV is created from linked data and adds value to the data.
Building a master dataset is essential for linked data analysis. The master dataset is useful for data quality improvement. The more data to be linked when building the master dataset, including internal and external datasets, the greater the chance of improving data quality (including consistency and completeness), and the larger the amount of information made available for research. For a mother’s perinatal psychiatric study, the master dataset will be more useful if it includes, in addition to mothers’ and babies’ information, fathers’ data and other data collections such as the national health insurance data (Medicare), RBDM and the hospital- based Emergency Department Data Collection (EDDC). An optimal linked master dataset should cover all life events and health conditions of a study population in the long term.
The techniques for building a master dataset were derived from the current study data which were relatively simple. For very complicated data, such as the Western Australian Data Linkage System (WADLS), which was instigated in 1995 to link up to 40 years of data from over 30 collections for an historical population of 3.7 million, more linking methods were applied . For example, firstly the study records and variables for specific topics were selected from different data sources and then the data were linked .
It should be borne in mind when describing rates or risks before birth that gestational age in MDC refers to the time interval from the first day of women’s last menstrual period (LMP) to her baby’s date of birth rather than the interval between conception and date of birth. The conception date is about 14 days after the first day of women’s LMP. Furthermore, gestational age data is provided in completed weeks rather than days. As a result, the shortest time interval before birth should be expressed in weeks.
For more complicated data linkage, building a variable dictionary, including the SV, is very helpful when checking and analysing data.
To provide a comprehensive and representative picture of maternal mental illness before and after birth, some other limitations of linked data should also be considered when interpreting and disseminating the results [28, 29]. Patients’ access to health care impacts hospital admission rates. For psychiatric disorders, severity needs to be considered because only mild and severe patients admitted to hospital.
Linked data preparation including building a master dataset and calculating the SV can improve data quality and enhance data function.
Admitted Patient Data Collection
The Centre for Health Record Linkage
Emergency Department Data Collection
International Classification of Diseases
International Statistical Classification of Diseases and Related Health Problems Tenth Revision Australian Modification
Last menstrual period
Midwives Data Collection
New South Wales
Pharmaceutical Drugs of Addiction System
Register of Congenital Conditions
Statistical Package for the Social Sciences
statistical variable of time interval.
We would like to thank data custodians Lee Taylor, Kim Lim and Tony Dunn of the NSW Department of Health, Hilarie Tardif and Glenda Lawrence of the Centre for Health Record Linkage (CheReL) and John Lumby and Pia Salmelainen of the Pharmaceutical Services Branch of NSW Health for providing the data, undertaking the linkage and providing expert technical advice. We acknowledge Nicole Reilly for helping with diagnoses grouping and D’Arcy J Holman for providing training courses on the analysis of linked health data. We acknowledge the families who have contributed their data and professional staff involved in the data collection and management of this study.
Funding for this study was provided by Australia’s National Health and Medical Research Council (NHMRC) Training Fellowship.
- Burns L, Mattick RP, Cooke M: Use of record linkage to examine alcohol use in pregnancy. Alcohol Clin Exp Res. 2006, 30 (4): 642-8. 10.1111/j.1530-0277.2006.00075.x.View ArticlePubMedGoogle Scholar
- Morgan VA, Jablensky AV: From inventory to benchmark: quality of psychiatric case registers in research. Br J Psychiatry. 2010, 197 (1): 8-10. 10.1192/bjp.bp.109.076588.View ArticlePubMedGoogle Scholar
- Mai Q, et al: Mental illness related disparities in diabetes prevalence, quality of care and outcomes: a population-based longitudinal study. BMC Med. 2011, 9: 118-10.1186/1741-7015-9-118.View ArticlePubMedPubMed CentralGoogle Scholar
- Munk-Olsen T, Laursen TM, et al: Psychiatric Disorders with Postpartum Onset: Possible Early Manifestations of Bipolar Affective Disorders. Arch Gen Psychiatry. 2011, 69 (4): 428-34.PubMedGoogle Scholar
- Holman CD, et al: A decade of data linkage in Western Australia: strategic design, applications and benefits of the WA data linkage system. Aust Health Rev. 2008, 32 (4): 766-77. 10.1071/AH080766.View ArticlePubMedGoogle Scholar
- Goldsbury DE, et al: Using linked routinely collected health data to describe prostate cancer treatment in New South Wales.Australia: A validation study. BMC Health Serv Res. 2011, 11: 253-10.1186/1472-6963-11-253.View ArticlePubMedPubMed CentralGoogle Scholar
- Yu XQ, et al: A population-based study from New South Wales, Australia 1996–2001: area variation in survival from colorectal cancer. Eur J Cancer. 2005, 41 (17): 2715-21. 10.1016/j.ejca.2005.05.018.View ArticlePubMedGoogle Scholar
- Roberts CL, Tracy S, Peat B: Rates for obstetric intervention among private and public patients in Australia: population based descriptive study. BMJ. 2000, 321 (7254): 137-41. 10.1136/bmj.321.7254.137.View ArticlePubMedPubMed CentralGoogle Scholar
- Algert CS, et al: Regional block versus general anaesthesia for caesarean section and neonatal outcomes: a population-based study. BMC Med. 2009, 7: 20-10.1186/1741-7015-7-20.View ArticlePubMedPubMed CentralGoogle Scholar
- Xu F, et al: Improvement of maternal Aboriginality in NSW birth data. BMC Med Res Methodol. 2012, 12 (1): 1-8. 10.1186/1471-2288-12-1.View ArticleGoogle Scholar
- Australian Institute of Health and Welfare: Australian hospital statistics 2008–09. Health services series no. 17. Cat. no. HSE 84. 2010, AIHW, CanberraGoogle Scholar
- Centre for Epidemiology and Research: Diabetes as a principal diagnosis and as a comorbidity:hospitalizations by sex, NSW 1991–92 to 2010–2011. 2012, http://www.healthstats.nsw.gov.au/Indicator/dia_pcohos,Google Scholar
- Xu F, et al: Improvement of maternal Aboriginality in NSW birth data. BMC Med Res Methodol. 2012, 12 (1): 8-10.1186/1471-2288-12-8.View ArticlePubMedPubMed CentralGoogle Scholar
- Leeds KL, et al: Indigenous mothers and their babies, Australia 2001–2004. cat. no. PER 38. 2007, AIHW, CanberraGoogle Scholar
- Li Z, et al: Australia’s mothers and babies 2009. 2011, AIHW National Perinatal Epidemiology and Statistics Unit, Sydney, 1-122.Google Scholar
- Lawrence D, et al: Mortality in Western Australian psychiatric patients. Soc Psychiatry Psychiatr Epidemiol. 2000, 35 (8): 341-7. 10.1007/s001270050248.View ArticlePubMedGoogle Scholar
- Lawrence D, et al: Excess cancer mortality in Western Australian psychiatric patients due to higher case fatality rates. Acta Psychiatr Scand. 2000, 101 (5): 382-8. 10.1034/j.1600-0447.2000.101005382.x.View ArticlePubMedGoogle Scholar
- Munk-Olsen T, et al: Perinatal mental disorders in native Danes and immigrant women. Archives of Women's Mental Health. 2010, 13 (4): 319-326. 10.1007/s00737-009-0131-0.View ArticlePubMedGoogle Scholar
- Mai Q, et al: Do users of mental health services lack access to general practitioner services?. Medical Journal of Australia. 2010, 192 (9): 501-506.PubMedGoogle Scholar
- Page A, Ambrose S, HDGlover J, Atlas of Avoidable Hospitalisations in Australia: Atlas of Avoidable Hospitalisations in Australia: ambulatory care-sensitive conditions. 2007, PHIDU, University of Adelaide, AdelaideGoogle Scholar
- Miettunen J, et al: Use of Register Data for Psychiatric Epidemiology in the Nordic Countries. Textbook in Psychiatric Epidemiology. Edited by: John Wiley. 2011, 117, 131Google Scholar
- Munk-Olsen T, et al: New parents and mental disorders: a population-based register study. JAMA. 2006, 296 (21): 2582-9. 10.1001/jama.296.21.2582.View ArticlePubMedGoogle Scholar
- Surman G, da Silva AA, Kurinczuk JJ: Cerebral palsy registers and high-quality data: an evaluation of completeness of the 4Child register using capture-recapture techniques. Child Care Health Dev. 2012, 38 (1): 98-107. 10.1111/j.1365-2214.2011.01280.x.View ArticlePubMedGoogle Scholar
- Ohrn A, et al: Adverse events in spine surgery in Sweden: a comparison of patient claims data and national quality register (Swespine) data. Acta Orthop. 2011, 82 (6): 727-31. 10.3109/17453674.2011.636673.View ArticlePubMedPubMed CentralGoogle Scholar
- Labek G, et al: Organisation, data evaluation, interpretation and effect of arthroplasty register data on the outcome in terms of revision rate in total hip arthroplasty. Int Orthop. 2011, 35 (2): 157-63. 10.1007/s00264-010-1131-4.View ArticlePubMedGoogle Scholar
- Mai Q, et al: Mental illness related disparities in diabetes prevalence, quality of care and outcomes: a population-based longitudinal study. BMC Medicine. 2011, 9 (1): 118-10.1186/1741-7015-9-118.View ArticlePubMedPubMed CentralGoogle Scholar
- Nummela O, et al: Register-based data indicated nonparticipation bias in a health study among aging people. J Clin Epidemiol. 2011, 64 (12): 1418-25. 10.1016/j.jclinepi.2011.04.003.View ArticlePubMedGoogle Scholar
- Schuh R, et al: Validity of published outcome data concerning Anatomic Graduated Component total knee arthroplasty: a structured literature review including arthroplasty register data. Int Orthop. 2012, 36 (1): 51-6. 10.1007/s00264-011-1255-1.View ArticlePubMedGoogle Scholar
- Kallen B, Nilsson E, Olausson PO: Antidepressant use during pregnancy: comparison of data obtained from a prescription register and from antenatal care records. Eur J Clin Pharmacol. 2011, 67 (8): 839-45. 10.1007/s00228-011-1021-8.View ArticlePubMedGoogle Scholar
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2288/12/71/prepub