- Research
- Open access
- Published:
Cause of death coding in asthma
BMC Medical Research Methodology volume 24, Article number: 129 (2024)
Abstract
Background
While clinical coding is intended to be an objective and standardized practice, it is important to recognize that it is not entirely the case. The clinical and bureaucratic practices from event of death to a case being entered into a research dataset are important context for analysing and interpreting this data. Variation in practices can influence the accuracy of the final coded record in two different stages: the reporting of the death certificate, and the International Classification of Diseases (Version 10; ICD-10) coding of that certificate.
Methods
This study investigated 91,022 deaths recorded in the Scottish Asthma Learning Healthcare System dataset between 2000 and 2017. Asthma-related deaths were identified by the presence of any of ICD-10 codes J45 or J46, in any position. These codes were categorized either as relating to asthma attacks specifically (status asthmatic; J46) or generally to asthma diagnosis (J45).
Results
We found that one in every 200 deaths in this were coded as being asthma related. Less than 1% of asthma-related mortality records used both J45 and J46 ICD-10 codes as causes. Infection (predominantly pneumonia) was more commonly reported as a contributing cause of death when J45 was the primary coded cause, compared to J46, which specifically denotes asthma attacks.
Conclusion
Further inspection of patient history can be essential to validate deaths recorded as caused by asthma, and to identify potentially mis-recorded non-asthma deaths, particularly in those with complex comorbidities.
Background
In countries with full civil registration coverage, death records are used to legally certify someone as deceased, to monitor mortality patterns and the epidemiology of specific conditions, to inform family members about their relative’s medical history which may be of relevance to themselves, and to act as a form of emotional closure for the family and healthcare team who had looked after the patient [1]. The correct and accurate recording of death is therefore important on a personal, national, and international level and this information is processed by these different stakeholders in a variety of ways.
The Medical Certificate of Cause of Death (MCCD) [1] is an internationally standardized form which requires a qualified medical practitioner to complete. In the UK, the issuing of a MCCD is the legal responsibility of the doctor who had attended to the patient during their last illness within the past 28 days [2]. In practical terms, this tends to fall upon the patient’s most recent hospital consultant or General Practitioner who is expected to assign the final cause of death except in the rare circumstance where certification is conducted by a coroner. Cause of death assignment is often based on the patient’s medical records, relevant investigation results, and the responsible doctor’s experience of meeting and caring for the patient [2].
Legally, deaths must be registered within 5 days unless referred to a coroner whom may also initiate a post-mortem or an inquest [2]. In such cases, the coroner then shares their findings with the registrar, and their final diagnosis is utilized instead of the information provided on MCCD for the purpose of officially registering the death [3]. The MCCD assignment is divided into two parts: the causes of death (Part I) and any other health or circumstantial condition which is thought to have indirectly significantly contributed to a person’s final cause of death (Part II) [1]. Part I is further divided into the immediate cause of death (Part Ia) and the contributing causes (if required; Part Ib) which sequentially led to this event such that the diagnosis on the lowest line caused the conditions above it (thus known as the Underlying Cause Of Death, or UCOD).
The reported causes of death are then coded according to the International Classification of Diseases (ICD) by a trained coder or validated clinical coding software. This is conducted by the Office for National Statistics (ONS) in England and Wales and National Records of Scotland (NRS) in Scotland. Between 2000 and 2017, NRS used the automated Mortality Medical Data System (MMDS) for cause of death coding. Since then, the NRS transitioned to Iris, as recommended by the World Health Organization (WHO) [4], but the ONS have been using version 5.8 of the Multicausal and Unicausal Selection Engine, or MUSE, since January 2022 [5].
While clinical coding is intended to be an objective and standardized practice, it is important to recognize that it is not entirely devoid of bias. Sometimes, the available medical information may be ambiguous or incomplete, making it challenging to determine the exact cause of death [6]. In such cases, clinical coders must rely on their judgment and interpretation to assign the most appropriate code. Additionally, coding guidelines evolve over time to accommodate new medical knowledge and practices [7]. Changes in guidelines or updates may require coders to adapt their coding practices, which can introduce inconsistencies until widespread adoption and understanding of the new guidelines are achieved [8, 9]. This bias can introduce variability in the coding process, leading to discrepancies, particularly in assigning codes for complex or rare conditions [10]. As well as ambiguity and uncertainty, there can be cultural and political influences on how cause of death is reported and subsequently coded (Fig. 1). Suicide is notoriously poorly reported, due in part to the sensitivity of the event and the legality of suicidal behavior, resulting in substantial under-reporting [11, 12].
Shifts and variations in the use of the specific reporting and coding practices have two primary impacts in public health research. Firstly, comparisons between different populations may be hindered due to differences (either due to administrative or cultural variations), resulting in poor inference about associations between population-level factors and mortality. Secondly, it also hinders within population estimates over a duration of time. For example, interrupted time-series analyses may be affected by changes which do not reflect the underlying mortality rate. Additionally, the performance of risk prediction models may become biased by these differences for testing done by a temporal split if the practices have changed [13,14,15].
In this study, we aimed to investigate how asthma deaths in Scotland are coded, and whether there has been any change in practices over time.
Methods
Mortality data
In UK population mortality research datasets, the core variables are the personal identifier (typically pseudonymized), the date of death, and the causes of death. Additional variables may include the location of death, date of death registration, and details of coronial enquiry [16, 17].
The causes of death are presented as the ‘primary’ cause (the UCOD), and up to ten secondary causes, ordered from 0 to 9 (as per the sequence of events recorded in the first part of the MCCD). There is also a field denoting the version of the ICD that was in use in the data. In the ICD-10 ontology, each code is of four to six characters in length: the first three characters indicate the category of diagnosis, and characters four to six indicate aetiology, anatomical site, severity, or other clinical details [18].
The asthma learning healthcare system study population
The Asthma Learning Healthcare System (ALHS) study recruited over half a million patients from 75 general practices in Scotland, with primary care records linked to national accident and emergency (A&E), hospital, and mortality datasets, for all participants [19]. The study period was between January 2000 and March 2017. In this dataset, each diagnostic code is formatted to be 5 characters in length, with the form of a letter (denoting the chapter) plus either two or three digits (and one or two trailing spaces) as per ICD-10.
Analysis plan
Mortality records were reviewed to identify duplicated codes in different positions (primary and secondary). The number of (de-duplicated) codes recorded per death and change over time (by year) were examined, as well as the odds ratios of code incidence compared to those without asthma as a primary or contributing cause of death.
Asthma-related deaths were identified by the presence of any of J45 or J46, in any position. These codes were categorized either as relating to asthma attacks specifically (status asthmatic; J46) or generally to asthma diagnosis (J45). Our study did not use J44 codes, which incorporates chronic obstructive asthmatic bronchitis under J448, to identify cases as this category pertains primarily to chronic obstructive pulmonary disease (COPD) rather than asthma. In individuals who died of asthma-related causes, the location (primary versus secondary) of the code(s) were assessed, and specifically whether they were more commonly asthma or asthma-attack codes.
Any counts of five or below have been suppressed to protect patient confidentiality. Analysis was conducted in the R programming language (version 4.3.1), and code is available on the open-source platform GitHub at https://github.com/hollytibble/asthma_mortality_ICD.
Results
The ALHS dataset contained records for 91,022 deaths (excluding stillbirths) between January 1st 2000, and March 31st, 2017. After duplicate removal, there were a median of 2 causes coded per death (interquartile range 2 to 3). The number of codes increased on average over time, with 25.4% having only a single code, and 6.0% having five or more in 2000, compared to 18.6% with a single code and 16.3% with five or more in 2017 (Fig. 2).
There were 487 deaths in this time (0.54%) which included asthma-related causes. There was no observed change in frequency by year. Fewer than 5 of these deaths had both J45 and J46 codes (in any position; less than 1%), and 92% (450/487) deaths had only the J45 code, rather than the J46 code.
For the deaths mentioning asthma as a cause, it was the primary cause of death in 190 cases (39.0%). Of these, the most common code was J45 (82.1%, n = 156). Across years, the median percentage of primary asthma codes that were J45, rather than J46, was 87.1% (interquartile range of annual percentages = 68.2 to 89.3%), with no consistent trend over time.
There was an infection reported as a contributing or primary cause of death for 56.4% of deaths with asthma (J45) as the primary cause, ≤ 14.7% (n ≤ 5, exact number masked to protect patient confidentiality) when asthma attack (J46) was the primary cause. In comparison, infection was reported as a contributing or primary cause of death for 24.2% of deaths in which asthma was only a secondary cause (n = 72/297), and 25.2% (n = 22,793/90,535) of deaths in which asthma was not recorded in any position in the death record. Compared to ‘no asthma’, J45 as primary cause had 3.85 times higher odds of recorded infection (95% CI = 2.80 to 5.28), while there was no significant difference for J46 primary cause or non-primary asthma cause. The difference in infection record between those with J45 and J46 as the primary cause was also significant (7.51 times higher odds with J45 as primary code, 95% CI = 2.76 to 20.41).
When the primary cause of death was asthma-related, the most common secondary cause (any position) was unspecified bronchopneumonia (26.3%; top five presented in Table 1).
However, given the general prevalence of some of these as secondary causes, the odds of unspecified chronic ischemic heart disease was in fact lower when asthma was the primary cause of death than when asthma was not recorded as a cause at all (odds ratio = 0.56, 95% CI = 0.34 to 0.93). There were four secondary causes with odds ratios of higher than 3, as listed in Table 2.
For the 297 deaths with asthma as a secondary cause rather than the primary cause, the most common primary cause chapter was diseases of the circulatory system (ICD chapter I; 42.1%). The top five chapters by prevalence are shown in Table 3.
There were 1.93 times higher odds of endocrine, nutritional and metabolic diseases (chapter E) when asthma was a secondary cause (compared to asthma not recorded in any position, 95% CI 1.03 to 3.63), 1.59 times higher odds for diseases of the circulatory system (chapter I, 95% CI = 1.26 to 2.00), but an odds ratio of 0.53 for Mental, Behavioural and Neurodevelopmental disorders (chapter F, 95% CI = 0.31 to 0.93).
Discussions
Summary of results
0.5% of death records included an asthma-related cause. When the primary cause of death was asthma-related, ICD-10 code J45 (asthma) rather than J46 (asthma attack) was used 82% of the time. There were higher odds of infection (predominantly pneumonia) being reported as a contributing cause of death when J45 was the primary coded cause, compared to J46 (odds ratio = 7.51, 95% CI = 2.76 to 20.41). When asthma was only a secondary cause, 42% of deaths had primary cause related to diseases of the circulatory system.
Results in context
The process of completing a death certificate is liable to bias, particularly where there may be very co-morbid patients with chronic diseases. In the 2014 National Review of Asthma Deaths (NRAD), the MCCDs had been predominantly completed by junior doctors and therefore more likely to have inaccuracies than if they were completed by a senior doctor [20]. An expert panel review of the medical records of 900 cases classified with asthma as the underlying cause of death, they found that 10% had no evidence of asthma diagnosis at all, and 13% had asthma but did not die from it [21]. We consider such cross-referencing with primary care records to be the gold standard in reliable ascertainment of validated asthma deaths.
This bias can influence the accuracy of the final coded record in two different stages: the reporting of the death on the MCCD certificate by the attending physician, and the ICD coding of that certificate by the medical coder. A Japanese 2019 study reviewed 103 asthma ICD-10 coded deaths, and found that 16% were not asthma deaths, and that for 13% the cause could not be ascertained without further investigation [22]. A Swiss study compared the mortality data for in-hospital deaths to their terminal hospital discharge records, and found that for asthma mortality record cases (n = 50) there was only 24% agreement as primary cause [23]. Conversely, for the 20 cases with asthma as the principal terminal hospital discharge cause, the agreement to the mortality record was 60%. Unfortunately, we were not able to identify any UK data to directly compare these studies to.
The ontological coding from death certificate to ICD-10 value may also introduce some degree of bias, particularly when conducted by a human-coder rather than automated coding software. A study in the Netherlands compared the results of two independent ICD coders, and found that for respiratory deaths (n = 1145) there was agreement to the 4-digit level in 81% of cases, and to the 3-digit level in 84% of cases, but only to the chapter level in 88% of cases [24]. The chapter-level agreement was less than 70% for infectious, endocrine, and skin diseases, but over 95% for neoplasms. Although this study used manual coding, there may also be variation in automated coding between software and versions which affect outputs. An NRS review of the change to Iris in January 2017 from the previous Mortality Medical Data System (MMDS) system, which was an automated system in use from 2000, observed a decrease of 4.8% in the number of deaths allocated to respiratory causes in Scotland. This was mainly due to the switch of deaths from chest infections and aspiration pneumonia to dementia and diseases of the nervous system [25].
The typical standard for asthma death ascertainment from ICD-10 coded data is either J45 or J46 (and lower-level codes under these parents) as the primary cause of death. However, consideration must be paid to patients with a prior misdiagnosis of asthma [26], with an incorrect prior diagnosis of a different respiratory condition [27], with no reported history of asthma diagnosis [21], and with comorbid conditions [21], including infections and overlapping COPD and asthma [20, 21, 27]. As highlighted in our methods, the code J448 includes chronic obstructive asthmatic bronchitis however as it does not differentiate between asthma related and emphysematous related COPD, it was not used as an identification criterion in our study. As such our study may have underestimated any deaths associated with chronic asthmatic bronchitis. More complex rules and exclusions may be required to improve the accuracy of asthma mortality ascertainment, especially if such data were to be used for training disease prediction models.
We investigated changes in practices over the duration of the observed data (2000 to 2017). Variation in practice over time was observed, such as in the use of J45 versus J46 as the UCOD, but we failed to identify any clear trends. Further investigation is required to explore possible causes of temporal changes. Furthermore, the effect of the disruption to the healthcare system resulting from the CoVID-19 pandemic warrants further exploration [28].
The ALHS dataset contains records for a subset of the Scottish population: over half a million patients from 75 general practices in Scotland [19]. In this study population, Scottish people from areas with lower socioeconomic deprivation are slightly over-represented, however the population is otherwise fairly representative.
Conclusion
One in every 200 deaths in this Scottish dataset, between 2000 and 2017, were coded as being asthma related, as denoted by the inclusion of ICD-10 codes J45 and J46. Infection (predominantly pneumonia) was more commonly reported as a contributing cause of death when J45 was the primary coded cause, compared to J46, which specifically denotes asthma attacks. However, our study highlights the potential impact that bias can have on final cause of death reporting and coding. This is important to consider when creating disease prediction models which utilize retrospective data spanning large population and points in time.
Data availability
The ALHS data are held by the National Services Scotland electronic Data Research and Innovation Service (eDRIS) in the National Safe Haven. Restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. Data would be made available from a reasonable request to phs.edris@phs.scot. Code scripts, in the R language, for all components of the data cleaning and analysis are available at https://github.com/hollytibble/asthma_mortality_ICD.
Abbreviations
- ALHS:
-
Asthma Learning Healthcare System study
- A&E:
-
Accident and Emergency Department
- COPD:
-
Chronic Obstructive Pulmonary Disease
- ICD-10:
-
International Classification of Diseases, Version 10
- MCCD:
-
Medical Certificate of Cause of Death
- MUSE:
-
Multicausal and Unicausal Selection Engine
- UCOD:
-
Underlying Cause Of Death
- NRAD:
-
National Review of Asthma Deaths
- NRS:
-
National Records of Scotland
- ONS:
-
the Office for National Statistics
- WHO:
-
World Health Organization
References
World Health Organization. Medical certification of cause of death: instructions for physicians on use of international form of medical certificate of cause of death [Internet]. World Health Organization. 1979 [cited 2023 Jul 6]. https://apps.who.int/iris/handle/10665/40557.
GOV.UK [Internet]. [cited 2023 Jun 20]. Guidance for doctors completing medical certificates of cause of death in England and Wales (accessible version). https://www.gov.uk/government/publications/guidance-notes-for-completing-a-medical-certificate-of-cause-of-death/guidance-for-doctors-completing-medical-certificates-of-cause-of-death-in-england-and-wales-accessible-version.
User guide to mortality statistics -. Office for National Statistics [Internet]. [cited 2023 Jun 20]. https://www.ons.gov.uk/peoplepopulationandcommunity/birthsdeathsandmarriages/deaths/methodologies/userguidetomortalitystatisticsjuly2017#cause-of-death-coding.
Deaths - ScotPHO [Internet]. [cited 2023 Jul 11]. https://www.scotpho.org.uk/methods-and-data/overview-of-key-data-sources/scottish-national-data-schemes/deaths/.
Cause of death coding in mortality statistics, software changes. January 2022 - Office for National Statistics [Internet]. [cited 2023 Jun 20]. https://www.ons.gov.uk/releases/causeofdeathcodinginmortalitystatisticssoftwarechangesjanuary2022.
Harteloh P. The implementation of an automated coding system for cause-of-death statistics. Inform Health Soc Care. 2020;45(1):1–14.
Hirsch JA, Nicola G, McGinty G, Liu RW, Barr RM, Chittle MD, et al. ICD-10: history and context. Am J Neuroradiol. 2016;37(4):596–9.
Quan H, Li B, Duncan Saunders L, Parsons GA, Nilsson CI, Alibhai A, et al. Assessing validity of ICD-9-CM and ICD-10 Administrative Data in Recording Clinical conditions in a Unique dually coded database. Health Serv Res. 2008;43(4):1424–41.
Anderson RN, Miniño AM, Hoyert DL, Rosenberg HM. Comparability of cause of death between ICD-9 and ICD-10: preliminary estimates. Natl Vital Stat Rep. 2001;49(2):1–32.
Millares Martin P. Medical certificate of cause of death: looking for an European single standard. J Forensic Leg Med. 2020;75:102052.
Tøllefsen IM, Hem E, Ekeberg Ø. The reliability of suicide statistics: a systematic review. BMC Psychiatry. 2012;12(1):9.
Pritchard C, Iqbal W, Dray R. Undetermined and accidental mortality rates as possible sources of underreported suicides: population-based study comparing islamic countries and traditionally religious western countries. BJPsych Open. 2020;6(4):e56.
Davis SE, Lasko TA, Chen G, Matheny ME. Calibration Drift Among Regression and Machine Learning Models for Hospital Mortality. AMIA Annu Symp Proc. 2018;2017:625–34.
Davis SE, Greevy RA, Lasko TA, Walsh CG, Matheny ME. Detection of calibration drift in clinical prediction models to inform model updating. J Biomed Inform. 2020;112:103611.
Hickey GL, Grant SW, Murphy GJ, Bhabra M, Pagano D, McAllister K, et al. Dynamic trends in cardiac surgery: why the logistic EuroSCORE is no longer suitable for contemporary cardiac surgery and implications for future risk models. Eur J Cardiothorac Surg. 2013;43(6):1146–52.
NHS Digital [Internet]. [cited 2023 Jun 29]. Primary Care Mortality Database. https://digital.nhs.uk/services/primary-care-mortality-database.
2023.2. NRS Death - DataLoch Metadata Catalogue - Wiki Service [Internet]. [cited 2023 Jun 29]. https://www.wiki.ed.ac.uk/display/DMCatalogue/2023.2%3A+NRS+Death.
ISD Scotland. Code Formats for ICD-10 [Internet]. [cited 2023 Jun 29]. https://www.ndc.scot.nhs.uk/Dictionary-A-Z/Definitions/index.asp?Search=I&ID=987&Title=ICD-10%20Code%20Formats
Soyiri IN, Sheikh A, Reis S, Kavanagh K, Vieno M, Clemens T, et al. Improving predictive asthma algorithms with modelled environment data for Scotland: an observational cohort study protocol. BMJ Open. 2018;8:e23289.
Levy ML. The national review of asthma deaths: what did we learn and what needs to change? Breathe (Sheff). 2015;11(1):14–24.
Royal College of Physcians. Why asthma still kills: The National Review of Asthma Deaths (NRAD) [Internet]. 2014. Report No.: 978-1-86016-532–0. www.rcplondon.ac.uk/nrad.
Kamei T, Kanaji N, Nakamura H, Arakawa Y, Miyawaki H, Kishimoto N, et al. Asthma mortality based on death certificates: a demographic survey in Kagawa, Japan. Respiratory Invest. 2019;57(3):268–73.
Zellweger U, Junker C, Bopp M, Egger M, Spoerri A, Zwahlen M, et al. Cause of death coding in Switzerland: evaluation based on a nationwide individual linkage of mortality and hospital in-patient records. Popul Health Metrics. 2019;17(1):2.
Harteloh P, de Bruin K, Kardaun J. The reliability of cause-of-death coding in the Netherlands. Eur J Epidemiol. 2010;25(8):531–8.
National Records for Scotland. The Impact of the Implementation of IRIS Software for ICD-10 Cause of Death Coding on Mortality Statistics in Scotland. 2017.
Kavanagh J, Jackson DJ, Kent BD. Over- and under-diagnosis in asthma. Breathe. 2019;15(1):e20–7.
Yamauchi Y, Yasunaga H, Matsui H, Hasegawa W, Jo T, Takami K, et al. Comparison of in-hospital mortality in patients with COPD, asthma and asthma–COPD overlap exacerbations. Respirology. 2015;20(6):940–6.
Duckworth C, Chmiel FP, Burns DK, Zlatev ZD, White NM, Daniels TWV, et al. Using explainable machine learning to characterise data drift and detect emergent health risks for emergency department admissions during COVID-19. Sci Rep. 2021;11(1):23017.
Acknowledgements
The authors would like to extend their gratitude to Prof. Colin Simpson and Dr. Irenous Soyiri for their work in the conception and development of the Asthma Learning Healthcare System (ALHS) study, the data for which was used to support these analyses.
Funding
Not Applicable.
Author information
Authors and Affiliations
Contributions
HT and AC conceived the analysis and wrote the manuscript. HT conducted the analysis. GAOP contributed to the analysis plan and to the final version of the manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Ethical approval for current study is obtained from the South East Scotland Research Ethics Committee 02 [16/SS/0130] and the Public Benefit and Privacy Panel (PBPP) for Health and Social Care [1516 − 0489], who waived the need for informed consent from individuals. All methods were carried out in accordance with relevant guidelines and regulations.
Consent for publication
Not Applicable.
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.