Skip to main content
  • Research article
  • Open access
  • Published:

Machine learning techniques for mortality prediction in critical traumatic patients: anatomic and physiologic variables from the RETRAUCI study



Interest in models for calculating the risk of death in traumatic patients admitted to ICUs remains high. These models use variables derived from the deviation of physiological parameters and/or the severity of anatomical lesions with respect to the affected body areas. Our objective is to create different predictive models of the mortality of critically traumatic patients using machine learning techniques.


We used 9625 records from the RETRAUCI database (National Trauma Registry of 52 Spanish ICUs in the period of 2015–2019). Hospital mortality was 12.6%. Data on demographic variables, affected anatomical areas and physiological repercussions were used. The Weka Platform was used, along with a ten-fold cross-validation for the construction of nine supervised algorithms: logistic regression binary (LR), neural network (NN), sequential minimal optimization (SMO), classification rules (JRip), classification trees (CT), Bayesian networks (BN), adaptive boosting (ADABOOST), bootstrap aggregating (BAGGING) and random forest (RFOREST). The performance of the models was evaluated by accuracy, specificity, precision, recall, F-measure, and AUC.


In all algorithms, the most important factors are those associated with traumatic brain injury (TBI) and organic failures. The LR finds thorax and limb injuries as independent protective factors of mortality. The CT generates 24 decision rules and uses those related to TBI as the first variables (range 2.0–81.6%). The JRip detects the eight rules with the highest risk of mortality (65.0–94.1%). The NN model uses a hidden layer of ten nodes, which requires 200 weights for its interpretation. The BN find the relationships between the different factors that identify different patient profiles. Models with the ensemble methodology (ADABOOST, BAGGING and RandomForest) do not have greater performance. All models obtain high values ​​in accuracy, specificity, and AUC, but obtain lower values ​​in recall. The greatest precision is achieved by the SMO model, and the BN obtains the best recall, F-measure, and AUC.


Machine learning techniques are useful for creating mortality classification models in critically traumatic patients. With clinical interpretation, the algorithms establish different patient profiles according to the relationship between the variables used, determine groups of patients with different evolutions, and alert clinicians to the presence of rules that indicate the greatest severity.

Peer Review reports


Models for calculating the risk of death are used to assess the severity of the condition of traumatic patients. Classically, models for calculating the risk of death in traumatic patients have used two types of approximations. One approach consists of using physiological variables, which can define organic failures, which indicate a greater risk if their values are far from the levels defined as normality; and another approximation determines the severity according to the graduation of the anatomical lesions produced in the different body areas [1].

Although various studies have tried to take advantage of these two approaches – anatomical and physiological – there is still a need to look for systems that achieve better results and to obtain tools that can be used in healthcare practice [2].

Lesions are divided into groups associated with different anatomical areas, and their intensity can be assessed according to the Abbreviated Injury Scale (AIS) [3]. The physiological impact is assessed at the neurological, hemodynamic, and respiratory levels according to the Triage-Revised Trauma Score (T-RTS) [4].

In order to analyse the relationship between mortality, anatomical extension of the injury and its physiological repercussion, it is necessary to have large databases that include records of critically traumatic patients. The RETRAUCI (National Trauma Registry in ICU) study includes the participation of 52 ICUs in Spain and almost 10,000 patients [5].

Classification systems that use machine learning techniques (MLT) provide a global methodological vision and allow us to create multiple algorithms to achieve a more accurate result [6]. The current interest in MLT methodology applied to biomedical research is especially keen, and the development of these techniques requires adequate standardization and evaluation guidelines [7].

There are platforms that make it possible to work with multiple algorithms and that make the work more user-friendly and accurate in the construction and evaluation of results [8]. Among others, the WEKA (Waikato Environment for Knowledge Analysis) platform, developed by the University of Waikato, offers the possibility of using various algorithms and evaluating them with a single tool [9].

From a theoretical point of view, in the ideal conditions of a refined database with sufficient records, the No Free Lunch Theorem establishes that all the algorithms will optimize their results. With real data, however, this same theorem forces us to use different algorithms that will obtain different degrees of precision [10].

Some algorithms produce models with clinical interpretation, such as those based on classification trees, decision rules or Bayesian networks [11, 12]. Understanding the relationships between the variables that influence the classification of patients according to their severity offers us the possibility of understanding the different profiles of traumatic patients admitted to the ICU.

Our objective is to create different predictive models of the mortality of critically traumatic patients using machine learning techniques, to evaluate their performance and, if they can be interpreted, to evaluate relationships between the different types of variables included.


RETRAUCI database

RETRAUCI is an observational, prospective, and multicentre nationwide registry that currently includes 52 ICUs in Spain. The RETRAUCI database only collects traumatic patients admitted to the ICU. It has the endorsement of the Neurointensive Care and Trauma Working Group of the Spanish Society of Intensive Care Medicine (SEMICYUC) and currently operates in a web-based electronic format [13]. We include a five-year study period (2015–2019). Ethics Committee approval for the registry was obtained (Hospital Universitario 12 de Octubre, Madrid: 12/209). Due to the retrospective analysis of de-identified collected data, informed consent was not obtained. Hospital mortality was used as the outcome variable.

The variables collected were classified into several groups (Table 1).

Table 1 Risk factors associated with mortality. Description and attribute evaluation

First, we considered patient variables, such as Age and Sex. Variables were used that describe the importance of injuries by anatomical area according to the AIS model (2005 version) - severity levels ranging from 1(least severe) to 4–6 (most severe) [3]. The anatomical areas were head (AHEAD), neck (ANECK), face (AFACE), thorax (ATHORAX), abdomen (AABDOM), spine (ASPINE), upper extremity (AUPPEREXT), lower extremity (LOWEREXT) and external and thermal injuries (AEXTERNAL).

Also, we considered variables derived from the T-RTS, obtained from first medical attention before initiating resuscitation and/or mechanical ventilation, such as the Respiratory Rate (PointRF), Systolic Blood Pressure (PointSBP) and the Glasgow Coma Score (PointGCS), which range between 0 points (greater severity) and 4 points (normality) [4].

Next, patient treatment variables, such as the presence at the ICU of mechanical ventilation (MV) or the occurrence of a Massive Haemorrhage (MASSIVEHEM) requiring activation of the massive transfusion protocol, were also included [14].

Finally, variables that defined organic failures during the ICU stay: hemodynamic failure (HEMODINAM) indicated by the presentation of an SBP lower than 90 mmHg requiring the administration of volume, blood products, and vasoconstrictor support; respiratory failure (RESPIRATORY), indicated by the presence of PO2/FiO2 below 300; renal failure (AKIDNEY), indicated by an increase in creatinine > 1.5 times the initial, or 25% reduction in urine flow to less than 0.5 ml/kg/h for at least 6 h; and the presence of coagulopathy (COAGULOP), indicated by the prolongation of prothrombin and activated partial thromboplastin times in > 1.5 times the control or by levels of fibrinogen < 150 mg / dl or thrombocytopenia < 100,000 [13, 15, 16].

Conventional statistics

Variable distribution was tested with the Kolmogorov–Smirnov test. The variable AGE did not meet the criteria of normality (p > 0.05). Variables are described as median (interquartile range) or as a percentage. For the comparison of survivors (A-ALIVE) and non-survivors (D-DIED), the Mann-Whitney test was used for continuous variables, and the chi-square test or Fisher’s exact test for categorical variables. A p-value of < 0.05 was taken as significant.

Machine learning techniques

We used the WEKA Platform (version 3.8). We first use attribute selection methodology. Attribute selection is a technique used to extract the ranking of attributes and can help us by reducing the work of processing algorithms by discarding irrelevant variables. WEKA incorporates various attribute selection techniques. We use the Information Gain Attribute evaluation method. This method measures the significance of attribute by measure of information gain calculated with respect to target class and orders the variables according to their importance [17].

Second, we use WEKA’s EXPLORER module to determine the optimal parameters for each algorithm used. The parameters chosen were those that achieved the best performance values (see Algorithm evaluation section). A ten-fold cross-validation process system was used in all algorithms [18].

And third, using WEKA’s EXPERIMENTER module, run all algorithms 10 times, using repeated ten-fold cross-validation, to facilitate comparison of the predictive performance based on the different evaluation criteria that are available in WEKA [18].

Algorithm selection

Of the multiple algorithms included in WEKA, we selected nine supervised algorithms classified in traditional and ensemble methodology. The first six are traditional models based on logistic regression binary (LR) functions, a neural network according to multilayer perceptron (NN), sequential minimal optimization (SMO), classification rules (JRip), classification trees (CT) and Bayesian networks (BN), respectively. We also included three models that use ensemble classification algorithms: adaptive boosting (ADABOOST), bootstrap aggregating (BAGGING), and random forest (RFOREST) [18]. With the WEKA EXPLORER module we select the optimal parameters of the different algorithms used.

For the LR model, we used a backward stepwise regression system with variable input with p <  0.05 and removal with p <  0.10. Odds ratios (OR) with a 95% confidence interval were calculated.

In the CT model, we used the J48 algorithm based on C4.5, obtaining a pruned tree [19]. The JRip algorithm uses a rule learner: Repeated Incremental Pruning to Produce Error Reduction (RIPPER) [20]. We limited tree growth (CT) and the number of rules (JRip), with a minimum of 20 instances.

For the BN, we used the TAN (Tree Augmented Network) variable relation search algorithm, which generates a graph that can be interpreted. This method does not assume the independence of the variables [21, 22].

The SMO implements John Platt’s sequential minimal optimization algorithm for training a support vector classifier [23]. In NN, we used the automatic mode for selecting the number of nodes in the hidden layer, with a learning rate of 0.3 and a momentum of 0.2 [24]. In RFOREST, we selected ten trees with the C4.5 algorithm [25]. In the rest of the algorithms (ADABOOST and BAGGING), we used the parameters that WEKA incorporates by default [18, 26].

Algorithm evaluation

To evaluate the performance of the algorithms, we used the calculation of accuracy, specificity, precision, recall, F-measure, and the area under curve ROC (AUC).

A patient who dies can be classified correctly (true positive-TP) or incorrectly (false negative-FN) and a patient who survives can be classified correctly (true negative-TN) or incorrectly (false positive-FP). We define the evaluation indices as:

  • Accuracy. The proportion of patients that are correctly labelled among the total number of patients. Accuracy = (TP + TN)/(TP + TN + FP + FN).

  • Specificity. The proportion of patients predicted as survivors and are correctly identified. Specificity = TN/(TN + FP).

  • Precision. The proportion of patients that are correctly predicted as dead among those labelled as dead. Precision = TP/(TP + FP)

  • Recall (Sensitivity). The proportion of dead patients that are correctly labelled. Recall = TP/(TP + FN)

  • F-measure. A measure that combine both Precision and Recall. F-measure = (2 x Precision x Recall)/ (Precision + Recall).”

WEKA’s Experimenter module, with ten repetitions, allows one to establish whether there are statistical differences between the evaluated properties of the algorithms using the paired T-Test (corrected) [18].


The RETRAUCI database enrolled 9790 patients in the 2015–2019 period. With 165 records, the data was not complete. The study group includes 9625 patients with a median age of 48 (33–64) years, 77.8% men and a hospital mortality of 12.6% (1212 patients).

Table 2 show demographic and clinical characteristics of patients according to mortality. Table 3 show values in the AIS model scale according to anatomical zone and mortality. It is observed that the factors without significant differences between survivors and non-survivors also have less weight (see last column of Table 1) according to WEKA’s attribute selection criteria. The ANECK and AFACE variables, with the lowest values, were not used in the construction of the models. The most important factors (PointGCS and AHEAD) are those associated with traumatic brain injury (TBI).

Table 2 Demographic and clinical characteristics of patients according to mortality
Table 3 Values in the AIS model scale according to anatomical zone and mortality

Table 4 shows the results of the LR model. 13 variables are included. Do not include SEX or PointFR variables, and that there are three anatomical areas with some OR with values less than 1 (THORAX, LOWEREXT and UPPEREXT).

Table 4 Logistic Regression Binary model for mortality prediction

The CT algorithm (Fig. 1) generates 24 decision rules and uses those related to TBI as the first variables. A range of probability of death is obtained between 2.0 and 81.6%.

Fig. 1
figure 1

Mortality Classification Tree Model in Critically Traumatic Patients. A: Alive. D: Died

The JRip algorithm detects eight classification rules (Fig. 2) that define the patients with the highest risk of mortality. The mortality rate ranges between 65.0 and 94.1%. Patients who do not comply with any of these rules have a lower mortality of 5.8%.

Fig. 2
figure 2

JRip-based classification rules. Output: Mortality. A: Alive. D: Died

The NN model was established automatically with a hidden layer of ten nodes. The model is fully interconnected and requires 200 weights - 180 × 10 nodes in hidden layer and 10 × 2 in output layer - to be used for its interpretation.

The BN model offers us a graph (Fig. 3) with which we can identify the relationships between the different factors, which, in turn, can help us identify different patient profiles. For example, the relationships are observed in the variables associated with TBI (AHEAD and PointGCS), hemodynamic failure with COAGULOP and AKIDNEY, respiratory failure with ATHORAX and AUPPEREXT and the relationship between ALOWEREXT injury with MASSIVEHEM and COAGULOP.

Fig. 3
figure 3

Bayesian network model (TAN) of mortality classification in critically traumatic patients. Output: Mortality

The performance data of the nine algorithms evaluated are shown in Table 5. No great differences were found in the precision measurements. All models obtained high values ​​in accuracy, specificity, and AUC, but obtained lower values ​​in recall. The highest precision was achieved by the SMO model, and the BN obtained the best recall, F-measure, and AUC.

Table 5 Performance properties of the 9 algorithms analysed

Despite using more complex algorithms, models with the ensemble methodology (ADABOOST, BAGGING and RandomForest) did not manage to increase the performance of the classification.


The availability of a database such as RETRAUCI gives us the opportunity to apply classification model methodology to stratify the risk of mortality and, therefore, establish the severity of traumatic patients admitted to the ICU [27]. Critical trauma patients have different characteristics from those who do not require admission to the ICU. Therefore, specific models for these patients should be developed using databases such as RETRAUCI [2, 28].

Classifying these patients in groups of different severity can help to prioritize the allocation of healthcare resources for the most seriously ill patients. There are several studies on the application of MLT in biomedical problems and in other aspects of critically ill patients [29, 30]. The WEKA platform enables us to carry out multiple classification models using a single tool [31].

In our results, the different algorithms have found certain common factors to be the most important in determining the risk of mortality. The most influential factors are those derived from TBI, both measured by anatomical involvement and by physiological repercussion. These results have already been studied in other studies on the severity of critical trauma patients [32]. The presence of an organic failure has also been shown to influence mortality [33]. Age is also a particularly important factor in the evolution of these patients [34].

In general, with some differences, the algorithms used have achieved similar levels of performance. The models have failed to classify the group of deceased patients with moderate recall values. Although this result coincides with other studies with different groups of patients, it requires us to continue searching for more precise algorithms [8].

The algorithms used have specific characteristics based on the clinical interpretation of the groups of patients with different severities and on the relationships of the different variables studied, which must be considered.

The LR results identify the variables that are independently associated with higher mortality. They also indicate that in critically ill patients, those with only more severe chest or limb injuries are a group which requires intensive surveillance, but which has a lower mortality rate among those admitted to the ICU [35].

The CT model serves to establish a hierarchy of variables and, through decision rules, establish different groups of patients according to their mortality rate. There are two large groups of patients: those with TBI and those without. It is also interesting to observe the different cut-off points for age according to each decision rule. Other models have been built into classification trees for both traumatic patients and other critical pathologies, and these have also found an increased risk of mortality associated with TBI [19, 36].

The JRip model shows easily interpretable classification rules [32]. In our work, it identifies the groups of patients with the highest mortality rate. This set of classification rules should become an alert system that identifies those patients with the highest risk of mortality early. In these groups of patients, the most important factors are advanced age, the presence of TBI and organic failure.

The BN-based model shows the relationship between the different factors studied. The relationship between factors dependent on head trauma can be appreciated. For example, the relationship between thoracic injury, respiratory involvement and upper extremity injuries is observed. On the other hand, the relationship between the presence of lower limb injuries (including the pelvis), coagulopathy, hemodynamic alteration and massive bleeding is observable. The study of these relationships is capable of differentiating groups of patients with different profiles of anatomical involvement and physiological repercussions. Traumatic patients admitted to the ICU share a critical process, but they express different forms of involvement that can be grouped into different profiles with specific characteristics in their severity and treatment. As in other works, the BN algorithm obtained better precision values [37].

The NN works with all possible relationships between the analysed factors. This characteristic has resulted in NN models obtaining the best classification results in other databases [24, 38]. In our case, it did not manage to improve the performance. In addition, the great complexity of its structure turned the model into a black box that It is difficult to interpret due to the large number of parameters to evaluate.

The ensemble algorithms, although more complex in their methodology, have been shown to obtain greater performance in other works. In our results, however, they also did not achieve greater performance values [39].

The ideal mortality risk calculation model must take two aspects into account. On the one hand, it must have the highest possible performance that can be achieved using more complex techniques in its calculation and/or include more predictor variables and their relationships. Complex models usually use specific programs for their use and are difficult to interpret due to the large number of parameters to be evaluated (for example NN). And, on the other hand, simpler models that have a great facility of clinical interpretation (R-TS and based on classification rules). We believe that they are not two divergent aspects, the creation of complex models can help to achieve that interpretable models improve their performance, for example, identifying variables and their relationships to incorporate them into models of clinical use.


Our work has several limitations. Other variables concerning the type or mechanism of the trauma, analytical or evolutionary, could have been included. Our objective required working with variables of anatomical involvement and physiological repercussion. More types of classification algorithms could also have been used [40, 41].

Also consider that an interpretable model may not learn ground truth relationships. For example, a causal interpretation of the Bayesian network in Fig. 3 would indicate that a person’s sex is caused by their age.

The incorporation of new records, which is carried out continuously in the RETRAUCI database, will allow the validation of the algorithms created and will incorporate more patients in the less numerous groups, such as in the neck lesions. The WEKA platform is a dynamic project that continuously improves the learning methodology by incorporating new algorithms and further automating the construction process [25].


Machine learning techniques are useful for creating mortality classification models in critically traumatic patients. Even with some differences, the different algorithms achieved similar performance values. In addition, the algorithms that have a clinical interpretation help us to establish different patient profiles according to the relationship between the variables used and establish groups of patients with different evolutions, and some of the rules can even become alert systems to identify patients with the highest severity.

The models for classifying the severity of critically ill patients should have the common objective of determining the variables and their relationship in order to improve precision by establishing groups of patients with a greater probability of dying who could benefit from priority care that improves their survival.

Availability of data and materials

The datasets analysed during the current study are not publicly available due they are the property of the RETRAUCI project. Data are however available from the authors upon reasonable request and with permission of RETRAUCI project.



adaptive boosting


Abbreviated Injury Scale


Bootstrap aggregating


Bayesian networks


Classification trees


Fraction of Inspired Oxygen


Glasgow Coma Score


Intensive Care Unit


Classification rules


Logistic Regression Binary


Machine Learning Techniques


mechanical ventilation


neural network


Partial pressure of oxygen


National Trauma Registry in ICU


Respiratory rate


Random forest


Systolic blood pressure


Sequential minimal optimization


Traumatic brain injury


Triage-Revised Trauma Score


Waikato Environment for Knowledge Analysis


  1. de Munter L, Polinder S, Lansink KW, et al. Mortality prediction models in the general trauma population: a systematic review. Injury. 2017;48:221–9.

    Article  PubMed  Google Scholar 

  2. Lefering R, Huber-Wagner S, Bouillon B, et al. Cross-validation of two prognostic trauma scores in severely injured patients. Eur J Trauma Emerg Surg. 2020;

  3. Gennarelli TA, Wodzin E. AIS 2005: a contemporary injury scale. Injury. 2006;37:1083–91.

    Article  PubMed  Google Scholar 

  4. Champion HR, Sacco WJ, Copes WS, Gann DS, Gennarelli TA, Flanagan ME. A revision of the trauma score. J Trauma. 1989;29:623–9.

    Article  CAS  PubMed  Google Scholar 

  5. Chico-Fernández M, Sánchez-Casado M, Llompart-Pou JA. Trauma registry in Spain. Comment to "trauma systems around the world: a systematic overview". J Trauma Acute Care Surg. 2018;84(1):217–8.

    Article  PubMed  Google Scholar 

  6. Ma H, Xu CF, Shen Z, Yu CH, Li YM. Application of machine learning techniques for clinical predictive modeling: a cross-sectional study on nonalcoholic fatty liver disease in China. Biomed Res Int. 2018;2018:4304376.

    PubMed  PubMed Central  Google Scholar 

  7. Luo W, Phung D, Tran T, et al. Guidelines for developing and reporting machine learning predictive models in biomedical research: a multidisciplinary view. J Med Internet Res. 2016;18(12):e323.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Deist TM, Dankers FJWM, Valdes G, et al. Machine learning algorithms for outcome prediction in (chemo)radiotherapy: an empirical comparison of classifiers. Med Phys. 2018;45(7):3449–59.

    Article  PubMed  Google Scholar 

  9. Frank E, Hall M, Trigg L, Holmes G, Witten IH. Data mining in bioinformatics using Weka. Bioinformatics. 2004;20(15):2479–81.

    Article  CAS  PubMed  Google Scholar 

  10. Gómez D, Rojas A. An empirical overview of the no free lunch theorem and its effect on real-world machine learning classification. Neural Comput. 2016;28(1):216–28.

    Article  PubMed  Google Scholar 

  11. Pourhoseingholi MA, Kheirian S, Zali MR. Comparison of basic and ensemble data mining methods in predicting 5-year survival of colorectal Cancer patients. Acta Inform Med. 2017;25(4):254–8.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Zador Z, Sperrin M, King AT. Predictors of outcome in traumatic brain injury: new insight using receiver operating curve indices and Bayesian network analysis. PLoS One. 2016;11(7):e0158762.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Chico-Fernández M, Llompart-Pou JA, Guerrero-López F, et al. Epidemiology of severe trauma in Spain. Registry of trauma in the ICU (RETRAUCI). Pilot phase. Med Int. 2016;40(6):327–47.

    Google Scholar 

  14. Llau JV, Acosta FJ, Escolar G, et al. Multidisciplinary consensus document on the management of massive haemorrhage (HEMOMAS document). Med Intensiva. 2015;39(8):483–504 O.

    Article  CAS  PubMed  Google Scholar 

  15. Søvik S, Isachsen MS, Nordhuus KM, et al. Acute kidney injury in trauma patients admitted to the ICU: a systematic review and meta-analysis. Intensive Care Med. 2019;45(4):407–19.

    Article  PubMed  Google Scholar 

  16. Spahn DR, Bouillon B, Cerny V, et al. The European guideline on management of major bleeding and coagulopathy following trauma: fifth edition. Crit Care. 2019;23(1):98.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Gnanambal S, Thangaraj M, Meenatchi VT, Gayathri V. Classification algorithms with attribute selection: an evaluation study using WEKA. Int J Advanced Networking Applications. 2018;9(6):3640–4.

    Google Scholar 

  18. Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH. The WEKA data mining software. ACM SIGKDD Explor Newslett. 2009;11:10.

    Article  Google Scholar 

  19. Trujillano J, Badia M, Serviá L, March J, Rodriguez-Pozo A. Stratification of the severity of critically ill patients with classification trees. BMC Med Res Methodol. 2009;9:83.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Rajput A, Prasad R, Dubey M, Saxena SP, Raghuvanshi M. J48 and JRIP rules for E-governance data. IJCSS. 2011;5(2):201–7.

    Google Scholar 

  21. Banu AB. Thirumalaikolundusubramanian P. comparison of Bayes classifiers for breast Cancer classification. Asian Pac J Cancer Prev. 2018;19(10):2917–20.

    Google Scholar 

  22. Friedman N, Geiger D, Goldszmidt M. Bayesian networks classifiers. Mach Learn. 1997;29:131–63.

    Article  Google Scholar 

  23. Zhang YH, Hu Y, Zhang Y, Hu LD, Kong X. Distinguishing three subtypes of hematopoietic cells based on gene expression profiles using a support vector machine. Biochim Biophys Acta Mol Basis Dis. 2018;1864(6 Pt B):2255–65.

    Article  CAS  PubMed  Google Scholar 

  24. Lee KH, Dong JJ, Jeong SJ, et al. Early detection of Bacteraemia using ten clinical variables with an artificial neural network approach. J Clin Med. 2019;8(10):1592.

    Article  PubMed Central  Google Scholar 

  25. Wang HL, Hsu WY, Lee MH, et al. Automatic machine-learning-based outcome prediction in patients with primary Intracerebral hemorrhage. Front Neurol. 2019;10:910.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Somnay YR, Craven M, McCoy KL, et al. Improving diagnostic recognition of primary hyperparathyroidism with machine learning. Surgery. 2017;161(4):1113–21.

    Article  PubMed  Google Scholar 

  27. Chico-Fernández M, Llompart-Pou JA, Sánchez-Casado M, et al. Mortality prediction using TRISS methodology in the Spanish ICU trauma registry (RETRAUCI). Med Int. 2016;40(7):395–402.

    Google Scholar 

  28. DiMaggio CJ, Avraham JB, Lee DC, Frangos SG, Wall SP. The epidemiology of emergency department trauma discharges in the United States. Acad Emerg Med. 2017;24(10):1244–56.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Sidey-Gibbons JAM, Sidey-Gibbons CJ. Machine learning in medicine: a practical introduction. BMC Med Res Methodol. 2019;19(1):64.

    Article  PubMed  PubMed Central  Google Scholar 

  30. Shillan D, Sterne JAC, Champneys A, Gibbison B. Use of machine learning to analyse routinely collected intensive care unit data: a systematic review. Crit Care. 2019;23(1):284.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Smith TC, Frank E. Introducing machine learning concepts with WEKA. Methods Mol Biol. 2016;1418:353–78.

    Article  PubMed  Google Scholar 

  32. Rau CS, Kuo PJ, Chien PC, Huang CY, Hsieh HY, Hsieh CH. Mortality prediction in patients with isolated moderate and severe traumatic brain injury using machine learning models. PLoS One. 2018;13(11):e0207192.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Fröhlich M, Lefering R, Probst C, et al. Epidemiology and risk factors of multiple-organ failure after multiple trauma: an analysis of 31,154 patients from the TraumaRegister DGU. J Trauma Acute Care Surg. 2014;76(4):921–8.

    Article  PubMed  Google Scholar 

  34. Llompart-Pou JA, Chico-Fernández M, Sánchez-Casado M, et al. Age-related injury patterns in Spanish trauma ICU patients. Results from the RETRAUCI. Injury. 2016;47(Suppl 3):S61–5.

    Article  PubMed  Google Scholar 

  35. Lin FC, Tsai SC, Li RY, Chen HC, Tung YW, Chou MC. Factors associated with intensive care unit admission in patients with traumatic thoracic injury. J Int Med Res. 2013;41(4):1310–7.

    Article  PubMed  Google Scholar 

  36. Serviá L, Badia M, Montserrat N, Trujillano J. Severity scores in trauma patients admitted to ICU. Physiological and anatomic models. Med Int. 2019;43(1):26–34.

    Google Scholar 

  37. Zampieri FG, Aguiar FJ, Bozza FA, Salluh JIF, Soares M, ORCHESTRA Study Investigators. Modulators of systemic inflammatory response syndrome presence in patients admitted to intensive care units with acute infection: a Bayesian network approach. Intensive Care Med. 2019;45(8):1156–8.

    Article  PubMed  Google Scholar 

  38. Gholipour C, Rahim F, Fakhree A, Ziapour B. Using an artificial neural networks (ANNs) model for prediction of intensive care unit (ICU) outcome and length of stay at Hospital in Traumatic Patients. J Clin Diagn Res. 2015;9(4):OC19–23.

    PubMed  PubMed Central  Google Scholar 

  39. Hosni M, Abnane I, Idri A. Carrillo de Gea JM, Fernández Alemán JL. Reviewing ensemble classification methods in breast cancer. Comput Methods Prog Biomed. 2019;177:89–112.

    Article  Google Scholar 

  40. Sandri M, Berchialla P, Baldi I, Gregori D, De Blasi RA. Dynamic Bayesian networks to predict sequences of organ failures in patients admitted to ICU. J Biomed Inform. 2014;48:106–13.

    Article  PubMed  Google Scholar 

  41. Sierra B, Serrano N, Larrañaga P, et al. Using Bayesian networks in the construction of a bi-level multi-classifier. A case study using intensive care unit patient’s data. Artif Intell Med. 2001;22(3):233–48.

    Article  CAS  PubMed  Google Scholar 

Download references


To all the professionals of the UCIs participating in the RETRAUCI project.


This work has not received any funding.

Author information

Authors and Affiliations



LS, NM, MB, and JT conceived and supervised the project. JLL, JA, MC, MS, JJ, and DM analyzed the data, interpreted the findings, and drafted the manuscript. JLL, JA, MC, and MS designed and collected data study. All authors edited the manuscript and approved the final version.

Corresponding author

Correspondence to Javier Trujillano.

Ethics declarations

Ethics approval and consent to participate

Ethics Committee approval for the registry was obtained (Hospital Universitario 12 de Octubre, Madrid: 12/209).

Consent for publication

Not applicable.

Competing interests

The authors report no conflicts of interests related to the work described.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Serviá, L., Montserrat, N., Badia, M. et al. Machine learning techniques for mortality prediction in critical traumatic patients: anatomic and physiologic variables from the RETRAUCI study. BMC Med Res Methodol 20, 262 (2020).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: