Stratification of the severity of critically ill patients with classification trees
© Trujillano et al; licensee BioMed Central Ltd. 2009
Received: 22 February 2009
Accepted: 9 December 2009
Published: 9 December 2009
Development of three classification trees (CT) based on the CART (Classification and Regression Trees), CHAID (Chi-Square Automatic Interaction Detection) and C4.5 methodologies for the calculation of probability of hospital mortality; the comparison of the results with the APACHE II, SAPS II and MPM II-24 scores, and with a model based on multiple logistic regression (LR).
Retrospective study of 2864 patients. Random partition (70:30) into a Development Set (DS) n = 1808 and Validation Set (VS) n = 808. Their properties of discrimination are compared with the ROC curve (AUC CI 95%), Percent of correct classification (PCC CI 95%); and the calibration with the Calibration Curve and the Standardized Mortality Ratio (SMR CI 95%).
CTs are produced with a different selection of variables and decision rules: CART (5 variables and 8 decision rules), CHAID (7 variables and 15 rules) and C4.5 (6 variables and 10 rules). The common variables were: inotropic therapy, Glasgow, age, (A-a)O2 gradient and antecedent of chronic illness. In VS: all the models achieved acceptable discrimination with AUC above 0.7. CT: CART (0.75(0.71-0.81)), CHAID (0.76(0.72-0.79)) and C4.5 (0.76(0.73-0.80)). PCC: CART (72(69-75)), CHAID (72(69-75)) and C4.5 (76(73-79)). Calibration (SMR) better in the CT: CART (1.04(0.95-1.31)), CHAID (1.06(0.97-1.15) and C4.5 (1.08(0.98-1.16)).
With different methodologies of CTs, trees are generated with different selection of variables and decision rules. The CTs are easy to interpret, and they stratify the risk of hospital mortality. The CTs should be taken into account for the classification of the prognosis of critically ill patients.
Stratifying the patients into risk groups, according to their severity, is essential for the comparison of treatments and the establishment of differences between different units or hospital centres. As a result, working in an intensive care unit (ICU) necessitates making prognoses for patients within the first 24 hours of their admission. Establishing a prognosis consists of assigning a probability of death by using variables commonly used for the diagnosis and treatment of critically ill patients .
Severity scores are classic tools used in establishing this probability. The most commonly used scores are the APACHE II (Acute Physiology and Chronic Health Evaluation II), the SAPS II (Simplified Acute Physiology Score II) and the MPM II-24 (Mortality Probability Models II-24) scores [2–4].
Other systems of severity classification based on different mathematical strategies have also been used .
In the last decade, classification trees (CT), which were developed more than 20 years ago, have acquired greater importance in the immediate interpretation of the decision rules that they generate, and they are readily accepted by professionals in clinical practice .
A CT is a graphic representation of a series of decision rules. Beginning with a root node that includes all cases, the tree branches are divided into different child nodes that contain a subgroup of cases. The criterion for branching (or partitioning) is selected after examining all possible values of all available predictive variables. In the terminal nodes (the "leaves" of the tree), a grouping of cases is obtained, such that the cases are as homogeneous as possible with respect to the value of the dependent variable .
Characteristics of the classification tree methods
Classification and Regression Tree
Chi-Square Automatic Interaction Detection
Concept Learning Systems
Breiman et al. (1984)
Many disciplines with little data
Gini reduction or twoing
Best binary split
Number of values of the input
Best binary split
The aim of the present study was to develop (with a population of critically ill patients) three classification trees (based on CART, CHAID and C4.5 methodologies) to calculate the probability of hospital mortality and to compare these trees with each other, with the classic scores (APACHE II, SAPS II and MPM II-24) and with a model based on multiple logistic regression.
This is a retrospective study carried out using the database of a mixed ICU (with medical and surgical services) of 14 beds located at the University Hospital Arnau de Vilanova of Lleida. The ethical committee of the hospital was informed that the study was being carried out, and informed consent was not deemed necessary, since all the variables were collected for the diagnosis and treatment of the patients and their anonymity was assured at all times.
Data collected over ten years (from January 1997 to December 2006) were used. In this study, all patients were over the age of 16 years and remained in the ICU for more than 24 hours. Patient records with incomplete data were not used.
A random partition, in a 70:30 ratio, was made to establish the development and the validation sets, respectively.
Data concerning age, sex, length of stay in the ICU and procedures specific to the ICU were used. The outcome variable of interest was the probability of hospital mortality. The patients were divided according to their diagnostic groups following the Knaus classification . Six diagnostic groups were established according to the case mix and level of severity of the ICU, including two trauma categories of TBI (traumatic brain injury) and Multiple trauma (multiple trauma without brain injury), Respiratory (chronic respiratory problems with decompensation), Neurological (ischemic or hemorrhagic strokes), Surgery (surgical problems not included in other categories) and Medicine (medical pathology not included in other categories).
Each patient's medical records and laboratory database files were used to obtain information pertaining to baseline (at ICU admission) demographics, pre-existing comorbidities and scores (APACHE II, SAPS II and MPM II-24). The data were then compiled (manual recording) into single data using a relational database management system (Microsoft Access©).
The presence of acute renal failure was defined (according to the model MPM II-24) by levels of serum creatinin above 2 mg/dL . The antecedents of chronic organ insufficiency (defined according to the APACHE II model) were included in the variable COI .
Logistic models and classification trees
Models were created with the development set and were subsequently checked in the validation set.
Working with the development set, first, a univariate analysis was performed for all the variables included in the three scores to select those that predicted survival. Those that were statistically significant predictors were included in the development of the multivariable models.
We used a model of multiple logistic regression (LR) with forward stepwise selection of variables .
The computer programs used for creating the CTs are presented in Table 1. The program WEKA (a project of Waikato University) is freely accessible and includes a CT module, named J48, that includes CART and C4.5 .
Answer-Tree©, a module of SPSS (Statistical Package for the Social Sciences), includes options for CART and CHAID, and the program DTREG© (version 3.5) is based on a CART-type methodology.
To create the three types of CTs, a cross-validation system with ten partitions was used, and the only common restriction for terminating the growth of the tree was the minimum number of subjects in the terminal nodes (which was fixed at 50 patients).
The variables are presented as the mean (standard deviation), the median (interquartile interval) or as a percentage. For a comparison of the variables, the chi-squared (χ2) test was used for categorical variables, and the ANOVA test or non-parametric Mann-Whitney test was used for continuous variables, depending on the characteristics of the distribution.
To compare the different models, we measured their precision (discrimination and calibration) with the Brier score. The discrimination was measured by calculating the percentage of correctly classified patients (PCC) with a cut-off point with a probability of 0.5 and by the area below the ROC curve (AUC) . For calibration, the Hosmer-Lemeshow C test (HL-C) was used  by constructing the calibration curve and calculating the standardized mortality ratio (SMR) . These calculations were made both in the development set and in the validation set. We used a correlation matrix (Spearman correlation coefficients) and the Bland-Altman test to analyse the individual probabilities generated by the CT models .
The statistical analysis was carried out with the program SPSS (version 14.0).
Among 2823 patients, 139 were excluded due to incomplete or erroneous data (4.9%), leaving 2684 eligible patients. The development group consisted of 1880 patients (70%) and the validation group consisted of 804 (30%).
Demographic characteristics of patients
(n = 2684)
(n = 1880)
(n = 804)
Sex, male (%)
O Medicine (%)
Inotropic therapy (%)
Acute renal failure (%)
Outcome trend over the observation period
APACHE II a
SAPS II a
MPM II-24 a
Variable selection: univariate analysis
Univariate analyses of characteristics of patients at discharge, by survival status.
(n = 1302)
(n = 578)
Inotropic therapy (%)
Intracranial mass (%)
(A-a)O2 gradient (mmHg)
Urine output (cc/24 h)
Acute renal failure (%)
Urine output < 150 cc/8 h (%)
Only the COI variable reflected the chronic illnesses of the patient. For variables related to diagnoses, the surgery group was associated with a greater possibility of hospital mortality, while the trauma group was associated with a lower likelihood of mortality.
Multiple Logistic Regression Model
Results of multiple logistic regression
1.033 - 1.050
1.005 - 1.013
1.585 - 2.714
0.805 - 0.867
1.245 - 2.201
1.002 - 1.003
Acute renal failure
1.180 - 2.123
2.054 - 3.788
0.511 - 0.957
Classification Tree Models
The variables common to the three CTs and the LR model are inotropic therapy (INOT), Glasgow value, (A-a)O2 gradient ((A-a)O2), age and COI.
It is noted that a CT can use the same variables in various decision rules and that, for continuous variables, different cut-off points can be selected.
Comparison of model properties
The three CT models and the LR model were also compared with those generated using the APACHE II, SAPS II and MPM II-24 scores.
The severity scores were applied without making recalibration in all the population (development and validation sets).
Performance of the classification models: development and validation sets
DEVELOPMENT (n = 1880)
AUC (CI 95%)
PPV (CI 95%)
PCC (CI 95%)
SMR (CI 95%)
0.81 (0.79 - 0.83)
0.72 (0.66 - 0.78)
0.75 (0.73 - 0.77)
1.30 (1.23 - 1.37)
0.82 (0.80 - 0.84)
0.74 (0.68 - 0.79)
0.74 (0.68 - 0.75)
1.31 (1.24 - 1.38)
MPM II 24
0.81 (0.79 - 0.83)
0.75 (0.70 - 0.80)
0.77 (0.75 - 0.79)
1.29 (1.22 - 1.36)
0.83 (0.81 - 0.85)
0.75 (0.70 - 0.80)
0.77 (0.76 - 0.79)
1.00 (0.92 - 1.10)
0.78 (0.76 - 0.80)
0.67 (0.61 - 0.72)
0.75 (0.73 - 0.77)
1.00 (0.94 - 1.06)
0.80 (0.78 - 0.82)
0.68 (0.63 - 0.73)
0.75 (0.73 - 0.77)
1.00 (0.93 - 1.08)
0.80 (0.78 - 0.82)
0.69 (0.65 - 0.74)
0.78 (0.76 - 0.80)
1.00 (0.94 - 1.06)
VALIDATION (n = 804)
0.77 (0.74 - 0.81)
0.69 (0.60 - 0.70)
0.73 (0.70 - 0.76)
1.36 (1.26 - 1.47)
0.79 (0.76 - 0.83)
0.71 (0.63 - 0.78)
0.74 (0.71 - 0.77)
1.39 (1.28 - 1.49)
MPM II 24
0.79 (0.75 - 0.82)
0.71 (0.63 - 0.78)
0.74 (0.71 - 0.77)
1.36 (1.25 - 1.46)
0.81 (0.78 - 0.84)
0.73 (0.66 - 0.81)
0.75 (0.73 - 0.78)
1.22 (1.16 - 1.29)
0.75 (0.71 - 0.81)
0.64 (0.57 - 0.72)
0.72 (0.69 - 0.75)
1.04 (0.95 - 1.31)
0.76 (0.72 - 0.79)
0.64 (0.56 - 0.72)
0.72 (0.69 - 0.75)
1.06 (0.97 - 1.15)
0.76 (0.73 - 0.80)
0.70 (0.63 - 0.76)
0.76 (0.73 - 0.79)
1.08 (0.98 - 1.16)
All the models correctly classified approximately 75% of the cases evaluated.
Comparison of individual probabilities generated by the CT models
Correlation matrix of the probabilities (CTs and LR models)
DEVELOPMENT SET (n = 1880)
VALIDATION SET (n = 804)
We observed that there were patients for whom the difference in the probabilities exceeded the acceptable limit of the test. There were 116 patients included in the comparison of the CART and CHAID CTs, and 245 in the comparison of the CART and C4.5 CTs. The differences can be partly attributed to the behaviour of the Glasgow variable (different cut-off points or partitions) and to the influence of the COI variable in the different divisions of the tree branches.
The different models generate, in some patients, different allocation of death provability. When performing a validation with records not used in the phase of development, the different allocation of probability determines in our case a conservation of a similar discrimination but that the calibration is different (being better for the AC).
The results illustrate that the ICU had particular demographic characteristics due to its case mix, with a low percentage of scheduled patients, a long length of stay and high mortality. These data are important when it comes to appraising and generalising the results obtained with our database .
The results yielded mortality rates that were higher than expected (according to the classic APACHE II, SAPS II and MPM II-24 scores), which can be partly attributed to these individual characteristics . However, this finding also necessitates a recalibration of these models in order to achieve a correct stratification of the patients' risk of hospital mortality .
Previously, CTs have been used with critical patients, e.g., for the calculation of the probability of death from coronary pathology , intracerebral haemorrhages  or traumatic brain injuries , for the prediction of persistent vegetative states  or (as in our study) for stratifying the probability of death in a general population of ICU patients [23, 24].
The common variables selected by the three CT types (and also by the model based on LR) were: the necessity of inotropic therapy, the point value on the Glasgow coma scale, the alveolar-arterial gradient in oxygen, age and the presence of antecedents of important chronic diseases. This group of variables included information concerning chronic health and age (which were variables specific to the patient), the point value on the Glasgow coma scale and the (A-a)O2 gradient as deviations from the normal state as well as a variable specific to the intensive treatment (INOT). The selection of some of these variables has also been reported in other studies of mortality in other groups of critical patients .
These five variables are capable of stratifying the examined population of critical patients (for example, as in the CART CT), using eight simple decision rules, with acceptable properties of discrimination and calibration.
We also observed that the three CT types exhibited differences. Even when incorporating the five common variables mentioned earlier, these CTs differed in the first variable to be selected, in the details of "branching", in the cut-off points (and subgroups), in the order of variable selection and in the incorporation of other variables.
We have already seen that the CART CT includes the five general variables. The C4.5 CT adds the MAP (Mean Arterial Pressure) variable and the CHAID CT includes MV (Mechanical Ventilation) variables and the fact of belonging to the trauma group. The LR model uses those of the CHAID CT model, also including the Acute Renal Failure and HR (Heart Rate) variables.
The CT software allows to adjust the levels and the number of partitions for each branch in order to get more complex models . In our case, our only restriction (in the 3 CT models) was that the minimum number of subjects in the terminal nodes should be of 50 patients.
We cannot state which CT was optimal (since they had similar general properties). The CART and CHAID CTs were similar in their order of partitioning, although the CHAID CT (due to its inherent characteristics) separated the continuous variables into more than two possibilities and generated more decision rules. The CART CT was simpler, while the CHAID CT showed greater complexity (and also selected more variables). Different CTs can select different first variables, and in the C4.5 CT, the first variable, the Glasgow point value, was different from that of the other CTs; the C4.5 CT also incorporated different variables. The analysis of the individual probabilities generated by the different CTs (in spite of a good correlation) assisted in the identification of possible "problem" variables, e.g. the Glasgow point value and the COI variable, in their order of appearance in the decision rules generated.
When there is a classification problem, there is no model that can be chosen a priori to be the best . Even with the same information, different CTs develop models with different interpretations . Based on our data, the CTs do not compete with the classic scores in their function of calculating individual probabilities. In the case of a large database, the CTs generated would be too complex to interpret and use with regularity (many branches and decision rules). The immediate interpretative advantage of CTs is only obtained with simple trees [31, 32].
Our study had several limitations. In the first place, it was carried out in only one ICU and within a ten-year span database (although no variation was observed during the period of study). It would also have been possible to employ more methodologies for comparison or to improve those that were used, by incorporating relations and/or ranks of a priori variables, as do the classic scores.
As exposed by one of the reviewers, we found a great difference between the observed and expected mortality in the validation group in the LR model. The LR-based model could have been carried out using the variables as categorical, thus minimizing the possible effect that outlier values (using the variables as continuous) have on the predicted outcome. One of the advantages of CT-based models is that they automatically change the continuous variables into categorical ones and that their cut-off points could also be used to create a LR model with discreet variables
We must mention the effort at Waikato University (New Zealand) regarding the free-access program WEKA, which strives to collect (in a single tool) the majority of the methodologies that are used to classify, select and group variables .
The principal advantage of CTs is that they are easy to interpret. However, this advantage could turn into an obstacle, since we tend to choose the optimal CT as that which more closely approaches the clinical reasoning that coincides with that of the program user . An understanding of the clinical problem is necessary in order to adequately interpret CTs.
One contribution of our effort was the demonstration that the CT methodology is not unique and that different CTs could be generated according to various methodologies. The CTs assisted in both selecting variables of greater importance in the problem of classification and determining the best cut-off points for the continual variables.
We believe that CTs (e.g., the model based on CART) are mainly useful in obtaining homogenous groups for the assignation of the probability of hospital mortality. These groups with different characteristics (defined by rules of classification that can be interpreted) can serve, for example, as a basis for the creation of new scores.
We intend to do further research including a multi-centre study, with the incorporation of more methodologies and the possible use of hybrid models. In order to generalise our results, external validation will be required .
The main benefits to CT analysis are to identify a relatively small number of groups that are reasonably homogeneous with regard to the outcome. The CTs can be used in intensive care medicine for assisting in diagnosis and prognosis [37, 38]. Those less familiar with CTs should realise that this us a class of methods including many different approaches, and that these different approaches may result in considerable differences in classifications.
- Lemeshow S, Le Gall JR: Modeling the severity of illness of ICU patients. JAMA. 1994, 272: 1049-1055. 10.1001/jama.272.13.1049.View ArticlePubMedGoogle Scholar
- Knaus WA, Draper EA, Wagner DP, Zimmerman JE: APACHE II: A severity of disease classification system. Crit Care Med. 1985, 13: 818-829. 10.1097/00003246-198510000-00009.View ArticlePubMedGoogle Scholar
- Le Gall JR, Lemeshow S, Saulnier F: A new simplified acute physiology score (SAPS II) based on a European/North American multicenter study. JAMA. 1993, 270: 2957-63. 10.1001/jama.270.24.2957.View ArticlePubMedGoogle Scholar
- Lemeshow S, Teres D, Klar J, Avrunin JS, Gehlbach SH, Rapoport J: Mortality probability models (MPM II) based on an international cohort of intensive care unit patients. JAMA. 1993, 270: 2478-86. 10.1001/jama.270.20.2478.View ArticlePubMedGoogle Scholar
- Tom E, Schulman KA: Mathematical models in decision analysis. Infect Control Hosp Epidemiol. 1997, 18: 65-73. 10.1086/647503.View ArticlePubMedGoogle Scholar
- Harper PR: A review and comparison of classification algorithms for medical decision making. Health Policy. 2005, 71: 315-31. 10.1016/j.healthpol.2004.05.002.View ArticlePubMedGoogle Scholar
- Trujillano J, Sarria-Santamera A, Esquerda A, Badia M, Palma M, March J: Aproximación a la metodología basada en árboles de decisión (CART). Mortalidad hospitalaria del infarto agudo de miocardio. Gac Sanit. 2008, 22: 65-72. 10.1157/13115113.View ArticlePubMedGoogle Scholar
- Crichton NJ, Hinde JP, Marchini J: Models for diagnosing chest pain: is CART helpful?. Stat Med. 1997, 16: 717-27. 10.1002/(SICI)1097-0258(19970415)16:7<717::AID-SIM504>3.0.CO;2-E.View ArticlePubMedGoogle Scholar
- Knaus WA, Wagner DP, Draper EA, Zimmerman JE, Bergner M, Bastos PG, et al: The APACHE III prognostic system. Risk prediction of hospital mortality for critically ill hospitalized adults. Chest. 1991, 100: 1619-1636. 10.1378/chest.100.6.1619.View ArticlePubMedGoogle Scholar
- Hosmer DW, Lemeshow S: Applied logistic regression. 2000, John Wiley & Sons New York, full_text. 2View ArticleGoogle Scholar
- Ian H: Witten and Eibe Frank. Data Mining: Practical machine learning tools and techniques. 2005, Morgan Kaufmann, San Francisco, 2Google Scholar
- Hanley JA, McNeil BJ: The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology. 1982, 143: 29-36.View ArticlePubMedGoogle Scholar
- Lemeshow S, Hosmer DW: A review of goodness of fit statistics for use in the development of logistic regression models. Am J Epidemiol. 1982, 115: 92-106.PubMedGoogle Scholar
- Rapoport J, Teres D, Lemeshow S, Gehlbach S: A method for assessing the clinical performance and cost-effectiveness of intensive care units: A multicenter inception cohort study. Crit Care Med. 1994, 22: 1385-1391. 10.1097/00003246-199409000-00006.View ArticlePubMedGoogle Scholar
- Bland JM, Altman DG: Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986, 1: 307-10.View ArticlePubMedGoogle Scholar
- Trujillano J, March J, Badia M, Rodriguez A, Sorribas A: Aplicación de las Redes Neuronales Artificiales para la estratificación de riesgo de mortalidad hospitalaria. Gac Sanit. 2003, 17: 504-11. 10.1157/13055392.View ArticlePubMedGoogle Scholar
- Zimmerman JE, Wagner DP: Prognostic systems in intensive care: How do you interpret an observed mortality that is higher than expected?. Crit Care Med. 2000, 28: 258-260. 10.1097/00003246-200001000-00048.View ArticlePubMedGoogle Scholar
- Zhu BP, Lemeshow S, Hosmer DW, Klar J, Avrunin JS, Teres D: Factors affecting the performance of the models in the Mortality Probability Model II system and strategies of customization: A simulation study. Crit Care Med. 1996, 24: 57-63. 10.1097/00003246-199601000-00011.View ArticlePubMedGoogle Scholar
- Austin PC: A comparison of regression trees, logistic regression, generalized additive models, and multivariate adaptive regression splines for predicting AMI mortality. Stat Med. 2007, 26: 2937-57. 10.1002/sim.2770.View ArticlePubMedGoogle Scholar
- Takahashi O, Cook EF, Nakamura T, Saito J, Ikawa F, Fukui T: Risk stratification for in-hospital mortality in spontaneous intracerebral haemorrhage: a Classification and Regression Tree analysis. QJM. 2006, 99: 743-50. 10.1093/qjmed/hcl107.View ArticlePubMedGoogle Scholar
- Rovlias A, Kotsou S: Classification and Regression tree for prediction of outcome after severe head injury using simple clinical and laboratory variables. J Neurotrauma. 2004, 21: 886-893. 10.1089/0897715041526249.View ArticlePubMedGoogle Scholar
- Dolce G, Quinteri M, Serra S, Lagani V, Pignolo L: Clinical signs and early prognosis in vegetative state: a decisional tree, data-minig study. Brain Inj. 2008, 22: 617-23. 10.1080/02699050802132503.View ArticlePubMedGoogle Scholar
- Abu-Hanna A, de Keizer N: Integrating classification trees with local logistic regression in Intensive Care prognosis. Artif Intell Med. 2003, 29: 5-23. 10.1016/S0933-3657(03)00047-2.View ArticlePubMedGoogle Scholar
- Gortzis LG, Sakellaropoulos F, Ilias I, Stamoulis K, Dimopoulou I: Predicting ICU survival: a meta-level approach. BMC Health Serv Res. 2008, 26: 8-157.Google Scholar
- de Rooij SE, Abu-Hanna A, Levi M, de Jonge E: Identification of high-risk subgroups in very elderly intensive care unit patients. Crit Care. 2007, 11: R33-10.1186/cc5716.View ArticlePubMedPubMed CentralGoogle Scholar
- Gerald LB, Tang S, Bruce F, Redden D, Kimerling ME, Brook N, Dunlap N, Bailey WC: A decision tree for tuberculosis contact investigation. Am J Respir Crit Care Med. 2002, 166: 1122-7. 10.1164/rccm.200202-124OC.View ArticlePubMedGoogle Scholar
- Costanza MC, Paccaud F: Binary classification of dyslipidemia from the waist-to-hip ratio and body mass index: a comparison of linear, logistic, and CART models. BMC Med Res Methodol. 2004, 4: 7-17. 10.1186/1471-2288-4-7.View ArticlePubMedPubMed CentralGoogle Scholar
- Muller R, Möckel M: Logistic regression and CART in the analysis of multimarker studies. Clin Chim Acta. 2008, 394: 1-6. 10.1016/j.cca.2008.04.007.View ArticlePubMedGoogle Scholar
- Magdon-Ismail M: No free lunch for noise prediction. Neural Comput. 2000, 12: 547-64. 10.1162/089976600300015709.View ArticlePubMedGoogle Scholar
- Wolfe R, McKenzie DP, Black J, Simpson P, Gabbe BJ, Cameron PA: Models developed by three techniques did not achieve acceptable prediction of binary trauma outcomes. J Clin Epidemiol. 2006, 59: 26-35. 10.1016/j.jclinepi.2005.05.007.View ArticlePubMedGoogle Scholar
- Peters RP, Twisk JW, van Agtmael MA, Groeneveld AB: The role of procalcitonin in a decision tree for prediction of bloodstream infection in febrile patients. Clin Microbiol Infect. 2006, 12: 1207-13. 10.1111/j.1469-0691.2006.01556.x.View ArticlePubMedGoogle Scholar
- Mann JJ, Ellis SP, Waternaux CM, Liu X, Oquendo MA, Malone KM, Brodsky BS, Haas GL, Currier D: Classification trees distinguish suicide attempters in major psychiatric disorders: a model of clinical decision making. J Clin Psychiatry. 2008, 69: 23-31. 10.4088/JCP.v69n0104.View ArticlePubMedPubMed CentralGoogle Scholar
- Pang BC, Kuralmani V, Joshi R, Hongli Y, Lee KK, Ang BT, Li J, Leong TY, Ng I: Hybrid outcome prediction model for severe traumatic brain injury. J Neurotrauma. 2007, 24: 136-46. 10.1089/neu.2006.0113.View ArticlePubMedGoogle Scholar
- Gaudart J, Poudiogou B, Ranque S, Doumbo O: Oblique decision trees for spatial pattern detection: optimal algorithm and application to malaria risk. BMC Med Res Methodol. 2005, 5: 22-10.1186/1471-2288-5-22.View ArticlePubMedPubMed CentralGoogle Scholar
- Podgorelec V, Kokol P, Stiglic B, Rozman I: Decision trees: an overview and their use in medicine. J Med Syst. 2002, 26: 445-63. 10.1023/A:1016409317640.View ArticlePubMedGoogle Scholar
- van Dijk MR, Steyerberg EW, Stenning SP, Habbema JD: Identifying subgroups among poor prognosis patients with nonseminomatous germ cell cancer by tree modelling: a validation study. Ann Oncol. 2004, 15: 1400-5. 10.1093/annonc/mdh350.View ArticlePubMedGoogle Scholar
- Webster AP, Goodacre S, Walker D, Burke D: How do clinical features help identify paediatric patients with fractures following blunt wrist trauma?. Emerg Med J. 2006, 23: 354-7. 10.1136/emj.2005.029249.View ArticlePubMedPubMed CentralGoogle Scholar
- Köhne CH, Cunningham D, Di CF, Glimelius B, Blijham G, Aranda E, Scheithauer W, Rougier P, Palmer M, Wils J, Baron B, Pignatti F, Schöffski P, Micheel S, Hecker H: Clinical determinants of survival in patients with 5-fluorouracil-based treatment for metastatic colorectal cancer: results of a multivariate analysis of 3825 patients. Ann Oncol. 2002, 13: 308-17. 10.1093/annonc/mdf034.View ArticlePubMedGoogle Scholar
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2288/9/83/prepub
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.