Skip to main content

Table 2 Descriptive statistics

From: A comparative study on deep learning models for text classification of unstructured medical notes with various levels of class imbalance

Descriptive Statistics

Number of Words

Number of Characters

Before Cleaning

After Cleaning

Before Cleaning

After Cleaning

Minimum

146

50

903

410

25% Percentile

819

391

4798

3089

Median

1084

517

6391

4098

Mean

1170

557

6870

4429

75% Percentile

1425

687

8404

5420

Maximum

4280

2098

25,842

16,976

Standard Deviation

506

242

2960

1931