Skip to main content

Table 3 Original data model AUROCs (with 95% confidence intervals)

From: Impact of random oversampling and random undersampling on the performance of prediction models developed using observational health data

Outcome of interest

Classifier

CCAE

MDCD

MDCR

IQVIA Germany

Acute myocardial infarction

Lasso

 

0.86 (0.82–0.89)

0.71 (0.69–0.73)

 

Random forest

 

0.87 (0.84–0.90)

0.69 (0.66–0.71)

 

XGBoost

 

0.87 (0.85–0.90)

0.71 (0.69–0.73)

 

Alopecia

Lasso

0.61 (0.57–0.66)

0.69 (0.65–0.73)

0.69 (0.65–0.73)

 

Random forest

0.58 (0.53–0.63)

0.65 (0.61–0.70)

0.68 (0.64–0.72)

 

XGBoost

0.64 (0.59–0.68)

0.68 (0.64–0.73)

0.68 (0.64–0.72)

 

Constipation

Lasso

0.67 (0.64–0.69)

0.65 (0.63–0.66)

0.66 (0.65–0.68)

0.80 (0.78–0.83)

Random forest

0.66 (0.64–0.69)

0.64 (0.62–0.66)

0.64 (0.63–0.66)

0.81 (0.79–0.83)

XGBoost

0.67 (0.65–0.69)

0.65 (0.63–0.66)

0.66 (0.65–0.68)

0.80 (0.77–0.83)

Delirium

Lasso

 

0.79 (0.75–0.84)

0.75 (0.72–0.78)

 

Random forest

 

0.80 (0.76–0.84)

0.73 (0.70–0.76)

 

XGBoost

 

0.80 (0.75–0.84)

0.74 (0.71–0.77)

 

Diarrhea

Lasso

0.65 (0.63–0.67)

0.67 (0.66–0.69)

0.64 (0.62–0.65)

 

Random forest

0.64 (0.62–0.66)

0.67 (0.65–0.69)

0.62 (0.61–0.64)

 

XGBoost

0.63 (0.61–0.66)

0.67 (0.66–0.69)

0.63 (0.61–0.65)

 

Fracture

Lasso

0.61 (0.56–0.66)

0.70 (0.67–0.74)

0.67 (0.65–0.70)

0.82 (0.78–0.86)

Random forest

0.61 (0.56–0.65)

0.66 (0.63–0.70)

0.65 (0.63–0.67)

0.80 (0.77–0.84)

XGBoost

0.62 (0.57–0.67)

0.69 (0.65–0.72)

0.67 (0.65–0.69)

0.82 (0.79–0.86)

Gastrointestinal hemorrhage

Lasso

0.73 (0.67–0.78)

0.74 (0.71–0.77)

0.73 (0.71–0.76)

 

Random forest

0.72 (0.67–0.77)

0.75 (0.72–0.78)

0.72 (0.70–0.74)

 

XGBoost

0.70 (0.65–0.75)

0.74 (0.71–0.77)

0.72 (0.70–0.75)

 

Hyponatremia

Lasso

0.74 (0.69–0.78)

0.84 (0.81–0.86)

0.66 (0.64–0.68)

 

Random forest

0.73 (0.68–0.77)

0.83 (0.80–0.85)

0.64 (0.62–0.66)

 

XGBoost

0.74 (0.70–0.78)

0.84 (0.81–0.86)

0.66 (0.64–0.68)

 

Hypotension

Lasso

0.74 (0.70–0.78)

0.75 (0.73–0.77)

0.72 (0.71–0.74)

0.71 (0.66–0.75)

Random forest

0.74 (0.70–0.78)

0.74 (0.72–0.77)

0.71 (0.70–0.73)

0.71 (0.67–0.75)

XGBoost

0.74 (0.71–0.78)

0.75 (0.73–0.78)

0.72 (0.70–0.74)

0.71 (0.67–0.75)

Hypothyroidism

Lasso

0.80 (0.78–0.83)

0.76 (0.72–0.79)

0.83 (0.81–0.85)

0.86 (0.82–0.89)

Random forest

0.79 (0.76–0.82)

0.74 (0.71–0.78)

0.82 (0.80–0.84)

0.87 (0.84–0.90)

XGBoost

0.80 (0.77–0.83)

0.75 (0.72–0.78)

0.83 (0.81–0.85)

0.86 (0.82–0.89)

Insomnia

Lasso

0.64 (0.62–0.66)

0.61 (0.60–0.63)

0.67 (0.65–0.69)

0.60 (0.57–0.63)

Random forest

0.62 (0.61–0.64)

0.60 (0.58–0.61)

0.66 (0.64–0.67)

0.58 (0.55–0.60)

XGBoost

0.64 (0.62–0.66)

0.61 (0.60–0.63)

0.67 (0.65–0.69)

0.59 (0.56–0.62)

Ischemic stroke inpatient

Lasso

  

0.79 (0.76–0.82)

 

Random forest

  

0.76 (0.73–0.79)

 

XGBoost

  

0.78 (0.75–0.81)

 

Nausea

Lasso

0.67 (0.66–0.69)

0.66 (0.65–0.68)

0.66 (0.64–0.68)

0.75 (0.73–0.77)

Random forest

0.65 (0.64–0.67)

0.65 (0.64–0.66)

0.64 (0.63–0.66)

0.75 (0.72–0.77)

XGBoost

0.66 (0.65–0.68)

0.66 (0.65–0.67)

0.66 (0.64–0.68)

0.75 (0.73–0.77)

Open-angle glaucoma

Lasso

  

0.76 (0.71–0.82)

 

Random forest

  

0.77 (0.72–0.82)

 

XGBoost

  

0.79 (0.75–0.84)

 

Seizure

Lasso

0.75 (0.70–0.79)

0.74 (0.71–0.77)

0.74 (0.70–0.77)

 

Random forest

0.73 (0.69–0.78)

0.71 (0.68–0.74)

0.73 (0.70–0.77)

 

XGBoost

0.72 (0.67–0.76)

0.73 (0.70–0.76)

0.73 (0.69–0.76)

 

Suicide and ideation

Lasso

0.79 (0.77–0.81)

0.76 (0.74–0.77)

0.73 (0.69–0.77)

 

Random forest

0.75 (0.73–0.77)

0.72 (0.71–0.74)

0.64 (0.59–0.68)

 

XGBoost

0.79 (0.77–0.81)

0.75 (0.74–0.77)

0.71 (0.67–0.75)

 

Tinnitus

Lasso

0.66 (0.62–0.70)

0.69 (0.64–0.74)

0.60 (0.56–0.63)

0.60 (0.56–0.65)

Random forest

0.64 (0.60–0.68)

0.71 (0.67–0.76)

0.58 (0.55–0.62)

0.62 (0.58–0.66)

XGBoost

0.66 (0.62–0.70)

0.69 (0.65–0.74)

0.59 (0.55–0.62)

0.60 (0.55–0.65)

Ventricular arrhythmia and sudden cardiac death inpatient

Lasso

 

0.83 (0.79–0.87)

0.77 (0.74–0.79)

 

Random forest

 

0.84 (0.81–0.87)

0.76 (0.73–0.79)

 

XGBoost

 

0.83 (0.79–0.87)

0.77 (0.74–0.80)

 

Vertigo

Lasso

0.65 (0.61–0.70)

0.72 (0.67–0.76)

0.62 (0.59–0.65)

0.63 (0.57–0.68)

Random forest

0.63 (0.58–0.68)

0.70 (0.66–0.74)

0.59 (0.55–0.62)

0.65 (0.60–0.70)

XGBoost

0.63 (0.58–0.67)

0.71 (0.66–0.75)

0.60 (0.57–0.64)

0.63 (0.59–0.68)

  1. Each column represents a database