MIMIC-I (or MIMIC) [93] | 100 | Medical signals and sequential EHR | Patient monitor data, patient-descriptive data (gender, age, record duration), symptoms, fluid balance, diagnoses, progress notes, medications, and laboratory results | Potential hemodynamically unstable |
MIMIC-II [121] | 33,000 | Medical signals and sequential EHR | Patient monitor data, patient-descriptive data (demographics, admissions, transfers, discharge times, dates of death), diagnoses, notes, reports, procedure data, medications, fluid balances, and laboratory test data | diseases of the circulatory system; trauma; diseases of the digestive system; pulmonary diseases; infectious diseases; and neoplasms |
MIMIC-III [65] | 46,520 | Medical signals and sequential EHR | Patient monitor data, patient-descriptive data, diagnoses, reports, notes, interventions, medications, and laboratory tests data. | Diseases of the circulatory system, pulmonary diseases, infectious and parasitic diseases, diseases of the digestive system, diseases of the genitourinary system, neoplasms, diseases of the genitourinary system, and trauma |
MIMIC-IV [64] | 383,220 | Medical signals and sequential EHR | Hosp module contains patient-descriptive data, basic health data (blood pressure, height, weight...), medication, procedure data, and diagnoses. Icu module contains timing information data, patient monitor data, fluid balance, and procedure data. | Diseases of the circulatory system, pulmonary diseases, infectious and parasitic diseases, diseases of the digestive system, diseases of the genitourinary system, neoplasms, diseases of the genitourinary system, and trauma |
eICU-CRD [114] | 139,367 | Sequential EHR | Vital signs, laboratory measurements, medications, APACHE components, care plan information, admission diagnosis, patient history, and time-stamped diagnoses. | pulmonary sepsis, acute myocardial infarction, cerebrovascular accident, congestive heart failure, renal sepsis, diabetic ketoacidosis, coronary artery bypass graft, atrial rhythm disturbance, cardiac arrest, and emphysema |
Amsterdam UMCdb [138] | 20,109 | Medical signals and sequential EHR | Patient monitor and life support device data, laboratory measurements, clinical observation and scores, medical procedures and tasks, medication, fluid balance, diagnosis groups and clinical patient outcomes | Not specified |
UT Physicians clinical database [141] | 5,501,776 | Sequential EHR | Demographic data, vital signs, immunization data (body site, dose), laboratory data, transaction data (evaluation and management, radiology, medicine, surgery, anethesia), appointment data, medications, and invoices | diabetes mellitus, hyperlipidemia, hypertension, and unspecified chest pain |
Breast Cancer Wisconsin dataset (UCI) [34] | 569 | Tabular data | Diagnoses, radiuses, texture data, perimeters, areas, smoothness data, compactness data, concavity data, concave points data, symmetry data, and fractal dimensions. | Breast cancer |
Heart Disease dataset (UCI) [34] | 303 | Tabular data | Demographic data, smoking status data, disease history data, exercise protocols, chart data (blood pressure, heart rate, ECG), pain status data, and diagnoses | Heart disease |
Diabete dataset (UCI) [34] | 70 | Sequential data | Iinsulin dose, blood glucose measurement, hypoglycemic symptoms, meal ingestion, exercise activity | Diabete |