Resources
Database
Credentialed Access
Nils Strodthoff, Juan Miguel Lopez Alcaraz, Wilhelm Haverkamp
Dataset that links ECG records from MIMIC-IV-ECG to ED discharge and hospital discharge diagnoses, which enables to train general ECG prediction models based on clinical labels and facilitates the retrieval of further clinical metadata from MIMIC-IV.
machine learning
electrocardiography
mimic
Published: Aug. 30, 2024.
Version: 1.0.1
Database
Credentialed Access
Asad Aali, Vasiliki Bikia, Maya Varma, et al.
MedVAL-Bench is the first large-scale physician-validated benchmark for medical text validation, spanning 6 diverse medical tasks and containing 840 language model-generated outputs annotated by 12 physicians with error assessments and risk grades.
Published: Nov. 14, 2025.
Version: 1.0.1
Database
Credentialed Access
Hanbin Ko
CXR-Align is a benchmark dataset created to evaluate vision-language models' capability to interpret negations in chest X-ray (CXR) reports, featuring systematically modified reports from MIMIC-CXR.
Published: Aug. 21, 2025.
Version: 1.0.0
Database
Open Access
Patrick Wagner, Nils Strodthoff, Ralf-Dieter Bousseljot, et al.
The PTB-XL ECG dataset is a large dataset of 21801 clinical 12-lead ECGs from 18869 patients of 10 second length. The raw signal data has been annotated by up to two cardiologists with 71 different ECG statements and is supplemented by rich metadata.
ptb-xl
ptb
ecg
electrocardiography
Published: Nov. 9, 2022.
Version: 1.0.3
Visualize waveforms
Database
Credentialed Access
Jayetri Bardhan, Anthony Colas, Kirk Roberts, et al.
DrugEHRQA is a QA dataset containing question-answers from MIMIC-III tables and discharge summaries.
question-answer
qa
Published: April 12, 2022.
Version: 1.0.0
Database
Open Access
Alex Bennett, Hannes Ulrich, Joshua Wiedekopf, et al.
The MIMIC-IV Clinical Database Demo on FHIR is a 100 patient subset of the MIMIC-IV v2.2 and MIMIC-IV-ED v2.2 clinical databases converted into the Fast Healthcare Interoperability Resources (FHIR) format.
fhir
electronic health records
mimic
Published: Aug. 27, 2025.
Version: 2.1.0
Database
Credentialed Access
Benedikt Boecking, Naoto Usuyama, Shruthi Bannur, et al.
MS-CXR is a new dataset containing 1162 chest X-ray bounding box labels paired with radiology text descriptions, annotated and verified by two board-certified radiologists.
vision-language processing
chest x-ray
phrase grounding
localization
Published: Nov. 15, 2024.
Version: 1.1.0
Database
Credentialed Access
Shruthi Bannur, Stephanie Hyland, Qianchu Liu, et al.
The MS-CXR-T is a multimodal benchmark that enhances the MIMIC-CXR v2 dataset by including expert-verified annotations. Its goal is to evaluate biomedical visual-language processing models in terms of temporal semantics extracted from image and text.
disease progression
cxr
vision-language processing
chest x-ray
radiology
multimodal
Published: March 17, 2023.
Version: 1.0.0
Database
Credentialed Access
Chenlong Yin, Weijia Zhang
The processed MIMIC-III dataset for the benchmark of Irregular Multivariate Time Series Forecasting: A Transformable Patching Graph Neural Networks Approach.
Published: April 9, 2025.
Version: 1.0.0
Model
Credentialed Access
Shekoofeh Azizi, Jan Freyberg, Laura Culp, et al.
Medical AI Research Foundations is a repository of medical foundation models.
Published: April 25, 2023.
Version: 1.0.0