Resources
Database Credentialed Access
RadGraph-XL: A Large-Scale Expert-Annotated Dataset for Entity and Relation Extraction from Radiology Reports
Published: Sept. 12, 2025. Version: 1.0.0
Database Credentialed Access
MIMIC-IV-Ext-Instr: A Dataset of 450K+ EHR-Grounded Instruction-Following Examples
large language models medical question answering instruction tuning
Published: Sept. 9, 2025. Version: 1.0.0
Database Credentialed Access
Annotated Social Determinants of Health Dataset for Adverse Pregnancy Outcomes
Published: Aug. 4, 2025. Version: 1.0.0
Database Restricted Access
Swiss-Mammo: A physician-written, synthetic dataset of German mammography reports
radiology mammography structured reporting bi-rads
Published: June 24, 2025. Version: 1.0.1
Database Restricted Access
DREAMT: Dataset for Real-time sleep stage EstimAtion using Multisensor wearable Technology
wearable sleep disorders biomedical time series classification
Published: April 30, 2025. Version: 2.1.0
Database Credentialed Access
EHRCon: Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records
Published: March 19, 2025. Version: 1.0.1
Database Open Access
A Multimodal Dataset for Investigating Working Memory in Presence of Music
Published: Feb. 26, 2025. Version: 1.0.0
Database Restricted Access
Visual Question Answering evaluation dataset for MIMIC CXR
Published: Jan. 28, 2025. Version: 1.0.0
Database Open Access
CGMacros: a scientific dataset for personalized nutrition and diet monitoring
diabetes machine learning continuous glucose monitors obesity postprandial glucose response food macronutrients metabolic models food photographs personalized nutrition
Published: Jan. 28, 2025. Version: 1.0.0
Database Credentialed Access
SCRIPT X2B8 Dataset: per-day clinical features to model successful next-day extubation
Published: Jan. 28, 2025. Version: 1.0.0