Resources
Database Credentialed Access
MIMIC-Ext-MIMIC-CXR-VQA: A Complex, Diverse, And Large-Scale Visual Question Answering Dataset for Chest X-ray Images
question answering machine learning electronic health records evaluation chest x-ray radiology benchmark multimodal deep learning visual question answering
Published: July 19, 2024. Version: 1.0.0
Database Credentialed Access
EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images
question answering machine learning electronic health records evaluation chest x-ray multi-modal question answering ehr question answering semantic parsing benchmark deep learning visual question answering
Published: July 23, 2024. Version: 1.0.0
Database Restricted Access
EchoNext: A Dataset for Detecting Echocardiogram-Confirmed Structural Heart Disease from ECGs
heart failure clinical decision support artificial intelligence health equity ecg machine learning deep learning electrocardiogram aortic stenosis cardiovascular screening valvular heart disease digital health ai model deployment left ventricular dysfunction ai in healthcare population health transthoracic echocardiogram structural heart disease
Published: Sept. 16, 2025. Version: 1.1.0
Database Credentialed Access
RadGraph-XL: A Large-Scale Expert-Annotated Dataset for Entity and Relation Extraction from Radiology Reports
Published: Sept. 12, 2025. Version: 1.0.0
Database Open Access
MIMIC-IV Clinical Database Demo on FHIR
fhir electronic health records mimic
Published: Aug. 27, 2025. Version: 2.1.0
Database Restricted Access
Swiss-Mammo: A physician-written, synthetic dataset of German mammography reports
radiology mammography structured reporting bi-rads
Published: June 24, 2025. Version: 1.0.1
Database Open Access
Hillel Yaffe Glaucoma Dataset (HYGD): A Gold-Standard Annotated Fundus Dataset for Glaucoma Detection
ophthalmology retina dfi gold-standard gon fundus glaucoma
Published: June 3, 2025. Version: 1.0.0
Database Credentialed Access
Medical-CXR-VQA dataset: A Large-Scale LLM-Enhanced Medical Dataset for Visual Question Answering on Chest X-Ray Images
Published: Jan. 21, 2025. Version: 1.0.0
Database Credentialed Access
ReXPref-Prior: A MIMIC-CXR Preference Dataset for Reducing Hallucinated Prior Exams in Radiology Report Generation
chest x-rays reinforcement learning hallucination
Published: Aug. 14, 2024. Version: 1.0.0
Database Credentialed Access
RadGraph2: Tracking Findings Over Time in Radiology Reports
chest x-rays relation extraction disease progression information extraction radiology reports named entity recognition
Published: Aug. 8, 2024. Version: 1.0.0