Resources
Database Credentialed Access
DrugEHRQA: A Question Answering Dataset on Structured and Unstructured Electronic Health Records For Medicine Related Queries
Published: April 12, 2022. Version: 1.0.0
Database Credentialed Access
EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images
question answering machine learning electronic health records evaluation chest x-ray multi-modal question answering ehr question answering semantic parsing deep learning benchmark visual question answering
Published: July 23, 2024. Version: 1.0.0
Database Credentialed Access
MIMIC-Ext-MIMIC-CXR-VQA: A Complex, Diverse, And Large-Scale Visual Question Answering Dataset for Chest X-ray Images
question answering machine learning electronic health records evaluation chest x-ray radiology deep learning benchmark multimodal visual question answering
Published: July 19, 2024. Version: 1.0.0
Database Credentialed Access
Learning to Ask Like a Physician: a Discharge Summary Clinical Questions (DiSCQ) Dataset
question generation question answering machine learning
Published: July 28, 2022. Version: 1.0
Challenge Credentialed Access
ArchEHR-QA: A Dataset for Addressing Patient's Information Needs related to Clinical Course of Hospitalization
question answering electronic health record patient portals clinicians
Published: Jan. 1, 2026. Version: 1.3
Database Credentialed Access
EHR-DS-QA: A Synthetic QA Dataset Derived from Medical Discharge Summaries for Enhanced Medical Information Retrieval Systems
mimic-iv clinical question-answering medical discharge summaries large language models
Published: Jan. 11, 2024. Version: 1.0.0
Database Credentialed Access
MIMIC-IV-Ext-Instr: A Dataset of 450K+ EHR-Grounded Instruction-Following Examples
large language models medical question answering instruction tuning
Published: Sept. 9, 2025. Version: 1.0.0