Resources
Database Credentialed Access
EHR-DS-QA: A Synthetic QA Dataset Derived from Medical Discharge Summaries for Enhanced Medical Information Retrieval Systems
Konstantin Kotschenreuther
mimic-iv clinical question-answering medical discharge summaries large language models
Published: Jan. 11, 2024. Version: 1.0.0
Model Credentialed Access
Characterization of Stigmatizing Language in Medical Records
Keith Harrigian, Ayah Zirikly, Brant Chee, Alya Ahmad, Anne Links, Somnath Saha, Mary Catherine Beach, Mark Dredze
clinical natural language processing domain transfer bias stigmatizing language large language models mimic
Published: Nov. 6, 2023. Version: 1.0.0
Database Credentialed Access
RadQA: A Question Answering Dataset to Improve Comprehension of Radiology Reports
Sarvesh Soni, Kirk Roberts
machine reading comprehension radiology reports clinical notes question answering electronic health records
Published: Dec. 9, 2022. Version: 1.0.0
Database Restricted Access
Swiss-Mammo: A physician-written, synthetic dataset of German mammography reports
Daniel Reichenpfader, Sandro von Däniken, Harald Marcel Bonel
radiology mammography structured reporting bi-rads
Published: June 24, 2025. Version: 1.0.1
Database Credentialed Access
A Temporal Dataset for Respiratory Support in Critically Ill Patients
Mira Moukheiber, Lama Moukheiber, Dana Moukheiber, Sicheng Hao, Leo Anthony Celi, Hyung-Chul Lee
oberservational data time-series
Published: April 15, 2025. Version: 1.1.0
Database Credentialed Access
EHRCon: Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records
Yeonsu Kwon, Jiho Kim, Gyubok Lee, Seongsu Bae, Daeun Kyung, Wonchul Cha, Tom Pollard, Alistair Johnson, Edward Choi
Published: March 19, 2025. Version: 1.0.1
Database Restricted Access
OpenOximetry Repository
Nicholas Fong, Michael Lipnick, Philip Bickler, John Feiner, Tyler Law
Published: Feb. 28, 2025. Version: 1.1.1
Database Restricted Access
Application of Med-PaLM 2 in the refinement of MIMIC-CXR labels
Kendall Park, Rory Sayres, Andrew Sellergren, Tom Pollard, Fayaz Jamil, Timo Kohlberger, Charles Lau, Atilla Kiraly
Published: Feb. 4, 2025. Version: 1.0.0
Model Credentialed Access
Shareable Artificial Intelligence to Extract Cancer Outcomes from Electronic Health Records for Precision Oncology Research
Kenneth Kehl, Pavel Trukhanov, Christopher Fong, Justin Jee, Karl Pichotta, Morgan Paul, Chelsea Nichols, Michele Waters, Nikolaus Schultz, Deborah Schrag
Published: Oct. 24, 2024. Version: 1.0.0
Database Credentialed Access
MedDec: Medical Decisions for Discharge Summaries in the MIMIC-III Database
Mohamed Elgaar, Jiali Cheng, Nidhi Vakil, Hadi Amiri, Leo Anthony Celi
natural language processing medical decisions span classification discharge summary mimic
Published: Oct. 16, 2024. Version: 1.0.0