Resources
Database Restricted Access
LATTE-CXR: Locally Aligned TexT and imagE, Explainable dataset for Chest X-Rays
Elham Ghelichkhan, Tolga Tasdizen
eye-tracking chest x-ray dataset automatically generated dataset caption-guided object detection image captioning with region-level description grounded radiology report generation phrase grounding xai multi-modal learning local visual-language models localization
Published: Feb. 4, 2025. Version: 1.0.0
Database Credentialed Access
MIMIC-IV-Ext-Instr: A Dataset of 450K+ EHR-Grounded Instruction-Following Examples
Zhenbang Wu, Anant Dadu, Mike Nalls, Faraz Faghri, Jimeng Sun
large language models medical question answering instruction tuning
Published: Sept. 9, 2025. Version: 1.0.0
Database Credentialed Access
MIMIC-IV-Ext-Instr: A Dataset of 450K+ EHR-Grounded Instruction-Following Examples
Zhenbang Wu, Anant Dadu, Mike Nalls, Faraz Faghri, Jimeng Sun
large language models medical question answering instruction tuning
Published: Sept. 9, 2025. Version: 1.0.0
Database Credentialed Access
MedVH: Towards Systematic Evaluation of Hallucination for Large Vision Language Models in the Medical Context
Zishan Gu, Jiayuan Chen, Fenglin Liu, Changchang Yin, Ping Zhang
Published: March 11, 2025. Version: 1.0.0
Database Restricted Access
CXRGraph: Using Information Extraction to Normalize the Training Data for Automatic Radiology Report Generation
Yuxiang Liao, Hoisang Heung, Hantao Liu, Irena Spasic
relation extraction information extraction natural language processing named entity recognition structured radiology report
Published: Feb. 3, 2025. Version: 1.0.0
Database Credentialed Access
DrugEHRQA: A Question Answering Dataset on Structured and Unstructured Electronic Health Records For Medicine Related Queries
Jayetri Bardhan, Anthony Colas, Kirk Roberts, Daisy Zhe Wang
Published: April 12, 2022. Version: 1.0.0
Database Credentialed Access
MIMIC-IV-Ext-GPT-3_5-Generated-Discharge-Summaries-for-Low-Resource-Codes
Matúš Falis, Aryo Pradipta Gema, Hang Dong, Luke Daines, Siddharth Basetti, Michael Holder, Rose Penfold, Alexandra Birch, Beatrice Alex
icd coding large language model data augmentation
Published: Dec. 16, 2024. Version: 1.0.0
Database Credentialed Access
PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions
Daeun Kyung, Hyunseung Chung, Seongsu Bae, Jiho Kim, Jae Ho Sohn, Taerim Kim, Soo Kim, Edward Choi
electronic health records multi-turn dialogue llm simulation doctor-patient consultation
Published: Oct. 18, 2025. Version: 1.0.0
Database Credentialed Access
EHRNoteQA: An LLM Benchmark for Real-World Clinical Practice Using Discharge Summaries
Sunjun Kweon, Jiyoun Kim, Heeyoung Kwak, Dongchul Cha, Hangyul Yoon, Kwang Hyun Kim, Jeewon Yang, Seunghyun Won, Edward Choi
Published: June 26, 2024. Version: 1.0.1
Database Open Access
Respiratory and heart rate monitoring dataset from aeration study
Ella Frances Sophia Guy, Isaac Flett, Jaimey Anne Clifton, Trudy Caljé-van der Klei, Rongqing Chen, Jennifer Knopp, Knut Moeller, James Geoffrey Chase
Published: March 20, 2024. Version: 1.0.0