Resources
Challenge Credentialed Access
ShAReCLEF eHealth 2013: Natural Language Processing and Information Retrieval for Clinical Care
Danielle Mowery
Published: Feb. 15, 2013. Version: 1.0
Database Credentialed Access
EHR-DS-QA: A Synthetic QA Dataset Derived from Medical Discharge Summaries for Enhanced Medical Information Retrieval Systems
Konstantin Kotschenreuther
mimic-iv clinical question-answering medical discharge summaries large language models
Published: Jan. 11, 2024. Version: 1.0.0
Database Restricted Access
Organ Retrieval and Collection of Health Information for Donation (ORCHID)
Hammaad Adam, Vinith Suriyakumar, Tom Pollard, Benjamin Moody, Jennifer Erickson, Greg Segal, Brad Adams, Diane Brockmeier, Kevin Lee, Ginny McBride, Kelly Ranum, Matthew Wadsworth, Janice Whaley, Ashia Wilson, Marzyeh Ghassemi
organ procurement organizations organ transplantation
Published: Sept. 29, 2025. Version: 2.1.1
Challenge Credentialed Access
ShAReCLEF eHealth Evaluation Lab 2014 (Task 2): Disorder Attributes in Clinical Reports
Danielle Mowery
Published: Nov. 1, 2013. Version: 1.0
Database Credentialed Access
MIMIC-IV-Ext-22MCTS: A 22 Millions-Event Temporal Clinical Time-Series Dataset with Relative Timestamp
Jing Wang, Xing Niu, Tong Zhang, Jie Shen, Juyong Kim, Jeremy Weiss
mimic clinical event annotation time series temporal annotation
Published: Sept. 29, 2025. Version: 1.0.0
Database Credentialed Access
EHRCon: Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records
Yeonsu Kwon, Jiho Kim, Gyubok Lee, Seongsu Bae, Daeun Kyung, Wonchul Cha, Tom Pollard, Alistair Johnson, Edward Choi
Published: March 19, 2025. Version: 1.0.1
Database Credentialed Access
Tasks 1 and 3 from Progress Note Understanding Suite of Tasks: SOAP Note Tagging and Problem List Summarization
Yanjun Gao, John Caskey, Timothy Miller, Brihat Sharma, Matthew Churpek, Dmitriy Dligach, Majid Afshar
Published: Sept. 30, 2022. Version: 1.0.0
Database Credentialed Access
Learning to Ask Like a Physician: a Discharge Summary Clinical Questions (DiSCQ) Dataset
Eric Lehman
question generation question answering machine learning
Published: July 28, 2022. Version: 1.0
Database Credentialed Access
DrugEHRQA: A Question Answering Dataset on Structured and Unstructured Electronic Health Records For Medicine Related Queries
Jayetri Bardhan, Anthony Colas, Kirk Roberts, Daisy Zhe Wang
Published: April 12, 2022. Version: 1.0.0
Database Credentialed Access
MIMIC-III-Ext-VeriFact-BHC: Labeled Propositions From Brief Hospital Course Summaries for Long-form Clinical Text Evaluation
Philip Chung, Akshay Swaminathan, Alex Goodell, Yeasul Kim, Momsen Reincke, Lichy Han, Ben Deverett, Mohammad Amin Sadeghi, Abdel badih El Ariss, Marc Ghanem, David Seong, Andrew Lee, Caitlin Coombes, Brad Bradshaw, Mahir Sufian, Hyo Jung Hong, Teresa Nguyen, Mohammad Rasouli, Komal Kamra, Mark Burbridge, James McAvoy, Roya Saffary, Stephen Parnell Ma, Dev Dash, James Xie, Ellen Wang, Cliff Schmiesing, Nigam Shah, Nima Aghaeepour
artificial intelligence natural language processing clinical notes electronic health records large language models brief hospital course long-form text chart review text reranking atomic claim hybrid retrieval clinical informatics clinical medicine fact verification retrieval-augmented generation logical atomism text embedding formal logic llm-as-a-judge llm evaluation
Published: April 9, 2025. Version: 1.0.0