Resources
Database Credentialed Access
MIMIC-IV-Note: Deidentified free-text clinical notes
deidentification critical care electronic health record natural language processing clinical notes mimic
Published: Jan. 6, 2023. Version: 2.2
Database Contributor Review
BRATECA (Brazilian Tertiary Care Dataset): a Clinical Information Dataset for the Portuguese Language
prescriptions exams tertiary care natural language processing clinical notes
Published: July 14, 2022. Version: 1.1
Database Credentialed Access
Annotated Question-Answer Pairs for Clinical Notes in the MIMIC-III Database
clinical question answering clinical nlp clinical reading comprehension
Published: Jan. 15, 2021. Version: 1.0.0
Database Credentialed Access
SCRIPT CarpeDiem Dataset: demographics, outcomes, and per-day clinical parameters for critically ill patients with suspected pneumonia
Published: Aug. 4, 2025. Version: 1.8.0
Database Credentialed Access
MIMIC-III-Ext-VeriFact-BHC: Labeled Propositions From Brief Hospital Course Summaries for Long-form Clinical Text Evaluation
artificial intelligence natural language processing clinical notes electronic health records large language models brief hospital course long-form text chart review text reranking atomic claim hybrid retrieval clinical informatics clinical medicine fact verification retrieval-augmented generation logical atomism text embedding formal logic llm-as-a-judge llm evaluation
Published: April 9, 2025. Version: 1.0.0
Database Credentialed Access
MIMIC-IV-Note: Deidentified free-text clinical notes
deidentification critical care electronic health record natural language processing clinical notes mimic
Published: Jan. 6, 2023. Version: 2.2
Database Contributor Review
BRATECA (Brazilian Tertiary Care Dataset): a Clinical Information Dataset for the Portuguese Language
prescriptions exams tertiary care natural language processing clinical notes
Published: July 14, 2022. Version: 1.1
Challenge Credentialed Access
ShAReCLEF eHealth 2013: Natural Language Processing and Information Retrieval for Clinical Care
Published: Feb. 15, 2013. Version: 1.0
Database Credentialed Access
MIMIC-IV-Ext-22MCTS: A 22 Millions-Event Temporal Clinical Time-Series Dataset with Relative Timestamp
mimic clinical event annotation time series temporal annotation
Published: Sept. 29, 2025. Version: 1.0.0
Database Credentialed Access
C-REACT: Contextualized Race and Ethnicity Annotations for Clinical Text
clinical notes patient country information race and ethnicity patient language information
Published: Oct. 21, 2024. Version: 1.0.0