Resources
Challenge Credentialed Access
ShAReCLEF eHealth 2013: Natural Language Processing and Information Retrieval for Clinical Care
Published: Feb. 15, 2013. Version: 1.0
Model Credentialed Access
Clinical-T5: Large Language Models Built Using MIMIC Clinical Text
Published: Jan. 25, 2023. Version: 1.0.0
Database Credentialed Access
MIMIC-IV-Note: Deidentified free-text clinical notes
deidentification critical care natural language processing clinical notes electronic health record mimic
Published: Jan. 6, 2023. Version: 2.2
Database Credentialed Access
RadGraph-XL: A Large-Scale Expert-Annotated Dataset for Entity and Relation Extraction from Radiology Reports
Published: Sept. 12, 2025. Version: 1.0.0
Challenge Credentialed Access
ShAReCLEF eHealth Evaluation Lab 2014 (Task 2): Disorder Attributes in Clinical Reports
Published: Nov. 1, 2013. Version: 1.0
Challenge Credentialed Access
Analysis of Clinical Text: Task 14 of SemEval 2015
Published: Dec. 28, 2014. Version: 2.0
Database Credentialed Access
MIMIC-III-Ext-Notes
Published: Feb. 27, 2026. Version: 1.0.0
Database Credentialed Access
Annotation dataset of problematic opioid use and related contexts from MIMIC-III Critical Care Database discharge summaries
opioid use disorder substance use natural language processing clinical notes
Published: Feb. 8, 2023. Version: 1.0.0
Database Restricted Access
Swiss-Mammo: A physician-written, synthetic dataset of German mammography reports
radiology mammography structured reporting bi-rads
Published: June 24, 2025. Version: 1.0.1
Model Credentialed Access
EntityBERT: BERT-based Models Pretrained on MIMIC-III with or without Entity-centric Masking Strategy for the Clinical Domain
Published: March 17, 2022. Version: 1.0.1