Resources


Database Credentialed Access

RadGraph-XL: A Large-Scale Expert-Annotated Dataset for Entity and Relation Extraction from Radiology Reports

Jean-Benoit Delbrouck

RadGraph-XL is a large, expert-annotated dataset of 2,300 radiology reports covering multiple modalities and anatomies. It enables accurate extraction of clinical entities and relations for downstream medical AI tasks.

Published: Sept. 12, 2025. Version: 1.0.0


Database Credentialed Access

ReXPref-Prior: A MIMIC-CXR Preference Dataset for Reducing Hallucinated Prior Exams in Radiology Report Generation

Oishi Banerjee, Hong-Yu Zhou, Subathra Adithan, et al.

We propose ReXPref-Prior, an adapted version of MIMIC-CXR where GPT-4 has removed references to prior exams from both findings and impression sections of chest X-ray reports.

chest x-rays reinforcement learning hallucination

Published: Aug. 14, 2024. Version: 1.0.0


Database Credentialed Access

Insulin4RL: Real-Time Insulin Infusions For Offline Reinforcement Learning

Thomas Frost, Steve Harris

Openly available research dataset intended for offline reinforcement learning (ORL) using natively irregular healthcare data. The dataset is intended to encourage further research into ORL methods using naturally sporadic decision intervals.

insulin intensive care semi-markov decision process diabetes blood glucose offline reinforcement learning machine learning

Published: June 15, 2026. Version: 1.0.0


Database Credentialed Access

GOSSIS-1-eICU, the eICU-CRD subset of the Global Open Source Severity of Illness Score (GOSSIS-1) dataset

Jesse Raffa, Alistair Johnson, Tom Pollard, et al.

GOSSIS-1 is an in-hospital mortality prediction algorithm for critical care patients. GOSSIS-1 was trained using data from three countries. This dataset corresponds with the USA subset of the GOSSIS-1 dataset for the 2022 publication below.

icu critical care severity of illness global gossis apache mortality prediction benchmarking

Published: July 20, 2022. Version: 1.0.0