Resources


Database Restricted Access

Organ Retrieval and Collection of Health Information for Donation (ORCHID)

Hammaad Adam, Vinith Suriyakumar, Tom Pollard, Benjamin Moody, Jennifer Erickson, Greg Segal, Brad Adams, Diane Brockmeier, Kevin Lee, Ginny McBride, Kelly Ranum, Matthew Wadsworth, Janice Whaley, Ashia Wilson, Marzyeh Ghassemi

Multi-center dataset on organ procurement in the United States

organ procurement organizations organ transplantation

Published: Sept. 29, 2025. Version: 2.1.1


Database Open Access

MIMIC-IV demo data in the Medical Event Data Standard (MEDS)

Robin Philippus van de Water, Ethan Steinberg, Michael Wornow, Patrick Rockenschaub, Matthew McDermott

MIMIC-IV Clinical Database Demo in MEDS (Medical Event Data Standard) format.

ehr critical care electronic health record machine learning mimic meds medical event data standard

Published: Sept. 29, 2025. Version: 0.0.1


Database Credentialed Access

MIMIC-Ext-DrugDetection

Fabrice Harel-Canada, Nanyun Peng, David Goodman, Ruby Romero, Allan Nguyen, Brandon Moghanian, Anabel Salimian

This project offers a multilabel annotated dataset of clinical note sentences from MIMIC-III/IV for substance use detection. It supports NLP research for identifying various co-occurring drug use mentions in patient records.

ehr mimic-iv substance use clinical notes methamphetamine multi-label cocaine drug detection polysubstance use prescription opioid misuse cannabis benzodiazepine misuse injection drug use heroin mimic-iii

Published: Sept. 25, 2025. Version: 1.0.0


Database Open Access

Myocardial perfusion scintigraphy image database

Wesley Calixto, Solange Nogueira, Fernanda Luz, Thiago Fellipe Ortiz de Camargo

This database provides a collection of myocardial perfusion scintigraphy images. The dataset encompasses a diversity of clinical cases, including various perfusion patterns and underlying cardiac conditions.

nifti artificial intelligence anonymization clinical diagnosis myocardial perfusion systems modeling myocardial perfusion scintigraphy dicom metadata ventricular walls coronary artery disease convolutional neural networks automated segmentation

Published: Sept. 9, 2025. Version: 1.0.0


Database Credentialed Access

MIMIC-IV-Ext-Instr: A Dataset of 450K+ EHR-Grounded Instruction-Following Examples

Zhenbang Wu, Anant Dadu, Mike Nalls, Faraz Faghri, Jimeng Sun

This dataset contains 450K open-ended instruction-following examples generated using GPT-3.5 based on the MIMIC-IV EHR database.

large language models medical question answering instruction tuning

Published: Sept. 9, 2025. Version: 1.0.0


Software Open Access

Vital File Web Viewer: A Browser-Based Tool for Viewing High-Resolution Vital Signs Data

Eunsun Rachel Lee, Hyung-Chul Lee

A standalone, browser-based viewer for .vital files that enables interactive visualization of high-resolution vital signs data without server-side components.

vitaldb vital file

Published: Sept. 1, 2025. Version: 1.0.0


Database Credentialed Access

Immunosuppressive Condition and Medication Annotations for Admission Notes in the MIMIC-III Database

Vijeeth Guggilla, Melissa Bak, Mengjia Kang, Theresa Walunas, Catherine A Gao

This database contains 200 MIMIC-III admission notes with adjudicated labels for histories of various immunosuppressive conditions and usage of various immunosuppressive medications.

Published: Aug. 4, 2025. Version: 1.0.0


Database Credentialed Access

Annotated Social Determinants of Health Dataset for Adverse Pregnancy Outcomes

Nidhi Soley, MaKhaila Bentil, Jash Shah, Masoud Rouhizadeh, Casey Taylor

This project provides a manually annotated dataset of social determinants of health—social support, occupation, and substance use—linked to pregnancy outcomes, extracted from MIMIC-III and MIMIC-IV discharge summary notes.

Published: Aug. 4, 2025. Version: 1.0.0


Challenge Credentialed Access

SNOMED CT Entity Linking Challenge

Will Hardman, Mark Banks, Rory Davidson, Donna Truran, Nindya Widita Ayuningtyas, Hoa Ngo, Alistair Johnson, Tom Pollard

272 discharge notes from the MIMIC-IV-Note dataset annotated with SNOMED CT concepts.

snomed entity linking clinical annotation

Published: July 22, 2025. Version: 1.1.0


Database Open Access

tOLIet: Single-lead Thigh-based Electrocardiography Using Polimeric Dry Electrodes

Aline Santos Silva, Hugo Plácido da Silva, Miguel Correia, Andreia Cristina Gonçalves da Costa, Sérgio Laranjo

We present tOLIet, the first thigh ECG dataset with real signals captured by a toilet seat with electrodes. There are 149 recordings from 86 people, useful for research into cardiovascular assessment using "invisible" ECG.

Published: June 24, 2025. Version: 1.0.0