PhysioNet Index

Database Credentialed Access

CLIP: A Dataset for Extracting Action Items for Physicians from Hospital Discharge Notes

James Mullenbach, Yada Pruksachatkun, Sean Adler, et al.

Clinical action items annotated over MIMIC-III. 718 discharge summaries are labeled at a sentence- and character-level with multiple action labels including Appointment, Lab, Procedure, Medication, Imaging, Patient Instructions, and Other.

Published: June 21, 2021. Version: 1.0.0

Database Open Access

Q-Pain: A Question Answering Dataset to Measure Social Bias in Pain Management

Cécile Logé, Emily Ross, David Yaw Amoah Dadey, et al.

Q-Pain, a medical QA dataset designed to enable the substitution of multiple different racial and gender "profiles" for patients and to evaluate whether bias is present when deciding whether to prescribe pain medication or not.

Published: June 11, 2021. Version: 1.0.0

Database Contributor Review

HiRID, a high time-resolution ICU dataset

Martin Faltys, Marc Zimmermann, Xinrui Lyu, et al.

The HiRID database contains a large selection of all routinely collected data relating to patient admissions to the Department of Intensive Care Medicine of the Bern University Hospital, Switzerland (ICU).

icu clinical high resolution critical care intensive care machine learning

Published: Feb. 18, 2021. Version: 1.1.1

Database Credentialed Access

Maternal fat ultrasound measurement and nutritional assessment during pregnancy: A dataset centered in gestational outcomes

Alexandre da Silva Rocha, Juliana Rombaldi Bernardi, Alice Schoffel, et al.

Dataset collected as part of a prospective study in which abdominal maternal fat tissue measurements were compared with outcomes during hospitalization for labor and delivery.

pregnancy abdominal ultrasound

Published: Dec. 4, 2020. Version: 1.0.0

Database Restricted Access

Kinematic dataset of actors expressing emotions

Mingming Zhang, Lu Yu, Keye Zhang, et al.

1402 kinematic recordings of twenty-two semi-professional actors expressing emotions such as happiness, sadness, anger, fear, disgust, and surprise.

body movement emotion kinematic data motion capture

Published: July 7, 2020. Version: 2.1.0

Database Restricted Access

DREAMT: Dataset for Real-time sleep stage EstimAtion using Multisensor wearable Technology

Ke Wang, Jiamu Yang, Ayush Shetty, et al.

We present high resolution wearable device multichannel data along with clinical labeled and recorded sleep stage and polysomnography (PSG) data from 100 sleep abnormal patients with sleep apnea.

time series classification sleep disorders wearable biomedical

Published: June 2, 2026. Version: 2.2.0

Database Credentialed Access

Bridge2AI-Voice Pediatric Dataset

Yael Bensoussan, Alexandros Sigaras, Anais Rameau, et al.

A dataset of questionnaire responses, spectrograms, and other information for pediatric participants collected for the Bridge2AI voice as a biomarker of health project.

health pediatric biomarkers bridge2ai voice

Published: May 1, 2026. Version: 1.1.0

Database Credentialed Access

MIMIC-III-Ext-CA: a MIMIC-III Derived Dataset of Cardiac Arrests in Photoplethysmographs

Gerben Hup, Xi Long, Rik Vullings

The MIMIC-III-Ext-CA dataset contains annotations of 31 PPG-captured cardiac arrest episodes from the MIMIC-III clinical and waveform databases.

ppg photoplethysmography mimic-iii cardiac arrest out-of-hospital cardiac arrest ohca

Published: March 10, 2026. Version: 1.0.0

Database Contributor Review

ER-REASON: A Benchmark Dataset for LLM-Based Clinical Reasoning in the Emergency Room

Mel Molina, Nikita Mehandru, Niloufar Golchini, et al.

The ER-REASON dataset is a longitudinal collection of 25,174 de-identified clinical notes for 3,437 patients admitted to the emergency room (ER) at a large academic medical center between March 1, 2022, and March 31, 2024.

Published: Oct. 23, 2025. Version: 1.0.0

Database Credentialed Access

MIMIC-IV-ECHO-Ext-MIMICEchoQA: A Benchmark Dataset for Echocardiogram-Based Visual Question Answering

Rahul Thapa, Andrew Li, Qingyang Wu, et al.

We present MIMICEchoQA, a benchmark dataset for echocardiogram-based question answering, built from the publicly available MIMIC-IV-ECHO database.

Published: Oct. 7, 2025. Version: 1.0.0

Search

Resources

CLIP: A Dataset for Extracting Action Items for Physicians from Hospital Discharge Notes

Q-Pain: A Question Answering Dataset to Measure Social Bias in Pain Management

HiRID, a high time-resolution ICU dataset

Maternal fat ultrasound measurement and nutritional assessment during pregnancy: A dataset centered in gestational outcomes

Kinematic dataset of actors expressing emotions

DREAMT: Dataset for Real-time sleep stage EstimAtion using Multisensor wearable Technology

Bridge2AI-Voice Pediatric Dataset

MIMIC-III-Ext-CA: a MIMIC-III Derived Dataset of Cardiac Arrests in Photoplethysmographs

ER-REASON: A Benchmark Dataset for LLM-Based Clinical Reasoning in the Emergency Room

MIMIC-IV-ECHO-Ext-MIMICEchoQA: A Benchmark Dataset for Echocardiogram-Based Visual Question Answering