Resources


Database Credentialed Access

DrugEHRQA: A Question Answering Dataset on Structured and Unstructured Electronic Health Records For Medicine Related Queries

Jayetri Bardhan, Anthony Colas, Kirk Roberts, Daisy Zhe Wang

DrugEHRQA is a QA dataset containing question-answers from MIMIC-III tables and discharge summaries.

question-answer qa

Published: April 12, 2022. Version: 1.0.0


Database Credentialed Access

Synthetic Acute Hypotension and Sepsis Datasets Based on MIMIC-III and Published as Part of the Health Gym Project

Nicholas Kuo, Simon Finfer, Louisa Jorm, Sebastiano Barbieri

This repository hosts the original Health Gym datasets of Acute Hypotension and Sepsis

sepsis machine learning acute hypotension reinforcement learning synthetic dataset generative modelling wasserstein generative adversarial network

Published: Feb. 23, 2022. Version: 1.0.0


Database Credentialed Access

National Institutes of Health Stroke Scale (NIHSS) Annotations for the MIMIC-III Database

Jiayang Wang, Xiaoshuo Huang, Lin Yang, Jiao Li

A dataset of annotated NIHSS scale items and corresponding scores from stroke patients discharge summaries in MIMIC-III.

Published: Jan. 25, 2021. Version: 1.0.0


Database Restricted Access

Flatten: COVID-19 Survey Data on Symptoms, Demographics and Mental Health in Canada

Shrey Jain, Marie Charpignon, Mathew Samuel, Jaydeep Mistry, Nicholas Frosst, Leo Anthony Celi, Marzyeh Ghassemi

Freely accessible COVID-19 symptom dataset surveying Canadians and gathered from March to July of 2020 by the global humanitarian aid non-profit Flatten. This dataset of 294,106 surveys gathered from March 23rd to July 30th in 2020.

public health population statistics covid-19

Published: March 8, 2021. Version: 1.0


Database Restricted Access

Flatten: COVID-19 Survey Data on Symptoms, Demographics and Mental Health in Canada

Shrey Jain, Marie Charpignon, Mathew Samuel, Jaydeep Mistry, Nicholas Frosst, Leo Anthony Celi, Marzyeh Ghassemi

Freely accessible COVID-19 symptom dataset surveying Canadians and gathered from March to July of 2020 by the global humanitarian aid non-profit Flatten. This dataset of 294,106 surveys gathered from March 23rd to July 30th in 2020.

public health population statistics covid-19

Published: March 8, 2021. Version: 1.0


Database Open Access

Sleep Heart Health Study PSG Database

Data collected for a prospective cohort study designed to investigate the relationship between sleep disordered breathing and cardiovascular disease.

polysomnogram sleep multiparameter

Published: Oct. 23, 2003. Version: 1.0.0

Visualize waveforms

Database Open Access

Smart Health for Assessing the Risk of Events via ECG Database

Holter recordings of 139 hypertensive patients recruited at the Centre of Hypertension of the University Hospital of Naples Federico II.

risk hypertension holter hrv ecg

Published: May 19, 2015. Version: 1.0.0

Visualize waveforms

Database Credentialed Access

GLOBEM Dataset: Multi-Year Datasets for Longitudinal Human Behavior Modeling Generalization

Xuhai Xu, Han Zhang, Yasaman Sefidgar, Yiyi Ren, Xin Liu, Woosuk Seo, Jennifer Brown, Kevin Kuehn, Mike Merrill, Paula Nurius, Shwetak Patel, Tim Althoff, Margaret Morris, Eve Riskin, Jennifer Mankoff, Anind Dey

GLOBEM datasets contain the first released multi-year mobile and wearable sensing datasets from 2018 to 2021, containing 705 person-years and 497 unique participants.

health ubiquitous computing well-being passive mobile sensing human behavior modeling

Published: March 14, 2023. Version: 1.1


Database Credentialed Access

MIMIC-IV-Note: Deidentified free-text clinical notes

Alistair Johnson, Tom Pollard, Steven Horng, Leo Anthony Celi, Roger Mark

Deidentified free-text clinical notes for patients in the MIMIC-IV Clinical Database.

mimic deidentification critical care electronic health record clinical notes natural language processing

Published: Jan. 6, 2023. Version: 2.2


Database Credentialed Access

NCH Sleep DataBank: A Large Collection of Real-world Pediatric Sleep Studies with Longitudinal Clinical Data

Harlin Lee, Boyue Li, Yungui Huang, Yuejie Chi, Simon Lin

The NCH Sleep DataBank includes 3,984 pediatric sleep studies on 3,673 unique patients conducted at Nationwide Children's Hospital between 2017 and 2019. It contains polysomnography (PSG), clinical annotations, and longitudinal clinical data.

eeg ehr polysomnography pediatrics clinical decision support sleep disorders sleep study electronic health records ecg

Published: Oct. 27, 2021. Version: 3.1.0