Resources


Database Open Access

A Multi-Modal Satellite Imagery Dataset for Public Health Analysis in Colombia

Sebastian A Cajas, David Restrepo, Dana Moukheiber, Kuan Ting Kuo, Chenwei Wu, David Santiago Garcia Chicangana, Atika Rahman Paddo, Mira Moukheiber, Lama Moukheiber, Sulaiman Moukheiber, Saptarshi Purkayastha, Diego M Lopez, Po-Chih Kuo, Leo Anthony Celi

Multi-Modal Satellite imagery Dataset in Colombia: A public health analysis with spatiotemporally aligned satellite images and its corresponding metadata across 81 municipalities (2016-2018), facilitating multimodal AI applications.

multimodality satellite imagery

Published: Jan. 30, 2024. Version: 1.0.0


Database Credentialed Access

ODD: A Benchmark Dataset for the NLP-based Opioid Related Aberrant Behavior Detection

Sunjae Kwon, Xun Wang, Weisong Liu, Emily Druhl, Minhee Sung, Joel Reisman, Wenjun Li, Robert Kerns, William Becker, Hong Yu

Opioid-related aberrant behaviors (ORABs) detection Dataset (ODD) which is a large-size, expert-annotated, and multi-label classification benchmark dataset corresponding to the task

natural language processing substance use opioid related aberrant behavior

Published: Jan. 11, 2024. Version: 1.0.0


Database Restricted Access

A multimodal dental dataset facilitating machine learning research and clinic services

wenjing liu, Yunyou Huang, Suqin Tang

A new dental dataset that contains 389 patients, three commonly used dental image models, and images of various health conditions of the oral cavity.

Published: Sept. 6, 2023. Version: 1.0.0


Database Open Access

PTB-XL+, a comprehensive electrocardiographic feature dataset

Nils Strodthoff, Temesgen Mehari, Claudia Nagel, Philip Aston, Ashish Sundar, Claus Graff, Jørgen Kanters, Wilhelm Haverkamp, Olaf Doessel, Axel Loewe, Markus Bär, Tobias Schaeffter

ECG feature dataset accompanying the PTB-XL ECG dataset

electrocardiography ptb-xl ptb ecg

Published: June 27, 2023. Version: 1.0.1

Visualize waveforms

Database Credentialed Access

SCRIPT CarpeDiem Dataset: demographics, outcomes, and per-day clinical parameters for critically ill patients with suspected pneumonia

Nikolay Markov, Catherine A Gao, Thomas Stoeger, Anna Pawlowski, Mengjia Kang, Prasanth Nannapaneni, Rogan Grant, Luke Rasmussen, Daniel Schneider, Justin Starren, Richard Wunderink, GR Scott Budinger, Alexander Misharin, Benjamin Singer, NU SCRIPT Study Investigators

SCRIPT seeks to delineate the host/pathogen interactions during pneumonia using multiomic analysis of bronchoalveolar lavage fluid joined with clinical data and physician adjudication.

Published: March 13, 2023. Version: 1.1.0


Database Open Access

A Wearable Exam Stress Dataset for Predicting Cognitive Performance in Real-World Settings

Md Rafiul Amin, Dilranjan Wickramasuriya, Rose T Faghih

The data contains electrodermal activity, heart rate, blood volume pulse, skin surface temperature, inter beat interval and accelerometer data recorded during three exam sessions (midterm 1, midterm 2 and finals) as well as their corresponding grades

stress

Published: May 26, 2022. Version: 1.0.0


Database Restricted Access

VinDr-Mammo: A large-scale benchmark dataset for computer-aided detection and diagnosis in full-field digital mammography

Hieu Huy Pham, Hieu Nguyen Trung, Ha Quy Nguyen

A large-scale benchmark dataset for computer-aided detection and diagnosis in mammography

Published: March 21, 2022. Version: 1.0.0


Database Credentialed Access

CLIP: A Dataset for Extracting Action Items for Physicians from Hospital Discharge Notes

James Mullenbach, Yada Pruksachatkun, Sean Adler, Jennifer Seale, Jordan Swartz, T Greg McKelvey, Yi Yang, David Sontag

Clinical action items annotated over MIMIC-III. 718 discharge summaries are labeled at a sentence- and character-level with multiple action labels including Appointment, Lab, Procedure, Medication, Imaging, Patient Instructions, and Other.

Published: June 21, 2021. Version: 1.0.0


Database Open Access

Q-Pain: A Question Answering Dataset to Measure Social Bias in Pain Management

Cécile Logé, Emily Ross, David Yaw Amoah Dadey, Saahil Jain, Adriel Saporta, Andrew Ng, Pranav Rajpurkar

Q-Pain, a medical QA dataset designed to enable the substitution of multiple different racial and gender "profiles" for patients and to evaluate whether bias is present when deciding whether to prescribe pain medication or not.

Published: June 11, 2021. Version: 1.0.0


Database Contributor Review

HiRID, a high time-resolution ICU dataset

Martin Faltys, Marc Zimmermann, Xinrui Lyu, Matthias Hüser, Stephanie Hyland, Gunnar Rätsch, Tobias Merz

The HiRID database contains a large selection of all routinely collected data relating to patient admissions to the Department of Intensive Care Medicine of the Bern University Hospital, Switzerland (ICU).

intensive care clinical icu high resolution critical care machine learning

Published: Feb. 18, 2021. Version: 1.1.1