Resources


Database Open Access

Multimodal Synchronized Motion Capture, Force Plate, and Radar Dataset of the One-Legged Stand Test for Fall-Risk Assessment

Daniel Copeland, Evan Linton, Xiang Zhang, et al.

A multimodal dataset of 32 participants performing the One-Legged Stand Test (OLST), with synchronized motion capture, force plate, and 24 GHz radar data. Each of 1,241 trials is labeled with foot-lift, stability phases, and foot-touchdown.

motion capture human pose estimation human movement fall risk assessment non-contact sensing one-legged stand test force plate analysis digital biomarkers human balance testing geriatrics radar signal processing postural control multimodal sensing biomechanics aging and mobility

Published: Jan. 25, 2026. Version: 1.0


Database Credentialed Access

Bridge2AI-Voice: An ethically-sourced, diverse voice dataset linked to health information

Yael Bensoussan, Alexandros Sigaras, Anais Rameau, et al.

A dataset of features from voice recordings and metadata to enable the development, benchmarking, and validation of clinically applicable machine-learning models for diagnosing a wide range of health conditions.

voice bridge2ai

Published: Dec. 16, 2025. Version: 3.0.0


Database Open Access

bigP3BCI: An Open, Diverse and Machine Learning Ready P300-based Brain-Computer Interface Dataset

Boyla Mainsah, Chance Fleeting, Thomas Balmat, et al.

A collection of data from P300-based brain-computer interface studies.

brain-computer interface electroencephalography ieee p2731 working group standard amyotrophic lateral sclerosis p300 speller p300 event related potential oddball paradigm error-related potential

Published: May 19, 2025. Version: 1.0.0


Database Open Access

CheXmask Database: a large-scale dataset of anatomical segmentation masks for chest x-ray images

Nicolas Gaggion, Candelaria Mosquera, Martina Aineseder, et al.

CheXmask Database is a 657,566 uniformly annotated chest radiographs with segmentation masks. Images were segmented using HybridGNet, with automatic quality control indicated by RCA scores.

automatic quality assesment chest x-ray segmentation medical image segmentation

Published: Jan. 22, 2025. Version: 1.0.0


Database Restricted Access

Endoscapes2023, A Critical View of Safety and Surgical Scene Segmentation Dataset for Laparoscopic Cholecystectomy

Pietro Mascagni, Deepak Alapatt, Aditya Murali, et al.

Endoscapes2023 enables the development of models for object detection, semantic and instance segmentation, and Critical View of Safety (CVS) prediction, contributing to safe laparoscopic cholecystectomy.

surgical safety computer assisted interventions semantic segmentation surgical data science medical imaging analysis

Published: Dec. 11, 2024. Version: 1.0.0


Database Credentialed Access

CovIdentify Dataset

Peter Cho, Md Mobashir Hasan Shandhi, Ali Roghanizad, et al.

This contains wearable device data from Fitbit, Garmin, and Apple Watch users. The data is from April 2nd, 2020 to March 21st, 2021 and has been date-shifted. An appropriate amount has also shifted test dates for each user.

Published: Nov. 25, 2024. Version: 1.0.0


Database Credentialed Access

ReFiSco: Report Fix and Score Dataset for Radiology Report Generation

Katherine Tian, Sina J Hartung, Andrew A Li, et al.

Preliminary human expert evaluation study on 60 MIMIC-CXR radiology reports

Published: Aug. 23, 2023. Version: 0.0


Challenge Credentialed Access

MIT Critical Datathon 2023: a MIMIC-IV Derived Dataset for Pulse Oximetry Correction Models

João Matos, Tristan Struja, David S Restrepo, et al.

A SaO2-SpO2 Pairs Dataset derived from MIMIC-IV

pulse oximetry health equity machine learning

Published: May 8, 2023. Version: 1.0.0


Software Credentialed Access

Code for generating the HAIM multimodal dataset of MIMIC-IV clinical data and x-rays

Luis R Soenksen, Yu Ma, Cynthia Zeng, et al.

Code for generating the HAIM multimodal dataset of MIMIC-IV clinical data and x-rays

database code multimodality

Published: Aug. 23, 2022. Version: 1.0.1


Database Open Access

Icentia11k Single Lead Continuous Raw Electrocardiogram Dataset

Shawn Tan, Satya Ortiz-Gagné, Nicolas Beaudoin-Gagnon, et al.

This is a dataset of continuous raw electrocardiogram (ECG) signals for representation learning containing 11 thousand patients and 2 billion labelled beats.

representation learning ecg

Published: April 12, 2022. Version: 1.0

Visualize waveforms