Resources


Database Restricted Access

Bridge2AI-Voice: An ethically-sourced, diverse voice dataset linked to health information

Alistair Johnson, Jean-Christophe Bélisle-Pipon, David Dorr, Satrajit Ghosh, Philip Payne, Maria Powell, Anais Rameau, Vardit Ravitsky, Alexandros Sigaras, Olivier Elemento, Yael Bensoussan

A dataset of voice recordings and metadata to enable the development, benchmarking, and validation of clinically applicable machine-learning models for diagnosing a wide range of health conditions.

voice bridge2ai

Published: Jan. 17, 2025. Version: 1.1


Database Restricted Access

A database of hand kinematics, high-density sEMG of forearm and wrist for motion intent recognition

Zeming Zhao, Weichao Guo, Zeyu Zhou

A database of hand kinematics, high-density sEMG of forearm and wrist.

Published: Jan. 17, 2025. Version: 1.0.0


Database Open Access

SensSmartTech database of cardiovascular signals synchronously recorded by an electrocardiograph, phonocardiograph, photoplethysmograph and accelerometer

Aleksandar Lazović, Predrag Tadić, Natalija Đorđević, Vladimir Atanasoski, Masa Tiosavljevic, Marija Ivanovic, Ljupco Hadzievski, Arsen Ristic, Vladan Vukcevic, Jovana Petrovic

SensSmartTech is a unique multiparametric dataset recorded systematically at rest and during the relaxation after activity. It contains the simultaneously recorded electrocardiogram, phonocardiogram, arterial plethysmograms and seismocardiogram.

Published: Dec. 19, 2024. Version: 1.0.0

Visualize waveforms

Database Credentialed Access

MIMIC-IV-Ext-GPT-3_5-Generated-Discharge-Summaries-for-Low-Resource-Codes

Matúš Falis, Aryo Pradipta Gema, Hang Dong, Luke Daines, Siddharth Basetti, Michael Holder, Rose Penfold, Alexandra Birch, Beatrice Alex

9,606 Synthetic Discharge Summaries generated by GPT-3.5 based on combinations of ICD-10-code descriptions associated with real discharge summaries in MIMIC-IV. Focus on low resource codes.

icd coding data augmentation large language model

Published: Dec. 16, 2024. Version: 1.0.0


Database Restricted Access

Endoscapes2023, A Critical View of Safety and Surgical Scene Segmentation Dataset for Laparoscopic Cholecystectomy

Pietro Mascagni, Deepak Alapatt, Aditya Murali, Armine Vardazaryan, Alain Garcia Vazquez, Nariaki Okamoto, Guido Costamagna, Didier Mutter, Jacques Marescaux, Bernard Dallemagne, Nicolas Padoy

Endoscapes2023 enables the development of models for object detection, semantic and instance segmentation, and Critical View of Safety (CVS) prediction, contributing to safe laparoscopic cholecystectomy.

surgical safety computer assisted interventions semantic segmentation surgical data science medical imaging analysis

Published: Dec. 11, 2024. Version: 1.0.0


Database Credentialed Access

CovIdentify Dataset

Peter Cho, Md Mobashir Hasan Shandhi, Ali Roghanizad, Jessilyn Dunn

This contains wearable device data from Fitbit, Garmin, and Apple Watch users. The data is from April 2nd, 2020 to March 21st, 2021 and has been date-shifted. An appropriate amount has also shifted test dates for each user.

Published: Nov. 25, 2024. Version: 1.0.0


Database Credentialed Access

Northwestern ICU (NWICU) database

Dana Moukheiber, William Temps, Bhadrappa Molgi, Yikuan Li, Alice Lu, Prasanth Nannapaneni, Abdulrahman Chahin, Sicheng Hao, Felipe Torres Fabregas, Leo Anthony Celi, Adrian Wong, Maxwell Lloyd, Xavier Borrat Frigola, Hyung-Chul Lee, Daniel Schneider, Tom Pollard, Yuan Luo, Abel Kho, Roger Mark

A freely available COVID-rich ICU database comprising de-identified health-related data from Northwestern Memorial Health Center (NHMC).

Published: Nov. 19, 2024. Version: 0.1.0


Database Credentialed Access

MS-CXR: Making the Most of Text Semantics to Improve Biomedical Vision-Language Processing

Benedikt Boecking, Naoto Usuyama, Shruthi Bannur, Daniel Coelho de Castro, Anton Schwaighofer, Stephanie Hyland, Harshita Sharma, Maria Teodora Wetscherek, Tristan Naumann, Aditya Nori, Javier Alvarez Valle, Hoifung Poon, Ozan Oktay

MS-CXR is a new dataset containing 1162 chest X-ray bounding box labels paired with radiology text descriptions, annotated and verified by two board-certified radiologists.

vision-language processing localization phrase grounding chest x-ray

Published: Nov. 15, 2024. Version: 1.1.0


Database Credentialed Access

MIMIC-IV on FHIR

Alex Bennett, Joshua Wiedekopf, Hannes Ulrich, Philip van Damme, Piotr Szul, John Grimes, Alistair Johnson

MIMIC-IV and MIMIC-IV-ED data mapped into FHIR resources.

mimic-iv fhir electronic health record us core mimic fast healthcare interoperability resources

Published: Nov. 12, 2024. Version: 2.1


Database Contributor Review

Chest Computed Tomography for patients with sepsis in the Emergency Department

Senjun Jin, Zhongheng Zhang

The database is intended to support a wide array of research studies involving radiomics in sepsis patients, helping to reduce barriers to the reproducibility of clinical research.

sepsis

Published: Oct. 28, 2024. Version: 1.0.0