Resources


Model Credentialed Access

Asclepius-R : Clinical Large Language Model Built On MIMIC-III Discharge Summaries

Sunjun Kweon, Junu Kim, Jiyoun Kim, Sujeong Im, Eunbyeol Cho, Seongsu Bae, Jungwoo Oh, Gyubok Lee, Jong Hak Moon, Seng Chan You, Seungjin Baek, Chang Hoon Han, Yoon Bin Jung, Yohan Jo, Edward Choi

Asclepius: Publicly Available Clinical Large Language Models with Synthetic Clinical Notes Asclepius-R: A instruction-finetuned large language model with MIMIC-III clinical notes

clinical notes synthetic clinical notes synthetic notes asclepius open-source llm clinical llm large language model

Published: March 25, 2024. Version: 1.1.0


Database Credentialed Access

VinDr-CXR: An open dataset of chest X-rays with radiologist annotations

Ha Quy Nguyen, Hieu Huy Pham, le tuan linh, Minh Dao, lam khanh

VinDr-CXR: An open dataset of chest X-rays with radiologist's annotations

lesion detection chest x-ray interpretation computer vision disease classification deep learning

Published: June 22, 2021. Version: 1.0.0


Database Credentialed Access

PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions

Daeun Kyung, Hyunseung Chung, Seongsu Bae, Jiho Kim, Jae Ho Sohn, Taerim Kim, Soo Kim, Edward Choi

PatientSim is a patient simulator that simulates realistic and diverse personas for clinical scenarios, enabling robust training and evaluation of doctor-patient interactions in multi-turn dialogues.

electronic health records multi-turn dialogue llm simulation doctor-patient consultation

Published: Oct. 18, 2025. Version: 1.0.0


Database Open Access

Pulse Transit Time PPG Dataset

Philip Mehrgardt, Matloob Khushi, Simon Poon, Anusha Withana

Time synchronised multi-site PPG dataset for PTT including sensors’ attachment pressures, temperatures, inertial data from accelerometer and gyroscope, annotated ECG data, blood pressures, as well as blood oxygenation saturation levels (SpO2)

blood pressure finger attachment pressure ptt imu spo2 gyroscope attachment force pulse transit time accelerometer ppg ecg

Published: March 18, 2022. Version: 1.1.0

Visualize waveforms

Database Open Access

SCG-RHC: Wearable Seismocardiogram Signal and Right Heart Catheter Database

Michael Chan, Liviu Klein, Joanna Fan, Omer Inan

This is the first public dataset that contains simultaneous recordings of Right Heart Catheter data (pressure) and chest-worn wearable patch data (electrocardiogram and seismocardiogram signals).

Published: March 31, 2023. Version: 1.0.0

Visualize waveforms

Database Credentialed Access

GOSSIS-1-eICU, the eICU-CRD subset of the Global Open Source Severity of Illness Score (GOSSIS-1) dataset

Jesse Raffa, Alistair Johnson, Tom Pollard, Omar Badawi

GOSSIS-1 is an in-hospital mortality prediction algorithm for critical care patients. GOSSIS-1 was trained using data from three countries. This dataset corresponds with the USA subset of the GOSSIS-1 dataset for the 2022 publication below.

icu critical care severity of illness global gossis apache mortality prediction benchmarking

Published: July 20, 2022. Version: 1.0.0


Database Open Access

Simultaneous physiological measurements with five devices at different cognitive and physical loads

Marcus Vollmer, Dominic Bläsing, Julian Elias Reiser, Maria Nisser, Anja Buder

Dataset to support comparison of usability and accuracy from simultaneous measurements collected from 13 subjects including five devices: NeXus-10 MKII, eMotion Faros 360°, Hexoskin Hx1, SOMNOTouch NIBP, Polar RS800 Multi.

holter multiparameter photoplethysmogram noise accelerometer heart rate movement temperature hrv respiration ecg

Published: Jan. 18, 2023. Version: 1.0.2

Visualize waveforms

Software Open Access

R-DECO: An open-source Matlab based graphical user interface for the detection and correction of R-peaks

Jonathan Moeyersons, Matthew Amoni, Sabine Van Huffel, Rik Willems, Carolina Varon

An open-source Matlab based graphical user interface for the detection and correction of R-peaks.

algorithms and analysis of algorithms signal processiong visual analysis graphical user interface

Published: Sept. 8, 2020. Version: 1.0.0


Database Restricted Access

HYAMD High-Resolution Fundus Image Dataset for age related macular degeneration (AMD) Diagnosis

Meishar Meisel, Benjamin Alfred Cohen, Meital Baskin, Beatrice Tiosano, Joachim Behar, Eran Berkowitz

The HYAMD dataset comprises 1,560 high-resolution fundus images from 325 patients, aimed at validating machine learning models for age-related macular degeneration (AMD) diagnosis.

Published: Sept. 9, 2025. Version: 1.0.0


Database Contributor Review

COVID Data for Shared Learning (CDSL): A comprehensive, multimodal COVID-19 dataset from HM Hospitales

Álvaro Ritoré, Andreea M Oprescu, Alberto Estirado Bronchalo, Miguel Ángel Armengol de la Hoz

COVID Data for Shared Learning (CDSL) is a multimodal database comprising de-identified structured health data and radiological images from 4,479 patients with COVID-19, as a comprehensive toolkit for developing predictive models.

covid-19 multimodal database radiological images open data healthcare data machine learning and ai

Published: Oct. 25, 2024. Version: 1.0.0