Resources


Database Credentialed Access

RaDialog Instruct Dataset

Chantal Pellegrini, Ege Özsoy, Benjamin Busam, Nassir Navab, Matthias Keicher

Image-based instruct data for Chest X-Ray understanding and analysis.

radiology report generation large vision-language models medical image understaning radiology assistant radiology chatbot

Published: March 25, 2024. Version: 1.0.0


Challenge Open Access

Heart Murmur Detection from Phonocardiogram Recordings: The George B. Moody PhysioNet Challenge 2022

Matthew Reyna, Yashar Kiarashi, Andoni Elola, Jorge Oliveira, Francesco Renna, Annie Gu, Erick Andres Perez Alday, Nadi Sadr, Sandra Mattos, Miguel Coimbra, Reza Sameni, Ali Bahrami Rad, Zuzana Koscova, Gari Clifford

2022 Physionet Challenge is devoted to detecting the presence or absence of murmurs from multiple heart sound recordings from multiple auscultation locations, as well as detecting the clinical outcomes.

challenge competition cardiac auscultation congenital heart diseases

Published: Sept. 28, 2023. Version: 1.0.0


Database Open Access

Facial and oral temperature data from a large set of human subject volunteers

Quanzeng Wang, Yangling Zhou, Pejman Ghassemi, Dwith Chenna, Michelle Chen, Jon Casamento, Joshua Pfefer, David Mcbride

Data for each subject include temperatures measured at 29 facial locations over four rounds with two IRTs, oral temperatures measured with a thermometer in two modes, subject demographics (gender, age, ethnicity), environmental conditions, etc.

clinical accuracy receiver operating characteristic curve infectious disease epidemics thermography fever screening thermometry inner canthus elevated body temperature facial maximum temperatures infrared thermograph pearson correlation coefficients

Published: May 24, 2023. Version: 1.0.0


Database Credentialed Access

Critical care database comprising patients with infection at Zigong Fourth People's Hospital

Ping Xu, Lin Chen, Zhongheng Zhang

Routinely collected data from critical care units at Zigong Fourth People’s Hospital, Sichuan, China for patients admitted between January 2019 and December 2020 Missing information on temperature are updated in the new version.

Published: June 30, 2022. Version: 1.1


Database Credentialed Access

DrugEHRQA: A Question Answering Dataset on Structured and Unstructured Electronic Health Records For Medicine Related Queries

Jayetri Bardhan, Anthony Colas, Kirk Roberts, Daisy Zhe Wang

DrugEHRQA is a QA dataset containing question-answers from MIMIC-III tables and discharge summaries.

question-answer qa

Published: April 12, 2022. Version: 1.0.0


Challenge Open Access

Paroxysmal Atrial Fibrillation Events Detection from Dynamic ECG Recordings: The 4th China Physiological Signal Challenge 2021

Xingyao Wang, Caiyun Ma, Xiangyu Zhang, Hongxiang Gao, Gari D. Clifford, Chengyu Liu

CPSC2021 for paroxysmal atrial fibrillation events detection.

event detection paroxysmal atrial fibrillation

Published: June 21, 2021. Version: 1.0.0

Visualize waveforms

Database Credentialed Access

Curated Data for Describing Blood Glucose Management in the Intensive Care Unit

Aldo Robles Arévalo, Roselyn Mateo-Collado, Leo Anthony Celi

The data subsets consist of time series files that includes all the curated entries of glucose readings and insulin inputs from MIMIC-III database.

insulin replacement therapy glycemic control critical care

Published: April 19, 2021. Version: 1.0.1


Database Credentialed Access

EHR-DS-QA: A Synthetic QA Dataset Derived from Medical Discharge Summaries for Enhanced Medical Information Retrieval Systems

Konstantin Kotschenreuther

Dataset consisting of question and answer pairs synthetically generated from medical discharge summaries, designed to facilitate the training and development of large language models specifically tailored for healthcare applications

mimic-iv clinical question-answering medical discharge summaries large language models

Published: Jan. 11, 2024. Version: 1.0.0


Model Credentialed Access

Characterization of Stigmatizing Language in Medical Records

Keith Harrigian, Ayah Zirikly, Brant Chee, Alya Ahmad, Anne Links, Somnath Saha, Mary Catherine Beach, Mark Dredze

A suite of classifiers for detecting three types of stigmatizing language in electronic medical records. Trained on MIMIC-IV discharge notes.

mimic clinical natural language processing domain transfer bias stigmatizing language large language models

Published: Nov. 6, 2023. Version: 1.0.0


Database Credentialed Access

RadQA: A Question Answering Dataset to Improve Comprehension of Radiology Reports

Sarvesh Soni, Kirk Roberts

RadQA is an electronic health record question answering dataset containing clinical questions that can be answered using the Findings and Impressions sections of radiology reports

electronic health records clinical notes question answering radiology reports machine reading comprehension

Published: Dec. 9, 2022. Version: 1.0.0