Resources


Challenge Credentialed Access

CXR-LT: Multi-Label Long-Tailed Classification on Chest X-Rays

Gregory Holste, Mingquan Lin, Song Wang, Yiliang Zhou, Yishu Wei, Hao Chen, Atlas Wang, Yifan Peng

CXR-LT 2024 was a challenge for long-tailed, multi-label, and zero-shot thorax disease classification on chest X-rays, held at MICCAI 2024. This page contains long-tailed labels for 45 diseases from the CXR-LT 2024 and 2023 challenges.

disease classification artificial intelligence chest x-ray deep learning computer-aided diagnosis long-tailed learning cardiopulmonary disease zero-shot learning

Published: March 19, 2025. Version: 2.0.0


Database Restricted Access

Dataset for Segmentation and Classification of Cardiac Implantable Electronic Devices in Chest X-Rays

Keno Bressem, Felix Busch, Andrei Zhukov, Lisa Adams

This dataset comprises 11,094 converted DICOM and smartphone images of Cardiac Implantable Electronic Devices (CIEDs), collected from 897 patients. It aims to facilitate the development of algorithms for CIED detection and classification.

chest x-ray radiology medical imaging cardiac implantable electronic devices

Published: March 4, 2025. Version: 1.0.0


Database Restricted Access

EchoNext: A Dataset for Detecting Echocardiogram-Confirmed Structural Heart Disease from ECGs

Pierre Elias, Joshua Finer

EchoNext is a curated dataset of electrocardiograms (ECGs) paired with echocardiogram-confirmed structural heart disease labels, designed to support the development and validation of machine learning models.

heart failure clinical decision support artificial intelligence health equity ecg machine learning deep learning electrocardiogram aortic stenosis cardiovascular screening valvular heart disease digital health ai model deployment left ventricular dysfunction ai in healthcare population health transthoracic echocardiogram structural heart disease

Published: Sept. 16, 2025. Version: 1.1.0


Database Credentialed Access

FDTooth: Intraoral Photographs and Cone-Beam Computed Tomography Images for Fenestration and Dehiscence Detection

Yanqi Yang, Xiaomeng LI, Keyuan Liu, Marawan Elbatel

FDTooth is a dataset containing intraoral photographs and cone-beam computed tomography (CBCT) images with annotations for automated detection of fenestration and dehiscence in anterior teeth.

Published: May 5, 2025. Version: 1.0.0


Database Open Access

SHDB-AF: a Japanese Holter ECG database of atrial fibrillation

Kenta Tsutsui, Shany Biton Brimer, Joachim Behar

Holter ECG database from Japan, containing data from 100 unique patients with paroxysmal AF including expert annotations of Supraventricular arrhythmias at the beat level.

atrial fibrillation ecg holters

Published: April 16, 2025. Version: 1.0.1


Database Restricted Access

Endoscapes2023, A Critical View of Safety and Surgical Scene Segmentation Dataset for Laparoscopic Cholecystectomy

Pietro Mascagni, Deepak Alapatt, Aditya Murali, Armine Vardazaryan, Alain Garcia Vazquez, Nariaki Okamoto, Guido Costamagna, Didier Mutter, Jacques Marescaux, Bernard Dallemagne, Nicolas Padoy

Endoscapes2023 enables the development of models for object detection, semantic and instance segmentation, and Critical View of Safety (CVS) prediction, contributing to safe laparoscopic cholecystectomy.

surgical safety computer assisted interventions semantic segmentation surgical data science medical imaging analysis

Published: Dec. 11, 2024. Version: 1.0.0


Database Credentialed Access

Chest X-ray Dataset with Lung Segmentation

Wimukthi Indeewara, Mahela Hennayake, Kasun Rathnayake, Thanuja Ambegoda, Dulani Meedeniya

CXLSeg dataset: Chest X-ray with Lung Segmentation, a comparatively large dataset of segmented Chest X-ray radiographs based on the MIMIC-CXR dataset. This contains segmentation results of 243,324 frontal view images and corresponding masks.

segmentation medical reports u-net chest radiographs mimic-cxr chest x-ray

Published: Feb. 8, 2023. Version: 1.0.0


Database Credentialed Access

RuMedNLI: A Russian Natural Language Inference Dataset For The Clinical Domain

Pavel Blinov, Aleksandr Nesterov, Galina Zubkova, Arina Reshetnikova, Vladimir Kokh, Chaitanya Shivade

RuMedNLI is the full counterpart dataset of MedNLI in Russian language.

natural language inference recognizing textual entailment russian language

Published: April 1, 2022. Version: 1.0.0


Database Credentialed Access

RadGraph: Extracting Clinical Entities and Relations from Radiology Reports

Saahil Jain, Ashwin Agrawal, Adriel Saporta, Steven QH Truong, Du Nguyen Duong, Tan Bui, Pierre Chambon, Matthew Lungren, Andrew Ng, Curtis Langlotz, Pranav Rajpurkar

RadGraph is a dataset of entities and relations in full-text chest X-ray radiology reports, which are obtained using a novel information extraction (IE) schema to capture clinically relevant information in a radiology report.

entity and relation extraction graph multi-modal natural language processing radiology

Published: June 3, 2021. Version: 1.0.0


Database Restricted Access

Smartphone-Captured Chest X-Ray Photographs

Po-Chih Kuo, ChengChe Tsai, Diego M Lopez, Alexandros Karargyris, Tom Pollard, Alistair Johnson, Leo Anthony Celi

Smartphone-captured CXR images including photographs taken from MIMIC-CXR and CheXpert, photographs taken by resident doctors, and photographs taken with different devices.

smartphone photograph cxr

Published: Sept. 27, 2020. Version: 1.0.0