Resources


Database Restricted Access

VinDr-PCXR: An open, large-scale pediatric chest X-ray dataset for interpretation of common thoracic diseases

Hieu Huy Pham, Tien Thanh Tran, Ha Quy Nguyen

An open, large-scale pediatric chest X-ray dataset that contains both lesion-level labels and image-level labels for multiple findings and diseases for interpretation of common thoracic diseases.

Published: March 21, 2022. Version: 1.0.0


Database Restricted Access

REFLACX: Reports and eye-tracking data for localization of abnormalities in chest x-rays

Ricardo Bigolin Lanfredi, Mingyuan Zhang, William Auffermann, Jessica Chan, Phuong-Anh Duong, Vivek Srikumar, Trafton Drew, Joyce Schroeder, Tolga Tasdizen

This dataset contains 3032 cases of eye-tracking data collected while five radiologists dictated reports for frontal chest x-rays, synchronized timestamped dictation transcription, and manual labels for validation of localization of abnormalities.

eye tracking radiology report reflacx fixations computer vision chest x-rays gaze radiology machine learning deep learning

Published: Sept. 27, 2021. Version: 1.0.0


Database Credentialed Access

VinDr-CXR: An open dataset of chest X-rays with radiologist annotations

Ha Quy Nguyen, Hieu Huy Pham, le tuan linh, Minh Dao, lam khanh

VinDr-CXR: An open dataset of chest X-rays with radiologist's annotations

lesion detection chest x-ray interpretation computer vision disease classification deep learning

Published: June 22, 2021. Version: 1.0.0


Database Credentialed Access

Eye Gaze Data for Chest X-rays

Alexandros Karargyris, Satyananda Kashyap, Ismini Lourentzou, Joy Wu, Matthew Tong, Arjun Sharma, Shafiq Abedin, David Beymer, Vandana Mukherjee, Elizabeth Krupinski, Mehdi Moradi

This dataset was a collected using an eye tracking system while a radiologist interpreted and read 1,083 public CXR images. The dataset contains the following aligned modalities: image, transcribed report text, dictation audio and eye gaze data.

convolutional network heatmap eye tracking explainability audio chest cxr chest x-ray radiology machine learning multimodal deep learning

Published: Sept. 12, 2020. Version: 1.0.0


Database Credentialed Access

CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays

Hyungyung Lee, Geon Choi, Jung Oh Lee, Hangyul Yoon, Hyuk Gi Hong, Edward Choi

CheXStruct is an automated pipeline that derives structured diagnostic reasoning steps from chest X-rays. CXReasonBench builds on this to evaluate whether models perform clinically grounded, multi-step reasoning beyond final diagnoses.

chest x-ray evaluation benchmark structured diagnostic pipeline structured chest x-ray qa diagnostic reasoning intermediate reasoning steps grounded reasoning structured reasoning

Published: Oct. 15, 2025. Version: 1.0.0


Challenge Credentialed Access

CXR-LT: Multi-Label Long-Tailed Classification on Chest X-Rays

Gregory Holste, Mingquan Lin, Song Wang, Yiliang Zhou, Yishu Wei, Hao Chen, Atlas Wang, Yifan Peng

CXR-LT 2024 was a challenge for long-tailed, multi-label, and zero-shot thorax disease classification on chest X-rays, held at MICCAI 2024. This page contains long-tailed labels for 45 diseases from the CXR-LT 2024 and 2023 challenges.

disease classification artificial intelligence chest x-ray deep learning computer-aided diagnosis long-tailed learning cardiopulmonary disease zero-shot learning

Published: March 19, 2025. Version: 2.0.0


Database Restricted Access

Dataset for Segmentation and Classification of Cardiac Implantable Electronic Devices in Chest X-Rays

Keno Bressem, Felix Busch, Andrei Zhukov, Lisa Adams

This dataset comprises 11,094 converted DICOM and smartphone images of Cardiac Implantable Electronic Devices (CIEDs), collected from 897 patients. It aims to facilitate the development of algorithms for CIED detection and classification.

chest x-ray radiology medical imaging cardiac implantable electronic devices

Published: March 4, 2025. Version: 1.0.0


Database Credentialed Access

Medical-CXR-VQA dataset: A Large-Scale LLM-Enhanced Medical Dataset for Visual Question Answering on Chest X-Ray Images

Xinyue Hu, Lin Gu, Kazuma Kobayashi, liangchen liu, Mengliang Zhang, Tatsuya Harada, Ronald Summers, Yingying Zhu

Medical-CXR-VQA provides a large-scale LLM-enhanced dataset for visual question answering in medical chest x-ray images.

Published: Jan. 21, 2025. Version: 1.0.0


Database Credentialed Access

CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays

Hyungyung Lee, Geon Choi, Jung Oh Lee, Hangyul Yoon, Hyuk Gi Hong, Edward Choi

CheXStruct is an automated pipeline that derives structured diagnostic reasoning steps from chest X-rays. CXReasonBench builds on this to evaluate whether models perform clinically grounded, multi-step reasoning beyond final diagnoses.

chest x-ray evaluation benchmark structured diagnostic pipeline structured chest x-ray qa diagnostic reasoning intermediate reasoning steps grounded reasoning structured reasoning

Published: Oct. 15, 2025. Version: 1.0.0


Database Credentialed Access

MIMIC-Ext-CXR-QBA: A Structured, Tagged, and Localized Visual Question Answering Dataset with Question-Box-Answer Triplets and Scene Graphs for Chest X-ray Images

Philip Müller, Friederike Jungmann, Georgios Kaissis, Daniel Rueckert

We present a large-scale CXR VQA dataset derived from MIMIC-CXR with 42M QA pairs, featuring hierarchical answers, bounding boxes, and structured tags. We generated QA-pairs using LLM-based extraction from radiology reports and localization models.

chest x-rays vqa localization scene graphs

Published: July 22, 2025. Version: 1.0.0