PhysioNet Index

Database Credentialed Access

MIMIC-CXR-Ext-ILS: Lesion Segmentation Masks and Instruction-Answer Pairs for Chest X-rays

Geon Choi, Hangyul Yoon, Hyunju Shin, et al.

Instruction-guided lesion segmentation data for chest X-rays, including 1.1M instruction-answer pairs and 91K segmentation masks covering seven major lesion types.

chest x-ray segmentation text-guided segmentation lesion segmentation

Published: March 25, 2026. Version: 1.0.0

Database Credentialed Access

MIMIC-IV-Ext-MedicalBench: Evaluating Large Language Models Towards Improved Medical Concept Extraction

Zhichao Yang, Gregory Lyng, Sanjit Batra, et al.

This dataset is an evidence‑grounded benchmark built on MIMIC‑IV discharge summaries that evaluates how well large language models can verify ICD‑10 medical concepts, including implicitly documented diagnoses, by identifying supporting text evidence.

Published: March 23, 2026. Version: 1.0.0

Database Open Access

Longitudinal Cylinder Rearing Behavioral Data in a Mouse Stroke Model Across Multiple Drug Treatments

Yunhao Jiang, Shreyas Venkitaraman, Hee Ra Jung, et al.

This dataset provides cylinder rearing video and behavioral scoring data from 59 mice undergoing stroke and drug treatments.

Published: March 4, 2026. Version: 1.0.0

Database Open Access

Respiratory and Pulse Oximetry Waveforms from Healthy Adults During Simulated Apnoea Events

Jordan Hill, Ella Frances Sophia Guy, Jaimey Anne Clifton, et al.

This dataset contains airway pressure, flow and pulse oximetry waveforms from 20 healthy adults during simulated apnoea events, including arterial and venous PPG signals for developing and validating OSA detection and oxygenation models.

pulse oximetry respiratory obstructive sleep apnea

Published: March 4, 2026. Version: 1.0.0

Database Credentialed Access

Lunguage: A Benchmark for Structured and Sequential Chest X-ray Interpretation

Jong Hak Moon, Geon Choi, Paloma Rabaey, et al.

A radiologist-annotated benchmark of structured chest X-ray reports at single and sequential levels, comprising 1,473 reports across 18 relation types and 80 longitudinal cases.

fine-grained structured reports attribute-level clinical reasoning medical text structuring longitudinal clinical reasoning chest x-ray report parsing medical information structuring benchmark dataset for radiology report medical information extraction structured radiology reports temporal relation extraction radiology report benchmarking longitudinal clinical understanding

Published: Jan. 11, 2026. Version: 1.0.0

Database Open Access

PSG-IPA: A PolySomnoGraphic Inter-scorer Performance Assessment database

Diego Alvarez-Estevez

The HMC-IPA dataset comprises 20 PSG recordings, each with manual and computer-assisted scorings by 12 sleep technologists, for studying inter-scorer variability and evaluating automated sleep analysis algorithms

Published: Jan. 8, 2026. Version: 1.0.0

Visualize waveforms

Database Credentialed Access

MedVH: Towards Systematic Evaluation of Hallucination for Large Vision Language Models in the Medical Context

Zishan Gu, Jiayuan Chen, Fenglin Liu, et al.

MedVH provides a visual hallucination evaluation benchmark for large language models in the medical context. It formulates tests using chest X-ray images, including multi-choice question answering and long-text generation tasks.

Published: Dec. 10, 2025. Version: 1.0.1

Database Credentialed Access

CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays

Hyungyung Lee, Geon Choi, Jung Oh Lee, et al.

CheXStruct is an automated pipeline that derives structured diagnostic reasoning steps from chest X-rays. CXReasonBench builds on this to evaluate whether models perform clinically grounded, multi-step reasoning beyond final diagnoses.

evaluation chest x-ray benchmark structured chest x-ray qa intermediate reasoning steps structured reasoning grounded reasoning diagnostic reasoning structured diagnostic pipeline

Published: Oct. 23, 2025. Version: 1.0.1

Model Credentialed Access

RadVLM model

Nicolas Deperrois, Hidetoshi Matsuo, Samuel Ruiperez-Campillo, et al.

RadVLM is a 7B-parameter vision-language model fine-tuned on public chest-X-ray data that drafts reports, lists abnormalities, grounds findings, and chats about a CXR through a single image-to-text interface.

Published: Oct. 8, 2025. Version: 1.0.0

Database Restricted Access

Organ Retrieval and Collection of Health Information for Donation (ORCHID)

Hammaad Adam, Vinith Suriyakumar, Tom Pollard, et al.

Multi-center dataset on organ procurement in the United States

organ procurement organizations organ transplantation

Published: Sept. 29, 2025. Version: 2.1.1

Search

Resources

MIMIC-CXR-Ext-ILS: Lesion Segmentation Masks and Instruction-Answer Pairs for Chest X-rays

MIMIC-IV-Ext-MedicalBench: Evaluating Large Language Models Towards Improved Medical Concept Extraction

Longitudinal Cylinder Rearing Behavioral Data in a Mouse Stroke Model Across Multiple Drug Treatments

Respiratory and Pulse Oximetry Waveforms from Healthy Adults During Simulated Apnoea Events

Lunguage: A Benchmark for Structured and Sequential Chest X-ray Interpretation

PSG-IPA: A PolySomnoGraphic Inter-scorer Performance Assessment database

MedVH: Towards Systematic Evaluation of Hallucination for Large Vision Language Models in the Medical Context

CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays

RadVLM model

Organ Retrieval and Collection of Health Information for Donation (ORCHID)