Resources


Database Credentialed Access

EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images

Seongsu Bae, Daeun Kyung, Jaehee Ryu, Eunbyeol Cho, Gyubok Lee, Sunjun Kweon, Jungwoo Oh, Lei JI, Eric Chang, Tackeun Kim, Edward Choi

We present EHRXQA, the first multi-modal EHR QA dataset combining structured patient records with aligned chest X-ray images. EHRXQA contains a comprehensive set of QA pairs covering image-related, table-related, and image+table-related questions.

question answering machine learning evaluation visual question answering electronic health records benchmark multi-modal question answering deep learning chest x-ray ehr question answering semantic parsing

Published: July 23, 2024. Version: 1.0.0


Database Credentialed Access

Chest ImaGenome Dataset

Joy Wu, Nkechinyere Agu, Ismini Lourentzou, Arjun Sharma, Joseph Paguio, Jasper Seth Yao, Edward Christopher Dee, William Mitchell, Satyananda Kashyap, Andrea Giovannini, Leo Anthony Celi, Tanveer Syeda-Mahmood, Mehdi Moradi

The Chest ImaGenome dataset is a scene graph dataset with additional chronological comparison relations for chest X-rays. It is automatically derived from the MIMIC-CXR dataset. A manually annotated gold standard is also available for 500 patients.

scene graph visual dialogue object detection semantic reasoning bounding box relation extraction knowledge graph explainability reasoning chest cxr disease progression multimodal radiology machine learning visual question answering deep learning chest x-ray

Published: July 13, 2021. Version: 1.0.0


Database Credentialed Access

INSPIRE, a publicly available research dataset for perioperative medicine

Leerang Lim, Hyung-Chul Lee

A public dataset that contains information related to surgery, anesthesia, laboratory results, medications, diagnosis, and outcomes from 50% of the patients who received surgery at Seoul National University Hospital between 2011 and 2020.

surgery perioperative medicine multi-center open dataset

Published: Dec. 28, 2023. Version: 1.2


Database Credentialed Access

A Brazilian Multilabel Ophthalmological Dataset (BRSET)

Luis Filipe Nakayama, Mariana Goncalves, Lucas Zago Ribeiro, Helen Santos, Daniel Ferraz, Fernando Malerbi, Leo Anthony Celi, Caio Regatieri

This is the first Brazilian Multilabel Ophthalmological Dataset with demographic information and retinal photos labeled images according to anatomical parameters, quality control, and presumed diagnosis.

dataset retina ophthalmology

Published: March 8, 2023. Version: 1.0.0


Database Credentialed Access

EHR-DS-QA: A Synthetic QA Dataset Derived from Medical Discharge Summaries for Enhanced Medical Information Retrieval Systems

Konstantin Kotschenreuther

Dataset consisting of question and answer pairs synthetically generated from medical discharge summaries, designed to facilitate the training and development of large language models specifically tailored for healthcare applications

mimic-iv clinical question-answering medical discharge summaries large language models

Published: Jan. 11, 2024. Version: 1.0.0


Database Credentialed Access

INSPIRE, a publicly available research dataset for perioperative medicine

Leerang Lim, Hyung-Chul Lee

A public dataset that contains information related to surgery, anesthesia, laboratory results, medications, diagnosis, and outcomes from 50% of the patients who received surgery at Seoul National University Hospital between 2011 and 2020.

surgery perioperative medicine multi-center open dataset

Published: Dec. 28, 2023. Version: 1.2


Database Credentialed Access

Medical-Diff-VQA: A Large-Scale Medical Dataset for Difference Visual Question Answering on Chest X-Ray Images

Xinyue Hu, Lin Gu, Qiyuan An, Mengliang Zhang, liangchen liu, Kazuma Kobayashi, Tatsuya Harada, Ronald Summers, Yingying Zhu

MIMIC-Diff-VQA provides a large-scale dataset for Difference visual question answering in medical chest x-ray images.

difference vqa vqa difference visual question answering visual question answering chest x-ray

Published: Sept. 15, 2023. Version: 1.0.0


Database Open Access

Electroencephalogram and eye-gaze datasets for robot-assisted surgery performance evaluation

Somayeh B Shafiei, Saeed Shadpour, James Mohler, Mehdi Seilanian Toussi, Philippa Doherty, Zhe Jing

The brain activity and eye gaze data were recorded from 25 participants performing surgical tasks using a robot simulator. The performance score was created by the simulator. Data can be used to evaluate surgical performance.

Published: July 14, 2023. Version: 1.0.0

Visualize waveforms

Database Credentialed Access

Radiology Report Expert Evaluation (ReXVal) Dataset

Feiyang Yu, Mark Endo, Rayan Krishnan, Ian Pan, Andy Tsai, Eduardo Pontes Reis, Eduardo Kaiser Ururahy Nunes Fonseca, Henrique Lee, Zahra Shakeri, Andrew Ng, Curtis Langlotz, Vasantha Kumar Venugopal, Pranav Rajpurkar

The Radiology Report Expert Evaluation (ReXVal) Dataset is a publicly available dataset of radiologist evaluations of errors in automatically generated radiology reports.

Published: June 20, 2023. Version: 1.0.0


Database Credentialed Access

A Brazilian Multilabel Ophthalmological Dataset (BRSET)

Luis Filipe Nakayama, Mariana Goncalves, Lucas Zago Ribeiro, Helen Santos, Daniel Ferraz, Fernando Malerbi, Leo Anthony Celi, Caio Regatieri

This is the first Brazilian Multilabel Ophthalmological Dataset with demographic information and retinal photos labeled images according to anatomical parameters, quality control, and presumed diagnosis.

dataset retina ophthalmology

Published: March 8, 2023. Version: 1.0.0