Resources


Database Open Access

A Multi-Modal Satellite Imagery Dataset for Public Health Analysis in Colombia

Sebastian A Cajas, David Restrepo, Dana Moukheiber, Kuan Ting Kuo, Chenwei Wu, David Santiago Garcia Chicangana, Atika Rahman Paddo, Mira Moukheiber, Lama Moukheiber, Sulaiman Moukheiber, Saptarshi Purkayastha, Diego M Lopez, Po-Chih Kuo, Leo Anthony Celi

Multi-Modal Satellite imagery Dataset in Colombia: A public health analysis with spatiotemporally aligned satellite images and its corresponding metadata across 81 municipalities (2016-2018), facilitating multimodal AI applications.

multimodality satellite imagery

Published: Jan. 30, 2024. Version: 1.0.0


Database Restricted Access

A multimodal dental dataset facilitating machine learning research and clinic services

wenjing liu, Yunyou Huang, Suqin Tang

A new dental dataset that contains 389 patients, three commonly used dental image models, and images of various health conditions of the oral cavity.

Published: Sept. 6, 2023. Version: 1.0.0


Database Credentialed Access

RadGraph: Extracting Clinical Entities and Relations from Radiology Reports

Saahil Jain, Ashwin Agrawal, Adriel Saporta, Steven QH Truong, Du Nguyen Duong, Tan Bui, Pierre Chambon, Matthew Lungren, Andrew Ng, Curtis Langlotz, Pranav Rajpurkar

RadGraph is a dataset of entities and relations in full-text chest X-ray radiology reports, which are obtained using a novel information extraction (IE) schema to capture clinically relevant information in a radiology report.

entity and relation extraction graph multi-modal radiology natural language processing

Published: June 3, 2021. Version: 1.0.0


Database Credentialed Access

MS-CXR-T: Learning to Exploit Temporal Structure for Biomedical Vision-Language Processing

Shruthi Bannur, Stephanie Hyland, Qianchu Liu, Fernando Pérez-García, Max Ilse, Daniel Coelho de Castro, Benedikt Boecking, Harshita Sharma, Kenza Bouzid, Anton Schwaighofer, Maria Teodora Wetscherek, Hannah Richardson, Tristan Naumann, Javier Alvarez Valle, Ozan Oktay

The MS-CXR-T is a multimodal benchmark that enhances the MIMIC-CXR v2 dataset by including expert-verified annotations. Its goal is to evaluate biomedical visual-language processing models in terms of temporal semantics extracted from image and text.

multimodal chest x-ray radiology cxr disease progression vision-language processing

Published: March 17, 2023. Version: 1.0.0


Database Credentialed Access

Eye Gaze Data for Chest X-rays

Alexandros Karargyris, Satyananda Kashyap, Ismini Lourentzou, Joy Wu, Matthew Tong, Arjun Sharma, Shafiq Abedin, David Beymer, Vandana Mukherjee, Elizabeth Krupinski, Mehdi Moradi

This dataset was a collected using an eye tracking system while a radiologist interpreted and read 1,083 public CXR images. The dataset contains the following aligned modalities: image, transcribed report text, dictation audio and eye gaze data.

audio convolutional network heatmap eye tracking multimodal chest x-ray machine learning radiology explainability chest cxr deep learning

Published: Sept. 12, 2020. Version: 1.0.0


Software Credentialed Access

Code for generating the HAIM multimodal dataset of MIMIC-IV clinical data and x-rays

Luis R Soenksen, Yu Ma, Cynthia Zeng, Leonard David Jean Boussioux, Kimberly Villalobos Carballo, Liangyuan Na, Holly Wiberg, Michael Li, Ignacio Fuentes, Dimitris Bertsimas

Code for generating the HAIM multimodal dataset of MIMIC-IV clinical data and x-rays

database code multimodality

Published: Aug. 23, 2022. Version: 1.0.1


Database Credentialed Access

DrugEHRQA: A Question Answering Dataset on Structured and Unstructured Electronic Health Records For Medicine Related Queries

Jayetri Bardhan, Anthony Colas, Kirk Roberts, Daisy Zhe Wang

DrugEHRQA is a QA dataset containing question-answers from MIMIC-III tables and discharge summaries.

question-answer qa

Published: April 12, 2022. Version: 1.0.0


Database Open Access

EPHNOGRAM: A Simultaneous Electrocardiogram and Phonocardiogram Database

Arsalan Kazemnejad, Peiman Gordany, Reza Sameni

An open-access database recorded during the EPHNOGRAM project, consisting of simultaneous electrocardiogram (ECG) and phonocardiogram (PCG) recordings from young healthy adults, during stress-test experiments.

stress-test electrocardiogram phonocardiogram

Published: June 11, 2021. Version: 1.0.0

Visualize waveforms

Database Credentialed Access

RadGraph: Extracting Clinical Entities and Relations from Radiology Reports

Saahil Jain, Ashwin Agrawal, Adriel Saporta, Steven QH Truong, Du Nguyen Duong, Tan Bui, Pierre Chambon, Matthew Lungren, Andrew Ng, Curtis Langlotz, Pranav Rajpurkar

RadGraph is a dataset of entities and relations in full-text chest X-ray radiology reports, which are obtained using a novel information extraction (IE) schema to capture clinically relevant information in a radiology report.

entity and relation extraction graph multi-modal radiology natural language processing

Published: June 3, 2021. Version: 1.0.0