Resources
Database Credentialed Access
MIMIC-IV-Ext-Instr: A Dataset of 450K+ EHR-Grounded Instruction-Following Examples
This dataset contains 450K open-ended instruction-following examples generated using GPT-3.5 based on the MIMIC-IV EHR database.
large language models medical question answering instruction tuning
Published: Sept. 9, 2025. Version: 1.0.0
Database Contributor Review
COVID Data for Shared Learning (CDSL): A comprehensive, multimodal COVID-19 dataset from HM Hospitales
COVID Data for Shared Learning (CDSL) is a multimodal database comprising de-identified structured health data and radiological images from 4,479 patients with COVID-19, as a comprehensive toolkit for developing predictive models.
covid-19 multimodal database radiological images open data healthcare data machine learning and ai
Published: Oct. 25, 2024. Version: 1.0.0