Resources


Software Open Access

Random Search Toolbox

A major issue with many signal processing and machine learning algorithms is the lack of optimisation methods for determining the numerous hyper-parameters associated with the model as well as the knowledge of which hyper-parameters are relevant. Th…

Published: Nov. 4, 2014. Version: 1.0.0


Database Contributor Review

Salzburg Intensive Care database (SICdb), a freely accessible intensive care database

Niklas Rodemund, Andreas Kokoefer, Bernhard Wernly, Crispiana Cozowicz

The SICdb dataset, version 1.0.8 contains 27350 admissions to an ICU in an Austrian tertiary care institution.

clinical intensive care critical care open data machine learning

Published: Sept. 10, 2024. Version: 1.0.8


Database Open Access

A multi-camera and multimodal dataset for posture and gait analysis

Manuel Palermo, João Mendes Lopes, João André, Joao Cerqueira, Cristina Santos

Multimodal dataset with 166k samples for vision-based applications with a smart walker used in gait and posture rehabilitation. It is equipped with a pair of Depth cameras with data synchronized with an inertial MoCap system worn by the participant.

computer vision inertial motion capture smart walker human pose estimation gait and posture analysis depth rehabilitation deep learning

Published: Nov. 1, 2021. Version: 1.0.0


Database Open Access

Wide-field calcium imaging sleep state database

Eric Landsness, Xiaohui Zhang, Wei Chen, Hanyang Miao, Michelle Tang, Lindsey Brier, Mark Anastasio, Jin-Moo Lee, Joseph Culver

Wide-field calcium imaging database that consists of annotated sleep recording collected from transgenic mice at Washington University of St Louis School of Medicine.

sleep wide-field calcium imaging sleep state classification sleep staging machine learning

Published: March 17, 2022. Version: 1.0.1


Database Restricted Access

EchoNext: A Dataset for Detecting Echocardiogram-Confirmed Structural Heart Disease from ECGs

Pierre Elias, Joshua Finer

EchoNext is a curated dataset of electrocardiograms (ECGs) paired with echocardiogram-confirmed structural heart disease labels, designed to support the development and validation of machine learning models.

heart failure clinical decision support artificial intelligence health equity ecg machine learning deep learning electrocardiogram aortic stenosis cardiovascular screening valvular heart disease digital health ai model deployment left ventricular dysfunction ai in healthcare population health transthoracic echocardiogram structural heart disease

Published: Sept. 16, 2025. Version: 1.1.0


Database Credentialed Access

MIMIC-IV-Ext Triage Instruction Corpus

Qingyang Shen, Quan Guo

MIMIC-IV-Ext Triage Instruction Corpus includes 9,629 ED triage cases organized by the five-level ESI, enabling LLMs to improve triage accuracy. It provides CSV data, generation prompts, expert validation samples, and SQL QC scripts.

nlp clinical decision support large language models machine learning emergency severity index emergency triage

Published: March 4, 2025. Version: 1.0.0


Database Open Access

Synthetic Mention Corpora for Disease Entity Recognition and Normalization

Kuleen Sasse, John David Osborne

We present the Synthetic Mention Corpora for Disease Entity Recognition and Normalization, containing 128000 disease mentions from the UMLS disorder group, generated by an LLM. This corpus aims to improve these tasks in biomedical and clinical texts.

nlp named entity recognition machine learning data augmentation entity normalization

Published: Feb. 3, 2025. Version: 1.0.0


Database Open Access

CGMacros: a scientific dataset for personalized nutrition and diet monitoring

Ricardo Gutierrez-Osuna, David Kerr, Bobak Mortazavi, Anurag Das

CGMacros contains information from two continuous glucose monitors (CGM), food macronutrients, food photographs, physical activity, and anonymized participant demographics, anthropometric measurements and health parameters.

diabetes continuous glucose monitors machine learning obesity postprandial glucose response food macronutrients metabolic models food photographs personalized nutrition

Published: Jan. 28, 2025. Version: 1.0.0


Database Contributor Review

COVID Data for Shared Learning (CDSL): A comprehensive, multimodal COVID-19 dataset from HM Hospitales

Álvaro Ritoré, Andreea M Oprescu, Alberto Estirado Bronchalo, Miguel Ángel Armengol de la Hoz

COVID Data for Shared Learning (CDSL) is a multimodal database comprising de-identified structured health data and radiological images from 4,479 patients with COVID-19, as a comprehensive toolkit for developing predictive models.

covid-19 multimodal database radiological images open data healthcare data machine learning and ai

Published: Oct. 25, 2024. Version: 1.0.0


Database Credentialed Access

MIMIC-IV

Alistair Johnson, Lucas Bulgarelli, Tom Pollard, Brian Gow, Benjamin Moody, Steven Horng, Leo Anthony Celi, Roger Mark

Large database of de-identified health information from patients admitted to Beth Israel Deaconess Medical Center

critical care intensive care unit mimic machine learning

Published: Oct. 11, 2024. Version: 3.1