Resources


Database Credentialed Access

BRAX, a Brazilian labeled chest X-ray dataset

Eduardo Pontes Reis, Joselisa Paiva, Maria Carolina Bueno da Silva, Guilherme Alberto Sousa Ribeiro, Victor Fornasiero Paiva, Lucas Bulgarelli, Henrique Lee, Paulo Victor dos Santos, vanessa brito, Lucas Amaral, Gabriel Beraldo, Jorge Nebhan Haidar Filho, Gustavo Teles, Gilberto Szarf, Tom Pollard, Alistair Johnson, Leo Anthony Celi, Edson Amaro

BRAX contains 24,959 chest radiography exams and 40,967 images acquired in a large general Brazilian hospital. All images have been read by trained radiologists and 14 labels were derived from Brazilian Portuguese reports using NLP.

chest x-ray artificial intelligence dataset

Published: June 17, 2022. Version: 1.1.0


Software Open Access

R-DECO: An open-source Matlab based graphical user interface for the detection and correction of R-peaks

Jonathan Moeyersons, Matthew Amoni, Sabine Van Huffel, Rik Willems, Carolina Varon

An open-source Matlab based graphical user interface for the detection and correction of R-peaks.

algorithms and analysis of algorithms signal processiong visual analysis graphical user interface

Published: Sept. 8, 2020. Version: 1.0.0


Database Credentialed Access

Phenotype Annotations for Patient Notes in the MIMIC-III Database

Edward Moseley, Leo Anthony Celi, Joy Wu, Franck Dernoncourt

Clinical notes, annotated by at least two expert annotators for over ten patient phenotypes, including advanced cancer, substance abuse, and treatment non-adherence.

patient classification natural language processing

Published: March 5, 2020. Version: 1.20.03


Software Open Access

Waveform Database Software Package (WFDB) for Python

Chen Xie, Lucas McCullum, Alistair Johnson, Tom Pollard, Brian Gow, Benjamin Moody

Tools for working with waveforms in Python.

waveform wfdb python

Published: Jan. 24, 2023. Version: 4.1.0


Database Open Access

EPHNOGRAM: A Simultaneous Electrocardiogram and Phonocardiogram Database

Arsalan Kazemnejad, Peiman Gordany, Reza Sameni

An open-access database recorded during the EPHNOGRAM project, consisting of simultaneous electrocardiogram (ECG) and phonocardiogram (PCG) recordings from young healthy adults, during stress-test experiments.

stress-test electrocardiogram phonocardiogram

Published: June 11, 2021. Version: 1.0.0

Visualize waveforms

Database Credentialed Access

Curated Data for Describing Blood Glucose Management in the Intensive Care Unit

Aldo Robles Arévalo, Roselyn Mateo-Collado, Leo Anthony Celi

The data subsets consist of time series files that includes all the curated entries of glucose readings and insulin inputs from MIMIC-III database.

insulin replacement therapy glycemic control critical care

Published: April 19, 2021. Version: 1.0.1


Database Credentialed Access

MedNLI for Shared Task at ACL BioNLP 2019

Chaitanya Shivade

Data for the MedNLI Shared Task at the 2019 ACL BioNLP 2019 Workshop on Biomedical Language Processing

mimic natural language inference recognizing textual entailment

Published: Nov. 28, 2019. Version: 1.0.1


Challenge Contributor Review

BioNLP Workshop 2023 Shared Task 1A: Problem List Summarization

Yanjun Gao, Timothy Miller, Majid Afshar, Dmitriy Dligach

This is the data storage for BioNLP Workshop Shared Task 1A: Problem List Summarization.

clinical natural language processing bionlp electronic health record summarization

Published: Jan. 19, 2023. Version: 1.0.0


Database Credentialed Access

Nosocomial Risk Datasets from MIMIC-III

Travis Goodwin

Text-based Longitudinal Data for Predicting Nosocomial Disease Risk as used by CANTRIP.

deep learning pressure injury risk prediction acute kidney injury anemia forecasting natural language processing

Published: Sept. 15, 2022. Version: 1.0


Database Contributor Review

BRATECA (Brazilian Tertiary Care Dataset): a Clinical Information Dataset for the Portuguese Language

Henrique Dias, Ana Helena Dias Pereira dos Ulbrich

Brazilian clinical dataset containing over 70,000 admissions from 10 hospitals in two Brazilian states.

prescriptions exams tertiary care clinical notes natural language processing

Published: July 14, 2022. Version: 1.1