Database Credentialed Access
MIMIC-III-Ext-PPG: A PPG Benchmark Dataset for Cardiorespiratory Analysis
Mohammad Moulaeifard , Peter H Charlton , Nils Strodthoff
Published: Feb. 4, 2026. Version: 1.0.0
When using this resource, please cite:
Moulaeifard, M., Charlton, P. H., & Strodthoff, N. (2026). MIMIC-III-Ext-PPG: A PPG Benchmark Dataset for Cardiorespiratory Analysis (version 1.0.0). PhysioNet. RRID:SCR_007345. https://doi.org/10.13026/nmwb-6h34
Please include the standard citation for PhysioNet:
(show more options)
Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P. C., Mark, R., ... & Stanley, H. E. (2000). PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220. RRID:SCR_007345.
Abstract
MIMIC-III-Ext-PPG is a large-scale, annotated, and quality-assessed dataset of photoplethysmogram (PPG) segments extracted from the MIMIC-III Waveform Database Matched Subset. It contains 4.9 million non-overlapping 30-second segments from 6,131 critically-ill patients and serves as the largest public resource for heart rhythm classification. The dataset contains signals recorded during several heart rhythms: sinus rhythm, atrial fibrillation, atrial flutter, heart block, pacing, and several less common rhythms. PPG segments were extracted within the 15-minute period prior to a patient's heart rhythm being recorded in the medical record, providing reference rhythm annotations in-line with best practice from the literature. In addition, simultaneous electrocardiogram (ECG, lead II), arterial blood pressure (ABP), and respiratory (RESP) signals are provided where available. Additional annotations were derived from these signals, including blood pressure, heart rate, and respiratory rate. Signal quality metrics are provided for all signals. Furthermore, a dedicated pipeline was applied to remove mislabeled signals, which, to the best of our knowledge, have previously gone unnoticed. The MIMIC-III-Ext-PPG dataset is a valuable resource for a variety of downstream prediction tasks, enabled by its size, range of reference annotations, and signal quality metrics.
Background
Physiological signals such as PPG, ECG, ABP, and RESP are widely used to monitor health. In the Intensive Care Unit (ICU), these signals are often monitored continuously, whilst wearables such as smartwatches use PPG and ECG for health monitoring. Much research focuses on extracting physiological information from these signals, with tasks including arrhythmia detection [1], blood pressure estimation [2], and signal quality assessment. However, public datasets are often limited in scale or lack support for comprehensive classification and regression tasks (e.g., DeepBeat [3], PulseDB [4], MIMIC-BP [5]).
To fill this gap, we introduce MIMIC-III-Ext-PPG, a large-scale, curated dataset with 4.9 million non-overlapping 30-second PPG segments. Each segment is annotated with a heart rhythm obtained from a rhythm documentation in the medical notes, at most 15 minutes after the waveform data. In addition, simultaneous ECG (lead II), ABP, and RESP signals and derived parameters from them are also included where available.
Methods
Data Processing Pipeline
- Waveform Selection:
We used records from the MIMIC-III Waveform Database Matched Subset [6], which contains digitized Critical Care waveforms (PPG, ECG, ABP, RESP) sampled at 125 Hz.- Inclusion criteria:
A waveform record was included if it met all of the following:- Contains a PPG/PLETH channel with valid sampling metadata.
- Its time interval overlaps with at least one rhythm chart event.
- Can be successfully parsed in WFDB format.
- Contains ≥30 seconds of non-flat, non-empty PPG signal before quality filtering.
A waveform record was excluded if:- It did not contain a PPG signal.
- No rhythm chart events occurred within its waveform time span.
- The WFDB file was corrupted or unreadable.
- The PPG channel was entirely flat, constant, NaN, or missing.
- Remaining usable PPG was <30 seconds after extraction.
- Inclusion criteria:
- Clinical Metadata Extraction:
Chart events, including heart rhythm annotations and demographic measurements, were extracted from the MIMIC-III clinical database using both CareVue and MetaVision systems.- Chart event inclusion criteria:
- The event timestamp falls within the waveform recording.
- Rhythm label text is non-empty and parsable.
- Non-rhythm chart events.
- Rhythm labels that were blank, ambiguous, or malformed.
- Chart event inclusion criteria:
- Event Matching:
For each waveform, we identified rhythm events that occurred within its time span and retrieved:
(i) the current rhythm annotation,
(ii) the previous rhythm event, and
(iii) the nearest height and weight records. - Annotation Harmonization:
Rhythm labels from CareVue and MetaVision were unified into a single standardized vocabulary (see Table: heart-rhythms_before_filter). - Diagnosis Mapping:
ICD-9 discharge diagnoses were mapped to ICD-10 codes [7], expanded hierarchically, and truncated to 3-digit levels to support generalizable analysis. - Demographics:
Patient age, sex, ethnicity, height (in cm), and weight (in kg) were extracted from structured fields in the clinical data. - Data Stratification:
We divided the dataset into 12 non-overlapping folds using unique patient identifiers. Stratification was balanced by rhythm class, age strata, and ICD-10 Chapter IX cardiovascular diagnoses to minimize data leakage. - Signal Segment Extraction:
For each matched event, we extracted waveform data ending at the annotation time and starting from the previous rhythm annotation or a point up to 15 minutes earlier, whichever was more recent. Segments were then split into 30-second windows, and leftover fragments under 30 seconds were discarded. All segments contained PPG (PLETH) signals. In addition, where available, simultaneous ECG (lead II), ABP, and RESP signals were also extracted. - Signal Storage:
All extracted 30-second waveform segments were stored in WFDB format (.heaand.dat) following PhysioNet conventions. - Timestamp Handling:
The event_time (timestamp of the rhythm chart event) and start_segment (start time of each 30-second waveform window) values directly inherit MIMIC-III’s date-shifted timestamps
Enrichment with Additional Annotations and Signal Quality Metrics:
Each 30-second segment was enriched with additional annotations and signal quality metrics based on the available waveforms:
- RESP was analyzed over the full 30-second segment.
- ABP, PPG, and ECG were evaluated in three consecutive 10-second windows, producing a 3-value vector representing the intervals 0–10 seconds, 10–20 seconds, and 20–30 seconds. For these signals, both the vector and aggregated statistics (median with IQR) were reported.
The processes for extracting additional annotations and signal quality metrics are now described.
Enrichment with Additional Annotations: SBP, DBP, HR, and RR
- Respiratory Rate (RR, from RESP, if available):
RR was derived over the full 30-second segment using the algorithm in [8]. An instantaneous RR was calculated from each valid breath duration, and RRs were summarized for each segment as the median and IQR. - Heart Rate (HR, from ECG, if available):
HR was calculated from beat-to-beat intervals within each 10-second window using the Neurokit QRS detector [9], with the final HR obtained from the median RR interval [10]. For each segment, a 3-element vector (one per window) was extracted, along with the segment-level median and IQR. - Systolic and Diastolic Blood Pressure (SBP and DBP from ABP, if available):
SBP and DBP were computed using the slope sum function (SSF) beat detection algorithm [11] within each 10-second window. Similarly to HR, a 3-element vector and segment-level median and IQR were reported.
Enrichment with Signal Quality Metrics
We implemented a signal quality assessment pipeline for RESP, PPG, ECG, and ABP signals. Quality was quantified using an integer-based signal quality index (SQI) code.
SQI codes:
| Code | Stage / Source | Signal | Meaning |
|---|---|---|---|
| 1 | SQI Calculation | All | Signal quality is high, suitable for reliable analysis |
| 0 | SQI Calculation | All | Signal quality is low but still usable for downstream processing |
| -2 | Validation | All | Signal is flatlined or shows repeated extreme values |
| -3 | Validation | All | Signal is empty or contains NaN values |
| -11 | SQI Calculation (based on [6]) | RESP | No valid respiratory peaks or troughs were detected |
| -12 | SQI Calculation (based on [6]) | RESP | Insufficient number of breathing cycles to compute the SQI |
| -13 | SQI Calculation (based on [6]) | RESP | Too few valid respiratory segments to perform template-based assessment |
| -14 | SQI Calculation (based on [8]) | ECG/PPG | No valid RR intervals detected, typically due to missing or unreliable beats. |
| -15 | SQI Calculation (based on [8]) | ECG/PPG | RR-interval sample window is too short for meaningful evaluation |
| -16 | SQI Calculation (based on [8]) | ECG/PPG | The number of usable segments is insufficient for template matching |
| -17 | SQI Calculation (based on [8]) | ECG | R-peak detections are invalid or too few for quality estimation. |
| -18 | SQI Calculation (based on [8]) | PPG | Too few or invalid pulse peaks were detected in the PPG signal |
| -19 | SQI Calculation (based on [9]) | ABP | Beat onset detection failed; ABP waveform cannot be evaluated |
| -20 | Dispatcher | All | Signal type is not recognized by the SQI module |
| -21 | Exception Handler | All | Unexpected error occurred during SQI computation |
Detection of Labeling Errors
During the signal inspection, we identified mislabeled or corrupted segments in the WFDB files. To address this, we applied a filtering procedure using the following features:
- Mean derivative of PPG peaks
- R-peak derivatives in PPG
- Dominant frequency in RESP
- SBP and DBP range validity
Approximately 30% of the segments were identified as possibly mislabeled or affected by signal transitions, based on established criteria. We excluded all the identified segments. We intentionally took this conservative approach to ensure the quality of the final dataset, even though this procedure might have excluded not only incorrectly labelled segments, but also some correctly labeled segments.
Data Description
Dataset Summary
The following table summarizes the data available for each task (heart rhythm classification, RR, HR, and BP estimation) in the MIMIC-III-Ext-PPG dataset. It provides the number of subjects, segments, and demographic statistics available.
| Metric | Heart Rhythm | RR | HR | BP |
|---|---|---|---|---|
| Number of subjects | 6,131 | 4,650 | 5,850 | 2,371 |
| Number of 30s segments | 4,920,487 | 4,346,432 | 3,991,992 | 1,890,846 |
| Total Duration (hours) | ~41,004 | ~36,220 | ~33,266 | ~15,757 |
| Age (years, mean ± SD) | 64.1 ± 17.0 | 65.0 ± 16.1 | 63.9 ± 16.9 | 63.2 ± 16.1 |
| Weight (kg, mean ± SD) | 87.3 ± 6.8 | 87.2 ± 6.8 | 87.4 ± 6.7 | 87.6 ± 6.7 |
| Height (cm, mean ± SD) | 169.4 ± 10.6 | 169.6 ± 10.5 | 169.4 ± 10.6 | 169.7 ± 10.4 |
| Gender (% female) | 44.1% | 43.6% | 44.3% | 42.3% |
| Ethnicity | ||||
| -white | 71.9% | 73.3% | 71.9% | 71.6% |
| -black | 9.2% | 9.4% | 9.2% | 7.5% |
| -hispanic | 4.1% | 3.8% | 4.1% | 3.9% |
| -asian | 2.9% | 2.9% | 2.9% | 3.0% |
| -other | 11.9% | 10.6% | 11.9% | 14.0% |
We also summarized the dataset in terms of heart rhythm labels (obtained from the 'event_rhythm' column of the 'metadata.csv' file).
| Acronym | Description | Patients | Samples |
|---|---|---|---|
| SR | Sinus rhythm | 4,988 | 3,095,378 |
| STACH | Sinus tachycardia | 2,148 | 841,720 |
| AF | Atrial fibrillation | 981 | 449,784 |
| SBRAD | Sinus bradycardia | 982 | 164,183 |
| VPACE | Ventricular pacing | 234 | 102,547 |
| 1AVB | First degree AV block | 263 | 91,274 |
| AVPACE | Atrioventricular pacing | 144 | 54,033 |
| APACE | Atrial pacing | 112 | 40,207 |
| AFLT | Atrial flutter | 139 | 29,774 |
| LBBB | Left bundle branch block | 43 | 18,999 |
| RBBB | Right bundle branch block | 39 | 10,703 |
| JR | Junctional rhythm | 49 | 5,650 |
| SVTACH | Supraventricular tachycardia | 63 | 5,067 |
| SARRH | Sinus arrhythmia | 60 | 4,636 |
| 2AVBM1 | 2nd degree AV block Mobitz I | 12 | 1,503 |
| 3AVB | Third degree AV block | 19 | 1,347 |
| VTACH | Ventricular tachycardia | 16 | 1,146 |
| MATACH | Multifocal atrial tachycardia | 6 | 1,114 |
| 2AVBM2 | 2nd degree AV block Mobitz II | 12 | 397 |
| WAPACE | Wandering atrial pacemaker | 5 | 392 |
| JTACH | Junctional tachycardia | 6 | 311 |
| OTHER | Other | 3 | 141 |
| VFIB | Ventricular fibrillation | 2 | 98 |
| IDIOV | Idioventricular rhythm | 4 | 44 |
| ASYS | Asystole | 2 | 33 |
| PATACH | Paroxysmal atrial tachycardia | 2 | 6 |
| Total | 10,334 | 4,920,487 |
Waveforms
This dataset is structured in WFDB format, making it compatible with a wide range of signal processing tools, including PhysioNet’s WFDB software, Python (wfdb, biosppy, neurokit2), MATLAB, and Julia packages. Each record contains preprocessed waveform data (e.g., PPG, ECG, ABP, RESP) segmented into 30-second waveform segments, along with a rich set of metadata provided in accompanying .csv files.
Metadata
This dataset includes a rich set of segment-level metadata for each signal sample, provided in a structured metadata.csv file.
Signal Information
| Variable Name | Type | Units | Description | Allowed Values / Notes |
|---|---|---|---|---|
| record_id | string | — | Original MIMIC-III waveform record ID (WFDB record name). | e.g. "3238451_0005" |
| event_id | integer | — | Sequential index of the rhythm chart event within the record. | 0, 1, 2, … |
| segment_id | integer | — | Index of the 30-s segment, counted backwards from the event time (0 = segment ending exactly at event_time). |
0, 1, 2, … |
| signal_file_name | string | — | Name of the WFDB file storing the 30-s segment. | <record>_<event_id>_<segment_id>.hea/dat |
| folder_path | string | — | Relative path to the WFDB directory containing the segment. | pXX/pYYYYYYY/ |
| start_segment | datetime | seconds | Start timestamp of the 30-s segment. | Inherits MIMIC-III date-shifted timestamps |
| event_time | datetime | seconds | Timestamp of the underlying heart rhythm chart event. | Inherits MIMIC-III date-shifted timestamps |
| signal_length_s | integer | seconds | Total waveform duration extracted before segmentation. | typically between 30–900 s (max 15 min) |
Heart Rhythm
| Variable Name | Type | Units | Description | Allowed Values |
|---|---|---|---|---|
| event_rhythm | string | — | Harmonized rhythm label associated with the chart event. | 26 allowable codes: SR, STACH, SBRAD, AF, AFLT, APACE, VPACE, AVPACE, 1AVB, 2AVBM1, 2AVBM2, 3AVB, RBBB, LBBB, SARRH, JTACH, JR, SVTACH, MATACH, VTACH, VFIB, IDIOV, ASYS, WAPACE, OTHER, PATACH |
BP
| Variable Name | Type | Units | Description |
|---|---|---|---|
| vector_10s_median_sbp | float list (3) | mmHg | Median SBP in each 10-s window |
| vector_10s_iqr_sbp | float list (3) | mmHg | IQR of SBP in each 10-s window |
| vector_10s_median_dbp | float list (3) | mmHg | Median DBP in each 10-s window |
| vector_10s_iqr_dbp | float list (3) | mmHg | IQR of DBP in each 10-s window |
| median_30s_sbp | float | mmHg | Median SBP across the whole 30-s segment |
| iqr_30s_sbp | float | mmHg | IQR across the whole 30-s segment |
| median_30s_dbp | float | mmHg | Median DBP across the whole 30-s segment |
| iqr_30s_dbp | float | mmHg | IQR across the whole 30-s segment |
HR
| Variable Name | Type | Units | Description |
|---|---|---|---|
| vector_10s_hr | float list (3) | bpm | HR calculated from ECG-derived RR intervals in each 10-s window |
| median_30s_hr | float | bpm | HR median for the whole 30-s segment |
| iqr_30s_hr | float | bpm | HR IQR across the whole 30-s segment |
RR
| Variable Name | Type | Units | Description |
|---|---|---|---|
| median_30s_rr | float | breaths/min | RR derived from RESP across the whole 30-s segment |
| iqr_30s_rr | float | breaths/min | RR variability across the whole 30-s segment |
PPG, ECG, ABP; 10-second Window SQIs
| Variable name | Type | Definition | Allowed Values |
|---|---|---|---|
| vector_10s_pleth_sqi | list(int, length 3) | PPG SQI for each 10-s window (0–10, 10–20, 20–30s). | Must be one of: 1, 0, -2, -3, -14, -15, -16, -18, -20, -21 |
| vector_10s_ecg_sqi | list(int, length 3) | ECG SQI for each 10-s window. | Must be one of: 1, 0, -2, -3, -14, -15, -16, -17, -20, -21 |
| vector_10s_abp_sqi | list(int, length 3) | ABP SQI for each 10-s window. | Must be one of: 1, 0, -2, -3, -19, -20, -21 |
RESP; 30-second Window SQIs
| Variable name | Type | Definition | Allowed Values |
|---|---|---|---|
| resp_sqi | integer | SQI of the entire 30-s RESP signal. | Must be one of: 1, 0, -2, -3, -11, -12, -13, -20, -21 |
Morphology-Based Features
| Variable Name | Type | Units | Description | Allowed Values |
|---|---|---|---|---|
| vector_10s_mean_rpeak_deriv_pleth | float list (3) | unit/s | Derivative of ECG R-peaks when PPG is incorrectly detected as ECG (used for mislabeled-channel detection). | NaN for true PPG; finite values for mislabeled ECG |
| vector_10s_mean_peak_deriv_pleth | float list (3) | unit/s | Mean derivative around true PPG peaks in each 10–second window. | from –8.86 to 0 unit/s after filtering |
| f_peak_rr | float | Hz | Fundamental frequency of respiration derived from PSD of RESP (if available). | from 0.07 to 0.99 Hz after filtering |
Subject Demographics
| Variable Name | Type | Units | Description | Allowed Values |
|---|---|---|---|---|
| subject_id | integer | — | Unique pseudonymized subject ID. | 6,131 unique subjects |
| hadm_id | integer | — | Hospital admission ID associated with the chart event. | matches MIMIC-III |
| age | float | years | Patient age at admission | matches MIMIC-III |
| weight | float | kg | Weight measurement based on subject_id and hadm_id | matches MIMIC-III |
| height | float | cm | Height measurement based on subject_id and hadm_id | matches MIMIC-III |
| gender | string | — | Biological sex recorded in MIMIC-III. | "M" or "F" |
| ethnicity | string | — | Harmonized ethnicity category. | White, Black, Hispanic, Asian, Other |
Diagnostic Information
| Variable Name | Type | Units | Description |
|---|---|---|---|
| icd9 | string list | — | List of ICD-9 discharge diagnoses for the admission. |
| icd10_truncated | string list | — | Mapped 3-digit ICD-10 codes |
Stratification
| Variable Name | Type | Units | Description | Allowed Values |
|---|---|---|---|---|
| strat_fold | integer | — | Stratified fold index for reproducible ML benchmarking. | 0–11 |
Usage Notes
Folder structure
We organized the data in a folder structure analogous to the MIMIC III Waveform Matched Subset:
MIMIC-III-Ext-PPG
p00
├── p0000052
│ ├── 3238451_0005_0_1.hea
│ ├── 3238451_0005_0_1.dat
│ ├── ...
├── p0000107
│ ├── 3805787_0011_0_4.hea
│ ├── 3805787_0011_0_4.dat
│ ├── ...
├── ...
p01
├── p0100045
│ ├── 3120713_0001_0_0.hea
│ ├── 3120713_0001_0_0.dat
│ ├── ...
├── p0100049
│ ├── 3456766_0016_0_1.hea
│ ├── 3456766_0016_0_1.dat
│ ├── ...
├── ...
...
p09
├── p0900012
│ ├── 3522957_0007_0_0.hea
│ ├── 3522957_0007_0_0.dat
│ ├── ...
├── ...
Access Logic
Locating WFDB Files and Corresponding Metadata:
I) From Metadata → WFDB Signal → Metadata Lookup
- Use the path in the
folder_pathcolumn of the metadata. - Use the following logic based on
subject_idandsignal_file_name:
- Files are organized as
pXX/pXXXXXX/, where: pXXis the first two digits ofsubject_id(zero-padded).pXXXXXXissubject_idzero-padded to 6 digits.- Inside each
pXXXXXXfolder, the WFDB files are named usingsignal_file_name(signal_file_name.heaandsignal_file_name.dat). - Minimal Code Example:
import wfdb
import pandas as pd
import os
# --------------------------------------------------------
# 1. Load metadata
# --------------------------------------------------------
df = pd.read_csv("metadata.csv")
# Select a specific segment (e.g., first row)
meta_row = df.iloc[X]
# --------------------------------------------------------
# 2. Build the WFDB path from metadata fields
# --------------------------------------------------------
folder = meta_row["folder_path"]
signal_file = meta_row["signal_file_name"]
wfdb_path = os.path.join(folder, signal_file)
# --------------------------------------------------------
# 3. Load the waveform segment using WFDB
# --------------------------------------------------------
record = wfdb.rdrecord(wfdb_path)
II) From WFDB File → Find Metadata Entry
To retrieve metadata for a given .hea or .dat file in the physionet/ folder:
- Extract the 6-digit subject ID from the folder name
pXXXXXX; this gives yousubject_id. - Extract the file name without the extension; this gives you
signal_file_name. - Use these two values to look up the corresponding row in
metadata.csv. - Minimal Code Example:
import wfdb
import pandas as pd
# Load metadata
df = pd.read_csv("metadata.csv")
# Suppose we manually pick a WFDB file path:
example_path = "p03/p0300123/3238451_0005_0_1" # without .hea/.dat
signal_file = example_path.split("/")[-1] # "3238451_0005_0_1"
# Look up metadata row
row = df[df["signal_file_name"] == signal_file]
Intended Uses and Limitations
The dataset supports various research and machine learning tasks, including:
- Arrhythmia detection and classification
- Cuffless blood pressure estimation
- Heart rate and respiratory rate estimation
- Signal quality assessment
Additionally, demographic and diagnostic metadata (e.g., age, gender, weight, height, ICD-9, and ICD-10 codes) enable stratified and subgroup analyses. This dataset is well-suited for benchmarking deep learning models.
Release Notes
v1.0.0 first public release
Ethics
This dataset is derived entirely from the MIMIC-III database, a publicly available critical care resource generated at the Beth Israel Deaconess Medical Center.
MIMIC-III was released under the oversight of the Institutional Review Boards (IRBs) of both Beth Israel Deaconess Medical Center and the Massachusetts Institute of Technology (MIT), with all data collection performed under the same IRB approvals and HIPAA-compliant de-identification procedures.
All patient data in MIMIC-III were included under the IRB-approved waiver of informed consent, due to the retrospective nature of the data and the use of rigorous de-identification protocols.
The MIMIC-III-Ext-PPG dataset does not introduce any new patient data; it consists solely of reprocessed and curated outputs derived from the existing MIMIC-III waveform and clinical records.
Accordingly, it inherits the same IRB approvals, consent waivers, and ethical framework as MIMIC-III. No additional data collection, patient contact, or identifiable information was involved in the creation of this dataset.
Acknowledgements
The project (22HLT01 QUMPHY) has received funding from the European Partnership on Metrology, co-financed from the European Union’s Horizon Europe Research and Innovation Programme and by the Participating States. Funding for the University of Cambridge was provided by Innovate UK under the Horizon Europe Guarantee Extension, grant number 10091955. PHC acknowledges funding from the British Heart Foundation (BHF) grant [FS/20/20/34626].
Conflicts of Interest
The authors declare no conflicts of interest.
References
- Moulaeifard M, Coquelin L, Rinkevičius M, Sološenko A, Pfeffer O, Bench C, Hegemann N, Vardanega S, Nandi M, Alastruey J, Heiss C. Machine-learning for photoplethysmography analysis: Benchmarking feature, image, and signal-based approaches. arXiv preprint arXiv:2502.19949. 2025 Feb 27.
- Moulaeifard M, Charlton PH, Strodthoff N. Generalizable deep learning for photoplethysmography-based blood pressure estimation--A Benchmarking Study. arXiv preprint arXiv:2502.19167. 2025 Feb 26.
- Torres-Soto J, Ashley EA. Multi-task deep learning for cardiac rhythm detection in wearable devices. NPJ digital medicine. 2020 Sep 9;3(1):116.
- Wang W, Mohseni P, Kilgore KL, Najafizadeh L. PulseDB: A large, cleaned dataset based on MIMIC-III and VitalDB for benchmarking cuff-less blood pressure estimation methods. Frontiers in Digital Health. 2023 Feb 8;4:1090854.
- Sanches I, Gomes VV, Caetano C, Cabrera LS, Cene VH, Beltrame T, Lee W, Baek S, Penatti OA. MIMIC-BP: A curated dataset for blood pressure estimation. Scientific Data. 2024 Nov 15;11(1):1233.
- Moody B, Moody G, Villarroel M, Clifford G, Silva I. MIMIC-III waveform database matched subset. MIMIC-III Waveform Database Matched Subset v1. 0. 2020 Apr.
- World Health Organization. International statistical classification of diseases and related health problems: 10th revision (ICD-10). https://icd.who.int/browse10/2019/en. 1992.
- Charlton PH, Bonnici T, Tarassenko L, Clifton DA, Beale R, Watkinson PJ, Alastruey J. An impedance pneumography signal quality index: Design, assessment and application to respiratory rate monitoring. Biomedical signal processing and control. 2021 Mar 1;65:102339.
- Makowski D, Pham T, Lau ZJ, Brammer JC, Lespinasse F, Pham H, Schölzel C, Chen SA. NeuroKit2: A Python toolbox for neurophysiological signal processing. Behavior research methods. 2021 Aug;53(4):1689-96.
- Orphanidou C, Bonnici T, Charlton P, Clifton D, Vallance D, Tarassenko L. Signal-quality indices for the electrocardiogram and photoplethysmogram: Derivation and applications to wireless monitoring. IEEE journal of biomedical and health informatics. 2014 Jul 23;19(3):832-8.
- Sun JX. Cardiac output estimation using arterial blood pressure waveforms (Doctoral dissertation, Massachusetts Institute of Technology).
Parent Projects
Access
Access Policy:
Only credentialed users who sign the DUA can access the files.
License (for files):
PhysioNet Credentialed Health Data License 1.5.0
Data Use Agreement:
PhysioNet Credentialed Health Data Use Agreement 1.5.0
Required training:
CITI Data or Specimens Only Research
Discovery
DOI (version 1.0.0):
https://doi.org/10.13026/nmwb-6h34
DOI (latest version):
https://doi.org/10.13026/fcvm-qs78
Topics:
blood pressure
critical care
photoplethysmography
electrocardiogram
heart rhythm
signal quality
respiratory rate
Project Views
5
Current Version5
All VersionsCorresponding Author
Versions
Files
- be a credentialed user
- complete required training:
- CITI Data or Specimens Only Research You may submit your training here.
- sign the data use agreement for the project