Database Open Access

MIMIC-III Waveform Database

Benjamin Moody George Moody Mauricio Villarroel Gari D. Clifford Ikaro Silva

Published: April 7, 2020. Version: 1.0


When using this resource, please cite: (show more options)
Moody, B., Moody, G., Villarroel, M., Clifford, G. D., & Silva, I. (2020). MIMIC-III Waveform Database (version 1.0). PhysioNet. https://doi.org/10.13026/c2607m.

Additionally, please cite the original publication:

Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L. A., & Mark, R. G. (2016). MIMIC-III, a freely accessible critical care database. Scientific Data, 3, 160035.

Please include the standard citation for PhysioNet: (show more options)
Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P. C., Mark, R., ... & Stanley, H. E. (2000). PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220.

Abstract

The MIMIC-III Waveform Database contains 67,830 record sets for approximately 30,000 ICU patients. Almost all record sets include a waveform record containing digitized signals (typically including ECG, ABP, respiration, and PPG, and frequently other signals) and a “numerics” record containing time series of periodic measurements, each presenting a quasi-continuous recording of vital signs of a single patient throughout an ICU stay (typically a few days, but many are several weeks in duration). A subset of this database contains waveform and numerics records that have been matched and time-aligned with MIMIC-III Clinical Database records.


Background

The MIMIC-III Waveform Database contains thousands of recordings of multiple physiologic signals (“waveforms”) and time series of vital signs (“numerics”) collected from bedside patient monitors in adult and neonatal intensive care units (ICUs).

The MIMIC-III Waveform Database is a companion to the MIMIC-III Clinical Database, which contains detailed clinical information about most of the patients represented in the Waveform Database [1]. Since the contents of each database were collected independently, in partially deidentified form, matching the clinical data with the waveform data is a non-trivial task, and only a subset of Waveform Database records has been matched with Clinical Database records. See the MIMIC-III Waveform Database Matched Subset for more information.


Methods

Unlike the original MIMIC Database, waveforms were collected in a largely automated fashion, from all of the bedside monitors in certain adult and neonatal ICUs. Not all of the ICUs in the hospital were included, and the data archiving process did not run continuously, but while it was running, all waveforms from those ICUs were captured and archived. As a result, these records represent a random sample of patients in those specific ICUs.

Recorded waveforms and numerics vary depending on choices made by the ICU staff. Waveforms almost always include one or more ECG signals, and often include continuous arterial blood pressure (ABP) waveforms, fingertip photoplethysmogram (PPG) signals, and respiration, with additional waveforms (up to 8 simultaneously) as available. Numerics typically include heart and respiration rates, SpO2, and systolic, mean, and diastolic blood pressure, together with others as available. Recording lengths also vary; most are a few days in duration, but some are shorter and others are several weeks long.

The project was approved by the Institutional Review Boards of Beth Israel Deaconess Medical Center (Boston, MA) and the Massachusetts Institute of Technology (Cambridge, MA). Requirement for individual patient consent was waived because the project did not impact clinical care and all protected health information was deidentified.


Data Description

Each recording comprises two records (a waveform record and a matching numerics record) in a single record directory (“folder”) with the name of the record. To reduce access time, the record directories have been distributed among ten intermediate-level directories (listed below). The names of these intermediate directories (30, 31, ..., 39) match the first two digits of the record directories they contain.

In almost all cases, the waveform records comprise multiple segments, each of which can be read as a separate record. Each segment contains an uninterrupted recording of a set of simultaneously observed signals, and the signal gains do not change at any time during the segment. Whenever the ICU staff changed the signals being monitored or adjusted the amplitude of a signal being monitored, this event was recorded in the raw data dump, and a new segment begins at that time.

Each composite waveform record includes a list of the segments that comprise it in its master header file. The list begins on the second line of the master header with a layout header file that specifies all of the signals that are observed in any segment belonging to the record. Each segment has its own header file and (except for the layout header) a matching (binary) signal (.dat) file. Occasionally, the monitor may be disconnected entirely for a short time; these intervals are recorded as gaps in the master header file, but there are no header or signal files corresponding to gaps.

The numerics records (designated by the letter n appended to the record name) are not divided into segments, since the storage savings that would be achieved by doing so would be relatively little.

Physiologic waveform records in this database contain up to eight simultaneously recorded signals digitized at 125 Hz with 8-, 10-, or (occasionally) 12-bit resolution. Numerics records typically contain 10 or more time series of vital signs sampled once per second or once per minute.

Technical Limitations

Waveforms or numerics missing:
Occasionally, technical limitations of the data acquisition system make it possible to create a physiologic waveform record but not a numerics record, or vice versa.
A given signal may not be available throughout an entire record:
Records in the MIMIC-III Waveform Database vary in length; some are several weeks in duration. It is common for the physiologic signals to be interrupted or changed occasionally during recordings of such long duration. When using a viewer such as LightWAVE, all signals available at any time during a record are listed, although in most cases only a subset is visible at any given time.
Gaps and patient identification:
The waveform and numerics records have been extracted from raw data dumps collected from the bedside monitors using a facility provided by the monitor manufacturer. The raw data dumps contain files of data collected from a single patient monitor during a single monitoring session (which may last days or weeks). Usually the monitoring session ends when the patient is discharged, so that the data in a single file come from a single patient. Occasionally, however, the monitor is not reset when the patient is discharged, and the session continues after a new patient has been admitted; in this case the raw data file contains data from two (or more) patients, with a gap (an interval during which no waveforms or numerics are recorded) that is typically an hour or more in duration. Such gaps may also appear if the monitor is temporarily disconnected (for example, for a laboratory test) and then reconnected to the same patient. Since the raw data files do not usually contain patient identifiers, it is not trivial to determine with certainty if the data before and after a gap were collected from the same patient.
Ideally, each MIMIC-III Waveform Database record should contain data from only one patient. All raw data files containing gaps of an hour or more have been split into separate records in order to decrease the likelihood that any record contains data from multiple patients. An ongoing project is to examine the sets of records created this way, matching them with MIMIC-III Clinical Database records when possible, to determine if and how they should be reassembled.
Inter-waveform alignment problems:
The method used for MIMIC waveform data extraction was not designed for inter-waveform analysis. The waveform data contain unspecified/unknown filtering delays and/or unknown inter-channel delays, which may not be constant in a given record. Therefore, although the ECGs are time-aligned with each other, there may be a (changing) delay of up to 500ms between any of the other waveforms in the data. For example, the pulse transit time measured between different waveforms may be unreliable (either in absolute or relative terms).
ECG limitations:
The ECG signals in the waveform records were originally sampled with 12-bit precision at a high sampling rate, and were then scaled and decimated to 500 samples per second (per signal). The scaling reduced the effective amplitude resolution from 12 bits to 9 or 10 bits in typical cases, and as little as 7 bits in some cases. From each set of 4 consecutive decimated samples of the same ECG signal, one was recorded (chosen using a turning-point compressor, a technique sometimes called “peak-picking”). The result is an ECG signal sampled 125 times per second, but at intervals that vary between 2 and 14 ms (averaging 8 ms). Since the interval between any given pair of samples was not available to us, the reconstructions of the ECG signals assume uniform 8 ms intervals. These signals with reduced time and amplitude resolution, and sampling jitter introduced by the “peak-picking”, were the only ECG signals that were possible to capture from the ICU monitors. Although ECGs reconstructed in this way can be readily interpreted visually, they are unsuitable as input for certain algorithms for ECG analysis, particularly those that are sensitive to frequency-domain features of the signal. Note that these limitations apply only to the ECG signals, not to the other signals, which were originally sampled at uniform 8 ms intervals (125 samples per second) and were not scaled prior to capture.

Usage Notes

The following example illustrates the organization of the database:

  • Intermediate directory 31 contains all records with names that begin with 31.
  • Record directory 3141595 is contained within intermediate directory 31.
  • All files associated with physiologic waveform record 3141595 and its companion numerics record 3141595n are contained within record directory 31/3141595.
    • The first line of the master header file for waveform record 314595 (31/3141595/3141595.hea) indicates that the record is 242353557 sample intervals (about 22 days at 125 samples per second) in duration, and that it contains 427 segments and gaps. (See header(5) in the WFDB Applications Guide for details on the format of this text file.) The first segment is named 3141595_0001, and it is 2888500 sample intervals (6 hours, 15 minutes, and 8 seconds, at 125 samples per second) in duration. At the end of the master header file, a comment (# Location: nicu) specifies the ICU in which the recording was made (the neonatal ICU, in this case).
    • The layout header file for this record (31/3141595/3141595_layout.hea) indicates that five ECG signals (I, II, III, AVR, and “V”), a respiration signal, and a PPG signal are available during portions of the record. (The five ECG signals are not all available simultaneously.)
    • The header file for the first segment of this record (31/3141595/3141595_0001.hea) shows that a PPG signal (“PLETH”), a respiration signal, and ECG leads II and AVR are available throughout this initial segment.
  • The matching numerics record is named 3141595n, and its header file (31/3141595/3141595n.hea) shows that it is 1938730 sample intervals (about 22 days at 1 sample per second) in duration, and that it contains heart rate (“HR”, which is measured from the ECG, as well as “PULSE”, measured from one or more pulsatile signals), noninvasive blood pressure (raw as well as systolic, diastolic, and mean), respiration rate, and SpO2.

Any WFDB application can read any waveform record from this database directly from the PhysioNet web server (i.e., without downloading the record first) using a record name of the form mimic3wdb/3x/3xyyyyy/. Numerics records can be read using the longer form mimic3wdb/3x/3xyyyyy/3xyyyyyn (note that the final 3xyyyyy must be repeated and followed by n to specify the numerics record).

For example, if you have installed the WFDB Software Package, you can read the first 10 seconds of waveform record 3141595 using this rdsamp command:

rdsamp -r mimic3wdb/31/3141595/ -p -v -t 10

To read the first 10 seconds of the matching numerics record 3141595n, use this command instead:

rdsamp -r mimic3wdb/31/3141595/3141595n -p -v -t 10

Notice that the first command produces 1250 samples of each waveform (125 samples per second, for 10 seconds), but the second command produces only 10 samples of each vital sign (1 sample per second, for 10 seconds).


Release Notes

Version 1.0 of the MIMIC-III Waveform Database supersedes previously-released versions of the MIMIC-II Waveform Database. The numbered records (3000003 to 3999988) are identical to those in version 3.2 of the MIMIC-II Waveform Database. The Matched Subset, however, uses different subject IDs and surrogate dates, corresponding to version 1.4 of the MIMIC-III Clinical Database.


Acknowledgements

We wish to thank Philips Healthcare, as well as the Beth Israel Deaconess Medical Center, for their invaluable support in making this project possible.

Many people have contributed to this project over the past 18 years, and it would be impossible to list them all. In particular, we would like to acknowledge Michael Craig, Tin Kyaw, and Mohammed Saeed, for their efforts in collecting and organizing the original MIMIC-II Waveform Database, upon which this database is based.


Conflicts of Interest

The authors have no conflicts of interests to declare.


References

  1. Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L. A., & Mark, R. G. (2016). MIMIC-III, a freely accessible critical care database. Scientific Data, 3, 160035. https://dx.doi.org/10.1038/sdata.2016.35

Parent Projects
MIMIC-III Waveform Database was derived from: Please cite them when using this project.
Share
Access

Access Policy:
Anyone can access the files, as long as they conform to the terms of the specified license.

License (for files):
Open Data Commons Open Database License v1.0

Discovery

DOI (version 1.0):
https://doi.org/10.13026/c2607m

DOI (latest version):
https://doi.org/10.13026/gs83-bd50

Corresponding Author
You must be logged in to view the contact information.

Files

Total uncompressed size: 6.7 TB.

Access the files

Visualize waveforms

Folder Navigation: <base>/matched/p00
Name Size Modified
Parent Directory
p000020
p000030
p000033
p000052
p000079
p000085
p000107
p000109
p000123
p000124
p000125
p000135
p000138
p000145
p000154
p000160
p000177
p000184
p000188
p000194
p000208
p000214
p000217
p000222
p000262
p000263
p000271
p000279
p000283
p000292
p000298
p000301
p000302
p000308
p000317
p000318
p000328
p000333
p000357
p000369
p000377
p000379
p000402
p000406
p000408
p000409
p000416
p000422
p000427
p000439
p000462
p000470
p000491
p000495
p000507
p000515
p000518
p000521
p000523
p000543
p000549
p000550
p000565
p000571
p000586
p000593
p000600
p000605
p000608
p000618
p000625
p000631
p000634
p000638
p000639
p000650
p000652
p000666
p000668
p000670
p000672
p000682
p000689
p000695
p000700
p000703
p000708
p000710
p000719
p000735
p000736
p000743
p000747
p000749
p000770
p000772
p000773
p000776
p000784
p000787
p000793
p000798
p000801
p000808
p000818
p000822
p000834
p000843
p000849
p000852
p000865
p000870
p000871
p000875
p000878
p000886
p000891
p000894
p000895
p000901
p000906
p000907
p000925
p000946
p000948
p000952
p000963
p000974
p000981
p000992
p001002
p001004
p001006
p001012
p001021
p001028
p001029
p001030
p001033
p001038
p001042
p001044
p001046
p001049
p001072
p001075
p001083
p001092
p001097
p001104
p001121
p001123
p001135
p001143
p001144
p001158
p001160
p001170
p001174
p001182
p001190
p001192
p001200
p001207
p001217
p001222
p001224
p001226
p001241
p001244
p001257
p001279
p001280
p001313
p001331
p001337
p001338
p001347
p001354
p001357
p001378
p001396
p001398
p001408
p001409
p001414
p001418
p001430
p001438
p001449
p001453
p001457
p001459
p001474
p001476
p001485
p001501
p001502
p001521
p001524
p001526
p001528
p001531
p001546
p001551
p001557
p001563
p001569
p001578
p001586
p001604
p001606
p001613
p001650
p001673
p001679
p001693
p001709
p001744
p001747
p001754
p001758
p001761
p001763
p001778
p001785
p001791
p001795
p001802
p001818
p001824
p001840
p001854
p001855
p001861
p001885
p001888
p001892
p001898
p001900
p001908
p001924
p001931
p001932
p001935
p001941
p001944
p001949
p001950
p001973
p001978
p001979
p001986
p001991
p001995
p002014
p002029
p002034
p002045
p002049
p002052
p002063
p002066
p002067
p002075
p002090
p002092
p002100
p002104
p002148
p002154
p002157
p002172
p002185
p002187
p002200
p002211
p002213
p002224
p002228
p002229
p002237
p002240
p002246
p002251
p002261
p002264
p002265
p002274
p002280
p002301
p002305
p002317
p002326
p002332
p002340
p002343
p002361
p002362
p002369
p002374
p002389
p002395
p002397
p002403
p002442
p002458
p002466
p002467
p002477
p002479
p002480
p002488
p002492
p002498
p002502
p002513
p002514
p002530
p002536
p002549
p002561
p002577
p002578
p002586
p002589
p002610
p002611
p002619
p002636
p002639
p002653
p002659
p002664
p002672
p002686
p002700
p002703
p002722
p002725
p002742
p002744
p002747
p002754
p002755
p002773
p002784
p002787
p002791
p002798
p002827
p002830
p002834
p002846
p002858
p002906
p002917
p002919
p002921
p002946
p002968
p002974
p002981
p002990
p002996
p003021
p003024
p003026
p003039
p003052
p003057
p003066
p003084
p003097
p003099
p003129
p003133
p003138
p003158
p003165
p003171
p003174
p003176
p003192
p003214
p003218
p003221
p003242
p003245
p003250
p003261
p003266
p003267
p003272
p003278
p003279
p003286
p003287
p003290
p003301
p003302
p003321
p003330
p003340
p003345
p003351
p003358
p003360
p003365
p003372
p003386
p003404
p003424
p003441
p003462
p003473
p003474
p003490
p003491
p003495
p003498
p003506
p003512
p003513
p003515
p003516
p003521
p003530
p003533
p003543
p003552
p003554
p003555
p003566
p003570
p003571
p003586
p003593
p003606
p003612
p003617
p003619
p003622
p003623
p003633
p003635
p003640
p003642
p003652
p003654
p003673
p003674
p003675
p003680
p003695
p003744
p003745
p003746
p003748
p003759
p003764
p003768
p003780
p003792
p003794
p003798
p003821
p003830
p003853
p003860
p003863
p003866
p003883
p003884
p003886
p003889
p003914
p003917
p003920
p003929
p003932
p003935
p003939
p003949
p003952
p003957
p003977
p003986
p003987
p003992
p003995
p004018
p004041
p004053
p004059
p004064
p004068
p004076
p004077
p004109
p004113
p004115
p004136
p004142
p004175
p004180
p004188
p004194
p004248
p004249
p004252
p004254
p004261
p004266
p004270
p004286
p004290
p004292
p004308
p004313
p004317
p004324
p004329
p004331
p004338
p004346
p004347
p004348
p004350
p004351
p004356
p004369
p004393
p004401
p004404
p004405
p004406
p004409
p004413
p004420
p004431
p004436
p004439
p004448
p004451
p004457
p004462
p004465
p004474
p004477
p004481
p004490
p004520
p004533
p004538
p004565
p004566
p004568
p004587
p004588
p004593
p004599
p004618
p004630
p004632
p004633
p004641
p004655
p004656
p004664
p004679
p004685
p004688
p004713
p004738
p004742
p004770
p004771
p004778
p004784
p004786
p004787
p004788
p004800
p004802
p004804
p004805
p004807
p004808
p004829
p004833
p004837
p004847
p004852
p004853
p004859
p004860
p004862
p004865
p004870
p004893
p004894
p004900
p004903
p004904
p004906
p004909
p004915
p004923
p004935
p004944
p004951
p004955
p004958
p004966
p004968
p004974
p004987
p005023
p005030
p005037
p005056
p005058
p005062
p005071
p005078
p005080
p005107
p005114
p005124
p005126
p005163
p005171
p005175
p005193
p005195
p005196
p005199
p005201
p005205
p005223
p005237
p005239
p005254
p005259
p005272
p005274
p005277
p005282
p005289
p005292
p005307
p005321
p005336
p005343
p005345
p005348
p005349
p005354
p005362
p005369
p005382
p005400
p005407
p005417
p005442
p005451
p005453
p005459
p005476
p005478
p005485
p005493
p005494
p005496
p005506
p005521
p005525
p005548
p005549
p005569
p005574
p005591
p005604
p005606
p005607
p005609
p005612
p005618
p005619
p005620
p005637
p005642
p005645
p005646
p005672
p005675
p005683
p005685
p005686
p005696
p005701
p005709
p005710
p005712
p005714
p005719
p005722
p005727
p005738
p005742
p005748
p005766
p005772
p005784
p005786
p005791
p005808
p005818
p005821
p005830
p005832
p005841
p005847
p005850
p005871
p005875
p005879
p005885
p005896
p005901
p005908
p005909
p005911
p005913
p005933
p005937
p005957
p005960
p005995
p006000
p006010
p006017
p006028
p006039
p006042
p006052
p006053
p006063
p006064
p006069
p006070
p006075
p006078
p006085
p006089
p006090
p006116
p006131
p006132
p006145
p006158
p006174
p006178
p006179
p006180
p006194
p006195
p006202
p006204
p006206
p006214
p006229
p006233
p006254
p006256
p006262
p006279
p006288
p006294
p006299
p006309
p006314
p006317
p006321
p006323
p006335
p006338
p006358
p006359
p006365
p006374
p006378
p006381
p006382
p006398
p006407
p006411
p006428
p006437
p006440
p006449
p006455
p006464
p006470
p006475
p006478
p006485
p006497
p006519
p006522
p006533
p006534
p006535
p006539
p006553
p006555
p006557
p006561
p006566
p006581
p006583
p006598
p006601
p006602
p006604
p006605
p006607
p006621
p006636
p006637
p006649
p006652
p006659
p006667
p006669
p006673
p006687
p006688
p006691
p006692
p006702
p006708
p006718
p006728
p006729
p006749
p006800
p006804
p006809
p006839
p006841
p006850
p006868
p006875
p006876
p006889
p006892
p006901
p006914
p006917
p006933
p006939
p006940
p006944
p006945
p006953
p006958
p006967
p006981
p006983
p006988
p007009
p007023
p007048
p007051
p007084
p007095
p007102
p007105
p007107
p007115
p007125
p007136
p007138
p007149
p007153
p007160
p007172
p007175
p007183
p007184
p007192
p007212
p007213
p007217
p007224
p007225
p007232
p007234
p007241
p007251
p007253
p007262
p007263
p007265
p007289
p007299
p007303
p007320
p007328
p007339
p007347
p007360
p007365
p007371
p007381
p007389
p007397
p007400
p007410
p007415
p007422
p007427
p007432
p007438
p007442
p007445
p007448
p007452
p007468
p007470
p007472
p007477
p007478
p007479
p007482
p007487
p007490
p007492
p007497
p007512
p007517
p007519
p007521
p007522
p007528
p007529
p007532
p007533
p007542
p007567
p007585
p007612
p007614
p007618
p007629
p007632
p007644
p007650
p007651
p007654
p007655
p007666
p007681
p007683
p007685
p007688
p007695
p007704
p007705
p007720
p007728
p007755
p007758
p007760
p007782
p007784
p007786
p007798
p007799
p007809
p007819
p007825
p007842
p007849
p007860
p007866
p007874
p007881
p007886
p007894
p007897
p007908
p007910
p007944
p007960
p007965
p007966
p007968
p007969
p007977
p007979
p007981
p007985
p007996
p008009
p008013
p008040
p008057
p008061
p008062
p008068
p008070
p008072
p008084
p008087
p008099
p008105
p008109
p008115
p008120
p008121
p008122
p008126
p008138
p008141
p008142
p008154
p008167
p008170
p008186
p008198
p008207
p008221
p008228
p008231
p008249
p008258
p008259
p008267
p008269
p008272
p008273
p008274
p008275
p008281
p008298
p008318
p008336
p008347
p008363
p008368
p008393
p008396
p008406
p008415
p008422
p008426
p008432
p008445
p008450
p008451
p008452
p008461
p008466
p008467
p008471
p008489
p008493
p008509
p008516
p008524
p008532
p008533
p008546
p008548
p008557
p008566
p008568
p008569
p008573
p008608
p008654
p008670
p008674
p008698
p008718
p008723
p008726
p008734
p008735
p008748
p008749
p008779
p008780
p008795
p008799
p008814
p008822
p008832
p008848
p008870
p008871
p008879
p008890
p008896
p008897
p008905
p008906
p008915
p008917
p008929
p008932
p008936
p008945
p008947
p008949
p008964
p008970
p008984
p008985
p008989
p008990
p008996
p009001
p009005
p009016
p009021
p009031
p009036
p009043
p009048
p009058
p009062
p009070
p009105
p009112
p009124
p009128
p009130
p009139
p009148
p009170
p009176
p009178
p009225
p009226
p009233
p009238
p009249
p009251
p009253
p009258
p009268
p009269
p009271
p009274
p009278
p009286
p009289
p009295
p009297
p009300
p009308
p009311
p009324
p009330
p009332
p009335
p009338
p009341
p009354
p009356
p009358
p009361
p009363
p009364
p009366
p009372
p009389
p009393
p009397
p009398
p009425
p009430
p009434
p009460
p009473
p009486
p009494
p009498
p009518
p009523
p009526
p009537
p009555
p009569
p009575
p009607
p009615
p009630
p009637
p009642
p009648
p009664
p009667
p009672
p009675
p009676
p009678
p009685
p009686
p009687
p009705
p009708
p009714
p009732
p009753
p009783
p009798
p009844
p009847
p009870
p009871
p009882
p009889
p009891
p009920
p009923
p009949
p009950
p009951
p009958
p009962
p009965
p009967
p009968
p009971
p009973
p009987
p009991
p009993
p009998