.LP .af PN i .ds LH MIT-BIH A\s-2RRHYTHMIA\s+2 D\s-2ATABASE\s+2 .ds RH F\s-2OREWORD\s+2 .bp 3 \s+2 \fB .ce 1 FOREWORD .PP For a number of years our group has been investigating methods for real-time ECG rhythm analysis. In the course of this work, we have developed an extensive annotated digital ECG database. The database has been enormously helpful to us in algorithm development and evaluation. The creation of this resource required a major effort, and was funded, in part, by both government and industry. We feel it is highly desirable to make this database available to other academic and industrial groups, and hence have prepared it for distribution. This catalog contains detailed descriptions of the database tapes. .PP We acknowledge with gratitude the many dedicated hours of work which went into this project on the part of cardiologists, Holter technicians, laboratory assistants, and engineers in our laboratories at MIT and at Beth Israel Hospital. We also acknowledge the help of our colleagues at Washington University, St. Louis (particularly Russell Hermes) in assuring a compatible data format. We especially wish to recognize: .IP \(bu \fIPaul Schluter\fR, who did the original design and implementation of the database. .IP \(bu \fIScott Peterson\fR, who supervised the detailed data selection, digitization, annotation, and editing at Beth Israel Hospital. He also contributed substantially to the development of software needed for using the database in our evaluation. .IP \(bu \fIGeorge Moody\fR, who converted the database to a format compatible with that used by the AHA database, and who contributed in a major way to the directory. .IP \(bu \fILarry Siegal\fR, who contributed to development of the waveform editor system. .IP \(bu \fICheryl Jackson\fR, who was responsible for most of the detailed transcription of cardiologist annotations, the comparison and quality control functions, and who assembled the final manuscript for the directory. .IP \(bu \fIDiane Perry\fR, who as chief technician in the Arrhythmia Laboratory helped to identify suitable data, and who helped in the annotation process. .IP \(bu The group of physicians who helped with the difficult task of beat-by-beat annotation of the ECGs: Dr. Esmerey Acarturk, Dr. John Aumiller, Dr. Sidney Blake, Dr. Alvin Blaustein, Dr. Chester Conrad, Dr. Gary Heller, Dr. Michael Malagold, Dr. Roger Mark, and Dr. Candice Miklozek. .nf .in +3i Roger Mark Walter Olson Cambridge, Massachusetts September, 1980 .in -3i .fi \s+2 \fB .ce 1 NOTES ON THE SECOND EDITION .PP During the eight years since we first published this book, nearly one hundred academic and industrial research groups worldwide have used the MIT-BIH Arrhythmia Database. We thank these organizations for their support. .PP In addition to those listed above, we also wish to recognize the contributions of: .IP \(bu \fIJoe Mietus\fR, who handled correspondence as well as production and distribution of the database from the Beth Israel Hospital, and who analyzed the mechanical sources of analog tape wow and flutter. .IP \(bu \fITed Baker\fR, who helped in the first ``port'' of the database from our homebrew 8080-based systems. .PP In lieu of reprinting the first edition of this volume, we have made use of modern printing technology to produce a far more readable and complete record of the contents of the database. .PP In June, 1987, the Association for the Advancement of Medical Instrumentation published its \fIRecommended Practice for Testing and Reporting Performance Results of Ventricular Arrhythmia Detection Algorithms\fR (AAMI ECAR-1987), which may be obtained from AAMI, 3330 Washington Boulevard, Suite 400, Arlington, VA 22201. The MIT-BIH and AHA Databases provide developers and evaluators of arrhythmia detectors with standard test data; the AAMI Recommended Practice provides guidelines for using these databases in a standard way, and for describing detector performance in a manner that facilitates comparisons between detectors. We urge all users of our database to follow the AAMI recommendations when preparing performance statistics for publication. .PP A package of C-language software for using the MIT-BIH and AHA Databases is available from MIT. The package includes programs for plotting ECGs with annotations, sampling rate conversion, and beat-by-beat comparison of annotation files following the AAMI recommended practice, as well as a variety of other useful programs. All the programs access the database via a common library of subroutines, which are also provided as part of the package and which may be used with user programs. .PP We have made several other sets of ECG recordings available, including specialized databases for ventricular fibrillation, atrial fibrillation, and ST segment changes. These databases are not annotated beat-by-beat. .nf .in +3i George Moody Roger Mark Cambridge, Massachusetts August, 1988 .in -3i .fi \s+2 \fB .ce 1 NOTES ON THE THIRD EDITION .PP Since the publication of the second edition, the database has been made available in CD-ROM format. Continued strong interest in the database has made it possible to prepare a second edition of the CD-ROM and a third edition of this book. The CD-ROM includes the additional specialized databases mentioned above, and the second edition of the CD-ROM also contains the software package mentioned above. .PP Compatible databases of ECGs and other physiologic signals are beginning to appear. Of particular interest to users of our database is the European ST-T Database, which consists of ninety two-hour ECG recordings with beats, rhythms, and signal quality annotated as in the MIT-BIH Arrhythmia Database, and with additional annotations to indicate ST and T-wave morphology changes. For information, write to: CNR Institute of Clinical Physiology, Computer Laboratory, via Trieste, 41, 56100 Pisa, Italy. .PP I wish to thank all of those who have supported this project over these years, especially Roger Mark, who has guided it from its inception in 1975. .nf .in +3i George Moody Cambridge, Massachusetts July, 1992 .in -3i .fi .ds RH I\s-2NTRODUCTION\s+2 .bp .SH INTRODUCTION .PP This introduction describes how the database records were obtained, and discusses the characteristics of the recorded signals. Following these notes are annotated ``full disclosure'' plots of the entire database. These can be useful for obtaining an overall impression of the contents of individual records. Following the ``full disclosure'' plots are sample ECG strips. These strips were chosen to illustrate the salient features of each record. Next are notes on the important features of each record. These notes also include background information on the subjects, including their medications. At the end of the book are tables of rhythms and annotations, which summarize the contents of the database. These tables can be helpful in finding a record with a specific set of characteristics. .SH Selection criteria .PP The source of the ECGs included in the MIT-BIH Arrhythmia Database is a set of over 4000 long-term Holter recordings that were obtained by the Beth Israel Hospital Arrhythmia Laboratory between 1975 and 1979. Approximately 60% of these recordings were obtained from inpatients. The database contains 23 records (numbered from 100 to 124 inclusive with some numbers missing) chosen at random from this set, and 25 records (numbered from 200 to 234 inclusive, again with some numbers missing) selected from the same set to include a variety of rare but clinically important phenomena that would not be well-represented by a small random sample of Holter recordings. Each of the 48 records is slightly over 30 minutes long. .PP The first group is intended to serve as a representative sample of the variety of waveforms and artifact that an arrhythmia detector might encounter in routine clinical use. A table of random numbers was used to select tapes, and then to select half-hour segments of them. Segments selected in this way were excluded only if neither of the two ECG signals was of adequate quality for analysis by human experts. .PP Records in the second group were chosen to include complex ventricular, junctional, and supraventricular arrhythmias and conduction abnormalities. Several of these records were selected because features of the rhythm, QRS morphology variation, or signal quality may be expected to present significant difficulty to arrhythmia detectors; these records have gained considerable notoriety among database users. .PP The subjects were 25 men aged 32 to 89 years, and 22 women aged 23 to 89 years. (Records 201 and 202 came from the same male subject.) .SH ECG lead configuration .PP In most records, the upper signal is a modified limb lead II (MLII), obtained by placing the electrodes on the chest. The lower signal is usually a modified lead V1 (occasionally V2 or V5, and in one instance V4); as for the upper signal, the electrodes are also placed on the chest. This configuration is routinely used by the BIH Arrhythmia Laboratory. Normal QRS complexes are usually prominent in the upper signal. The lead axis for the lower signal may be nearly orthogonal to the mean cardiac electrical axis, however (i.e., normal beats are usually biphasic and may be nearly isoelectric). Thus normal beats are frequently difficult to discern in the lower signal, although ectopic beats will often be more prominent (see, for example, record 106). A notable exception is record 114, for which the signals were reversed. Since this happens occasionally in clinical practice, arrhythmia detectors should be equipped to deal with this situation. In records 102 and 104, it was not possible to use modified lead II because of surgical dressings on the patients; modified lead V5 was used for the upper signal in these records. .SH Analog recording and playback .PP The original analog recordings were made using nine Del Mar Avionics model 445 two-channel recorders, designated \fIA\fP through \fII\fP: .TS center box; c | l. \fIRecorder Records\fP _ A 102, 107, 111, 115, 121 B 212 C 203 D 118, 124, 217 E 101, 103, 106, 108, 112, 117, 119, 122, 209, 219, 220, 223, 233 F 104, 109, 123, 205, 207, 210, 215, 221 G 100, 105, 114, 116, 213, 214, 222, 228 H 113, 201, 202, 231 I 200, 230, 232, 234 .TE (It is not known which recorder was used for record 208.) .PP During the digitization process, the analog recordings were played back on a Del Mar Avionics model 660 unit. The analog tapes used for records 112, 115 through 124, 205, 220, 223, and 230 through 234 were played back and digitized at twice real time; the rest were played back at real time using a specially constructed capstan for the model 660 unit. Skew between the two signals was found to be as great as 40 milliseconds for some of the analog recorders. In addition to the fixed skew that results from extremely small differences in the orientations of the tape heads on the recorder and the playback unit, microscopic vertical wobbling of the tape, either during recording or playback, introduces a variable skew, which may be comparable in magnitude to the fixed skew. This problem (which also occurs on the AHA database) may present difficulties for certain two-channel analysis methods designed for real-time applications. .PP Minor tape speed variations should not pose problems for typical arrhythmia detectors. It is difficult to avoid tape sticking or slippage during low-speed playback, and several episodes of tape slippage were noted and marked with comment annotations. Wow and flutter should be studied carefully in the context of heart-rate variability studies, since flutter compensation was not possible in these recordings. A number of frequency-domain artifacts have been identified and related to specific mechanical components of the recorders and the playback unit: .TS center box; c | l. \fIFrequency (Hz) Source\fR _ 0.042 Recorder pressure wheel 0.083 Playback unit capstan (for twice real-time playback) 0.090 Recorder capstan 0.167 Playback unit capstan (for real-time playback) 0.18\-0.10 Takeup reel (frequency decreases over time) 0.20\-0.36 Supply reel (frequency increases over time) .TE The most significant of these artifacts by far is the 0.167 Hz artifact on recordings that were played back at real time. The next largest is the 0.090 Hz artifact; the 0.083 Hz artifact on recordings that were played back at twice real-time is of roughly the same magnitude as the 0.090 Hz artifact. The 0.042 Hz artifact is of much lower magnitude. Other frequencies related to the drive train (at 0.42 Hz, 1.96 Hz, 9.1 Hz, and 42 Hz) do not appear as noticeable artifacts. The frequencies of the last two artifacts listed in the table depend on how much tape is on the supply and takeup reels; the supply reel causes a much more noticeable artifact than does the takeup reel. Other frequency-domain artifacts generated by the supply reel appear in the 0.10\-0.18 Hz and 0.30\-0.54 Hz bands. .PP Four of the 48 records (102, 104, 107, and 217) include paced beats. The original analog recordings do not represent the pacemaker artifacts with sufficient fidelity to permit them to be recognized by pulse amplitude (or slew rate) and duration alone, the method commonly used for real-time processing. The database records reproduce the analog recordings with sufficient fidelity to permit use of pacemaker artifact detectors designed for tape analysis, however. .SH Digitization .PP The analog outputs of the playback unit were filtered to limit analog-to-digital converter (ADC) saturation and for anti-aliasing, using a passband from 0.1 to 100 Hz relative to real time, well beyond the lowest and highest frequencies recoverable from the recordings. The bandpass-filtered signals were digitized at 360 Hz per signal relative to real time using hardware constructed at the MIT Biomedical Engineering Center and at the BIH Biomedical Engineering Laboratory. The sampling frequency was chosen to facilitate implementations of 60 Hz (mains frequency) digital notch filters in arrhythmia detectors. Since the recorders were battery-powered, most of the 60 Hz noise present in the database arose during playback. In those records that were digitized at twice real time, this noise appears at 30 Hz (and multiples of 30 Hz) relative to real time. .PP Samples were acquired from each signal almost simultaneously (the intersignal sampling skew was on the order of a few microseconds). As noted above, analog tape skew was several orders of magnitude larger. The ADCs were unipolar, with 11-bit resolution over a \(+-5 mV range. Sample values thus range from 0 to 2047 inclusive, with a value of 1024 corresponding to zero volts. .PP The 11-bit samples were recorded in 8-bit first difference format (this was necessary because of limited mass storage capacity). Given the sampling frequency and the resolution of the ADC, the difference encoding implies a maximum recordable slew rate of \(+-225 mV/s. In practice, this limit was exceeded by the input signals very infrequently, only during severe noise on a small number of records. The effect on the quality of the recorded signals is totally negligible. Since the ECG data files are reconstructed from difference data, no information is lost and considerable savings in storage may be obtained by converting them back into difference form; a program for doing so is included in the package of C-language software available from the BIH Biomedical Engineering Laboratory. .SH Annotations .PP An initial set of beat labels was produced by a simple slope-sensitive QRS detector, which marked each detected event as a normal beat. Two identical 150-foot chart recordings were printed for each 30-minute record, with these initial beat labels in the margin. For each record, the two charts were given to two cardiologists, who worked on them independently. The cardiologists added additional beat labels where the detector missed beats, deleted false detections as necessary, and changed the labels for all abnormal beats. They also added rhythm labels, signal quality labels, and comments. .PP The annotations were transcribed from the paper chart recordings. Once both sets of cardiologists' annotations for a given record had been transcribed and verified, they were automatically compared beat-by-beat, and another chart recording was printed. This chart showed the cardiologists' annotations in the margin, with all discrepancies highlighted. Each discrepancy was reviewed and resolved by consensus. The corrections were transcribed, and the annotations were then analyzed by an auditing program, which checked them for consistency and which located the ten longest and shortest R-R intervals in each record (to identify possible missing or falsely detected beats). .PP In early copies of the database, most beat labels were placed at the R-wave peak, but manually inserted labels were not always located precisely at the peak. In copies of the database made since 1983, the beat labels have been shifted from their original locations. The ECG (usually the upper signal) was digitally bandpass-filtered to emphasize the QRS complexes, and each beat label was moved to the major local extremum, after correction for phase shift in the filter. A few noisy beats were manually realigned. The result is that annotations generally appear at the R-wave peak, and are located with sufficient accuracy to make the reference annotation files usable for studies requiring waveform averaging and for heart rate variability studies (but note the comments with respect to analog tape wow and flutter above). In the annotated ECG plots in this book, each label is placed so that the fiducial mark for the annotation corresponds to the left edge of the label. .PP The database contains approximately 109,000 beat labels. Sixteen have been corrected in the eight years since the database was first released (in records 104, 108, 114, 203, 207, 217, and 222); in addition, all of the left bundle branch block beats in record 214 were originally labelled as normal beats. The rhythm labels have been more substantially revised and now include notations for paced rhythm, bigeminy, and trigeminy, which were missing in early copies. .ds RH S\s-2YMBOLS\s+2 .bp .ce 1 \fB\s+2SYMBOLS USED IN ECG PLOTS\s-2\fR .TS center box; c | lw(4.5i). \fISymbol Meaning\fR _ \(bu Normal beat L Left bundle branch block beat R Right bundle branch block beat A Atrial premature beat a Aberrated atrial premature beat J Nodal (junctional) premature beat S Supraventricular premature beat V Premature ventricular contraction F Fusion of ventricular and normal beat [ Start of ventricular flutter/fibrillation ! Ventricular flutter wave ] End of ventricular flutter/fibrillation e Atrial escape beat j Nodal (junctional) escape beat E Ventricular escape beat P Paced beat f Fusion of paced and normal beat p Non-conducted P-wave (blocked APB) Q Unclassifiable beat | Isolated QRS-like artifact _ T{ Rhythm annotations appear \fIbelow\fP the level used for beat annotations: T} (AB Atrial bigeminy (AFIB Atrial fibrillation (AFL Atrial flutter (B Ventricular bigeminy (BII 2\(de heart block (IVR Idioventricular rhythm (N Normal sinus rhythm (NOD Nodal (A-V junctional) rhythm (P Paced rhythm (PREX Pre-excitation (WPW) (SBR Sinus bradycardia (SVTA Supraventricular tachyarrhythmia (T Ventricular trigeminy (VFL Ventricular flutter (VT Ventricular tachycardia _ T{ Signal quality and comment annotations appear \fIabove\fR the level used for beat annotations: T} \fIqq\fR T{ Signal quality change: the first character (`c' or `n') indicates the quality of the upper signal (clean or noisy), and the second character indicates the quality of the lower signal T} U Extreme noise or signal loss in both signals: ECG is unreadable M Missed beat (`MISSB' in examples, pp. 99\-177) P Pause (`PSE' in examples) T Tape slippage (`TS' in examples) .TE