Tappy Keystroke Data

Version 1 - October 2017

When referencing this material, please include the standard citation for PhysioNet:

Goldberger AL, Amaral LAN, Glass L, Hausdorff JM, Ivanov PCh, Mark RG, Mietus JE, Moody GB, Peng C-K, Stanley HE. PhysioBank, PhysioToolkit, and PhysioNet: Components of a New Research Resource for Complex Physiologic Signals. Circulation 101(23):e215-e220 [Circulation Electronic Pages; http://circ.ahajournals.org/content/101/23/e215]; 2000 (June 13). [bib]
  author    = {Goldberger, Ary L. and Amaral, Luis A. N.
               and Glass, Leon and Hausdorff, Jeffrey M.
               and Ivanov, Plamen Ch. and Mark, Roger G.
               and Mietus, Joseph E. and Moody, George B.
               and Peng, Chung-Kang and Stanley, H. Eugene},
  title     = {{PhysioBank}, {PhysioToolkit}, and {PhysioNet}:
               Components of a New Research Resource for Complex
               Physiologic Signals},
  journal   = {Circulation},
  publisher = {American Heart Association, Inc.},
  volume    = {101},
  number    = {23},
  year      = {2000},
  month     = {June},
  pages     = {e215--e220},
  doi       = {10.1161/01.CIR.101.23.e215},
  issn      = {0009-7322},
  url       = {http://circ.ahajournals.org/content/101/23/e215}


This is the keystroke dataset for the study titled 'High-accuracy detection of early Parkinson's Disease using multiple characteristics of finger movement while typing'. This research report is currently under review for publication by PLOS ONE.

The dataset contains keystroke logs collected from over 200 subjects, with and without Parkinson's Disease (PD), as they typed normally on their own computer (without any supervision) over a period of weeks or months (having initially installed a custom keystroke recording app, Tappy). This dataset has been collected and analyzed in order to indicate that the routine interaction with computer keyboards can be used to detect changes in the characteristics of finger movement in the early stages of PD.


The participants, from the U.S., Canada, UK and Australia, had visited the project website and agreed to participate in the study. The research was approved by the Human Research Ethics Committee at Charles Sturt University, Australia, protocol number H17013.

Each data file collected includes the timing information from typing activity as the participants used their various Windows applications (such as email, word processing, web searches and the like). The keystroke acquisition software ('Tappy') provided timing accuracy of key press and release timestamps to within several milliseconds.

The data files comprise two Zip archives, one with the participant detail files and the other with the keystroke data files for each user.

Archived Users.zip

The filename of each user file contains a 10 character code, used to cross reference to the keystroke data files for that user. The fields are:

Archived Data.zip

Each file contains comma separated keystroke data for one month for a particular user. The filename comprises the 10 character code (matching the user details file) and the YYMM of the data. The fields are:

Icon  Name                    Last modified      Size  Description
[PARENTDIR] Parent Directory - [   ] Archived-Data.zip 2017-11-13 09:55 85M [   ] Archived-users.zip 2017-10-03 16:15 71K

Questions and Comments

If you would like help understanding, using, or downloading content, please see our Frequently Asked Questions.

If you have any comments, feedback, or particular questions regarding this page, please send them to the webmaster.

Comments and issues can also be raised on PhysioNet's GitHub page.

Updated Friday, 28 October 2016 at 16:58 EDT

PhysioNet is supported by the National Institute of General Medical Sciences (NIGMS) and the National Institute of Biomedical Imaging and Bioengineering (NIBIB) under NIH grant number 2R01GM104987-09.