ECG Datasets — Request Electrocardiogram Data

Electrocardiogram (ECG, also EKG) datasets record the heart's electrical activity as voltage-versus-time signals captured from electrodes placed on the body. They are foundational for training and validating machine-learning models in cardiology, remote patient monitoring, and wearable health technology. A typical ECG dataset spans multiple acquisition formats: standard resting 12-lead ECGs, single-lead and reduced-lead recordings from smartwatches and patch monitors, continuous ambulatory Holter recordings lasting 24 to 48 hours, and short rhythm strips captured during symptomatic episodes.

Signals are commonly stored as WFDB, DICOM-ECG, HL7 aECG, or CSV waveform files, sampled at 250 to 1000 Hz, and accompanied by lead configuration, sampling rate, and calibration metadata. Clinically meaningful ECG datasets include expert annotations of the P wave, QRS complex, and T wave, along with interval measurements such as PR, QRS duration, QT, and corrected QT (QTc). Label schemas often cover normal sinus rhythm and a wide range of abnormalities: atrial fibrillation and flutter, premature atrial and ventricular contractions, supraventricular and ventricular tachycardia, first, second, and third degree AV block, bundle branch blocks, ST-segment elevation and depression, T-wave inversion, and signs of myocardial ischemia or infarction.

High-quality cohorts are demographically balanced across age, sex, and comorbidities, and are de-identified to remove protected health information while preserving diagnostic fidelity. On GetDATA, researchers and medtech companies post requests describing the exact modality, lead set, sampling rate, label taxonomy, class balance, and minimum sample counts they need, and verified hospitals and labs fulfill those requests with compliant, quality-scored electrocardiogram data. Whether you are building arrhythmia classifiers, QT-prolongation screening tools, or wearable rhythm-detection algorithms, sourcing well-annotated ECG datasets is the difference between a model that generalizes and one that fails silently in production.

Common benchmarks and standards include the AAMI EC57 protocol, the PhysioNet/Computing in Cardiology challenges, and SNOMED-coded diagnostic statements, which help align labels across institutions and make datasets interoperable for federated training. Browse the open electrocardiogram data requests below, or explore related cardiac and imaging categories.

ECG Datasets — Request Electrocardiogram Data

Open ECG requests

50,000 Single-Lead Wearable ECG Strips for Large-Scale Atrial Fibrillation Population Screening

6,000 Pediatric 12-Lead ECGs Across Age Groups from Neonates to Adolescents with Diagnostic Labels

5,000 Serial 12-Lead ECGs for QT-Interval Prolongation and Drug-Induced Arrhythmia Safety Monitoring

15,000 12-Lead ECGs with STEMI and NSTEMI Labels for Acute MI Detection AI

8,000 Holter 24-Hour Ambulatory ECG Recordings for Arrhythmia Burden Quantification

12,000 Resting 12-Lead ECG Recordings with Expert Atrial Fibrillation Annotations

Related categories