ECG Benchmark

ECGBench

Reproducible ECG benchmark data from open-access PhysioNet datasets. A PyTorch-native library with standardised fold splits, quality-checked metadata, and a curated index of 64 publicly available ECG datasets.

64
Total Datasets
24
12-Lead (PhysioNet)
15
12-Lead (Other)
10
2-Lead
10
1-Lead
11M+
Max Records
13+
Countries

Installation
Install ECGBench from PyPI or source to start loading ECG benchmark data in PyTorch

Quick Install

# Using pip pip install ecgbench # Using uv (recommended) uv pip install ecgbench

From Source (Development)

git clone https://github.com/vlbthambawita/ECGBench.git cd ECGBench uv pip install -e ".[dev]"
Quick Start
Load ECG data with a standardised PyTorch Dataset in just a few lines

PyTorch DataLoader Example

from ecgbench import ECGDataset, ecg_collate_fn from torch.utils.data import DataLoader # Load PTB-XL training data (folds 1-8, 100 Hz) dataset = ECGDataset( physionet_path="/path/to/physionet.org/files/ptb-xl/1.0.3/", dataset_name="ptbxl", split="train", frequency="100", ) # Create DataLoader with custom collate for mixed metadata loader = DataLoader(dataset, batch_size=32, collate_fn=ecg_collate_fn) for batch in loader: signals = batch["signal"] # (B, channels, samples) ecg_ids = batch["ecg_id"] # list of IDs break

Dataset Catalogue

12-Lead ECG Datasets

24 datasets
# Dataset Format Patients Records Access Status Origin Paper
1 PTB-XL physionet.org ↗ 12-lead · 10 s · 500 Hz (also 100 Hz) 18,869 21,799 Open
CC BY 4.0
Completed Physikalisch-Technische Bundesanstalt
Germany
PTB-XL: A Large Publicly Available ECG Dataset
2 PTB-XL+ physionet.org ↗ 12-lead · 10 s · 500 Hz · features & median beats 18,869 21,799 Open
CC BY 4.0
Not started Karlsruhe Institute of Technology
Germany
PTB-XL+: A Comprehensive ECG Feature Dataset
3 PTB Diagnostic ECG Database physionet.org ↗ 15-lead (12 + 3 Frank) · variable · 1,000 Hz 290 549 Open
ODC Attribution
Not started Univ. Clinic Benjamin Franklin
Germany — Berlin
Bousseljot et al., Biomedizinische Technik, 1995
4 MIMIC-IV-ECG physionet.org ↗ 12-lead · 10 s · 500 Hz ~160,000 ~800,000 Credentialed
PhysioNet DUA
Not started Beth Israel Deaconess Medical Center
USA — Boston, MA
Gow et al.
5 MIMIC-IV-ECG Demo physionet.org ↗ 12-lead · 10 s · 500 Hz 92 659 Open Not started Beth Israel Deaconess Medical Center
USA — Boston, MA
Dataset DOI
6 MIMIC-IV-ECG-Ext-ICD physionet.org ↗ 12-lead · 10 s · 500 Hz · 1,076 ICD-10-CM codes Subset of MIMIC-IV-ECG Subset of MIMIC-IV-ECG Credentialed Not started MIT LCP + collaborators
USA / Germany
Eur Heart J Digital Health, 2024
7 Chapman-Shaoxing (Arrhythmia) physionet.org ↗ 12-lead · 10 s · 500 Hz 45,152 45,152 Open
CC BY 4.0
Not started Chapman Univ.; Shaoxing People's Hospital & Ningbo First Hospital
China / USA
Zheng et al., Scientific Reports, 2020
8 St Petersburg INCART 12-Lead Arrhythmia Database physionet.org ↗ 12-lead · 30 min · 257 Hz 32 75 Open Not started St. Petersburg Institute of Cardiological Technics (INCART)
Russia
Dataset DOI
9 Lobachevsky University ECG Database (LUDB) physionet.org ↗ 12-lead · 10 s · 500 Hz · manually annotated waves 200 200 Open
ODC Attribution
Not started Nizhny Novgorod City Hospital No. 5 / Lobachevsky University
Russia
IEEE Access, 2020
10 Brugada-HUCA physionet.org ↗ 12-lead · 12 s · 100 Hz 363 363 Open
CC BY-SA 4.0
Not started Hospital Universitario Central de Asturias (HUCA)
Spain
Dataset DOI
11 KURIAS-ECG physionet.org ↗ 12-lead · 10 s · 500 Hz · SNOMED CT + OMOP-CDM 13,862 20,000 Restricted
Pending audit
Not started Korea University Anam Hospital
South Korea — Seoul
Dataset DOI
12 Leipzig Heart Center ECG Database physionet.org ↗ 12-lead + intracardiac EGM · variable · 977 Hz 39 39 Open
ODC Attribution
Not started Leipzig Heart Center
Germany
Dataset DOI
13 Norwegian Endurance Athlete ECG Database physionet.org ↗ 12-lead · 10 s · 500 Hz 28 28 Open
CC BY 4.0
Not started University of Oslo
Norway
Dataset DOI
14 MHD Effect on 12-Lead ECGs in MRI Scanners physionet.org ↗ 12-lead + 3-lead · variable · 1,024 Hz 23 43 Open Not started Otto-von-Guericke University of Magdeburg
Germany
Krug et al., CinC 2017
15 Wilson Central Terminal ECG Database physionet.org ↗ 37 signals (12 std + WCT + limb) · 10 s 92 540 Open
ODC Attribution
Not started MARCS Institute, Western Sydney Univ.; Campbelltown Hospital
Australia
Machines, 2016
16 CiPA ECG Validation Study physionet.org ↗ 12-lead · 10 s 60 5,749 segments Open
ODC Attribution
Not started Phase I Clinical Pharmacology Study (NCT03070470)
USA
Clin Pharmacol Ther, 2018
17 ECG Effects of Dofetilide, Moxifloxacin and Combinations (ECGDMMLD) physionet.org ↗ 12-lead · 10 s · 500 Hz (upsampled to 1 kHz) 22 4,211 segments Open
ODC Attribution
Not started NCT02308748
USA
Clin Pharmacol Ther, 2016
18 ECG Effects of Ranolazine, Dofetilide, Verapamil, Quinidine (ECGRDVQ) physionet.org ↗ 12-lead · 10 s · 500 Hz 22 5,232 segments Open
ODC Attribution
Not started Clinical Pharmacology Study
USA
Clin Pharmacol Ther, 2014
19 Eye Tracking Dataset for 12-Lead ECG Interpretation physionet.org ↗ 12-lead ECG images · eye tracking at 60 Hz 63 interpreters 630 sessions Open
ODC ODbL
Not started Qatar Biomedical Research Institute, Hamad bin Khalifa Univ.
Qatar
JMIR, 2022
20 EchoNext physionet.org ↗ 12-lead · 10 s · 250 Hz Not disclosed 100,000 Restricted Not started Columbia University Irving Medical Center
USA — New York, NY
Poterucha et al., Nature, 2025
21 Symile-MIMIC physionet.org ↗ 12-lead · 10 s · 500 Hz · multimodal (ECG + CXR + labs) 9,573 11,622 Credentialed Not started MIT LCP / BIDMC
USA
Saporta et al., NeurIPS 2024
22 PhysioNet/CinC Challenge 2020 physionet.org ↗ 12-lead · 6–60 s · 257–1,000 Hz ~52,501 Open
CC BY 4.0
Not started CPSC, INCART, PTB, PTB-XL, Georgia
Multi-national (China, Russia, Germany, USA)
Physiol Meas, 2020
23 PhysioNet/CinC Challenge 2021 physionet.org ↗ 12-lead (+ reduced-lead) · 5–144 s · 250–1,000 Hz ~130,862 Open
CC BY 4.0
Not started CPSC, INCART, PTB-XL, Georgia, Chapman-Shaoxing, Ningbo, UMich
Multi-national (China, Russia, Germany, USA)
CinC 2021
24 STAFF III Database physionet.org ↗ 12-lead · variable duration · 1,000 Hz · 0.625 µV resolution · WFDB 104 152 inflations Open
ODC Attribution
Not started Charleston Area Medical Center; Blekinge Hospital
USA / Sweden
Martínez et al., CinC 2017

12-Lead ECG Datasets (Other Repositories)

15 datasets
# Dataset Format Patients Records Access Status Origin Paper
1 CPSC 2018 (China Physiological Signal Challenge 2018) icbeb.org ↗ 12-lead · 6–60 s · 500 Hz · MATLAB .mat ~6,877 6,877 Open Not started 11 hospitals (ICBEB, Nanjing)
China
Liu et al., J. Med. Imaging Health Inform., 2018
2 Chapman-Shaoxing ECG Database (10,646 patients) figshare.com ↗ 12-lead · 10 s · 500 Hz · CSV 10,646 10,646 Open
CC BY 4.0
Not started Chapman University; Shaoxing People's Hospital
China / USA
Zheng et al., Scientific Data, 2020
3 Ningbo First Hospital ECG Database (Idiopathic Ventricular Arrhythmia) figshare.com ↗ 12-lead · 10 s · 500 Hz · CSV 334 334 Open
CC BY 4.0
Not started Chapman University; Ningbo First Hospital, Zhejiang University
China / USA
Zheng et al., Scientific Data, 2020
4 Shandong Provincial Hospital ECG Database (SPHDB) figshare.com ↗ 12-lead · 10–60 s · 500 Hz · HDF5 24,666 25,770 Open
CC BY 4.0
Not started Shandong Provincial Hospital
China
Liu et al., Scientific Data, 2022
5 CODE-15% (Telehealth Network of Minas Gerais, 15% subset) zenodo.org ↗ 12-lead · ~10 s · 400 Hz · HDF5 233,770 345,779 Open
CC BY 4.0
Not started Telehealth Network of Minas Gerais (TNMG)
Brazil
Ribeiro et al., Nature Communications, 2020
6 CODE-test (827-record hold-out test set) zenodo.org ↗ 12-lead · 7–10 s · 400 Hz · HDF5 827 827 Open
CC BY 4.0
Not started Universidade Federal de Minas Gerais / TNMG
Brazil
Ribeiro et al., Nature Communications, 2020
7 CODE (Full Dataset, ~2.3M records) scilifelab.se ↗ 12-lead · 400 Hz · HDF5 ~1,676,384 ~2,322,513 Restricted
DUA required
Not started Telehealth Network of Minas Gerais (TNMG)
Brazil
Ribeiro et al., Nature Communications, 2020
8 SaMi-Trop (Chagas Cardiomyopathy Cohort) zenodo.org ↗ 12-lead · 400 Hz · HDF5 1,631 1,631 Open
CC BY 4.0
Not started UFMG; Uppsala University; EPFL
Brazil / Sweden / Switzerland
Lima et al., medRxiv, 2021
9 IKEM Dataset (Institute for Clinical and Experimental Medicine, Prague) zenodo.org ↗ 12-lead (stored as 8 reduced leads) · 10 s · 500 Hz · HDF5 30,290 98,130 Open
CC BY 4.0
Not started IKEM (Institute for Clinical and Experimental Medicine)
Czech Republic — Prague
Seják et al., Knowledge-Based Systems, 2023
10 MedalCare-XL (Synthetic 12-Lead ECGs from Simulations) zenodo.org ↗ 12-lead · 10 s · 500 Hz · CSV (raw/noise/filtered variants) 0 (synthetic) 16,900 Open
CC BY 4.0
Not started Medical Univ. of Graz; KIT; PTB; Univ. of Edinburgh
Austria / Germany / UK
Gillette et al., Scientific Data, 2023
11 Harvard-Emory ECG Database (HEEDB) bdsp.io ↗ 12-lead · 10 s · 250/500 Hz · WFDB 2,167,795 11,607,261 Credentialed
DUA (BDSP)
Not started Massachusetts General Hospital; Emory University Hospital
USA — Boston & Atlanta
Koscova et al., Scientific Data, 2026
12 Nightingale BWH Emergency Dept ECG Dataset ngsci.org ↗ 12-lead · 100 Hz · NumPy arrays 30,933 103,952 Restricted
Institutional credentials
Not started Brigham and Women's Hospital
USA — Boston, MA
Mullainathan & Obermeyer, QJE, 2021
13 Nightingale NTUH Cardiac Arrest ECG Dataset ngsci.org ↗ 12-lead · ~500 Hz · XML/array 10,950 18,072 Restricted
Institutional credentials
Not started National Taiwan University Hospital, Emergency Dept
Taiwan
Obermeyer et al., Nature Medicine, 2022
14 GU-ECG (Gazi University, PTCA-induced Ischaemia) mendeley.com ↗ 12-lead continuous · 8,800 Hz · 24-bit · .ekg format 74 222 Open
CC BY 4.0
Not started Gazi University Faculty of Medicine; Bilkent University
Turkey
Dataset DOI
15 ZZU pECG (Zhengzhou University Pediatric ECG Database) figshare.com ↗ 12-lead + 9-lead · 5–120 s · 500 Hz · WFDB 11,643 children 14,190 Open Not started First Affiliated Hospital of Zhengzhou University
China
Scientific Data, 2025

2-Lead ECG Datasets

12 datasets
# Dataset Format Patients Records Access Status Origin Paper
1 MIT-BIH Arrhythmia Database physionet.org ↗ 2-lead (MLII + V1) · 30 min · 360 Hz · WFDB 47 48 Open
ODC Attribution
Not started Beth Israel Hospital / MIT
USA
Moody & Mark, IEEE EMBS 2001
2 MIT-BIH Atrial Fibrillation Database physionet.org ↗ 2-lead · 10 h · 250 Hz · WFDB 25 25 Open
ODC Attribution
Not started Beth Israel Hospital
USA
Moody & Mark, CinC 1983
3 Long-Term AF Database (LTAFDB) physionet.org ↗ 2-lead · 24–25 h · 128 Hz · WFDB 84 84 Open
ODC Attribution
Not started Northwestern University; MEDICALgorithmics
USA / Poland
Petrutiu et al., Europace 2007
4 MIT-BIH Normal Sinus Rhythm Database physionet.org ↗ 2-lead · ~24 h · 128 Hz · WFDB 18 18 Open
ODC Attribution
Not started Beth Israel Hospital
USA
Dataset DOI
5 MIT-BIH Supraventricular Arrhythmia Database physionet.org ↗ 2-lead (MLII + V1) · 30 min · 360 Hz · WFDB 78 Open
ODC Attribution
Not started MIT / Harvard-MIT HST
USA
Greenwald, PhD thesis, Harvard-MIT 1990
6 European ST-T Database (EDB) physionet.org ↗ 2-lead ambulatory · 2 h · 250 Hz · WFDB 79 90 Open
ODC Attribution
Not started CNR Institute for Clinical Physiology, Pisa; European Society of Cardiology
Italy
Taddei et al., Eur Heart J 1992
7 BIDMC Congestive Heart Failure Database physionet.org ↗ 2-lead · ~20 h · 250 Hz · WFDB 15 15 Open
ODC Attribution
Not started Beth Israel Deaconess Medical Center
USA — Boston, MA
Baim et al., J Am Coll Cardiol 1986
8 Sudden Cardiac Death Holter Database physionet.org ↗ 2-lead · 4–25 h · 250 Hz · WFDB 23 23 Open
ODC Attribution
Not started MIT
USA
Greenwald, MS thesis, MIT 1986
9 QT Database (QTDB) physionet.org ↗ 2-lead · 15 min · various Hz · WFDB 105 Open
ODC Attribution
Not started MIT / PhysioNet
USA
Laguna et al., CinC 1997
10 SHDB-AF (Saitama Holter Database — Atrial Fibrillation) physionet.org ↗ 2-lead (CC5 + NASA) · ~24 h · 125 Hz · WFDB 122 128 Open
ODC Attribution
Not started Saitama Medical University International Medical Center
Japan
Tsutsui et al., Scientific Data 2025
11 MIT-BIH ST Change Database physionet.org ↗ 2-lead · variable length · 360 Hz · WFDB · mostly exercise stress ECGs 28 Open
ODC Attribution
Not started MIT / PhysioNet
USA
Dataset DOI
12 Long-Term ST Database (LTSTDB) physionet.org ↗ 2–3 lead · 21–24 h · 250 Hz · WFDB · annotated ST episodes 80 86 Open
ODC Attribution
Not started Ljubljana; Pisa; Cambridge
Multi-national (EU)
Jager et al., Med Biol Eng Comput, 2003

1-Lead ECG Datasets

10 datasets
# Dataset Format Patients Records Access Status Origin Paper
1 Icentia11k Single Lead Continuous ECG physionet.org ↗ 1-lead (modified Lead I) · ~70 min/seg · 250 Hz 11,000 541,794 segments Open
CC BY-NC-SA 4.0
Not started Université de Montréal; Icentia Inc.
Canada
Tan et al., CinC 2021
2 PhysioNet/CinC Challenge 2017 (AF Classification) physionet.org ↗ 1-lead (AliveCor) · 9–61 s · 300 Hz 12,186 Open
ODC Attribution (training)
Not started AliveCor Inc. / MIT-Harvard PhysioNet
USA
Clifford et al., CinC 2017
3 Apnea-ECG Database physionet.org ↗ 1-lead · 7–10 h overnight · 100 Hz ~70 70 Open
ODC Attribution
Not started Philipps-University Marburg
Germany
Penzel et al., CinC 2000
4 ECG-ID Database physionet.org ↗ 1-lead (Lead I, wrist) · 20 s · 500 Hz 90 310 Open
ODC Attribution
Not started Electrotechnical University "LETI"
Russia — St. Petersburg
Lugovaya, MSc thesis, 2005
5 Post-Ictal Heart Rate Oscillations in Partial Epilepsy physionet.org ↗ 1-lead · overnight continuous · 200 Hz 5 7 Open
ODC Attribution
Not started Beth Israel Deaconess Medical Center / Harvard
USA — Boston, MA
Al-Aweel et al., Neurology 1999
6 tOLIet (Thigh-based ECG, toilet seat) physionet.org ↗ 1-lead (thigh, dry polymer electrodes) · up to 5 min · 1,000 Hz 86 149 Open
CC BY 4.0
Not started Centro Hospitalar Universitário de Lisboa Central (CHULC)
Portugal — Lisbon
Silva et al., Scientific Data 2026
7 Brno University of Technology ECG Quality Database (BUT QDB) physionet.org ↗ 1-lead (Bittium Faros 180) + 3-axis accel. · ≥24 h · 1,000 Hz 15 18 Open
CC BY 4.0
Not started Brno University of Technology
Czech Republic
Smital et al., IEEE TBME 2020
8 VitalDB Arrhythmia Database physionet.org ↗ 1-lead (Lead II, intraoperative) · ~20 min median · 500 Hz 482 482 Open
CC BY 4.0
Not started Seoul National University Hospital
South Korea
Eun et al., Scientific Data 2026
9 Preterm Infant Cardio-Respiratory Signals Database (PICSDB) physionet.org ↗ 1-lead (single channel from bedside monitor) · 20–70 h · 500 Hz 10 infants 10 Open
ODC Attribution
Not started UMass Memorial Healthcare NICU
USA — Worcester, MA
Shamout et al., IEEE TBME 2017
10 ECG-Capable Smartwatches Dataset physionet.org ↗ 1-lead (Lead I) · 10 s · 4 smartwatch models + reference (synthetic) 0 (synthetic) 915 Restricted
DUA required
Not started Instituto Ramón y Cajal de Investigación Sanitaria
Spain
Recas et al. (pending)

3-Lead ECG Datasets

2 datasets
# Dataset Format Patients Records Access Status Origin Paper
1 St. Vincent's / UCD Sleep Apnea Database (UCDDB) physionet.org ↗ 3-lead Holter (V5, CC5, V5R) · overnight PSG · 128 Hz · EDF 25 25 Open
ODC Attribution
Not started St. Vincent's University Hospital / University College Dublin
Ireland
Dataset DOI
2 MIMIC-III Waveform Database Matched Subset physionet.org ↗ 1–5 ECG leads · typically 3-lead ICU (Lead II, V, AVR) · continuous · 125 Hz · WFDB 10,282 22,317 Open
ODbL
Not started Beth Israel Deaconess Medical Center
USA — Boston, MA
Johnson et al., Scientific Data, 2016

BSPM / ECGI Datasets

1 dataset
# Dataset Format Patients Records Access Status Origin Paper
1 EDGAR (Experimental Data & Geometric Analysis Repository) ecg-imaging.org ↗ BSPM (64+ leads) + torso geometry + CT · human, canine & simulation · MATLAB/SCIRun Multiple Multiple datasets Open
Free registration
Not started Univ. of Utah; Charles Univ. Hospital; Karlsruhe Institute of Technology
Multi-national (USA / Czech Republic / Germany)
Aras et al., J Electrocardiol, 2015

Dataset Statistics
Interactive visualisations derived from the full 64-dataset catalogue

Datasets by Lead Category

Distribution across the four lead-count groups

Access Type Breakdown

Open vs credentialed vs restricted across all datasets

Datasets by Country of Origin

Primary country attributed to each dataset (multi-national datasets counted once)

Top Datasets by Record Count (log scale)

Largest datasets ranked by number of records or segments

Access Type by Lead Category

How open access varies across lead-count categories

Sampling Frequency Distribution

How many datasets use each common sampling rate