10,000 echocardiogram cine loops with view labels and chamber segmentation masks for foundation model pre-training
OpenWe are pre-training a large cardiac ultrasound foundation model intended to serve as a general-purpose feature extractor for a broad range of downstream echocardiography AI tasks, including LVEF regression, valvular disease severity grading, diastolic function classification, and structural congenital anomaly detection. Diversity of acquisition views, patient demographics, scanner vendors, image quality levels, and disease states is the primary data requirement; this dataset is explicitly designed to span the full real-world distribution of clinical echo data rather than a curated high-quality subset, ensuring robust representation learning across the full spectrum of clinical practice.
Required acquisitions cover all standard transthoracic echocardiography views: apical 4-chamber (A4C), apical 2-chamber (A2C), and apical 3-chamber (A3C); parasternal long-axis (PLAX); parasternal short-axis at aortic valve level (PSAX-AV), mitral valve level (PSAX-MV), and papillary muscle level (PSAX-PM); subcostal 4-chamber (SC4C) and subcostal inferior vena cava (SCIVS); and suprasternal notch (SSN). B-mode cine loops are the primary modality. Color Doppler overlays, pulsed-wave Doppler spectral tracings, continuous-wave Doppler recordings, and M-mode sweeps through the left ventricle and mitral valve are also accepted and must be labelled by modality. Cine loop duration may range from one to ten cardiac cycles; single-frame still images without temporal context are excluded from this request. DICOM format is mandatory to preserve acquisition metadata embedded in standard tags — imaging depth, transducer frequency, mechanical index, scanner manufacturer and model, gain and time-gain compensation settings — all of which will serve as auxiliary conditioning inputs during foundation model pre-training.
De-identification must comply with the DICOM PS3.15 Annex E Basic Application Level Confidentiality Profile, with explicit removal of patient name, date of birth, institution name, referring physician, device serial number, and any burned-in annotation text overlaying pixel data. A per-batch de-identification certificate confirming the method and software version used is required. GDPR-compliant pseudonymisation is acceptable for European institutions in lieu of full anonymisation, provided a subject pseudonym key is retained securely by the contributing institution and never transferred.
Annotation requirements are intentionally lightweight to enable dataset scale: each cine loop requires only a view-classification label drawn from the controlled vocabulary (A4C, A2C, A3C, PLAX, PSAX-AV, PSAX-MV, PSAX-PM, SC4C, SCIVS, SSN, or Other) and a sonographer-assigned image quality score on a three-point scale (1 poor, 2 adequate, 3 good). For a 20% random stratified subsample of 2,000 studies, pixel-level segmentation masks of the left ventricle endocardium and epicardium, right ventricle endocardium, and left atrial endocardium at end-diastole are required to enable supervised fine-tuning experiments in parallel with self-supervised pre-training. Anatomical keypoints — medial and lateral mitral annular hinge points and the LV apex — are requested for the same 2,000-study subsample to facilitate alignment-based data augmentation. Scanner vendor diversity targets: minimum 2,000 studies each from GE, Philips, Siemens, and Canon platforms, with remaining studies from other vendors or mixed sources.
Medical imagingUltrasoundDICOMJSONPNG / JPG
0 / 10000 scans0%