GetDATA

Frequently Asked Questions

Everything you need to know about sourcing and providing de-identified medical data and expert labels on GetDATA.

FAQ

How does medical data crowdsourcing work?
A researcher posts a request describing the data they need; verified hospitals and labs then upload matching de-identified data and expert labels, and every batch is quality-scored before it is accepted.
How can I get medical data for AI training?
Post a data request specifying modality, anatomy, label schema, class balance and quantity. Verified providers fulfill it with de-identified, expert-labeled datasets tailored to your model — instead of scraping limited public sets.
Where can I source de-identified medical imaging datasets?
GetDATA is a marketplace where you source de-identified imaging datasets — X-ray, CT, MRI, ultrasound, ECG and pathology — directly from verified healthcare providers, with provenance and compliance built in.
How do I request a custom dataset like chest X-rays or ECG waveforms?
Use the guided request form to define the modality (for example chest X-ray or 12-lead ECG), label schema, cohort criteria, quantity and budget. You can also browse category pages such as the ECG or chest X-ray catalogs to see live requests.
What imaging modalities and data types are available?
X-ray and chest radiography, CT, MRI, ultrasound and echocardiography, ECG, and pathology — across anatomical regions including chest, brain, abdomen and heart.
What file formats can I download?
Standard medical formats including DICOM, NIfTI, PNG/JPG, EDF, CSV and JSON, depending on the modality and what you specify in your request.
What labeling and annotation formats are supported?
Classification, bounding boxes, segmentation masks, measurement tables, and custom JSON schemas, delivered alongside the source data.
Who can label the data?
Verified radiologists, clinicians, and trained annotators. Low-quality labels are flagged and rejected by reviewer consensus and automated checks.
Who can provide data on GetDATA?
Verified hospitals, imaging centers and laboratories. Each provider is reviewed before they can fulfill requests, and uploads undergo provenance checks.
How is patient data de-identified and anonymized?
Providers remove protected health information (PHI) from DICOM headers and pixel data before upload, automated PHI scanning flags residual identifiers, and brain scans can be defaced or skull-stripped where required.
Is the medical data HIPAA and GDPR compliant?
Yes. De-identification is required before upload, and the platform supports HIPAA/GDPR, Data Processing Agreements (DPAs) and Business Associate Agreements (BAAs), with audit trails on every transfer.
Is my uploaded data secure?
Uploads are transferred securely with provenance checks, automated PHI scanning, staged deliveries and rolling approvals; access is controlled and fully audited.
How is data quality ensured?
Both data and labels are quality-scored with reviewer consensus, consistency checks and automated QC for drift, and acceptance gates enforce a minimum quality before any payout.
How much does it cost to source medical data?
You set a budget per request and choose pay-per-sample or a fixed dataset price. Final cost depends on modality, labeling depth and quantity — see the pricing page for plan tiers.
How do data providers get paid?
Payouts split proportionally by approved volume and quality score across both data providers and annotators.
How long does it take to fulfill a data request?
Typical fulfillment completes in weeks, depending on modality, cohort criteria, labeling depth and quantity.
Can I source data for rare diseases or specific cohorts?
Yes. Specify inclusion and exclusion criteria, demographics and class balance in your request; the multi-site provider network helps assemble rarer cohorts that a single institution cannot supply alone.
How do I make sure a dataset is diverse and generalizable?
Source from multiple verified institutions and scanners. Models trained on a single site often fail to generalize, so multi-site sourcing reduces domain shift.