Richard Osuala
Universitat de Barcelona, Spain
Challenge Co-LeadDynamic contrast-enhanced MRI (DCE-MRI) plays a central role in breast cancer management, but its reliance on gadolinium-based contrast agents raises various concerns. MAMA-SYNTH introduces a standardized, clinically informed benchmark for evaluating generative models, with the goal of advancing the development of contrast-reduced and contrast-free breast MRI protocols.
Why?Gadolinium deposits, even from chelated agents, can lead to long-term accumulation, potential neurotoxicity and trigger nephrogenic systemic fibrosis.
Gadolinium is detected in drinking water supplies worldwide and beverages, raising concerns about long-term exposure and environmental health impacts.
Gadolinium contrast agents significantly increase the cost of MRI examinations, limiting accessibility in resource-constrained settings.
The task of the challenge is to synthesize single-timepoint 2-dimensional post-contrast breast DCE-MRI slices from corresponding pre-contrast T1-weighted MRI inputs using paired clinical data. Participating algorithms operate on pre-contrast images and generate synthetic peak enhancement post-contrast output.
Pre-contrast
Peak-enhancement
The challenge utilizes diverse datasets to ensure algorithmic generalizability across different scanners and populations:
This dataset contains pre-treatment DCE-MRI from 1,506 patients from 25 + centers across the United States. The MAMA-MIA dataset was utilized as the training set in the first edition of MAMA Challenges for Primary Tumor Segmentation and Pathologic Complete Response Prediction. Learn more about the MAMA-MIA benchmark at the MAMA-MIA Challenge 2025 and access the complete dataset at MAMA-MIA Dataset with key properties shown below:
We note that participants are allowed to train their models on any further dataset, as long as said dataset is publicly available. To ensure fair evaluation across participating teams, the usage of private data is not allowed in this challenge. We further note that by participating in this challenge, participants agree to comply with EO 14117, 28 CFR Part 202, and Guide Notice NOT-OD-25-083 and acknowledge that the usage of NIH Controlled-access Data Repositories (CADRs) is prohibited in this challenge.
The test data were acquired from two external centers located in the Netherlands and Argentina. Each test case refers to a 2D slice extracted from a patientβs DCE scan.
For each patient, the slice containing the largest malignant tumor area is selected from the peak enhancement phase.
The peak-enhancement phase is defined as the time point with the highest signal intensity within the tumor region.
All images are fat-suppressed and acquired in the axial plane. The main statistics are summarized below:
| Field | Radboud UMC The Netherlands |
Instituto Alexander Fleming Argentina |
|---|---|---|
| Number of Cases | 200 | 100 |
| Image Dimension | 416 Γ 416 px | 512 Γ 512 px |
| Contrast Agent | DOTAREM (99%), GADOVIST (0.5%) | DOTAREM, GADOVIST |
| Manufacturer | Siemens | GE |
| Magnetic Field Strength | 3T | 1.5T |
| Molecular Subtype | ||
| β³ Luminal | 165 (85.7 %) | 37 (37 %) |
| β³ Triple Negative | 23 (9.4 %) | 30 (30 %) |
| β³ Other | 12 (4.9 %) | 20 (20 %) |
Submissions are evaluated across four metric groups spanning pixel-level fidelity, perceptual realism, diagnostic classification performance, and segmentation accuracy.
MSE measures the average squared difference between each pixel in the synthesized output and its corresponding pixel in the ground-truth post-contrast image. Lower values indicate greater pixel-level fidelity.
LPIPS computes perceptual distance between images using deep network feature activations, capturing texture and structural similarity closer to human perception than pixel-wise metrics. Lower values indicate more realistic synthesis.
SSIM evaluates image quality by jointly measuring luminance, contrast, and structural similarity within local patches. Applied to the tumor ROI, it captures how well the synthesized enhancement texture matches the reference. Values range from 0β1; higher is better.
FRD adapts the FID framework to radiomic feature space, measuring the FrΓ©chet distance between the feature distributions of the synthesized ROI patches and real post-contrast patches. Lower values indicate that the synthesized tumors are more statistically indistinguishable from real enhancement patterns.
Measures the classifier's ability to distinguish Luminal A/B tumors from all other subtypes on synthesized images. A score of 1.0 indicates perfect separation; 0.5 is chance.
Measures the classifier's ability to distinguish Triple-Negative tumors from all other subtypes on synthesized images. A score of 1.0 indicates perfect separation; 0.5 is chance.
The Dice coefficient measures voxel-level overlap between the predicted segmentation mask on synthesized post-contrast and the ground-truth mask. It is the harmonic mean of precision and recall over the segmented region. Values range from 0β1; higher values indicate better spatial overlap.
The 95th-percentile Hausdorff Distance (HD95) measures the worst-case boundary deviation between the predicted segmentation mask on synthesized post-contrast and reference segmentation contours, excluding the top 5% of outlier distances for robustness. Lower values indicate more precise boundary delineation.
| May 1 | Validation Phase Opens | |
| June 15 | Test Phase Opens | |
| June 30 | Last Submission Deadline | |
| August 1 | Official Results Release | |
| October 8 | Winners Announcement at Deep-Breath Workshop (MICCAI 2026) | |
Universitat de Barcelona, Spain
Challenge Co-Lead
Universitat de Barcelona, Spain
Challenge Co-Lead
Radboud University Medical Centre, Netherlands
Challenge Co-Lead
Radboud University Medical Centre, Netherlands
Universitat de Barcelona, Spain
Universitat de Barcelona & CVC, Spain
Instituto Alexander Fleming, Argentina
The Netherlands Cancer Institute (NKI), Netherlands
Universitat de Barcelona & CVC, Spain
Universitat de Barcelona & ICREA, Spain
Radboud University Medical Centre, Netherlands
Radboud University Medical Centre, Netherlands
Radboud University Medical Centre, Netherlands
Instituto Alexander Fleming, Argentina
For inquiries about the MAMA-SYNTH challenge, feel free to reach out to Richard Osuala (richard.osuala[at]ub.edu), and Smriti Joshi (smriti.joshi[at]ub.edu).