Nguyen et al. (2023) - VinDr-Mammo: A Large-Scale Benchmark Dataset for CAD in Full-Field Digital Mammography
Summary
VinDr-Mammo is a large-scale Vietnamese full-field digital Mammography (FFDM), which consists of 5,000 exams (20,000 images) created to support Computer-Aided Detection (CADe) and Computer-Aided Diagnosis (CADx) research.
Contains 4 standard views for each patient (L/R CC and MLO) and provides:
- Breast-level BI-RADS assessment.
- Breast density (ACR categories A-D)
- Extensive lesion-level bounding-box annotations for masses, calcifications, asymmetries, distortions and other suspicious features.
- Double-reading with abritration. That is, two radiologists read each exam, if they disagree, a third one resolve the conflict.
One of the largest, publically accessible FFDM datasets.
Dataset
- Time window: 2018–2020
- Equipment vendors: Siemens, IMS, Planmed
- All images are for-presentation DICOMs (no raw for-processing images saved by hospitals).
- Images de-identified:
- DICOM PHI scrubbed
- Text burned into corners masked out
- Source:
- Images sourced from two major hospitals in Hanoi, Vietnam:
- Hospital 108 (H108)
- Hanoi Medical University Hospital (HMUH)
- Images sourced from two major hospitals in Hanoi, Vietnam:
- Format: DICOM
- Views: CC & MLO (left & right)
- Total: 20,000 images / 5,000 exams
- Stored in per-study folders with encoded IDs.
Labels and Annotations
- Breast-level (per image)
- BI-RADS scale:
- 1, 2, 3, 4, 5 (no 6; pathology unavailable)
- Breast density: A-D
- Metadata also has: laterality, view, image size, split.
- BI-RADS scale:
- Lesion level:
- Bounding box coordinates
- Categories:
- mass
- suspicious calcification
- asymmetry (global / focal)
- architectual distortion
- associated findings: skin thickening, skin retraction, nipple retraction, suspicious lymph node
- Each lession includes own BI-RADS assessment.
- Annotation protocol:
- Double-reading by experienced radiologists.
- Arbitraty third read if disagreement.
- Readers average 19 years clinical experience and interpret ~10k-15k mammograms/year
- Web-based tool: VinDR Lab (OHIF-based viewer)
- Stratified Data Split
- 4,000 training exams
- 1,000 test exams
- Iterative stratification preserving distributions across:
- BI-RADS
- density
- lesion categories
Technical Validation
- Privacy & Data Quality
- DICOM metadata reviewed manually post-scrubbing
- Image inspection to ensure absence of PHI
- Annotation errors prevented by built-in QA rules (e.g., cannot mark a lesion while selecting BI-RADS 1)
- Dataset Characteristics (Selected)
- BI-RADS distribution (overall 10,000 breasts):
- BI-RADS 1: 67.0%
- BI-RADS 2: 23.4%
- BI-RADS 3: 4.7%
- BI-RADS 4: 3.8%
- BI-RADS 5: 1.1%
- BI-RADS distribution (overall 10,000 breasts):
- Density distribution:
- A: 0.5%
- B: 9.5%
- C: 76.5% (dominant)
- D: 13.5%
- Lesion counts (entire dataset):
- Masses: 1,226
- Suspicious calcifications: 543
- Asymmetries (global, focal): 392
- Architectural distortions: 119
- Lymph nodes / retractions: <60 each
Significance
VinDr-Mammo addresses several key gaps in mammography AI research:
1. Scale & diversity:
* Largest public FFDM dataset with rich annotation detail.
2. Population representation:
* Introduces Southeast Asian data—critical given known domain shift across ethnicities, vendors, and clinical pipelines.
3. Realistic screening distribution:
* Mix of diagnostic + screening exams, and clinically realistic prevalence.
4. Bounding-box–level lesion annotations:
* Suitable for CADe, object detection, and weakly supervised methods.
5. Standardized split ensures reproducibility across publications.
Limitations:
- No pathology-proven labels → BI-RADS 4/5 are only suspicious, not confirmed malignancies.
- Some findings have very low sample counts (<40).
- DICOMs are not fully compliant with all processing libraries.
Comparisons
Moreira et al. (2012) - INbreast: Toward a Full-field Digital Mammographic Database
| Feature | INbreast | VinDr-Mammo |
|---|---|---|
| Size | 115 cases | 5,000 exams — ~40× larger |
| Population | Portuguese | Vietnamese (underrepresented population) |
| Annotation precision | Pixel-accurate contours | Bounding boxes (faster + scalable) |
| Pathology labels | Included for many lesions | Not available |
| Imaging | Siemens FFDM | Siemens / IMS / Planmed |
| Use cases | CADx, segmentation | CADe, detection, multi-label classification |
Key distinction:
INbreast remains the gold standard for pixel-level precision, but is tiny; VinDr-Mammo is designed for scale, population diversity, and bounding-box detection workflows.
| Feature | CBIS-DDSM | VinDr-Mammo |
|---|---|---|
| Imaging | Digitised film | Full-field digital mammography |
| Size | ~1,644 cases | 5,000 exams |
| Lesion types | Mass, calcification | Mass, calcification, distortion, multiple asymmetries, associated features |
| Annotations | Updated segmentations | Bounding boxes |
| Pathology labels | Yes | No |
| Image quality | Variable (film artefacts, scanner noise) | High-quality clinical FFDM |
CBIS-DDSM is rich and pathology-linked but suffers from film-scanner limitations.
VinDr-Mammo provides modern FFDM, higher consistency, vastly larger scale, and more comprehensive lesion categories.
Reference
- Nguyen, H. T., Nguyen, H. Q., Pham, H. H., et al. (2023). VinDr-Mammo: A large-scale benchmark dataset for computer-aided diagnosis in full-field digital mammography. Scientific Data, 10(1), 277. https://doi.org/10.1038/s41597-023-02100-7