Machine learning can reliably predict malignancy of BI-RADS 4a and 4b breast lesions based on clinical and ultrasonographic features

Authors

  • Isabela Panzeri Carlotti Buzatto Department of Obstetrics and Gynecology, Breast Disease Division, Ribeirão Preto Medical School, Universidade de São Paulo – Ribeirão Preto (SP), Brazil.
  • Daniel Guimarães Tiezzi Department of Obstetrics and Gynecology, Breast Disease Division, Ribeirão Preto Medical School, Universidade de São Paulo – Ribeirão Preto (SP), Brazil.
  • Sarah Abud Recife Department of Gynecology & Obstetrics, Women’s Health Reference Center of Ribeirão Preto (MATER), Ribeirão Preto Medical School, Universidade de São Paulo – Ribeirão Preto (SP), Brazil.
  • Ruth Morais Bonini Department of Radiology, Hospital de Amor de Campo Grande – Campo Grande (MS), Brazil.
  • Licerio Miguel Department of Gynecology & Obstetrics, Women’s Health Reference Center of Ribeirão Preto (MATER), Ribeirão Preto Medical School, Universidade de São Paulo – Ribeirão Preto (SP), Brazil.
  • Liliane Silvestre Department of Obstetrics and Gynecology, Breast Disease Division, Ribeirão Preto Medical School, Universidade de São Paulo – Ribeirão Preto (SP), Brazil
  • Nilton Onari Department of Radiology, Hospital de Amor de Campo Grande – Campo Grande (MS), Brazil.
  • Ana Luiza Peloso Araujo Faim Department of Radiology, Hospital de Amor de Campo Grande – Campo Grande (MS), Brazil.

DOI:

https://doi.org/10.29289/259453942023V33S1002

Keywords:

ultrasonography, machine learning, artificial intelligence, image-guided biopsy

Abstract

Objective: The objectives of this study were to establish the most reliable machine learning model to predict malignancy in BI-RADS 4a and 4b breast lesions and optimize the negative predictive value to minimize unnecessary biopsies.
Methodology: We included clinical and ultrasonographic attributes from 1,250 breast lesions from four institutions classified as BI-RADS 3, 4a, 4b, 4c, 5, and 6. We selected the most informative attributes to train the models in order to make
inferences about the diagnosis of BI-RADS 4a and 4b lesions (validation dataset). Using the best parameters and hyperparameters selected, we tested the performance of nine models and 1,530 ensemble models. Results: The most informative attributes were shape, margin, orientation, and size of the lesions, the resistance index of the internal vessel, the age
of the patient, and the presence of a palpable lump. The highest mean NPV was achieved with XGBoost (93.6%). The final
performance of the best ensemble model was NPV=96.4%, sensitivity=81.5%, specificity=84.1%, PPV=46.8%, f1-score=59.5%,
and the final accuracy=83.7%. Age was the most important attribute to predict malignancy. The use of the final model
associated with the patient’s age would reduce by 51% the number of biopsies in women with BI-RADS 4a or 4b lesions.
Conclusion: Machine learning can predict malignancy in BI-RADS 4a and 4b breast lesions identified by ultrasonography, based on clinical and ultrasonographic features. Our final prediction model would be able to avoid 51% of the 4a and
4b breast biopsies, without missing any cancers.

Downloads

Download data is not yet available.

Downloads

Published

2026-03-12

How to Cite

Buzatto, I. P. C., Tiezzi, D. G., Recife, S. A., Bonini, R. M., Miguel, L., Silvestre, L., … Faim, A. L. P. A. (2026). Machine learning can reliably predict malignancy of BI-RADS 4a and 4b breast lesions based on clinical and ultrasonographic features. Mastology, 33(suppl.1). https://doi.org/10.29289/259453942023V33S1002

Issue

Section

Oral Presentation