Artificial intelligence and machine learning for breast cancer recurrence prediction: a data-centric approach
DOI:
https://doi.org/10.29289/259453942025V35S1090Keywords:
breast neoplasms, artificial intelligence, machine learning, survival analysis, medical oncologyAbstract
Introduction: Breast cancer recurrence remains a significant clinical challenge, particularly in resource-constrained
environments where long-term patient monitoring is often limited. Objective: Leveraging recent advances in artificial
intelligence and machine learning, this study proposes a data-driven methodology to predict recurrence in breast cancer
patients based on clinical and histopathological data extracted from electronic medical records. Methods: Through the
development of a structured data processing pipeline, unstructured medical records were transformed into a high-quality
dataset suitable for predictive modeling. The study introduced a stratified modeling approach, beginning with unsupervised clustering to identify distinct patient subgroups based on tumor aggressiveness. Supervised learning models were
subsequently applied to each group, aiming to tailor predictions according to subgroup-specific characteristics. In parallel, survival analysis methods were employed to enhance interpretability and assess long-term outcomes. The ethics committee at Dr. Arnaldo Cancer Institute approved this study. Results: The models developed demonstrated promising performance in predicting recurrence, with results supporting the clinical applicability of the proposed stratified modeling
approach. Notably, the integration of survival analysis enhanced the interpretability of predictions, allowing the identification of patterns related to long-term outcomes. Conclusion: These findings highlight the potential of artificial intelligence-driven tools to support clinical decision-making and personalized follow-up strategies for breast cancer patients.
Ongoing work includes expanding validation across multiple institutions and exploring the integration of molecular biomarkers and advanced deep learning techniques.
Downloads
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Mariana Zuliani Theodoro de Lima, Jaqueline Alvarenga Silveira, Alexandre Ray da Silva, Daniela Gregolin Giannotti, André Luiz da Silva, Rodrigo Macedo da Silva, Jefferson Mazzei, Fabio Francisco Oliveira Rodrigues

This work is licensed under a Creative Commons Attribution 4.0 International License.




