Artificial Intelligence-Driven Prediction and Diagnosis of Breast Cancer using Core Needle Biopsy: A Systematic Review

Dr. (mrs.) R. Srivaramangai; Ms. Siddhi Rajgude; Ms. Manjiri Tamboli

doi:10.17577/IJERTV15IS070008

Volume 15, Issue 07 (July 2026)

Artificial Intelligence-Driven Prediction and Diagnosis of Breast Cancer using Core Needle Biopsy: A Systematic Review

DOI : 10.17577/IJERTV15IS070008

Download Full-Text PDF Cite this Publication

Open Access
Article Download / Views: 0
Authors : Dr. (Mrs.) R. Srivaramangai, Ms. Siddhi Rajgude, Ms. Manjiri Tamboli
Paper ID : IJERTV15IS070008
Volume & Issue : Volume 15, Issue 07 , July – 2026
Published (First Online): 05-07-2026
ISSN (Online) : 2278-0181
Publisher Name : IJERT
License: This work is licensed under a Creative Commons Attribution 4.0 International License

PDF Version

View

Text Only Version

Artificial Intelligence-Driven Prediction and Diagnosis of Breast Cancer using Core Needle Biopsy: A Systematic Review

Dr. (Mrs.) R. Srivaramangai

Research Guide, Department of Information Technology

University of Mumbai 2nd Floor, IT Park

Vidyanagari, Santacruz-East Mumbai-400098

Ms. Siddhi Rajgude.

Department of Information Technology, University of Mumbai, India

Ms. Manjiri Tamboli.

Department of Information Technology, University of Mumbai, India

Abstract – Innovations in artificial intelligence (AI) and machine learning have profoundly impacted breast cancers researches, especially in the area of core needle biopsy specimens analysis for diagnostic, histopathological characterization, treatment effectiveness prediction, and tumor prognosis risk stratification. Some researches have gave evidence that deep learning algorithms that analyzed digital histopathology whole-slide images from core needle biopsy tissues could effectively perform tumor identification, subtype classification and pathological complete remission prediction after neoadjuvant chemotherapy, when used alone or in combination/ with complementary imaging methods such as mammography, ultrasound, and magnetic resonance imaging. Besides, AI- based techniques have been highly effective in distinguishing breast lesions that are morphologically similar in biopsy samples, enhancing the reproducibility of histological grading, and addressing the variability between individual pathologists. Besides this, machine learning methods that combine biopsy-derived characteristics with structured clinical data and molecular information have been very helpful in improving the prediction accuracy and helping to plan personalized treatment.. Detailed literature review papers reflect the pros and cons of implementing AI in biopsy- based clinical workflows such as data heterogeneity, improving the prediction

accuracy and helping to plan personalized treatment.Detailed literature review papers reflect the pros and cons of implementing AI in biopsy-based clinical workflows such as data heterogeneity, limited tissue sampling, model interpretability, generalizability, and need for strong clinical validation. On the whole, existing studies are strongly in favour of AI being increasingly used as a decision-support tool in core needle biopsy based breast cancer assessment but also underline the need for large- scale, multi-center studies and prospective validation for dependable clinical translation.

Keywords: Breast cancer; Core needle biopsy; Diagnostic accuracy / Concordance; Artificial intelligence; Deep learning; Radiomics; Digital pathology Neoadjuvant chemotherapy.

INTRODUCTION
1. Background
  
  Breast cancer ranks as the most commonly diagnosed type of cancer among women globally and remains one of the top causes of cancer deaths yet the number of deaths has been significantly reduced due to the development of new diagnostic and treatment methods. An early and precise
  
  diagnosis is very important for effective treatment and better results. Core needle biopsy (CNB) is the main method of sampling tissue for histopathological diagnosis in a minimally invasive fashion. On one hand, the core needle biopsy (CNB) remains an invaluable clinical tool. On the other hand, the correct diagnosis of CNB specimens is not a simple task at all. Small tissue volume, sampling error, tumor heterogeneity, and staining inconsistency are some of the reasons that may lead to the loss of diagnostic accuracy and reproducibility. The differences in pathological interpretation resulting from these human factors can be so significant as to result in a completely different diagnosis or a postponement of clinical decisions. Since conditions diagnosed by biopsy are mainly used to tailor personalized treatment regimens, the availability of objective and reproducible diagnostic tools is a must. [3] [8] [11]
  Artificial intelligence (AI) and machine learning (ML) have come a long way in recent years – these are the main reasons why we now have many new ways to tackle the above challenges. Using deep learning techniques for analyzing digital histopathology whole-slide images related to core needle biopsies, researchers have achieved good results in identifying tumours, classifying subtypes, helping grading, and even predicting pathological complete response after neoadjuvant chemotherapy. Similarly, AI-based examination of other imaging methods and the combination of clinical and pathological information have significantly improved our ability to diagnose and prognosticate. [10][13][17][14][18][22]
  Despite some very promising results, AI-based biopsy analysis has seen only a limited clinical translation so far. The performance reported in the studies varies a lot because of differences in dataset size, annotation strategies used, pre- processing pipelines, and evaluation metrics. Besides, many models are trained on single center or retrospective datasets, which leads to robustness and generalizability concerns across institutions and patient populations. These problems highlight the necessity of well-validated biopsy-based AI systems that can be used with confidence for clinical decision-making in breast cancer diagnosis. [15] [19] [24] [16] [21].
2. Statement of Problem
  
  Even with huge progress in breast cancer diagnostic techniques, interpreting core needle biopsy (CNB) specimens, is still a major challenge. This is due to the fact that only a small amount of tissue is sampled, tumour heterogeneity, sampling variability, and staining inconsistencies. In particular, these restrictions frequently cause inter-observer differences, inconsistencies in diagnosis, and postponement of clinical decisions, most especially in difficult cases. AI-based deep learning models have been able to help biopsy analysis however their clinical use is very limited due to their inconsistent results in different studies, dependence on single- center retrospective datasets, and absence of standard methods for preprocessing and validation. Therefore, a robust AI framework, mainly focused on biopsy, is of utmost importance to improve diagnostic accuracy, reproducibility as well as generalizability across different institutions and patient populations.
3. Purpose of the Research
The main aim of this research is to propose an AI-based system that can provide precise and dependable analysis of breast core needle biopsy (CNB) whole-slide images (WSIs). The objective here is to increase diagnostic accuracy by combining deep learning techniques with sophisticated image pre- processing and feature extraction tools, such as convolutional neural networks (CNN), textural analysis, and stain normalization procedures. The system will help pathologists not only in identifying tumours but also in tumour grading and molecular subtype classification, which are bound to result in better treatment planning and clinical outcomes for patients.
[10] [13] [17] [14] [18] [22]
Besides, this research also aims at overcoming the limitations of existing AI models by enhancing the robustness and generalizability. Standardized data augmentation, evaluation of multi-center datasets, and rigorous validation strategies to reduce bias and prevent over fitting will be the ways to achieve this. The research is also geared towards reducing variability between observers and improving reproducibility, thereby leading to more consistent diagnostic decisions in breast cancer pathology.

The authors of this study also plan to examine how the combination of clinical and pathological metadata with image- based AI analysis can be used to create a comprehensive diagnostic support system. The authors of the study intend to combine image features with clinical variables such as age, hormone receptor status, and tumour markers so that the framework can be used to make prognostic predictions with greater accuracy and to make recommendations for personalized treatment. The integration of different modalities is expected to make the results of AI research more accessible and acceptable to clinicians, thereby facilitating the widespread use of AI in pathology. [14] [18] [22] [10] [13] [17]
LITERATURE REVIEW
[1] One reason why core needle biopsy (CNB) of ductal carcinoma in situ (DCIS) frequently underestimates it is due to the missing invasive component and the current limited performance of clinic pathological and imaging based predictors that lead to the necessity of dependable preoperative modeling. [2] Lymph vascular invasion (LVI) is the major prognostic factor that is linked to tumor recurrence/local and metastasis; however, the identification and incorporation of LVI in the prognostic models is inconsistent which explains the considerable improvement of the models needed to accurately predict the prognostic value of LVI. [3] The major risk factor for breast cancer is breast density without quite a clear histological basis, at the same time, machine learning provides the way to quantitatively characterize morphological changes that are not feasible by the pathologist visually. [4] Although core needle biopsy (CNB) is our standard method for obtaining diagnostic tissue in female breast cancer, the level of accuracy of CNB in diagnosing male breast lesions is still uncertain because of their rarity which is why the importance of very careful sampling and follow-up is emphasized. [5] An excellent prognosis is predicted by a pathological complete response (pCR) to neoadjuvant chemotherapy (NAC). However, pathology remains the reference standard for pCR assessment and none of the imaging modalities is sufficiently reliable, while the reliability of core needle biopsy (CNB) after NAC remains a subject of debate. [6] The main limitation of conventional breast cancer diagnosis relying on pathological assessment is the high level of subjectivity whereas machine learning based diagnostic approaches have the potential of providing higher diagnostic accuracy by recognizing and analyzing very complex patterns of images that humans cannot perceive. [7] As a surrogacy of the multigene assays, immunohistochemical (IHC) based surrogate markers are suitable nevertheless the discordance between biopsy and surgical specimen may lead to the change of subtype classification and hence an impact on the disease outcome. [8] Despite the fact that CNB is widely used, very little evidence exists proving that CNB is superior to FNAC, therefore the necessity of performing diagnostic studies comparing these two methods becomes very evident. [9] Machine learning based on the radiomics approach not only assists the radiologist in accurate assessment of lesions, but also helps better categorize of BI-RADS and reduction of benign biopsies. [10] Digital pathology and artificial intelligence have the capability to overcome the challenges of the conventional histopathology, but the first generation of the algorithms suffered from the lack of big annotated data sets, validation and limited diagnostic capability. [11] Inevitably, there is a lack of standardization in methodological designs in breast cancer machine learning studies, and besides, such studies are often constrained by limited datasets, class imbalance, inconsistent metrics, and lack of external validation.
[12] Deep learning techniques have a lot of potential in the field of digital pathology. However, due to the great similarity in the appearance of fibroadenoma and phyllodes tumor, it is still very difficult to differentiate the two on CNB. [13] While CNB is the standard method in developed countries, its costly nature means it cannot be widely used in LMICs. As a result,

the development of low-cost biopsy methods that are validated has been a priority. [14] The rate at which HER2 results on CNB match those of surgery is very much dependent on different factors. Considering the HER2-low category, it changes the examination for the cases that were previously negative according to the current cutoff. [15] High costs of CNB in LMICs are the major reason for the increased use of affordable biopsy methods that deliver accurate diagnostics without expensive systems. [16] Very little is known about how effective biopsy techniques used before surgery are in diagnosis because almost all studies are done on surgical specimens. [17] The rate at which breast cancer is found following the diagnosis of atypical ductal hyperplasia is highly varied, so it makes sense that things such as prediction tools might be used to decrease the number of unnecessary operations. [18] The bimodal AI systems that are capable of utilising pathology images along with clinical data give rise to breast cancer prediction performance that is at a higher level than that achieved by earlier single-modal methods. [19] Innovations in AI have helped to better detect classify predict treatment response to, and analyze the outcomes of triple- negative breast cancer through the mining of large datasets.
[20] The level of androgen receptor expression might be very significant in the case of breast cancer, especially in TNBC. However, the assessment of these cases, as well as that of CNB, has not yet been standardized, and accuracy has not been examined. [21] The prediction models for DCIS upgrade that were developed at the time were based on either basic imaging techniques or deep learning methods, and their accuracy was quite limited.
[22] Since the diagnostic accuracy of FNAC varies greatly, especially for the nodules that are indeterminate, researchers have been evaluating CNB as a possibility. [23] Using antithrombotic drugs, patients undergoing CNB face a risk of bleeding which is still a matter of debate however recent studies have come up with solutions by using large multicenter data. [24] US-guided CNB well serves the purpose of the sampling of the axillary lymph nodes but still, a systematic review of the comparative study with FNA is required.
[25] Leveraging AI to analyze longitudinal axillary ultrasound data could potentially not only predict the response of a patient after NAC better but also help in less invasive staging. [26] Creating AI-based histopathology is very dependent on having large, quality, and well-annotated datasets however the existing public datasets are known to be biased and not very reproducible. [27] Applying deep learning on HE-stained biopsy images could get us digital biomarkers of early chemotherapeutic response (NAC) which combined with clinicopathological data, would considerably increase the power of the prediction. [28] Radiomics and machine learning are non-invasive tools for Ki-67 prediction; however, the role of features outside but near to the tumor has to be investigated.
[29] Despite the limited patient cohorts and validation, deep learning radiomics in conjunction with clinical data significantly enhances the non-invasive prediction of axillary lymph node status. [30] Observers hardly agree on Nottingham grading, especially in the case of biopsies, which necessitates a deep learning approach to be used for early grading. [31] Genomic tests help in understanding the risk of recurrence

though there is hardly any evidence that these tests can predict axillary involvement which would then lead to imagining and personalizing the surgical treatment. [32] Managing LCIS on CNB continues to be a puzzle for clinicians as the risks of progression to invasive carcinoma have not been consistently reported and LCIS resembles atypical lobular hyperplasia on many occasions. [33] Studies on CNB accuracy are comprehensive, but fare exceptionally low in terms of performance comparison across hospital services.

The latest AI technologies have made a big impact on breast cancer diagnosis in various radiology and pathology modalities such as mammography ultrasound MRI, and histopathology. Analyzing whole-slide images without manual labeling can help reduce bias and annotation burden of neural network- based response prediction models for neoadjuvant chemotherapy (NAC). Though triple-negative breast cancer (TNBC) diagnosis mainly uses core needle biopsy (CNB) and MRI, very few researchers have combined MRI features with machine learning for predictive modeling. Current histopathological AI systems face challenges on extracting global features and working efficiently as they are very complex models. There is still biomarker inconsistency between CNB and surgical specimens especially for hormone receptors and molecular subtypes. Even though the significance of HER2-low tumors is increasing, there is still insufficient evidence on the accuracy of CNB for these tumors. The Wisconsin Breast Cancer Diagnostic dataset is commonly used as a machine learning benchmark for classifying breast tumors.
OBJECTIVE
1. Main Objective:
  
  The key goal of this research is to thoroughly review and integrate prior investigations on employing AI and machine learning methods for breast cancer diagnosis and prediction, especially using mammography, core needle biopsy, histopathology, and multi-modal imaging data. With this review, the authors intend to assess the diagnostic precision, predictive effectiveness, and clinical dependability of AI systems on various datasets and at the same time pinpoint the drawbacks, inconsistencies, and obstacles that hamper their practical clinical usage.
2. Specific Objectives:
  - To assess different AI models and deep learning frameworks in breast cancer detection and diagnosis- Research AI implementations in breast biopsy and imaging such as CNN-based models, radiomics, and ensemble learning.
  - To compare, evaluate datasets and diagnostic methods- Debate the advantages and disadvantages of public datasets (e.g. Wisconsin Breast Cancer Dataset) and clinical datasets obtained from core needle biopsy ultrasound MRI, and histopathology images.
  - To look at diagnostic accuracy and prediction outcomes in different papers- Assess results of lesion classification, BI- RADS prediction, HER2 status detection, lymph node
    
    metastasis prediction, and pathological complete response (pCR) prediction.
  - To find reasons for limitations and discrepancies between biopsy and surgical findings- Explore, for example, the phenomenon of biopsy underestimation, the discordance between core needle biopsy and surgical specimen, and the risk of malignant upgrade.
  - To investigate challenges for clinical applicability and integration of AI in breast cancer workflows- Analyze studies of AI validation in real-world, ethical issues, and clinical limitations for AI implementation in breast screening and diagnosis.

METHODOLOGY/ SURVEY FRAMEWORK

It goes without saying that core needle biopsy remains the gold standard for breast cancer diagnosis, since it can provide samples of cancer cells, tissue architecture and their environment. The earliest cores were examined using conventional methods like hematoxylin and eosin-stained sections viewed under a microscope. Still, in a big data era, we can leverag the multi-modal properties of breast cancer like a cluster of cell types, immune profiling, tumor morphology, genomic alterations, proteomic characteristics, and other forms of tumor heterogeneity. Numerous other methods that focus on specific tumor features have been developed to diagnose breast cancer.

Literature Identification and Selection

The content was gathered from a wide view of academic journals – articles that examined old cases, followed patients over time, compared different testing methods, or reviewed findings about the use of needle biopsies for detection and prognosis of breast cancer. These studies touched on statistics, traditional algorithm-based tools as well as recent neural networks, all utilized on tissue slides, imaging data, and integrated patient records.

Core needle biopsies were examined in some studies as a diagnostic tool in conjunction with surgery, while others investigated the potential of these biopsies to provide data for various prediction models such as CNNs SVMs random forests, combined models, or mixed-method setups [1], [3], [9],
[10], [18], [27].

Review papers were part of the selection as well – these allowed for the identification of frequently used methods, shortcomings, and recurring patterns in different datasets [11], [24], [26].
Inclusion and Exclusion Criteria Some studies made the cut by meeting just one of these conditions
1. Doctors were able to determine the type of breast cancer by examining a tiny piece of tissue. Using the same piece, they figured out the speed of the cancer growth. The laboratory carried out tests for certain
  
  markers to classify it into a category. Results yielded some predictions about the future as well.
2. Accuracy, sensitivity, and specificity are figures that indicate the degree of the model's success. They also examined performance by using AUC scores along with concordance values. Kappa statistics help understand how much two things agree beyond mere chance. Each metric shows results from a slightly different angle. Such measures allow the comparison between models in a standardized manner.
3. Works stat has, ML, or deep learning methods that should have been used on tissue slices, scans, or patient files taken from biopsies. Even though the individual methods were different, repeating models under newly altered conditions again and again brought about common features of datasets.
  
  Re-examining old cases while also monitoring new ones contributed to molding the evaluation, extracting information either from a single hospital or from multiple ones [4], [5], [7], [14], [24]. Studies that compiled the previously obtained results or simply described the ways in which AI was used have helped in locating the current achievements as well as the difficulties which are yet to be addressed [11], [26], [34].
  
  Some studies were excluded because there was no information about tissue samples from biopsies. Also, articles that lacked the essential diagnostic or predictive details were excluded. Only research based on surgical samples was considered if the core needle biopsy was part of the procedure. Patient case reports, personal views, and incomplete descriptions of the methods? They were removed as well [11], [26], [34].
Data Extraction

Sequentially, the qualifying papers revealed preparations methodologically one by one. Study layout was leading the way, then how many people took part, followed by their medical profiles. Rather than just naming tools, detailed information about the kind of scan or tissue check used was carefully recorded. The manner in which data features were extracted had a significant impact – hence it was documented as

well. Since result analysis techniques help in forming understanding, those methods have been included. Artificial Intelligence (AI) used concept was reflected in the records as the model's blueprint. Training routines were presented along with testing plans. Data group splits varied, but each division was documented. Performance statistics, once made available to the public, were incorporated here as well [3], [9], [10], [12],
[27], [29].

Measures suchas diagnosis agreement, post-surgery changes in diagnosis, matching of biopsy and operation samples, and molecular type consistency, among others were recorded [7], [8], [14], [38], [39].

Certain papers listed limitations – few patients, use of only historical data, lack of external validation, problems with interpreting results – which also got documented for reference [1], [5], [12], [18], [34].
Comparative Analysis Strategy

A comparative study was done to evaluate trends and variations in methodologies. Statistical and conventional machine learning methods were compared with deep learning and multimodal fusion models in terms of prediction efficacy, robustness, and clinical readiness [2], [6], [9], [27], [36]. The diagnostic accuracy of CNB has been verified across several clinical situations, such as the detection of invasive cancer, the involvement of lymph nodes, molecular subtyping, and the prediction of treatment response [4], [5], [14], [24], [31].

Particularly emphasized was how the size and diversity of the dataset, as well as the method of validation, affected the results communicated. Besides, the consistent tendency of hybrid models combining clinical and imaging characteristics to outperform single-modality methods was highlighted [18], [27], [29], [35]. The comparison of limitations like inter- observer variability, domain shift, and no use of external validation helped in pinpointing the enduring challenges and areas of research still needing focus [11], [26], [34].

Figures and Tables

Ref. No.	Study Focus	Data / Modality	Methodology/ Model	Key Performance Metrics
4	CNB diagnostic accuracy	US-guided CNB vs excision	Retrospective diagnostic accuracy analysis	Accuracy 99.6%, Sensitivity 98.9%, Specificity 100%
9	BI-RADS classification	Breast ultrasound radiomics	Ensemble SVM (majority voting)	Sensitivity>94%,PPV, unnecessary biopsies 1518%
10	Invasive carcinoma & DCIS detection	Whole-slide histopathology images (H&E/HES)	CNN ensemble (Inception + ResNet)	AUC > 0.98, high sensitivity & specificity

13	Low-cost biopsy validation (LMICs)	Vacuum-assisted CNB	Diagnostic accuracy evaluation	Accuracy 92.98%, Sensitivity 92%, Specificity 100%
15	Frugal VACN Btechnique	CNB(low-cost system)	Prospective diagnostic study	Accuracy 92.98%, PPV 100%
18	Axillary LN metastasis prediction	WSI+ clinic pathological data	Multimodal MIL-based deep learning	AUC0.809,outperforming single- modality models
24	Axillary LN diagnosis	US-guided CNB	Systematic review &meta- analysis	Sensitivity0.90,AUC 0.98, Specificity 0.99
25	NAC response prediction	Longitudinal ultrasound radiomics	SVM fusion model	AUC0.899,Accuracyto 79.2%
29	Axillary LN metastasis	US radiomics + clinical factors	Deep learning radiomics nomogram	AUCupto0.95(external validation)
30	Tumor grading (NHG)	Biopsy whole-slide images	CNN ensemble (DeepGrade)	AUC0.91,significant prognostic stratification
35	Imageclinical fusion	Imaging + clinical metadata	Deep learning fusion model	Accuracy 92.3%, performance dropped

DISCUSSION
1. Performance Trends Across AI Models
  
  It is clear from looking at previous research how far we have come – the transition from traditional machine learning techniques to advanced deep learning tools when detecting breast cancer. Not very long ago, simple classifiers achieved relatively good results but faced difficulties mainly because they were dependent on manually selected features and had weak pattern recognition capabilities [6 11 37, 40]. So what happened? It was the neural networks that analyze images and tissue samples which started identifying complex shapes and structures independently, thus outperforming the older techniques [10 12 18 27 30]. Nevertheless, results vary widely
  
  – different architectures, types of data, methods of testing continue to make results inconsistent [11,26]. When multi- modal data sources and state-of-the-art models combine, they outperform single modality model setups every time – although there is still a lot of variation in working across the board [18 27 37].
2. Influence of Dataset Size, Diversity, and Quality
Comprehensive sets of varied and thoroughly labeled examples determine a major factor in the performance of artificial intelligence systems. Often, a small dataset from a single location is used in research that causes the system to memorize instead of learning, and the results become less transferable to other areas [11 26 29]. Publically available data that are used to assess models vary considerably – while some have very thorough information, others lack vital details or have images of poor quality; all these aspects result in tests yielding very

different outcomes and also pose difficulties in trying to repeat them [26,40]. Insufficient labeling, unequal proportions, and lack of verification by independent sources are some of the major obstacles that these AI tools continue to face before they can be used in real medical environments [11 26 29]. What is increasingly evident is that robust, reliable AI requires a vast, diverse, and well-documented data source.
1. Role of Transfer Learning and Data Augmentation
  
  Generally, obtaining meaningful results from small breast cancer image datasets depends heavily on transfer learning of pre-trained models. When one re-uses older network architectures, they usually extract the most distinguishing features with greater ease, require less training time and eventually achieve higher accuracies. These improvements are strongly measurable in operations like determining tumor stages, assessing responses to chemo, and classifying microscopic tissue types [18 27
  
  30 35 37]. Performing a controlled transformation of the data helps the model to better deal with unbalanced distributions and to be less sensitive to noise [11,28]. Incorporating prior knowledge and re-processed data together results not only in a reduction of annotated data requirement, but also in a very promising performance in limited data scenario such as biopsy or histopathology images [27,35].
2. Interpretability and Clinical Applicability
  
  Doctors often stall even when AI makes accurate predictions because the inner workings stay hidden – like a black box with no clear path to answers. Instead of logical steps, the models spit out results based on code they don't explain. In areas like tissue review and scans, clinicians need to trace decisions step by step. Adding real-world patient info helps ground the
  
  machine's findings in something tangible. Trust builds not just from being right, but from fitting inside built thinking. Tools that mirror how experts normally decide feel less foreign over time. When tech adapts to current practices instead of forcing change, it slides into orkflows easier. Clarity isn't secondary, it's just as vital as precision here.
3. Limitations and Future Challenges
Results vary widely due to inconsistent sample divisions and patchy tracking protocols across departments [11,26,29]. Some labs catch things others miss, leading to erratic model outputs during cross-test evaluations [1,7,14,38]. Centralized workflows among medical centers, uniform auditing processes. Openly shared datasets built on strict norms could shift outcomes [26]. What matters more than individual tech upgrades is combining patient images, clinical logs, and clear algorithm designs through repeated analysis [18,25,27,29]. That said, progress ends up being messy without alignment across teams.
CONCLUSION & FUTURE SCOPE
1. Conclusion
  
  Decisions about treatment severity often rely on outdated data from single hospitals. That information tends to date back years, feature limited patient groups, or skip external validation steps. Even as algorithms can now review tissue slides, tumor images, and lab outcomes to catch key signs of cancer progression and possible drug responses, the tools don't yet operate evenly across care centers. So far, digital analysis supports doctors in the lab but doesn't show strong performance in daily clinical settings. In the end, broader testing and reliable standards are needed for true adoption. For now, trust still rests heavily on small-scale studies
2. Future Scope
Through combining scans, tissue samples, medical records. Genetic signs, researchers aim to create sharper forecasts for diseases. As it happens, tracking patients over years in multiple hospital settings helps build stronger patterns. With models showing step-by-step logic, clinicians can trust the outputs more easily. In real-world clinics, tools must prove reliable under daily pressures. That said, fairness and safety end up being just as vital as hitting high accuracy levels.
REFERENCES

Y. Park, H. M. Gweon, E. J. Son, M. Yoo, J. A. Kim, and J. H. Youk, Ductal carcinoma in situ diagnosed at US-guided 14-gauge core-needle biopsy for breast mass: preoperative predictors of invasive breast cancer, Eur. J. Radiol., vol. 83, pp. 654659, Apr. 2014, doi:10.1016/j.ejrad.2014.01.010.
S. Shen, G. Wu, G. Xiao, R. Du, N. Hu, X. Xia and H. Zhou, Prediction model of lymphovascular invasion based on clinicopathological factors in Chinese patients with invasive breast cancer, Medicine (Baltimore), vol. 97, no. 43, p. e12973, Oct. 2018, doi:10.1097/MD.0000000000012973.
M. Mullooly, B. Ehteshami Bejnordi, R. M. Pfeiffer, S. Fan, M. Palakal,

M. Hada, P. M. Vacek, D. L. Weaver, J. A. Shepherd, B. Fan, A. P. Mahmoudzadeh, J. Wang, S. Malkov, J. M. Johnson, S. D. Herschorn, B.

L. Sprague, S. Hewitt, L. A. Brinton, N. Karssemeijer, J. van der Laak, A. Beck, M. E. Sherman, and G. L. Gierach, Application of convolutional

neural networks to breast biopsies to delineate tissue correlates of mammographic breast density, npj Breast Cancer, vol. 5, Art. 43, 2019, doi:10.1038/s41523-019-0134-6.
Y. Yang, S. Li, G. Liu, and Z. Shao, Is Core Needle Biopsy Effective at Diagnosing Male Breast Lesions? Breast Cancer Research and Treatment, vol. 177, pp. 507511, 2019.
K. Narui et al., Prediction of pathological complete response after neoadjuvant chemotherapy in breast cancer by combining magnetic resonance imaging and core needle biopsy, Surgical Oncology, vol. 35, 2020, doi: 10.1016/j.suronc.2020.10.002.
C. G. Yedjou, S. S. Tchounwou, R. A. Aló, R. Elhag, B. Mochona, and L. Latinwo, Application of machine learning algorithms in breast cancer diagnosis and classification, International Journal of Science Academic Research, vol. 2, no. 1, pp. 30813086, Jan. 2021, PMID: 34825131, PMCID: PMC8612371.
M.Pölcher et al., Concordance of the molecular subtype classification between core needle biopsy and surgical specimen in primary breast cancer, Archives of Gynecology and Obstetrics, vol. 304, no. 2, pp. 405413, 2021, doi: 10.1007/s00404-021-05996-x.
P. Verma et al., Fine-Needle Aspiration Cytology versus Core-Needle Biopsy for Breast Lesions: A Dilemma of Superiority between the Two, Clin. Breast Cancer Res., 2021. doi: 10.1159/000517005.
M. Interlenghi, C. Salvatore, V. Magni, G. Caldara, E. Schiavon, A. Cozzi et al., A Machine Learning Ensemble Based on Radiomics to Predict BI-RADS Category and Reduce the Biopsy Rate of Ultrasound- Detected Suspicious Breast Masses, Diagnostics, vol. 12, no. 1, p. 187, Jan. 2022, doi:10.3390/diagnostics12010187.
J. Sandbank, G. Bataillon, A. Nudelman, I. Krasnitsky, R. Mikulinsky, L. Bien, L. Thibault, A. Albrecht Shach, G. Sebag, D. P. Clark, D. Laifenfeld, S. J. Schnitt, C. Linhart, and M. Vecsler, Validation and real- world clinical application of an artificial intelligence algorithm for breast cancer detection in biopsies, npj Breast Cancer, vol. 8, Art. no. 129, Dec. 2022, doi:10.1038/s41523-022-00496-w.
C. Mazo, C. Aura, A. Rahman, W. M. Gallagher, and C. Mooney, Application of artificial intelligence techniques to predict risk of recurrence of breast cancer: A systematic review, J. Pers. Med., vol. 12, no. 9, art. 1496, 2022, doi:10.3390/jpm12091496.
C. L. Cheng, N. D. Md Nasir, G. J. Z. Ng, K. W. J. Chua, Y. Li, J.

Rodrigues, A. A. Thike, S. Y. Heng, V. C. Y. Koh, J. X. Lim, V. J. N.

Hiew, R. Shi, B. Y. Tan, T. K. Y. Tay, S. Ravi, K. H. Ng and K. S. L. Oh, Artificial intelligence modelling in differentiating core biopsies of fibroadenoma from phyllodes tumor, Lab. Invest., vol. 102, no. 3, pp. 245252, Mar. 2022, doi: 10.1038/s41374-021-00689-0.
S. K. Yadav, A. Carpenter, R. Rai, C. K. Jha, and D. Sharma, Low-cost vacuum assisted core needle biopsy technique for breast lumps, World Journal of Surgery, 2022, doi: 10.1007/s00268-022-06475-3.
R. Chen et al., Diagnostic value of core needle biopsy for determining HER2 status in breast cancer, especially in the HER2-low population, BMC Cancer, vol. 22, 2022, Article 1325, doi: 10.1186/s12885-022-

10344-9.
S. K. Yadav, A. Carpenter, R. Rai, C. K. Jha, and D. Sharma, Low-Cost Vacuum Assisted Core Needle Biopsy Technique for Breast Lumps, World J Surg, vol. 46, pp. 14531460, 2022.https://doi.org/10.1007/s00268-022-06475-3
M. Lia et al., The diagnostic value of core needle biopsy in cervical cancer: A retrospective analysis, PLoS One, vol. 17, no. 1, 2022.
T. S. H. Bong et al., Atypical Ductal Hyperplasia of the Breast on Core Needle Biopsy: Risk of Malignant Upgrade on Surgical Excision, Breast J., 2022. doi: 10.1007/s10549-022-06789-0.
Y. Ding, F. Yang, M. Han, C. Li, Y. Wang, X. Xu, M. Zhao, M. Zhao, H. Deng, H. Yang, J. Yao and Y. Liu, Multi-center study on predicting breast cancer lymph node status from core needle biopsy specimens using multi-modal and multi-instance deep learning, npj Breast Cancer, vol. 9, Art. no. 58, Jul. 2023, doi: 10.1038/s41523-023-00562-x.
J. Guo, J. Hu, Y. Zheng, S. Zhao, and J. Ma, Artificial intelligence: opportunities and challenges in the clinical applications of triple-negative breast cancer, British Journal of Cancer, vol. 128, pp. 21412149, 2023, doi: 10.1038/s41416-023-02215-z.
M M. dos Santos et al., Core Needle Biopsy Accuracy for Androgen Receptor Expression in Invasive Breast Cancer, Revista Brasileira de Ginecologia e Obstetrícia, 2023, PMID: 37846186.
L.-N. Do et al., Predicting underestimation of invasive cancer in patients with core-needle-biopsy-diagnosed ductal carcinoma in situ using deep learning algorithms, Diagnostics, vol. 13, no. 2, p. 320, 2023.
W. Li et al., Comparison of fine-needle aspiration and core-needle biopsy in the pathological evaluation on thyroid lesions, Diagnostic Cytopathology, 2023.
V. A. Loving et al., Antithrombotic Therapy and Hematoma Risk during Image-guided Core-Needle Breast Biopsy, Radiology, vol. 306, no. 1, 2023. doi: 10.1148/radiol.220548.
Q. Xu et al., The Effectiveness of Ultrasound-Guided Core Needle Biopsy in Detecting Lymph Node Metastases in the Axilla in Patients with Breast Cancer: Systematic Review and Meta-Analysis, Clin. Investig. Med., 2023. doi: 10.1016/j.clinsp.2023.100207.
Y. Fu, Y.-T. Lei, Y.-H. Huang, F. Mei, S. Wang, K. Yan, Y.-H. Wang,

Y.-H. Ma and L.-G. Cui, Longitudinal ultrasound-based AI model predicts axillary lymph node response to neoadjuvant chemotherapy in breast cancer: a multicenter study, Eur. Radiol., vol. 34, no. 11, pp. 70807089, Nov. 2024, doi:10.1007/s00330-024-10786-5.
M. Tafavvoghi, L. A. Bongo, N. Shvetsov, L. T. R. Busund and K. Møllersen, Publicly available datasets of breast histopathology H&E whole-slide images: A scoping review, J. Pathol. Inform., vol. 15, p. 100363, 2024, doi:10.1016/j.jpi.2024.100363.
H. Zeng, S. Qiu, S. Zhuang, X. Wei, J. Wu, R. Zhang, K. Chen, Z. Wu, and Z. Zhuang, Deep learning-based predictive model for pathological complete response to neoadjuvant chemotherapy in breast cancer from biopsy pathological images: a multicenter study, Front. Physiol., vol. 15,

p. 1279982, Jan. 2024, doi:10.3389/fphys.2024.1279982.
F. Li, T.-W. Zhu, M. Lin, X.-T. Zhang, Y.-L. Zhang, A.-L. Zhou, and D.-

Y. Huang, Enhancing Ki-67 prediction in breast cancer: integrating intratumoral and peritumoral radiomics from automated breast ultrasound via machine learning, Acad. Radiol., vol. 31, no. 7, pp. 26632673, Jul. 2024. doi:10.1016/j.acra.2023.12.036.
H. Liu, L. Zou, N. Xu, H. Shen, Y. Zhang, P. Wan, B. Wen, X. Zhang, Y. He, L. Gui, and W. Kong, Deep learning radiomics based prediction of axillary lymph node metastasis in breast cancer, npj Breast Cancer, vol. 10, art. 22, Mar. 2024, doi:10.1038/s41523-024-00628-4.
C. Boissin, Y. Wang, A. Sharma, P. Weitz, E. Karlsson, S. Robertson, J. Hartman, M. Rantalainen et al., Deep learning-based risk stratification of preoperative breast biopsies using digital whole slide images, Breast Cancer Research, vol. 26, article no. 90, 2024, doi: 10.1186/s13058-024- 01840-7.
L. M. Drapalik et al., Using MammaPrint on core needle biopsy to guide the need for axillary staging during breast surgery, Surgery, vol. 175, no. 3, 2024.
R. M. Pride, R. E. Jimenez, T. L. Hoskin, A. C. Degnim, and T. J. Hieken, Upgrade at excisional biopsy after a core needle biopsy diagnosis of classic lobular carcinoma in situ, Surgery, vol. 169, no. 3, 2024.

[33]G. J. Ziegler-Rodriguez et al., Core Needle Biopsy of Breast Tumours: Comparison of Diagnostic Performance between Surgery and Radiology Services at a National Cancer Centre in Latin America, ecancermedicalscience, 2024. doi: 10.3332/ecancer.2024.1766.

A. Ali, M. Alghamdi, S. S. Marzuki, T. A. D. A. Tengku Din, M. S. Yamin, M. Alrashidi, I. S. Alkhazi, and N. Ahmed, Exploring AI Approaches for Breast Cancer Detection and Diagnosis: A Review Article, Breast Cancer: Targets and Therapy, vol. 17, pp. 927947, Oct. 2025, doi: 10.2147/BCTT.S550307.
MIGYEONG YANG, JINYOUNG HAN, HYUNSEON WON, SEOUNG WAN CHAE, HYUN-SOO KIM and SUNG-IM DO,

"Automated Prediction of Pathological Complete Response to Neoadjuvant Chemotherapy in Breast Carcinoma Using Deep Learning on Pretreatment Core Needle Biopsy Samples," Anticancer Research, vol. 45, no. 11, pp. 51675176, 2025, doi:10.21873/anticanres.17856.
Cai Y, Li Y, Wang W, Zhou Y, Wang J, Zhang L, Lu H , Prediction of triple-negative breast cancer using multiparametric MRI and machine learning, DOVEPRESS Taylorand Francis Group, https://doi.org/10.2147/BCTT.S513779
T. Mahmood, T. Saba, and A. Rehman, Breast cancer diagnosis with MFF-HistoNet: a multi-modal feature fusion network integrating CNNs and quantum tensor networks, Journal of Big Data, vol. 12, art. no. 60, Mar. 2025. doi:10.1186/s40537-025-01114-9.
M. Xu et al., Expression and subtype discordance between core needle biopsy and surgical specimen in breast cancer, Journal of Surgical Research, 2025, doi: 10.1016/j.jss.2025.01.010.
C. M. Ciniselli et al., Reliability of Core Needle Biopsy for HER2-Low Early-Stage Breast Cancer, Clin. Transl. Oncol., 2025. doi: 10.1007/s12094-025-03877-2.
W. H. Wolberg, O. L. Mangasarian, and W. N. Street, Breast Cancer Wisconsin (Diagnostic) Dataset, UCI Machine Learning Repository, 1995. Available: https://www.kaggle.com/datasets/uciml/breast-cancer- wisconsin-data?select=data.csv.