Breast Cancer Diagnosis Using Feature Selection Approaches and Bayesian Optimization

Akkur, E.; Türk, F.; Eroğul, Osman

Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.11851/10517

Title:	Breast Cancer Diagnosis Using Feature Selection Approaches and Bayesian Optimization
Authors:	Akkur, E. Türk, F. Eroğul, Osman
Keywords:	Bayesian optimization Breast cancer feature selection machine learning Computer aided diagnosis Decision trees Diseases Learning algorithms Learning systems Nearest neighbor search Support vector machines Bayesian optimization Breast Cancer Breast cancer diagnosis Features selection Least absolute shrinkage and selection operators Machine learning algorithms Machine learning models Machine-learning Mammographic Optimization approach Feature Selection
Publisher:	Tech Science Press
Abstract:	Breast cancer seriously affects many women. If breast cancer is detected at an early stage, it may be cured. This paper proposes a novel classification model based improved machine learning algorithms for diagnosis of breast cancer at its initial stage. It has been used by combining feature selection and Bayesian optimization approaches to build improved machine learning models. Support Vector Machine, K-Nearest Neighbor, Naive Bayes, Ensemble Learning and Decision Tree approaches were used as machine learning algorithms. All experiments were tested on two different datasets, which are Wisconsin Breast Cancer Dataset (WBCD) and Mammographic Breast Cancer Dataset (MBCD). Experiments were implemented to obtain the best classification process. Relief, Least Absolute Shrinkage and Selection Operator (LASSO) and Sequential Forward Selection were used to determine the most relevant features, respectively. The machine learning models were optimized with the help of Bayesian optimization approach to obtain optimal hyperparameter values. Experimental results showed the unified feature selection-hyperparameter optimization method improved the classification performance in all machine learning algorithms. Among the various experiments, LASSO-BO-SVM showed the highest accuracy, precision, recall and F1-score for two datasets (97.95%, 98.28%, 98.28%, 98.28% for MBCD and 98.95%, 97.17%, 100%, 98.56% for MBCD), yielding outperforming results compared to recent studies. © 2023 CRL Publishing. All rights reserved.
URI:	https://doi.org/10.32604/csse.2023.033003 https://hdl.handle.net/20.500.11851/10517
ISSN:	0267-6192
Appears in Collections:	Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection

Show full item record

CORE Recommender

SCOPUS^TM
Citations

10

checked on Sep 6, 2025

Page view(s)

290

checked on Sep 8, 2025

Google Scholar^TM

Check

SCOPUSTM Citations

Page view(s)

Google ScholarTM

Altmetric

SCOPUS^TM
Citations

Google Scholar^TM