Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.11851/10517
Title: Breast Cancer Diagnosis Using Feature Selection Approaches and Bayesian Optimization
Authors: Akkur, E.
Türk, F.
Eroğul, Osman
Keywords: Bayesian optimization
Breast cancer
feature selection
machine learning
Computer aided diagnosis
Decision trees
Diseases
Learning algorithms
Learning systems
Nearest neighbor search
Support vector machines
Bayesian optimization
Breast Cancer
Breast cancer diagnosis
Features selection
Least absolute shrinkage and selection operators
Machine learning algorithms
Machine learning models
Machine-learning
Mammographic
Optimization approach
Feature Selection
Publisher: Tech Science Press
Abstract: Breast cancer seriously affects many women. If breast cancer is detected at an early stage, it may be cured. This paper proposes a novel classification model based improved machine learning algorithms for diagnosis of breast cancer at its initial stage. It has been used by combining feature selection and Bayesian optimization approaches to build improved machine learning models. Support Vector Machine, K-Nearest Neighbor, Naive Bayes, Ensemble Learning and Decision Tree approaches were used as machine learning algorithms. All experiments were tested on two different datasets, which are Wisconsin Breast Cancer Dataset (WBCD) and Mammographic Breast Cancer Dataset (MBCD). Experiments were implemented to obtain the best classification process. Relief, Least Absolute Shrinkage and Selection Operator (LASSO) and Sequential Forward Selection were used to determine the most relevant features, respectively. The machine learning models were optimized with the help of Bayesian optimization approach to obtain optimal hyperparameter values. Experimental results showed the unified feature selection-hyperparameter optimization method improved the classification performance in all machine learning algorithms. Among the various experiments, LASSO-BO-SVM showed the highest accuracy, precision, recall and F1-score for two datasets (97.95%, 98.28%, 98.28%, 98.28% for MBCD and 98.95%, 97.17%, 100%, 98.56% for MBCD), yielding outperforming results compared to recent studies. © 2023 CRL Publishing. All rights reserved.
URI: https://doi.org/10.32604/csse.2023.033003
https://hdl.handle.net/20.500.11851/10517
ISSN: 0267-6192
Appears in Collections:Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection

Show full item record



CORE Recommender

Page view(s)

56
checked on Dec 16, 2024

Google ScholarTM

Check




Altmetric


Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.