A Feature Selection Approach Based on Archimedes’ Optimization Algorithm for Optimal Data Classification.
DOI:
https://doi.org/10.9781/ijimai.2023.01.005Keywords:
Optimization, Classification, Feature Selection, Machine Learning ClassifierAbstract
Feature selection is an active research area in data mining and machine learning, especially with the increase in the amount of numerical data. FS is a search strategy to find the best subset of features among a large number of subsets of features. Thus, FS is applied in most modern applications and in various domains, which requires the search for a powerful FS technique to process and classify high-dimensional data. In this paper, we propose a new technique for dimension reduction in feature selection. This approach is based on a recent metaheuristic called Archimedes’ Optimization Algorithm (AOA) to select an optimal subset of features to improve the classification accuracy. The idea of the AOA is based on the steps of Archimedes' principle in physics. It explains the behavior of the force exerted when an object is partially or fully immersed in a fluid. AOA optimization maintains a balance between exploration and exploitation, keeping a population of solutions and studying a large area to find the best overall solution. In this study, AOA is exploited as a search technique to find an optimal feature subset that reduces the number of features to maximize classification accuracy. The K-nearest neighbor (K-NN) classifier was used to evaluate the classification performance of selected feature subsets. To demonstrate the superiority of the proposed method, 16 benchmark datasets from the UCI repository are used and also compared by well-known and recently introduced meta-heuristics in this context, such as: sine-cosine algorithm (SCA), whale optimization algorithm (WOA), butterfly optimization algorithm (BAO), and butterfly flame optimization algorithm (MFO). The results prove the effectiveness of the proposed algorithm over the other algorithms based on several performance measures used in this paper.
Downloads
References
M. A. Khan et al., “Cucumber leaf diseases recognition using multi level deep entropy-ELM feature selection,” Applied Sciences, vol. 12, no. 2, p. 593, 2022.
M. A. Khan et al., “A fused heterogeneous deep neural network and robust feature selection framework for human actions recognition,” Arabian Journal for Science and Engineering, pp. 1–16, 2021.
A. Mehmood, U. Tariq, C. W. Jeong, Y. Nam, R. R. Mostafa, and A. Elaeiny, “Human Gait Recognition: A Deep Learning and Best Feature Selection Framework,” Computers, Materials & Continua, vol. 70, pp. 343–360, 2022.
N. Hussain et al., “Multiclass cucumber leaf diseases recognition using best feature selection,” Computers, Materials & Continua, vol. 70, pp. 3281–3294, 2022.
F. Zia et al., “A multilevel deep feature selection framework for diabetic retinopathy image classification,” 2022.
A. Rehman, M. A. Khan, T. Saba, Z. Mehmood, U. Tariq, and N. Ayesha, “Microscopic brain tumor detection and classification using 3D CNN and feature selection architecture,” Microscopy Research and Technique, vol. 84, no. 1, pp. 133–149, 2021.
M. U. Khan et al., “Expert hypertension detection system featuring pulse plethysmograph signals and hybrid feature selection and reduction scheme,” Sensors, vol. 21, no. 1, p. 247, 2021.
M. A. Khan et al., “Multimodal brain tumor classification using deep learning and robust feature selection: A machine learning application for radiologists,” Diagnostics, vol. 10, no. 8, p. 565, 2020.
L. Khrissi, H. Satori, K. Satori, and N. el Akkad, “An Efficient Image Clustering Technique based on Fuzzy C-means and Cuckoo Search Algorithm,” International Journal of Advanced Computer Science and Applications, vol. 12, no. 6, pp. 423–432, 2021, doi: 10.14569/IJACSA.2021.0120647.
L. Khrissi, N. E. Akkad, H. Satori, and K. Satori, “Color image segmentation based on hybridization between Canny and k-means,” in 7th Mediterranean Congress of Telecommunications 2019, CMT 2019, 2019. doi: 10.1109/CMT.2019.8931358.
D. Yousri, M. Abd Elaziz, L. Abualigah, D. Oliva, M. A. A. Al-Qaness, and A. A. Ewees, “COVID-19 X-ray images classification based on enhanced fractional-order cuckoo search optimizer using heavy-tailed distributions,” Applied Software Computing, vol. 101, p. 107052, 2021.
Z. Faska, L. Khrissi, K. Haddouch, and N. el Akkad, “A Powerful and Efficient Method of Image Segmentation Based on Random Forest Algorithm,” in Digital Technologies and Applications, 2021, pp. 893–903.
L. Khrissi, N. El Akkad, H. Satori, and K. Satori, “Simple and Efficient Clustering Approach Based on Cuckoo Search Algorithm,” 2020 Fourth International Conference On Intelligent Computing in Data Sciences (ICDS), pp. 1–6, Oct. 2020, doi: 10.1109/ICDS50568.2020.9268754.
S. Cheng, L. Ma, H. Lu, X. Lei, and Y. Shi, “Evolutionary computation for solving search-based data analytics problems,” Artificial Intelligence Review, vol. 54, no. 2, pp. 1321–1348, 2021, doi: 10.1007/s10462-020-09882-x.
D. S. A. Elminaam, S. A. Ibrahim, E. H. Houssein, and S. M. Elsayed, “An Efficient Chaotic Gradient-Based Optimizer for Feature Selection,” IEEE Access, vol. 10, pp. 9271–9286, 2022, doi: 10.1109/ACCESS.2022.3143802.
H. Moussaoui, N. el Akkad, and M. Benslimane, “Moroccan Carpets Classification Based on SVM Classifier and ORB Features,” in International Conference on Digital Technologies and Applications, 2022, pp. 446–455.
C. C. Aggarwal, X. Kong, Q. Gu, J. Han, and P. S. Yu, “Active learning: A survey,” Data Classification: Algorithms and Applications, pp. 571–605, 2014, doi: 10.1201/b17320.
E. H. Houssein, A. G. Gad, Y. M. Wazery, and P. N. Suganthan, “Task scheduling in cloud computing based on meta-heuristics: Review, taxonomy, open challenges, and future trends,” Swarm and Evolutionary Computation, vol. 62, p. 100841, 2021.
F. A. Hashim, E. H. Houssein, K. Hussain, M. S. Mabrouk, and W. Al-Atabany, “A modified Henry gas solubility optimization for solving motif discovery problem,” Neural Computing & Applicationsvol. 32, no. 14, pp. 10759–10771, 2020.
N. Neggaz, E. H. Houssein, and K. Hussain, “An efficient henry gas solubility optimization for feature selection,” Expert Systems With Applications, vol. 152, p. 113364, 2020.
L. Khrissi, N. el Akkad, H. Satori, and K. Satori, “Image Segmentation Based on K-means and Genetic Algorithms,” in Advances in Intelligent Systems and Computing, 2020, vol. 1076, pp. 489–497. doi: 10.1007/978-981-15-0947-6_46.
L. Khrissi, N. el Akkad, H. Satori, and K. Satori, “Clustering method and sine cosine algorithm for image segmentation,” Evolutionary Intelligence, Jan. 2021, doi: 10.1007/s12065-020-00544-z.
L. Khrissi, N. el Akkad, H. Satori, and K. Satori, “A Performant ClusteringApproach Based on An Improved Sine Cosine Algorithm,” International Journal of Computing, pp. 159–168, Jun. 2022, doi: 10.47839/ijc.21.2.2584.
M. Merras, N. el Akkad, A. Saaidi, A. G. Nazih, and K. Satori, “Camera calibration with varying parameters based on improved genetic algorithm,” WSEAS Transactions onComputersvol. 13, pp. 129–137, 2014.
N. el Akkad, M. Merras, A. Saaidi, and K. Satori, “Robust method for self-calibration of cameras having the varying intrinsic parameters,” Journal of Theoretical and Applied Information Technology, vol. 50, no. 1, pp. 57 – 67. 2013.
H. M. Zawbaa, E. Emary, and B. Parv, “Feature selection based on antlion optimization algorithm,” Proceedings of 2015 IEEE World Conference on Complex Systems, WCCS 2015, 2016, doi: 10.1109/ICoCS.2015.7483317.
M. Mafarja et al., “Binary dragonfly optimization for feature selection using time-varying transfer functions,” 2018, doi: 10.1016/j.knosys.2018.08.003.
A. G. Hussien, A. E. Hassanien, E. H. Houssein, S. Bhattacharyya, and M. Amin, “S-shaped binary whale optimization algorithm for feature selection,” vol. 727. Springer Singapore, 2019. doi: 10.1007/978-981-10-8863-6_9.
A. T. Sahlol, D. Yousri, A. A. Ewees, M. A. A. Al-Qaness, R. Damasevicius, and M. A. Elaziz, “COVID-19 image classification using deep features and fractional-order marine predators algorithm,” Scientific Reports , vol. 10, no. 1, pp. 1–15, 2020.
A. I. Hafez, H. M. Zawbaa, E. Emary, and A. E. Hassanien, “Sine cosine optimization algorithm for feature selection,” in 2016 international symposium on innovations in intelligent systems and applications (INISTA), 2016, pp. 1–5.
P. C. Chiu, A. Selamat, O. Krejcar, K. K. Kuok, E. Herrera-Viedma, and G. Fenza, “Imputation of Rainfall Data Using the Sine Cosine Function Fitting Neural Network.,” International Journal of Interactive Multimedia & Artificial Intelligence, vol. 6, no. 7, 2021.
Y. Zhang, R. Liu, X. Wang, H. Chen, and C. Li, “Boosted binary Harris hawks optimizer and feature selection,” Engineering with Computers, vol. 37, no. 4, pp. 3741–3770, 2021.
N. Neggaz, A. A. Ewees, M. Abd Elaziz, and M. Mafarja, “Boosting salp swarm algorithm by sine cosine algorithm and disrupt operator for feature selection,” Expert Systems With Applications, vol. 145, p. 113103, 2020.
M. M. Mafarja and S. Mirjalili, “Hybrid Whale Optimization Algorithm with simulated annealing for feature selection,” Neurocomputing, vol. 260, pp. 302–312, Oct. 2017, doi: 10.1016/j.neucom.2017.04.053.
M. Abdel-Basset, W. Ding, and D. El-Shahat, “A hybrid Harris Hawks optimization algorithm with simulated annealing for feature selection,” Artificial Intelligence Review, vol. 54, no. 1, pp. 593–637, 2021.
F. A. Hashim, K. Hussain, E. H. Houssein, M. S. Mabrouk, and W. Al-Atabany, “Archimedes optimization algorithm: a new metaheuristic algorithm for solving optimization problems,” Applied Intelligence, vol. 51, no. 3, pp. 1531–1551, 2021, doi: 10.1007/s10489-020-01893-z.
I. Neggaz and H. Fizazi, “An Intelligent handcrafted feature selection using Archimedes optimization algorithm for facial analysis,” Soft Computing, vol. 26, pp. 10435–10464, 2022.
V. Janamala and K. Radha Rani, “Optimal allocation of solar photovoltaic distributed generation in electrical distribution networks using Archimedes optimization algorithm,” Clean Energy, vol. 6, no. 2, pp. 271–287, 2022.
L. Zhang, J. Wang, X. Niu, and Z. Liu, “Ensemble wind speed forecasting with multi-objective Archimedes optimization algorithm and sub-model selection,” Applied Energy, vol. 301, p. 117449, 2021.
R. A. Khan et al., “Archimedes Optimization Algorithm Based Selective Harmonic Elimination in a Cascaded H-Bridge Multilevel Inverter,” Sustainability, vol. 14, no. 1, p. 310, 2021.
A. S. Desuky, S. Hussain, S. Kausar, M. A. Islam, and L. M. el Bakrawy, “EAOA: An Enhanced Archimedes Optimization Algorithm for Feature Selection in Classification,” IEEE Access, vol. 9, pp. 120795–120814, 2021.
E. H. Houssein, B. E. Helmy, H. Rezk, and A. M. Nassef, “An enhanced Archimedes optimization algorithm based on Local escaping operator and Orthogonal learning for PEM fuel cell parameter identification,” Engineering Applications of Artificial Intelligence, vol. 103, p. 104309, 2021, doi: https://doi.org/10.1016/j.engappai.2021.104309.
O. Akdag, “A Improved Archimedes Optimization Algorithm for multi/single-objective Optimal Power Flow,” Electric Power Systems Research, vol. 206, p. 107796, 2022.
D. Izzo, M. Märtens, and B. Pan, “A survey on artificial intelligence trends in spacecraft guidance dynamics and control,” Astrodynamics, vol. 3, no. 4, pp. 287–299, 2019, doi: 10.1007/s42064-018-0053-6.
A. A. Ewees, M. A. el Aziz, and A. E. Hassanien, “Chaotic multi-verse optimizer-based feature selection,” Neural Computing and Applications, vol. 31, no. 4, pp. 991–1006, 2019, doi: 10.1007/s00521-017-3131-4.
Y. Zhang, R. Liu, X. Wang, H. Chen, and C. Li, “Boosted binary Harris hawks optimizer and feature selection,” Engineering with Computers, vol. 37, no. 4, pp. 3741–3770, 2021.
G. Tikhe, T. Joshi, A. Lahorkar, A. Sane, and J. Valadi, “Feature selection using equilibrium optimizer,” in Data Engineering and Intelligent Computing, Springer, 2021, pp. 307–315.
C. Rorres, “Completing book II of Archimedes’s on floating bodies,” The mathematical intelligencer, vol. 26, no. 3, pp. 32–42, 2004.
Y. Y. Yiming, “An Evaluation of Statistical Approaches to Text Categorization,” Journal of Information Retrieval, vol. 1, pp. 67–88, 1999.
M. Mafarja, I. Aljarah, H. Faris, A. I. Hammouri, A. M. Al-Zoubi, and S. Mirjalili, “Binary grasshopper optimisation algorithm approaches for feature selection problems,” Expert Systems with Applications, vol. 117, pp. 267–286, 2019, doi: 10.1016/j.eswa.2018.09.015.
T. Thaher, A. A. Heidari, M. Mafarja, J. S. Dong, and S. Mirjalili, “Binary Harris Hawks Optimizer for High-Dimensional, Low Sample Size Feature Selection,” in Evolutionary Machine Learning Techniques: Algorithms and Applications, S. Mirjalili, H. Faris, and I. Aljarah, Eds. Singapore: Springer Singapore, 2020, pp. 251–272. doi: 10.1007/978-981-32-9990-0_12.
A. S. Desuky and L. M. el Bakrawy, “Improved prediction of post-operative life expectancy after thoracic surgery,” Advances in Systems Science and Application, vol. 16, no. 2, pp. 70–80, 2016.
M. Gong, “A Novel Performance Measure for Machine Learning Classification,” International Journal of Managing Information Technology, vol. 13, no. 1, pp. 11–19, 2021, doi: 10.5121/ijmit.2021.13101.
D. Dua and C. Graff, “UCI machine learning repository,” 2017.
D. Rey and M. Neuhäuser, “Wilcoxon-signed-rank test,” in International encyclopedia of statistical science, Springer, 2011, pp. 1658–1659.
Downloads
Published
-
Abstract163
-
PDF67