651
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Feature Selection-based Machine Learning Comparative Analysis for Predicting Breast Cancer

, , , , ORCID Icon &
Article: 2340386 | Received 31 Jul 2023, Accepted 29 Mar 2024, Published online: 10 Apr 2024

References

  • Aavula, R., and R. Bhramaramba. 2019. XBPF: An extensible breast cancer prognosis framework for predicting susceptibility, recurrence and survivability. International Journal of Engineering and Advanced Technology 8(5):159–23. https://www.ijeat.org/portfolio-item/C5808028319/.
  • Abdar, M., M. Zomorodi-Moghadam, X. Zhou, R. Gururajan, X. Tao, P. D. Barua, and R. Gururajan. 2020. A new nested ensemble technique for automated diagnosis of breast cancer. Pattern recognition letters 132 (April):123–31. doi:10.1016/j.patrec.2018.11.004.
  • Ahn, J. S., S. Shin, S.-A. Yang, E. Kyung Park, K. Hwan Kim, S. Ick Cho, C.-Y. Ock, and S. Kim. 2023. Artificial intelligence in breast cancer diagnosis and personalized medicine. Journal of Breast Cancer 26 (5):405–35. doi:10.4048/jbc.2023.26.e45.
  • Akay, M. F. 2009. Support vector machines combined with feature selection for breast cancer diagnosis. Expert Systems with Applications 36 (2, Part 2):3240–47. doi:10.1016/j.eswa.2008.01.009.
  • Ali, M., R. Ali, M. Ali, and R. Ali. 2017. Satisfiability in big boolean algebras via boolean-equation solving. Journal of King Abdulaziz University Engineering Sciences 28 (1):3–18. doi:10.4197/Eng.28-1.1.
  • Ali, H., K. Shaban, A. Erradi, A. Mohamed, S. Khan Rumi, and F. D. Salim. 2021. ‘Spatiotemporal data mining: A survey on challenges and open problems’. Artificial Intelligence Review 55 (2):1441–88. https://link.springer.com/article/10.1007/s10462-021-09994-y.
  • Amin, M. S., Y. Kia Chiam, and K. Dewi Varathan. 2019. Identification of significant features and data mining techniques in predicting heart disease. Telematics and Informatics 36 (March):82–93. doi:10.1016/j.tele.2018.11.007.
  • Amrane, M., S. Oukid, I. Gagaoua, and T. Ensari. 2018. Breast cancer classification using machine learning. In 2018 Electric Electronics, Computer Science, Biomedical Engineerings’ Meeting (EBBT), 1–4. Istanbul, IEEE. doi:10.1109/EBBT.2018.8391453.
  • Anand, P., A. B. Kunnumakara, C. Sundaram, K. B. Harikumar, S. T. Tharakan, O. S. Lai, B. Sung, and B. B. Aggarwal. 2008. Cancer is a preventable disease that requires major lifestyle changes. Pharmaceutical Research 25 (9):2097–116. doi:10.1007/s11095-008-9661-9.
  • Anari, S., T. S. Nazanin, M. Negin, D. Shadi, and R. Amirali. 2022. Review of deep learning approaches for thyroid cancer diagnosis. In Mathematical Problems in Engineering, ed. A. Darba, (August): 1–8. doi:10.1155/2022/5052435.
  • Arbaiy, N., S. Eliza Sulaiman, N. Hassan, and Z. Afizah Afip. 2017. Integrated knowledge based expert system for disease diagnosis system. In IOP Conference Series: Materials Science and Engineering 226 (August):012097. doi:10.1088/1757-899X/226/1/012097.
  • Arunadevi, J., and K. Ganeshamoorthi. 2019. Feature selection facilitated classification for breast cancer prediction. In 2019 3rd International Conference on Computing Methodologies and Communication (ICCMC), 560–63. doi:10.1109/ICCMC.2019.8819752.
  • Ataollahi, M. R., J. Sharifi, M. R. Paknahad, and A. Paknahad. 2015. Breast cancer and associated factors: A review. Journal of Medicine and Life 8 (Spec Iss 4):6–11.
  • Balogh, E. P., B. T. Miller, and J. R. Ball. 2015. Committee on diagnostic error in health care, board on health care services, institute of medicine, and engineering the national academies of sciences. In Improving Diagnosis in Health Care, National Academies Press (US). https://www.ncbi.nlm.nih.gov/books/NBK338594/. ‘Overview of Diagnostic Error in Health Care’
  • Barrett, C. W., D. L. Dill, and A. Stump. 2002. Checking satisfiability of first-order formulas by incremental translation to SAT. In Computer Aided Verification, ed. E. Brinksma and K. G. Larsen, 236–49. US: Springer. doi:10.1007/3-540-45657-0_18.
  • Bethesda. 2023. ‘PDQ Breast Cancer Screening’. pdqCancerinfosummary. Nciglobal,ncienterprise. 24 November 2023. https://www.cancer.gov/types/breast/patient/breast-screening-pdq.
  • Bolboacă, S. D. 2019. Medical diagnostic tests: A review of test anatomy, phases, and statistical treatment of data. Computational and Mathematical Methods in Medicine 2019 (May):1891569. doi:10.1155/2019/1891569.
  • Budh, D. P., and A. Sapra. 2024. Breast cancer screening. In StatPearls, US: StatPearls Publishing. http://www.ncbi.nlm.nih.gov/books/NBK556050/.
  • CDC. 2023. Breast and Ovarian Cancers and Family Health History | CDC. Accessed October 11, 2023. https://www.cdc.gov/genomics/disease/breast_ovarian_cancer/breast_ovarian_cancer.htm.
  • Dana, B., and R. Shubair. 2016. Comparative study of machine learning algorithms for breast cancer detection and diagnosis. 2016. https://ieeexplore.ieee.org/document/7818560.
  • Elhazmi, A., A. Al-Omari, H. Sallam, H. N. Mufti, A. A. Rabie, M. Alshahrani, A. Mady, A. Alghamdi, A. Altalaq, M. H. Azzam, et al. 2022. Machine learning decision tree algorithm role for predicting mortality in critically Ill adult COVID-19 patients admitted to the ICU. Journal of Infection and Public Health 15(7):826–34. doi:10.1016/j.jiph.2022.06.008.
  • Ellis, R. J., R. M. Sander, and A. Limon. 2022. Twelve key challenges in medical machine learning and solutions. Intelligence-Based Medicine 6 (January):100068. doi:10.1016/j.ibmed.2022.100068.
  • Embi, P. J., and P. R. O. Payne. 2009. Clinical research informatics: challenges, opportunities and definition for an emerging domain. Journal of the American Medical Informatics Association 16 (3):316–27. doi:10.1197/jamia.M3005.
  • Fatin Kadhim, N. 2022. ‘Breast cancer detection using decision tree and k-nearest neighbour classifiers’. 2022. https://ijs.uobaghdad.edu.iq/index.php/eijs/article/view/5386.
  • Ganggayah, M. D., N. Aishah Taib, Y. Cheng Har, P. Lio, and S. Kaur Dhillon. 2019a. Predicting factors for survival of breast cancer patients using machine learning techniques. BMC Medical Informatics & Decision Making 19 (1):48. doi:10.1186/s12911-019-0801-4.
  • Ganggayah, M. D., N. Aishah Taib, Y. Cheng Har, P. Lio, and S. Kaur Dhillon. 2019b. Predicting factors for survival of breast cancer patients using machine learning techniques. BMC Medical Informatics & Decision Making 19 (1):48. doi:10.1186/s12911-019-0801-4.
  • Gopal, V. N., F. Al-Turjman, R. Kumar, L. Anand, and M. Rajesh. 2021. Feature selection and classification in breast cancer prediction using IoT and machine learning. Measurement 178 (June):109442. doi:10.1016/j.measurement.2021.109442.
  • Hajiabadi, H., V. Babaiyan, D. Zabihzadeh, and M. Hajiabadi. 2020. Combination of loss functions for robust breast cancer prediction. Computers & Electrical Engineering 84 (June):106624. doi:10.1016/j.compeleceng.2020.106624.
  • Hartama, D., A. Perdana Windarto, and A. Wanto. 2019. The application of data mining in determining patterns of interest of high school graduates. Journal of Physics: Conference Series 1339 (1):012042. doi:10.1088/1742-6596/1339/1/012042.
  • Hikmatul, F. S., N. Nazifah Yu Zaini, M. Shareduwan, M. Kasihmuddin, M. A. Mansor, and S. Sathasivam. 2020. Hybrid ant colony optimization for even-2 Satisfiability Programming in Hopfield Neural Network | AIP Conference Proceedings, https://pubs.aip.org/aip/acp/article/2266/1/040014/1007336/Hybrid-ant-colony-optimization-for-even-2.
  • Huang, S., C. NIanguang, P. P. Pacheco, S. Narandes, Y. Wang, and X. Wayne. 2017. Applications of Support Vector Machine (SVM) learning in cancer genomics. Cancer Genomics & Proteomics 15 (1):41–51. doi:10.21873/cgp.20063.
  • Huang, X., X. Tang, W. Zhang, S. Pei, J. Zhang, M. Zhang, Z. Liu, R. Chen, and Y. Huang. 2023. A generic knowledge based medical diagnosis expert system. arXiv. doi:10.48550/arXiv.2110.04439.
  • Jaglan, P., R. Dass, and M. Duhan. 2019. Breast Cancer Detection Techniques: Issues and Challenges. Journal of the Institution of Engineers (India): Series B 100 (March):379–86. doi:10.1007/s40031-019-00391-2.
  • Jalal Abdullah, A., T. Mohammed Hasan, and J. Waleed. 2019. An expanded vision of breast cancer diagnosis approaches based on machine learning techniques. https://ieeexplore.ieee.org/document/8950530.
  • Kasgari, A. B., S. Safavi, M. Nouri, J. Hou, N. Tataei Sarshar, and R. Ranjbarzadeh. 2023. Point-of-interest preference model using an attention mechanism in a convolutional neural network. Bioengineering 10 (4):495. doi:10.3390/bioengineering10040495.
  • Kasihmuddin, M. S. M., S. Sathasivam, and M. Asyraf Mansor. 2017. Hybrid genetic algorithm in the hopfield network for maximum 2-satisfiability problem’ 1870. (August):050001. doi:10.1063/1.4995911.
  • Kattan, M. W. 2001. Expert systems in medicine. In International Encyclopedia of the social & behavioral sciences, ed. J. S. Neil and B. B. Paul, 5135–39. US: Pergamon. doi:10.1016/B0-08-043076-7/00556-8.
  • Khan, N., F. Afaq, and H. Mukhtar. 2010. Lifestyle as risk factor for cancer: evidence from human studies. Cancer Letters 293 (2):133–43. doi:10.1016/j.canlet.2009.12.013.
  • Khandezamin, Z., M. Naderan, and M. Javad Rashti. 2020. Detection and classification of breast cancer using logistic regression feature selection and GMDH Classifier. Journal of Biomedical Informatics 111 (November):103591. doi:10.1016/j.jbi.2020.103591.
  • Lahoura, V., H. Singh, A. Aggarwal, B. Sharma, M. Abed Mohammed, R. Damaševičius, S. Kadry, and K. Cengiz. 2021. Cloud computing-based framework for breast cancer diagnosis using extreme learning machine. Diagnostics (Basel, Switzerland) 11 (2):241. doi:10.3390/diagnostics11020241.
  • Mughal, M. J. H. 2018. Data mining: Web data mining techniques, tools and algorithms: An overview. International Journal of Advanced Computer Science and Applications (IJACSA) 9 (6). doi:10.14569/IJACSA.2018.090630.
  • Muhammad, H. M., J. Ping Li, A. Ul Haq, M. Hunain Memon, and W. Zhou. 2019. Breast cancer detection in the iot health environment using modified recursive feature selection’. https://www.hindawi.com/journals/wcmc/2019/5176705/.
  • Mümine, K. K. 2019. Breast cancer prediction and detection using data mining classification algorithms: A comparative study. Tehnicki Vjesnik – Technical Gazette 26:1. doi:10.17559/TV-20180417102943.
  • National Council Institute. 2015. Risk factors: Hormones – NCI. cgvArticle. Nciglobal,ncienterprise. Accessed April 29, 2015. https://www.cancer.gov/about-cancer/causes-prevention/risk/hormones.
  • Nazanin, T. S., R. Ranjbarzadeh, S. Jafarzadeh Ghoushchi, G. G. de Oliveira, S. Anari, M. Parhizkar, and M. Bendechache. 2022. ‘Glioma brain tumor segmentation in four MRI modalities using a convolutional neural network and based on a transfer learning method’. 2022. https://link.springer.com/chapter/10.1007/978-3-031-04435-9_39.
  • Ogundokun, R. O., S. Misra, M. Douglas, R. Damaševičius, and R. Maskeliūnas. 2022. Medical Internet-of-things based breast cancer diagnosis using hyperparameter-optimized neural networks. Future Internet 14 (5):153. doi:10.3390/fi14050153.
  • Omondiagbe, D. A., S. Veeramani, and A. S. Sidhu. 2019. Machine learning classification techniques for breast cancer diagnosis. In IOP Conference Series: Materials Science and Engineering 495 (June):012033. doi:10.1088/1757-899X/495/1/012033.
  • Parsa, N. 2012. Environmental factors inducing human cancers. Iranian Journal of Public Health 41 (11):1–9. doi:10.1186/1479-5876-4-14.
  • Petrucelli, N., B. D. Mary, and P. Tuya 1993. BRCA1- and BRCA2-associated hereditary breast and ovarian cancer. In GeneReviews®, ed. P. Margaret. J. F. Adam, G. M. Mirzaa, R. A. Pagon, S. E. Wallace, L. J. Bean, K. W. Gripp, and A. Amemiya, US: University of Washington, Seattle. http://www.ncbi.nlm.nih.gov/books/NBK1247/.
  • Pietro, T. 1985. Knowledge based expert systems for medical diagnosis. Statistics in Medicine 4 (3):317–25. doi:10.1002/sim.4780040311.
  • Rabiei, R., S. Mohammad Ayyoubzadeh, S. Sohrabei, M. Esmaeili, and A. Atashi. 2022. Prediction of breast cancer using machine learning approaches. Journal of Biomedical Physics & Engineering 12 (3):297–308. doi:10.31661/jbpe.v0i0.2109-1403.
  • Ramin, R., S. Jafarzadeh Ghoushchi, N. Tataei Sarshar, E. Babaee Tirkolaee, S. Samar Ali, T. Kumar, and M. Bendechache. 2023. ‘ME-CCNN: Multi-encoded images and a cascade convolutional neural network for breast tumor segmentation and recognition’. Artificial Intelligence Review 56 (9):10099–136. https://link.springer.com/article/10.1007/s10462-023-10426-2.
  • Reyes, L. T., J. Klöckner Knorst, F. Ruffo Ortiz, and T. Machado Ardenghi. 2021. Scope and challenges of machine learning-based diagnosis and prognosis in clinical dentistry: A literature review. Journal of Clinical and Translational Research 7 (4):523–39.
  • Sathasivam, S., W. A. Tajuddin, and W. Abdullah. 2011. Logic mining in neural network: Reverse analysis method. Computing 91 (2):119–33. doi:10.1007/s00607-010-0117-9.
  • Serhat, S., U. Kursuncu, E. Kibis, M. Anis Abdellatif, and A. Dag. 2020. ‘A hybrid data mining approach for identifying the temporal effects of variables associated with breast cancer survival’. Expert Systems with Applications 139:112863. https://www.sciencedirect.com/science/article/abs/pii/S0957417419305731.
  • Shadi, A., A. Anguera, J. William Atwood, J. A. Lara, and D. Lizcano. 2019. Particularities of data mining in medicine: lessons learned from patient medical time series data analysis. EURASIP Journal on Wireless Communications and Networking 2019 (1):260. doi:10.1186/s13638-019-1582-2.
  • Shravya, C., K. Pravalika, and S. Subhani. 2019. Prediction of breast cancer using supervised machine learning techniques. International Journal of Innovative Technology and Exploring Engineering 8(6):1106–1110. https://www.ijitee.org/portfolio-item/F3384048619/.
  • Sohail, M. N., R. Jiadong, M. Musa Uba, and M. Irshad. 2019. A comprehensive looks at data mining techniques contributing to medical data growth: a survey of researcher reviews. In Recent developments in intelligent computing, communication and devices, ed. S. Patnaik and V. Jain, 21–26. US: Springer. doi:10.1007/978-981-10-8944-2_3.
  • Takkar, N., S. Kochhar, P. Garg, A. K. Pandey, U. R. Dalal, and U. Handa. 2017. Screening methods (clinical breast examination and mammography) to detect breast cancer in women aged 40–49 years. Journal of Mid-Life Health 8 (1):2–10. doi:10.4103/jmh.JMH_26_16.
  • Tan, X., A. T. Su, H. Hajiabadi, M. Tran, and Q. Nguyen. 2021. Applying machine learning for integration of multi-modal genomics data and imaging data to quantify heterogeneity in tumour tissues. In Artificial neural networks, ed. H. Cartwright, 209–28. US: Springer US. doi:10.1007/978-1-0716-0826-5_10.
  • Tomislav, H., M. Nussbaum, M. N. Wright, G. B. M. Heuvelink, and B. Gräler. 2018. ‘Random forest as a generic framework for predictive modeling of spatial and spatio-temporal variables’. 2018. https://pubmed.ncbi.nlm.nih.gov/30186691/.
  • UCI. 2019. ‘UCI machine learning repository’. https://archive.ics.uci.edu/.
  • Uddin, K., M. Mohi, N. Biswas, S. Tasmin Rikta, and S. Kumar Dey. 2023. Machine learning-based diagnosis of breast cancer utilizing feature optimization technique. Computer Methods and Programs in Biomedicine Update 3 (January):100098. doi:10.1016/j.cmpbup.2023.100098.
  • Velev, M. N. 2004. Efficient translation of boolean formulas to cnf in formal verification of microprocessors. In Proceedings of the 2004 Asia and South Pacific Design Automation Conference, 310–15. ASP–DAC ’04. Yokohama, Japan, IEEE Press.
  • Vikas, C., S. Pal, and B. B. Tiwari. 2018. ‘Prediction of benign and malignant breast cancer using data mining techniques’. SSRN Electronic Journal 2018. https://journals.sagepub.com/doi/10.1177/1748301818756225.
  • Vivek, K., B. Kishore Mishra, M. Mazzara, D. N. H. Thanh, and A. Verma. 2020. ‘Prediction of malignant and benign breast cancer: A data mining approach in healthcare applications’. 2020. https://link.springer.com/chapter/10.1007/978-981-15-0978-0_43.
  • Wang, S., J. Cao, and P. S. Yu. 2022. ‘Deep learning for spatio-temporal data mining: a survey |. IEEE Journals & Magazine | IEEE Xplore’ 2022. https://ieeexplore.ieee.org/document/9204396.
  • William, W., and O. Mangasarian. 1993. ‘Breast cancer Wisconsin (Diagnostic)’. [ object Object]. doi:10.24432/C5DW2B.
  • Yang, J., Y. Li, Q. Liu, L. Li, A. Feng, T. Wang, S. Zheng, A. Xu, and J. Lyu. 2020. Brief introduction of medical database and data mining technology in big data era. Journal of Evidence-Based Medicine 13 (1):57–69. doi:10.1111/jebm.12373.
  • Yusupova, N., O. Smetanina, A. Agadullina, and E. Sazonova. 2020. Knowledge Identification by structured data for decision making in project teams 385–90. Atlantis Press. doi:10.2991/aisr.k.201029.072.
  • Zhang, Y.-B., X.-F. Pan, J. Chen, A. Cao, Y.-G. Zhang, L. Xia, J. Wang, H. Li, G. Liu, and A. Pan. 2020. Combined lifestyle factors, incident cancer, and cancer mortality: a systematic review and meta-analysis of prospective cohort studies |. British Journal of Cancer’ 2020. https://www.nature.com/articles/s41416-020-0741-x.