Transactions on Machine Intelligence

Transactions on Machine Intelligence

Detection of Benign and Malignant Breast Cancer Using Data Shuffling Ensemble Method

Document Type : Original Article

Authors
1 M.Sc. Student, Department of Electrical Engineering, Payam Noor Higher Education Institute, Golpayegan, Iran
2 Faculty Member, Department of Electrical Engineering, Payam Noor Higher Education Institute, Golpayegan, Iran
3 Faculty Member, Payam Noor Higher Education Institute, Golpayegan, Iran
Abstract
Breast cancer remains one of the most serious health concerns affecting women worldwide. Early detection of malignant tumors significantly improves survival rates and enhances patients' quality of life. As a result, the development of accurate and efficient diagnostic systems for breast cancer is of critical importance. Artificial neural networks (ANNs) have been widely applied in medical diagnosis, particularly in data classification tasks. The backpropagation algorithm is a commonly used technique for training neural networks; however, it suffers from certain limitations, such as slow convergence, susceptibility to local minima, and sensitivity to initial weight selection. To address these challenges, this study employs a data-shuffling ensemble method to improve classification accuracy. The effectiveness of both approaches is assessed in distinguishing between benign and malignant breast tumors using a well-established dataset. Experimental results indicate that the neural network utilizing the data-shuffling ensemble method achieves an accuracy of 99.3%, outperforming the backpropagation algorithm. These findings highlight the potential of ensemble learning techniques in enhancing diagnostic accuracy and reliability in medical applications. The study contributes to the ongoing advancements in artificial intelligence-based diagnostic tools, emphasizing the importance of robust classification techniques in improving breast cancer detection and patient outcomes.
Keywords

  • American Cancer Society (2013). Breast Cancer: Facts and Figures 2013. Atlanta: ACS.
  • UCI Machine Learning Repository. (n.d.). Retrieved from http://archive.ics.uci.edu/ml/machine-learningdatabases/
  • Mitra, S., & Hayashi, Y. (2000). Neuro-fuzzy rule generation: Survey in soft computing framework. IEEE Transactions on Neural Networks, 11(3), 748-768. https://doi.org/10.1109/72.846746
  • Mitra, S., & Acharya, T. (2005). Data mining: Multimedia, soft computing, and bioinformatics. John Wiley & Sons. https://doi.org/10.1117/1.2179076
  • Bellazzi, R., & Zupan, B. (2008). Predictive data mining in clinical medicine: Current issues and guidelines. International Journal of Medical Informatics, 77(2), 81-97. https://doi.org/10.1016/j.ijmedinf.2006.11.006
  • Marcano-Cedeño, A., Quintanilla-Domínguez, J., & Andina, D. (2011). WBCD breast cancer database classification applying artificial metaplasticity neural network. Expert Systems with Applications, 38(8), 9573-9579. https://doi.org/10.1016/j.eswa.2011.01.167
  • Malmir, H., Farokhi, F., & Sabbaghi-Nadooshan, R. (2013). Optimization of data mining with evolutionary algorithms for cloud computing application. 3rd International Conference on Computer and Knowledge Engineering (ICCKE), 343-347. IEEE. https://doi.org/10.1109/ICCKE.2013.6682821
  • Hamilton, H. J., Shan, N., & Cercone, N. (1996). RIAC: A rule induction algorithm based on approximate classification. Computer Science Department, University of Regina.
  • Quinlan, J. R. (1996). Improved use of continuous attributes in C4.5. Journal of Artificial Intelligence Research, 4, 77-909. https://doi.org/10.1613/jair.279
  • Pena-Reyes, C. A., & Sipper, M. (1999). A fuzzy-genetic approach to breast cancer diagnosis. Artificial Intelligence in Medicine, 17(2), 131-155. https://doi.org/10.1016/S0933-3657(99)00019-6
  • Abonyi, J., & Szeifert, F. (2003). Supervised fuzzy clustering for the identification of fuzzy classifiers. Pattern Recognition Letters, 24(14), 2195-2207. https://doi.org/10.1016/S0167-8655(03)00047-3
  • Paulin, F. (2011). Classification of breast cancer by comparing backpropagation training algorithm. International Journal on Computer Science and Engineering (IJCSE).
  • Lavanya, D., & Rani, D. K. U. (2011). Analysis of feature selection with classification: Breast cancer datasets. Indian Journal of Computer Science and Engineering (IJCSE), 2(5), 756-763.
  • Bhardwaj, A., & Tiwari, A. (2015). Breast cancer diagnosis using genetically optimized neural network model. Expert Systems with Applications. https://doi.org/10.1016/j.eswa.2015.01.065
  • Zadeh, H. G., Haddadnia, J., Seryasat, O. R., & Isfahani, S. M. M. (2016). Segmenting breast cancerous regions in thermal images using fuzzy active contours. EXCLI Journal, 15, 532.
  • Seryasat, O. R., & Haddadnia, J. (2018). Evaluation of a new ensemble learning framework for mass classification in mammograms. Clinical Breast Cancer, 18(3), e407-e420. https://doi.org/10.1016/j.clbc.2017.05.009
  • Rahmani-Seryasat, O., Haddadnia, J., & Ghayoumi-Zadeh, H. (2015). A new method to classify breast cancer tumors and their fractionation. Ciência e Natura, 37(4), 51-57. https://doi.org/10.5902/2179460X19428
  • Rahmani Seryasat, O., Kor, I., Ghayoumi Zadeh, H., & Shams Taleghani, A. (2021). Predicting the number of comments on Facebook posts using an ensemble regression model. International Journal of Nonlinear Analysis and Applications12(Special Issue), 49-62.
  • Kondo, T., & Ueno, J. (2007). Logistic GMDH-type neural network and its application to identification of X-ray film characteristic curve. Journal of Advanced Computational Intelligence and Intelligent Informatics, 11(3), 312-318. https://doi.org/10.20965/jaciii.2007.p0312
  • Kondo, T. (1998). GMDH neural network algorithm using the heuristic self-organization method and its application to the pattern identification problem. Proceedings of the 37th SICE Annual Conference, 1143-1148. https://doi.org/10.1109/SICE.1998.742993
  • Ivakhnenko, A. G. (1970). Heuristic self-organization in problems of engineering cybernetics. Automatica, 6(2), 207-219. https://doi.org/10.1016/0005-1098(70)90092-0
  • Farlow, S. J. (Ed.). (1984). Self-organizing methods in modeling: GMDH type algorithms. Marcel Dekker Inc.
  • Madala, H. R., & Ivakhnenko, A. G. (1994). Inductive learning algorithms for complex systems modeling. CRC Press Inc.
  • Müller, J.-A., Lemke, F., & Ivakhnenko, A. G. (1998). GMDH algorithms for complex systems modeling. Mathematical and Computer Modeling of Dynamical Systems, 4, 275-315. https://doi.org/10.1080/13873959808837083
  • Paulin, F., & Santhakumaran, A. (2011). Classification of breast cancer by comparing backpropagation training algorithms. International Journal on Computer Science and Engineering (IJCS). https://doi.org/10.5120/656-923
  •  
Volume 5, Issue 4
Autumn 2022
Pages 256-264

  • Receive Date 05 May 2022
  • Revise Date 29 August 2022
  • Accept Date 20 December 2022