Transactions on Machine Intelligence

Transactions on Machine Intelligence

Improving Accuracy in Breast Cancer Diagnosis Using Data Mining Techniques

Document Type : Original Article

Author
Department of Computer Engineering, Faculty of Technical and Engineering, Kermanshah Branch, Islamic Azad University, Kermanshah, Iran.
Abstract
This study introduces a novel, integrated approach for breast cancer diagnosis, addressing one of the most critical challenges in medical sciences: the lack of timely and precise detection. Breast cancer remains a leading cause of mortality worldwide, and early diagnosis plays a pivotal role in improving survival rates. Currently, diagnostic practices heavily rely on physicians' expertise, supported by complex and time-consuming laboratory tests, which are prone to human error and often lead to delays in treatment. To overcome these limitations, this research proposes a comprehensive methodology that combines principal component analysis (PCA) for dimensionality reduction, decision trees for feature selection, and artificial neural networks (ANNs) for classification and prediction. By integrating these techniques, the proposed system optimizes the use of database features, offering an adaptable, efficient, and accurate solution for breast cancer detection. The results demonstrate that this method achieves superior diagnostic accuracy compared to conventional techniques and existing artificial intelligence-based methods referenced in related studies. Furthermore, the system significantly reduces diagnostic costs and time without compromising performance. This research highlights the potential of combining machine learning and data mining techniques to enhance diagnostic precision, providing researchers and clinicians with an effective tool for improving early detection, treatment planning, and patient outcomes.
Keywords

  • World Health Organization. (2014). World cancer report 2014 (Chapter 5.2). Geneva: World Health Organization. ISBN: 92-832-0429-8.
  • National Cancer Institute (NCI). (2014). Breast cancer. Retrieved June 29, 2014, from https://www.cancer.gov
  • Yeh, J.-Y., Chan, S.-W., & Wu, T.-H. (2016). Mining breast cancer classification rules from mammograms. Journal of Intelligent Systems, 25(1), 19–36. https://doi.org/10.1515/jisys-2014-0122
  • National Cancer Institute (NCI). (2014, June 26). Breast cancer treatment (PDQ®). Retrieved June 29, 2014, from https://www.cancer.gov
  • Office for National Statistics. (2013, October 29). Cancer survival in England: Patients diagnosed 2007–2011 and followed up to 2012. Retrieved June 29, 2014, from https://www.ons.gov.uk
  • National Cancer Institute (NCI). (2014). SEER stat fact sheets: Breast cancer. Retrieved June 18, 2014, from https://seer.cancer.gov/statfacts/html/breast.html
  • National Cancer Institute (NCI). (2014). Male breast cancer treatment. Retrieved June 29, 2014, from https://www.cancer.gov
  • University of California, Irvine. (n.d.). Breast cancer Wisconsin (original) dataset. Retrieved from http://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin(Original)
  • Alizadeh, S., & Malek Mohammadi, S. (2014). Data mining and knowledge discovery step by step with Clementine software (3rd ed.). Tehran: Khajeh Nasir Toosi University of Technology Press.
  • Siahi, M., & Ashir, A. (2015). Breast cancer recurrence prediction using data mining. National Electronic Conference on Recent Advances in Engineering and Basic Sciences, Islamic Azad University, Dezful Branch.
  • Nazarian, M., Abbasi Dezfuli, M., & Haroon Abadi, A. (2013). A method for breast cancer diagnosis using data mining techniques and virtual bee colony algorithm. 5th National Conference on Electrical and Electronic Engineering of Iran, Islamic Azad University, Gonabad Branch.
  • Ahmed, L. Q. (2013). Using data mining techniques for prediction of breast cancer recurrence. Iranian Journal of Breast Disease, 5(4), 23–34.
  • Kiani, B., & Atashi, A. (2014). A prognostic model based on data mining techniques to predict breast cancer recurrence. Journal of Health and Biomedical Informatics, 1(1), 26–31.
  • Delen, D., Walker, G., & Kadam, A. (2005). Predicting breast cancer survivability: A comparison of three data mining methods. Artificial Intelligence in Medicine, 4(2), 113–127. https://doi.org/10.1016/j.artmed.2004.07.002
  • Zadeh, H. G., Haddadnia, J., Seryasat, O. R., & Isfahani, S. M. M. (2016). Segmenting breast cancerous regions in thermal images using fuzzy active contours. EXCLI Journal, 15, 532.
  • Seryasat, O. R., & Haddadnia, J. (2018). Evaluation of a new ensemble learning framework for mass classification in mammograms. Clinical Breast Cancer, 18(3), e407–e420. https://doi.org/10.1016/j.clbc.2017.05.009
  • Rahmani-Seryasat, O., Haddadnia, J., & Ghayoumi-Zadeh, H. (2015). A new method to classify breast cancer tumors and their fractionation. Ciência e Natura, 37(4), 51–57. https://doi.org/10.5902/2179460X19428
  • Seryasat, O. R., & Haddadnia, J. (2017). Assessment of a novel computer-aided mass diagnosis system in mammograms. Biomedical Research, 28(7), 3129–3135.
Volume 5, Issue 4
Autumn 2022
Pages 277-285

  • Receive Date 04 August 2022
  • Revise Date 19 November 2022
  • Accept Date 29 December 2022