Abstract
Medical professionals need a reliable methodology to predict diseases. The process of Machine Learning is used to identify unknown and useful patterns to assist in important tasks of disease prediction and treatment. The techniques that combine multiple classifiers are used for classifying the data sets. Each feature of data sets in the Wisconsin Breast Cancer Dataset (WBCD) collected from fine needle ambitious from human breast tissue. This data set was used to develop a predictive model for the classification and prediction of breast cancer. Support Vector Machine algorithm exhibited good performance when differentiating to other algorithms in such a way that it could be confirmed as the effective classification algorithm with respect to the accuracy, sensitivity, and mean absolute error when applied to diabetes, data sets. Classification and prediction accuracy varied with the quality of the data set.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Domingos, P., Pazzani, M.: On the optimality of the simple Bayesian classifier under zero-one loss. Mach. Learn. 29(2), 103–130 (1997). https://doi.org/10.1023/A:1007413511361
Kohavi, R., Provost, F.: On applied research in machine learning. In: Editorial for the Special Issue on Applications of Machine Learning and the Knowledge Discovery Process, vol. 30. Columbia University, New York (1998)
Rajendra Acharya, U., Ng, E.Y.K., Chang, Y.H., Yang, J., Kaw, G.J.L.: Computer-based identification of breast cancer using digitized mammograms. J. Med. Syst. 32(6), 499–507 (2008). https://doi.org/10.1007/s10916-008-9156-6
Ali, A., Tufail, A., Khan, U., Kim, M.: A survey of prediction models for breast cancer survivability. In: Proceedings of the 2nd International Conference on Interaction Sciences: Information Technology, Culture and Human, pp. 1259–1262 (2009)
American Cancer Society: Facts and figures 2015–2016. www.cancer.org/content/dam/cancer-org/research/cancer-facts-and-statistics/breast-cancer-facts-and-figures/breast-cancer-facts-and-figures-2015-2016.pdf
Aljarullah, A.A.: Decision tree discovery for the diagnosis type-2 diabetes. In: IEEE International Conference on Innovation in Information Technology, pp. 303–307 (2011)
Barnum, S.R.: Biotechnology: An Introduction, Cengage Learning (2006)
Batista, G.E., Monard, M.C.: An analysis of four missing data treatment methods for supervised learning. Appl. Artif. Intell. Int. J. 17(5), 519–533 (2003)
Berry, M.J.A., Linoff, G.S.: Data Mining Techniques for Marketing, Sales, and Customer Relationship Management. Wiley, Indianapolis (2004)
Ghosh, B.: Using fuzzy classification for chronic disease management. Indian J. Econ. Bus. 11(1), 231–240 (2012)
Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995). https://doi.org/10.1007/BF00994018
Cruz-Ramírez, N., Acosta-Mesa, G.H., Carrillo-Calvet, H., Nava-Fernández, L.A., Barrientos-Martínez, R.E.: Diagnosis of using Bayesian networks: a case study. Comput. Biol. Med. 37(11), 1553–1564 (2007)
European Public Health Alliance. [20]. http://www.epha.org/a/2352
Kandwal, R., Garg, P.K., Garg, R.D.: Health GIS and HIV/AIDS studies: perspective and retrospective. J. Biomed. Inform. 42(4), 748–755 (2009)
Kleissner, C.: Data mining for the enterprise. In: Proceeding of the 31st Annual Hawaii International Conference on System Science, CA, US, vol. 7, pp. 295–304. IEEE Computer Society (1998)
Liao, S.C., Lee, I.N.: Appropriate medical data categorization for data mining classification techniques. Med. Inform 27(1), 59–67 (2002)
Michie, D., Spiegelhalter, D., Taylor, C.: Machine learning: neural and statistical classification, Ellis Horwood, NJ, USA (1994)
Newman, D.J., Hettich, S., Blake, C.L., Merz, C.J.: UCI repository of machine learning databases (1998). University of California, Department of Information and Computer Science, Irvine. http://www.ics.uci.edu/~mlearn/MLRepository.html
Ali, P.U.S., Ventakeswaran, C.J.: Improved evidence theoretic kNN classifier based on theory of evidence. Int. J. Comput. Appl. 15(5), 37–41 (2011)
Vijiyarani, S., Sudha, S.: Disease prediction in data mining technique – a survey. Int. J. Comput. Appl. Inf. Technol. 2(1), 17–21 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Rao, S.V.A., Rao, P.R.K. (2021). A Predictive Model for Classification of Breast Cancer Data Sets. In: Singh, M., Tyagi, V., Gupta, P.K., Flusser, J., Ören, T., Sonawane, V.R. (eds) Advances in Computing and Data Sciences. ICACDS 2021. Communications in Computer and Information Science, vol 1441. Springer, Cham. https://doi.org/10.1007/978-3-030-88244-0_36
Download citation
DOI: https://doi.org/10.1007/978-3-030-88244-0_36
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-88243-3
Online ISBN: 978-3-030-88244-0
eBook Packages: Computer ScienceComputer Science (R0)