[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to main content

A Predictive Model for Classification of Breast Cancer Data Sets

  • Conference paper
  • First Online:
Advances in Computing and Data Sciences (ICACDS 2021)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1441))

Included in the following conference series:

  • 978 Accesses

Abstract

Medical professionals need a reliable methodology to predict diseases. The process of Machine Learning is used to identify unknown and useful patterns to assist in important tasks of disease prediction and treatment. The techniques that combine multiple classifiers are used for classifying the data sets. Each feature of data sets in the Wisconsin Breast Cancer Dataset (WBCD) collected from fine needle ambitious from human breast tissue. This data set was used to develop a predictive model for the classification and prediction of breast cancer. Support Vector Machine algorithm exhibited good performance when differentiating to other algorithms in such a way that it could be confirmed as the effective classification algorithm with respect to the accuracy, sensitivity, and mean absolute error when applied to diabetes, data sets. Classification and prediction accuracy varied with the quality of the data set.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
£29.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
GBP 19.95
Price includes VAT (United Kingdom)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
GBP 63.99
Price includes VAT (United Kingdom)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
GBP 79.99
Price includes VAT (United Kingdom)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Domingos, P., Pazzani, M.: On the optimality of the simple Bayesian classifier under zero-one loss. Mach. Learn. 29(2), 103–130 (1997). https://doi.org/10.1023/A:1007413511361

    Article  MATH  Google Scholar 

  2. Kohavi, R., Provost, F.: On applied research in machine learning. In: Editorial for the Special Issue on Applications of Machine Learning and the Knowledge Discovery Process, vol. 30. Columbia University, New York (1998)

    Google Scholar 

  3. Rajendra Acharya, U., Ng, E.Y.K., Chang, Y.H., Yang, J., Kaw, G.J.L.: Computer-based identification of breast cancer using digitized mammograms. J. Med. Syst. 32(6), 499–507 (2008). https://doi.org/10.1007/s10916-008-9156-6

    Article  Google Scholar 

  4. Ali, A., Tufail, A., Khan, U., Kim, M.: A survey of prediction models for breast cancer survivability. In: Proceedings of the 2nd International Conference on Interaction Sciences: Information Technology, Culture and Human, pp. 1259–1262 (2009)

    Google Scholar 

  5. American Cancer Society: Facts and figures 2015–2016. www.cancer.org/content/dam/cancer-org/research/cancer-facts-and-statistics/breast-cancer-facts-and-figures/breast-cancer-facts-and-figures-2015-2016.pdf

  6. Aljarullah, A.A.: Decision tree discovery for the diagnosis type-2 diabetes. In: IEEE International Conference on Innovation in Information Technology, pp. 303–307 (2011)

    Google Scholar 

  7. Barnum, S.R.: Biotechnology: An Introduction, Cengage Learning (2006)

    Google Scholar 

  8. Batista, G.E., Monard, M.C.: An analysis of four missing data treatment methods for supervised learning. Appl. Artif. Intell. Int. J. 17(5), 519–533 (2003)

    Article  Google Scholar 

  9. Berry, M.J.A., Linoff, G.S.: Data Mining Techniques for Marketing, Sales, and Customer Relationship Management. Wiley, Indianapolis (2004)

    Google Scholar 

  10. Ghosh, B.: Using fuzzy classification for chronic disease management. Indian J. Econ. Bus. 11(1), 231–240 (2012)

    Google Scholar 

  11. Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995). https://doi.org/10.1007/BF00994018

    Article  MATH  Google Scholar 

  12. Cruz-Ramírez, N., Acosta-Mesa, G.H., Carrillo-Calvet, H., Nava-Fernández, L.A., Barrientos-Martínez, R.E.: Diagnosis of using Bayesian networks: a case study. Comput. Biol. Med. 37(11), 1553–1564 (2007)

    Google Scholar 

  13. European Public Health Alliance. [20]. http://www.epha.org/a/2352

  14. Kandwal, R., Garg, P.K., Garg, R.D.: Health GIS and HIV/AIDS studies: perspective and retrospective. J. Biomed. Inform. 42(4), 748–755 (2009)

    Google Scholar 

  15. Kleissner, C.: Data mining for the enterprise. In: Proceeding of the 31st Annual Hawaii International Conference on System Science, CA, US, vol. 7, pp. 295–304. IEEE Computer Society (1998)

    Google Scholar 

  16. Liao, S.C., Lee, I.N.: Appropriate medical data categorization for data mining classification techniques. Med. Inform 27(1), 59–67 (2002)

    Google Scholar 

  17. Michie, D., Spiegelhalter, D., Taylor, C.: Machine learning: neural and statistical classification, Ellis Horwood, NJ, USA (1994)

    Google Scholar 

  18. Newman, D.J., Hettich, S., Blake, C.L., Merz, C.J.: UCI repository of machine learning databases (1998). University of California, Department of Information and Computer Science, Irvine. http://www.ics.uci.edu/~mlearn/MLRepository.html

  19. Ali, P.U.S., Ventakeswaran, C.J.: Improved evidence theoretic kNN classifier based on theory of evidence. Int. J. Comput. Appl. 15(5), 37–41 (2011)

    Google Scholar 

  20. Vijiyarani, S., Sudha, S.: Disease prediction in data mining technique – a survey. Int. J. Comput. Appl. Inf. Technol. 2(1), 17–21 (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Rao, S.V.A., Rao, P.R.K. (2021). A Predictive Model for Classification of Breast Cancer Data Sets. In: Singh, M., Tyagi, V., Gupta, P.K., Flusser, J., Ören, T., Sonawane, V.R. (eds) Advances in Computing and Data Sciences. ICACDS 2021. Communications in Computer and Information Science, vol 1441. Springer, Cham. https://doi.org/10.1007/978-3-030-88244-0_36

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-88244-0_36

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-88243-3

  • Online ISBN: 978-3-030-88244-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics