Generative Models for Class Imbalance Problem on BreakHis Dataset: A Case Study

Part of the book series: Studies in Computational Intelligence ((SCI,volume 1149))

150 Accesses

Abstract

In real-world classification tasks, it is common to find class imbalance issues in the training datasets, i.e. an unequal number of examples among the different classes. The class imbalance problem biases the performance of predictive models by overlooking minority classes; this is because predictive models employ learning rules with accuracy-based cost functions, thus favoring majority classes. In this work, the class imbalance issue is tackled through generative models, using the BreakHis dataset, a histopathologic image set intended for breast cancer classification, as a case study. The BreasHis’ minority class is balanced by adding synthetic images obtained by means of different generative methods, including variational autoencoders and two different generative adversarial networks. The quality of the image sets created by the different generative models, and their effects in balancing the BreakHis dataset, are evaluated through several quantitative metrics computed from classification tasks. Statistical analysis is performed and the results indicate that the DCGAN network is superior to the other evaluated models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 129.99; Price includes VAT (United Kingdom)

Hardcover Book: GBP 159.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Enhanced balancing GAN: minority-class image generation

Article 17 June 2021

eGAN: Unsupervised Approach to Class Imbalance Using Transfer Learning

SGBGAN: minority class image generation for class-imbalanced datasets

Article 29 January 2024

References

Latif, J., Xiao, C., Imran, A., Tu, S.: Medical imaging using machine learning and deep learning algorithms: a review, pp. 1–5 (2019)
Google Scholar
Spanhol, F.A., Oliveira, L.S., Petitjean, C., Heutte, L.: A dataset for breast cancer histopathological image classification. IEEE Trans. Biomed. Eng. 63, 1455–1462 (2016)
Article Google Scholar
Benhammou, Y., Achchab, B., Herrera, F., Tabik, S.: BreakHis based breast cancer automatic diagnosis using deep learning: taxonomy, survey and insights. Neurocomputing 375, 9–24 (2020)
Article Google Scholar
Langr, J., Bok, V.: GANs in action: deep learning with generative adversarial networks. Manning (2019)
Google Scholar
Foster, D.: Generative Deep Learning. O’Reilly Media (2022)
Google Scholar
Kingma, D.P., Welling, M.: Auto-encoding varational Bayes (2013). arXiv:1312.6114
Radford, A., Metz, L., Chintala, S.: Unsupervised representation learning with deep convolutional generative adversarial networks. Cornell University (2015). http://export.arxiv.org/pdf/1511.06434
Goodfellow, I., et al.: Generative adversarial networks. Commun. ACM 63, 139–144 (2020)
Article Google Scholar
Karras, T., Aila, T., Laine, S., Lehtinen, J.: Progressive growing of gans for improved quality, stability, and variation (2017)
Google Scholar
Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift, pp. 448–456. pmlr (2015)
Google Scholar
Ulyanov, D., Vedaldi, A., Lempitsky, V.: Instance normalization: the missing ingredient for fast stylization (2016). arXiv:1607.08022
Tan, M., Le, Q.V.: Efficientnet: rethinking model scaling for convolutional neural networks (2019)
Google Scholar
Glassner, A.: Deep Learning: A Visual Approach. No Starch Press (2021)
Google Scholar
Zhao, S., Song, J., Ermon, S.: Towards deeper understanding of variational autoencoding models (2017). arXiv:1702.08658
Heusel, M., Ramsauer, H., Unterthiner, T., Nessler, B., Hochreiter, S.: Gans trained by a two time-scale update rule converge to a local nash equilibrium. In: Advances in Neural Information Processing Systems, vol. 30 (2017)
Google Scholar

Download references

Acknowledgements

This work was partially supported by the National Council of Humanities, Science and Technology (CONAHCYT) of Mexico, via Postgraduate Scholarship 824473 (A. Rosales) and Grant CÁTEDRAS-2598 (A. Rojas).

Author information

Authors and Affiliations

Div. de Estudios de Posgrado e Investigación, Tecnológico Nacional de México - Campus León, Av. Tecnológico S/N, 37290, León, Gto., México
Angel E. Rosales-Morales, Manuel Ornelas-Rodríguez, Alfonso Rojas-Domínguez, Héctor J. Puga-Soberanes & J. Martín Carpio
Centro de Investigación en Computación, Instituto Politécnico Nacional, Av. Juan de Dios Bátiz, Gustavo A. Madero, CDMX, 7738, México
Alfredo Gutiérrez-Alfaro
Departamento de Estudios Organizacionales, Universidad de Guanajuato, Fracc. 1, Guanajuato, 36250, Gto., México
Andrés Espinal

Authors

Angel E. Rosales-Morales
View author publications
You can also search for this author in PubMed Google Scholar
Alfredo Gutiérrez-Alfaro
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Ornelas-Rodríguez
View author publications
You can also search for this author in PubMed Google Scholar
Andrés Espinal
View author publications
You can also search for this author in PubMed Google Scholar
Alfonso Rojas-Domínguez
View author publications
You can also search for this author in PubMed Google Scholar
Héctor J. Puga-Soberanes
View author publications
You can also search for this author in PubMed Google Scholar
J. Martín Carpio
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andrés Espinal .

Editor information

Editors and Affiliations

Division of Graduate Studies and Research, Tijuana Institute of Technology, Tijuana, Baja California, Mexico
Oscar Castillo
Graduate Studies and Research, Tijuana Institute of Technology, Tijuana, Mexico
Patricia Melin

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Rosales-Morales, A.E. et al. (2024). Generative Models for Class Imbalance Problem on BreakHis Dataset: A Case Study. In: Castillo, O., Melin, P. (eds) New Horizons for Fuzzy Logic, Neural Networks and Metaheuristics. Studies in Computational Intelligence, vol 1149. Springer, Cham. https://doi.org/10.1007/978-3-031-55684-5_8

Download citation

DOI: https://doi.org/10.1007/978-3-031-55684-5_8
Published: 22 May 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-55683-8
Online ISBN: 978-3-031-55684-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Generative Models for Class Imbalance Problem on BreakHis Dataset: A Case Study

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Enhanced balancing GAN: minority-class image generation

eGAN: Unsupervised Approach to Class Imbalance Using Transfer Learning

SGBGAN: minority class image generation for class-imbalanced datasets

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Generative Models for Class Imbalance Problem on BreakHis Dataset: A Case Study

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Enhanced balancing GAN: minority-class image generation

eGAN: Unsupervised Approach to Class Imbalance Using Transfer Learning

SGBGAN: minority class image generation for class-imbalanced datasets

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation