Spam Email Detection Using Machine Learning and Neural Networks

Manoj Sethi¹⁸,
Sumesha Chandra¹⁸,
Vinayak Chaudhary¹⁸ &
…
Yash Dahiya¹⁸

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1408))

1713 Accesses

Abstract

Spam emails are junk emails which are unrequested deceptive emails sent or forwarded to any person or a company which may contain malware and has access to confidential information of any individual. A lot of research work has been done in this area of spam detection which is limited to some specific domains. Machine learning is generally used to classify whether an email is valid (ham) or unwanted (spam). Two feature sets are introduced namely stopwords and word count to determine an email is spam or ham on the basis of textual information and fields of an email file. The entire process involves the comparison of two different feature sets on Multinomial Naïve Bayes, Logistic Regression, Linear Support Vector Machine, and Artificial Neural Network Algorithms to determine a more reliable method for spam detection. For this purpose, we use benchmark datasets as well as real time evaluation to experimentally evaluate the proposed work. Detection of a spam email on basis of content, malware, and sender’s information can reduce the threat to user’s confidential information to a great extent.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 159.50; Price includes VAT (United Kingdom)

Softcover Book: GBP 199.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Email Spam Detection Using Machine Learning and Feature Optimization Method

Detection and Classification of Spam Email: A Machine Learning-Based Experimental Analysis

Spam Email Detection Using Deep Support Vector Machine, Support Vector Machine and Artificial Neural Network

References

Mohammed, M. A., Mostafa, S. A., & Obaid, O. I. An anti-spam detection model for emails of multi-natural language.
Google Scholar
Mallampati, D., & Hegde, N. P. (2020). A machine learning based email spam classification framework model. IJITEE, ISSN, 9(4), 2278–3075.
Google Scholar
Cormack, G. V. (2006). Email spam filtering: A systematic review. Foundations and Trends® in Information Retrieval, 1(4), 335–455.
Google Scholar
Chen, J. I. Z., & Smys, S. (2020). Social multimedia security and suspicious activity detection in SDN using hybrid deep learning technique. Journal of Information Technology, 2(02), 108–115.
Google Scholar
Siponen, M., & Stucke, C. (2006). Effective anti-spam strategies in companies: An international study. In Proceedings of the 39th Annual Hawaii international conference on system sciences (HICSS’06).
Google Scholar
Mallampati, D., Chandra Shekar, K., & Ravikanth, K. Supervised machine learning classifier for email spam filtering, © Springer Nature Singapore Pte Ltd. 2019 and Engineering. https://doi.org/10.1007/978-981-13-7082-341.
Gupta, H., Jamal, M. S., Madisetty, S., & Desarkar, M. S. (2018, January). A framework for real-time spam detection in Twitter. In 2018 10th international conference on communication systems & networks (COMSNETS) (pp. 380–383).
Google Scholar
Mahmoud, T. M., & Mahfouz, A. M. (2012). SMS spam filtering technique based on artificial immune system. International Journal of Computer Science Issues (IJCSI), 9(2), 589.
Google Scholar
Akinyelu, A. A., & Adewumi, A. O. (2014). Classification of phishing email using random forest machine learning technique. Journal of Applied Mathematics.
Google Scholar
Yüksel, A. S., Cankaya, S. F., & Üncü, İ. S. (2017). Design of a machine learning based predictive analytics system for spam problem. Acta Physica Polonica, A., 132(3); Goodman, J. (2004, July). IP Addresses in Email Clients. CEAS.
Google Scholar
Androutsopoulos, J. Koutsias, K. Chandrinos and C. D. Spyropoulos, “An experimental comparison of naive Bayesian and keyword-based anti-spam filtering with personal email messages,” Computation and Language, pp. 160–167, 2000.
Google Scholar
Huang, L., Jia, J., Ingram, E., & Peng, W. Enhancing the naive bayes spam filter through intelligent text modification detection. In 2018 17th IEEE international conference on trust, security and privacy in computing and communications.
Google Scholar
Apache. (2019). “open-source Apache SpamAssassin Dataset”, https://spamassassin.apache.org/old/publiccorpus/
Vinodhini, M., Prithvi, D., Balaji, S. (2020, March). Spam detection framework using ML algorithm. IJRTE, 8(6). ISSN: 2277-3878.
Google Scholar
Brownlee, J. (2016, April 1). Logistic regression for machine learning. The Machine Learning Mastery. https://machinelearningmastery.com/logistic-regression-for-machine-learning/
Zavvar, M., Rezaei, M., & Garavand, S. (2016) Email spam detection using combination of particle swarm optimization and artificial neural network and support vector machine. International Journal of Model Education and Computer Science 68–74.
Google Scholar
Gandhi, R. (2018, June 7). Support vector machine. The Machine Learning Mastery. https://towardsdatascience.com/support-vector-machine-introduction-to-machine-learning-algorithms-934a444fca47
Smys, S., Basar, A., & Wang, H. (2020). Artificial neural network based power management for smart street lighting systems. Journal of Artificial Intelligence, 2(01), 42–52.
Google Scholar
Li, X. M., & Kim, U. M. (2012, June). A hierarchical framework for content-based image spam filtering. In 8th international conference on information science and digital content technology (ICIDT) (pp. 149–155). Jeju.
Google Scholar
Mukherjee, A., Venkataraman, V., Liu, B., & Glance, N. S. (2013). What yelp fake review filter might be doing? In ICWSM.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Delhi Technological University, New Delhi, India
Manoj Sethi, Sumesha Chandra, Vinayak Chaudhary & Yash Dahiya

Authors

Manoj Sethi
View author publications
You can also search for this author in PubMed Google Scholar
Sumesha Chandra
View author publications
You can also search for this author in PubMed Google Scholar
Vinayak Chaudhary
View author publications
You can also search for this author in PubMed Google Scholar
Yash Dahiya
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Engineering, Tribhuvan University, Pulchowk Campus, Lalitpur, Nepal
Subarna Shakya
Intelligent Systems Research Centre, Aurel Vlaicu University of Arad, Arad, Romania
Valentina Emilia Balas
Songkla University, Songkhla, Thailand
Sinchai Kamolphiwong
Department of Electrical and Computer Engineering, Concordia University, Montreal, QC, Canada
Ke-Lin Du

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sethi, M., Chandra, S., Chaudhary, V., Dahiya, Y. (2022). Spam Email Detection Using Machine Learning and Neural Networks. In: Shakya, S., Balas, V.E., Kamolphiwong, S., Du, KL. (eds) Sentimental Analysis and Deep Learning. Advances in Intelligent Systems and Computing, vol 1408. Springer, Singapore. https://doi.org/10.1007/978-981-16-5157-1_22

Download citation

DOI: https://doi.org/10.1007/978-981-16-5157-1_22
Published: 26 October 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-5156-4
Online ISBN: 978-981-16-5157-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Spam Email Detection Using Machine Learning and Neural Networks

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Email Spam Detection Using Machine Learning and Feature Optimization Method

Detection and Classification of Spam Email: A Machine Learning-Based Experimental Analysis

Spam Email Detection Using Deep Support Vector Machine, Support Vector Machine and Artificial Neural Network

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Spam Email Detection Using Machine Learning and Neural Networks

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Email Spam Detection Using Machine Learning and Feature Optimization Method

Detection and Classification of Spam Email: A Machine Learning-Based Experimental Analysis

Spam Email Detection Using Deep Support Vector Machine, Support Vector Machine and Artificial Neural Network

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation