[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to main content

Enhancing Web Spam Detection Through a Blockchain-Enabled Crowdsourcing Mechanism

  • Conference paper
  • First Online:
Web Information Systems Engineering – WISE 2024 (WISE 2024)

Abstract

The proliferation of spam on the Web has necessitated the development of machine learning models to automate their detection. However, the dynamic nature of spam and the sophisticated evasion techniques employed by spammers often lead to low accuracy in these models. Traditional machine-learning approaches struggle to keep pace with spammers’ constantly evolving tactics, resulting in a persistent challenge to maintain high detection rates. To address this, we propose blockchain-enabled incentivized crowdsourcing as a novel solution to enhance spam detection systems. We create an incentive mechanism for data collection and labeling by leveraging blockchain’s decentralized and transparent framework. Contributors are rewarded for accurate labels and penalized for inaccuracies, ensuring high-quality data. A smart contract governs the submission and evaluation process, with participants staking cryptocurrency as collateral to guarantee integrity. Simulations show that incentivized crowdsourcing improves data quality, leading to more effective machine-learning models for spam detection. This approach offers a scalable and adaptable solution to the challenges of traditional methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
£29.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
GBP 19.95
Price includes VAT (United Kingdom)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
GBP 49.99
Price includes VAT (United Kingdom)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
GBP 64.99
Price includes VAT (United Kingdom)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    https://www.unb.ca/cic/datasets/url-2016.html.

  2. 2.

    https://www.kaggle.com/datasets/aman9d/phishing-data.

References

  1. Buterin, V., et al.: A next-generation smart contract and decentralized application platform. White Paper 3(37), 2–1 (2014)

    Google Scholar 

  2. Chellapilla, K., Maykov, A.: A taxonomy of Javascript redirection spam. In: Proceedings of the 3rd International Workshop on Adversarial Information Retrieval on the Web, pp. 81–88 (2007)

    Google Scholar 

  3. Choudhari, S., Das, S.: Spam e-mail identification using blockchain technology. IEEE 1, 1–5 (2021)

    Google Scholar 

  4. Crawford, M., Khoshgoftaar, T.M., Prusa, J.D., Richter, A.N., Al Najada, H.: Survey of review spam detection using machine learning techniques. J. Big Data 2(1), 1–24 (2015). https://doi.org/10.1186/s40537-015-0029-9

    Article  Google Scholar 

  5. Dave, V., Guha, S., Zhang, Y.: Measuring and fingerprinting click-spam in ad networks. In: Proceedings of the ACM SIGCOMM 2012 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication, pp. 175–186 (2012)

    Google Scholar 

  6. Farooq, S.: A survey on adversarial information retrieval on the web. arXiv preprint arXiv:1911.11060 (2019)

  7. Gyöngyi, Z., Garcia-Molina, H.: Link spam alliances. In: VLDB, vol. 5, pp. 517–528 (2005)

    Google Scholar 

  8. Gyongyi, Z., Garcia-Molina, H.: Web spam taxonomy. In: First International Workshop on Adversarial Information Retrieval on the Web (2005)

    Google Scholar 

  9. Harris, J.D., Waggoner, B.: Decentralized and collaborative AI on blockchain. In: 2019 IEEE International Conference on Blockchain (Blockchain). IEEE (2019). https://doi.org/10.1109/blockchain.2019.00057

  10. Harvey, C.R., Moorman, C., Toledo, M.: How blockchain can help marketers build better relationships with their customers. Harv. Bus. Rev. 9, 6–13 (2018)

    Google Scholar 

  11. Howe, J.: Crowdsourcing: why the power of the crowd is driving the future of business. Crown Currency (2009)

    Google Scholar 

  12. Jelodar, H., Wang, Y., Yuan, C., Jiang, X.: A systematic framework to discover pattern for web spam classification. In: 2017 8th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON), pp. 32–39. IEEE (2017)

    Google Scholar 

  13. Kadadha, M., Otrok, H., Mizouni, R., Singh, S., Ouali, A.: On-chain behavior prediction machine learning model for blockchain-based crowdsourcing. Futur. Gener. Comput. Syst. 136, 170–181 (2022)

    Article  Google Scholar 

  14. Kim, B., Abuadbba, S., Kim, H.: Deepcapture: image spam detection using deep learning and data augmentation. In: Information Security and Privacy: 25th Australasian Conference, ACISP 2020, Perth, 30 November–2 December 2020, Proceedings 25, pp. 461–475. Springer (2020)

    Google Scholar 

  15. Mamun, M.S.I., Rathore, M.A., Lashkari, A.H., Stakhanova, N., Ghorbani, A.A.: Detecting malicious URLs using lexical analysis. In: International Conference on Network and System Security, pp. 467–482. Springer (2016)

    Google Scholar 

  16. Markines, B., Cattuto, C., Menczer, F.: Social spam detection. In: Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web, pp. 41–48 (2009)

    Google Scholar 

  17. Matthew Pisano, C.P., Seneviratne, O.: Predictchain: empowering collaboration and data accessibility for AI in a decentralized blockchain-based marketplace. In: ChainScience 2023, Ledger Journal (2023)

    Google Scholar 

  18. Mühlberger, R., et al.: Foundational oracle patterns: connecting blockchain to the off-chain world. In: Business Process Management: Blockchain and Robotic Process Automation Forum: BPM 2020 Blockchain and RPA Forum, Seville, 13–18 September 2020, Proceedings 18, pp. 35–51. Springer (2020)

    Google Scholar 

  19. Nakamoto, S.: Bitcoin: a peer-to-peer electronic cash system. bitcoin.org (2008)

    Google Scholar 

  20. Nguyen, C.T., Hoang, D.T., Nguyen, D.N., Niyato, D., Nguyen, H.T., Dutkiewicz, E.: Proof-of-stake consensus mechanisms for future blockchain networks: fundamentals, applications and opportunities. IEEE Access 7, 85727–85745 (2019)

    Google Scholar 

  21. Nguyen, K., Ghinita, G., Naveed, M., Shahabi, C.: A privacy-preserving, accountable and spam-resilient geo-marketplace. In: Proceedings of the 27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pp. 299–308 (2019)

    Google Scholar 

  22. Ntoulas, A., Najork, M., Manasse, M., Fetterly, D.: Detecting spam web pages through content analysis. In: Proceedings of the 15th International Conference on World Wide Web, pp. 83–92 (2006)

    Google Scholar 

  23. Sahmoud, T., Mikki, D.M.: Spam detection using bert. arXiv preprint arXiv:2206.02443 (2022)

  24. Seneviratne, O.: Blockchain for social good: combating misinformation on the web with AI and blockchain. In: Proceedings of the 14th ACM Web Science Conference, pp. 435–442 (2022)

    Google Scholar 

  25. Sheikh, S.A., Banday, M.T.: A cryptocurrency-based e-mail system for spam control. Int. J. Adv. Comput. Sci. Appl. 12(1) (2021)

    Google Scholar 

  26. Spirin, N., Han, J.: Survey on web spam detection: principles and algorithms. ACM SIGKDD Explor. Newsl. 13(2), 50–64 (2012)

    Article  Google Scholar 

  27. Urvoy, T., Chauveau, E., Filoche, P., Lavergne, T.: Tracking web spam with html style similarities. ACM Trans. Web 2(1), 1–28 (2008)

    Article  Google Scholar 

  28. Wu, B., Davison, B.D.: Identifying link farm spam pages. In: SpeciaL Interest Tracks and Posters of the 14th International Conference on World Wide Web, pp. 820–829 (2005)

    Google Scholar 

  29. Xu, H., Wei, W., Qi, Y., Qi, S.: Blockchain-based crowdsourcing makes training dataset of machine learning no longer be in short supply. Wireless Commun. Mobile Comput. 2022, 1–13 (2022)

    Google Scholar 

  30. Xu, X., Tian, M., Li, Z.: Improving spam filtering in enterprise email systems with blockchain-based token incentive mechanism. In: The 22nd International Conference on Electronic Business (2022)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Oshani Seneviratne .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2025 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kader, N., Kang, I., Seneviratne, O. (2025). Enhancing Web Spam Detection Through a Blockchain-Enabled Crowdsourcing Mechanism. In: Barhamgi, M., Wang, H., Wang, X. (eds) Web Information Systems Engineering – WISE 2024. WISE 2024. Lecture Notes in Computer Science, vol 15440. Springer, Singapore. https://doi.org/10.1007/978-981-96-0576-7_35

Download citation

  • DOI: https://doi.org/10.1007/978-981-96-0576-7_35

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-96-0575-0

  • Online ISBN: 978-981-96-0576-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics