Abstract
The proliferation of spam on the Web has necessitated the development of machine learning models to automate their detection. However, the dynamic nature of spam and the sophisticated evasion techniques employed by spammers often lead to low accuracy in these models. Traditional machine-learning approaches struggle to keep pace with spammers’ constantly evolving tactics, resulting in a persistent challenge to maintain high detection rates. To address this, we propose blockchain-enabled incentivized crowdsourcing as a novel solution to enhance spam detection systems. We create an incentive mechanism for data collection and labeling by leveraging blockchain’s decentralized and transparent framework. Contributors are rewarded for accurate labels and penalized for inaccuracies, ensuring high-quality data. A smart contract governs the submission and evaluation process, with participants staking cryptocurrency as collateral to guarantee integrity. Simulations show that incentivized crowdsourcing improves data quality, leading to more effective machine-learning models for spam detection. This approach offers a scalable and adaptable solution to the challenges of traditional methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Buterin, V., et al.: A next-generation smart contract and decentralized application platform. White Paper 3(37), 2–1 (2014)
Chellapilla, K., Maykov, A.: A taxonomy of Javascript redirection spam. In: Proceedings of the 3rd International Workshop on Adversarial Information Retrieval on the Web, pp. 81–88 (2007)
Choudhari, S., Das, S.: Spam e-mail identification using blockchain technology. IEEE 1, 1–5 (2021)
Crawford, M., Khoshgoftaar, T.M., Prusa, J.D., Richter, A.N., Al Najada, H.: Survey of review spam detection using machine learning techniques. J. Big Data 2(1), 1–24 (2015). https://doi.org/10.1186/s40537-015-0029-9
Dave, V., Guha, S., Zhang, Y.: Measuring and fingerprinting click-spam in ad networks. In: Proceedings of the ACM SIGCOMM 2012 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication, pp. 175–186 (2012)
Farooq, S.: A survey on adversarial information retrieval on the web. arXiv preprint arXiv:1911.11060 (2019)
Gyöngyi, Z., Garcia-Molina, H.: Link spam alliances. In: VLDB, vol. 5, pp. 517–528 (2005)
Gyongyi, Z., Garcia-Molina, H.: Web spam taxonomy. In: First International Workshop on Adversarial Information Retrieval on the Web (2005)
Harris, J.D., Waggoner, B.: Decentralized and collaborative AI on blockchain. In: 2019 IEEE International Conference on Blockchain (Blockchain). IEEE (2019). https://doi.org/10.1109/blockchain.2019.00057
Harvey, C.R., Moorman, C., Toledo, M.: How blockchain can help marketers build better relationships with their customers. Harv. Bus. Rev. 9, 6–13 (2018)
Howe, J.: Crowdsourcing: why the power of the crowd is driving the future of business. Crown Currency (2009)
Jelodar, H., Wang, Y., Yuan, C., Jiang, X.: A systematic framework to discover pattern for web spam classification. In: 2017 8th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON), pp. 32–39. IEEE (2017)
Kadadha, M., Otrok, H., Mizouni, R., Singh, S., Ouali, A.: On-chain behavior prediction machine learning model for blockchain-based crowdsourcing. Futur. Gener. Comput. Syst. 136, 170–181 (2022)
Kim, B., Abuadbba, S., Kim, H.: Deepcapture: image spam detection using deep learning and data augmentation. In: Information Security and Privacy: 25th Australasian Conference, ACISP 2020, Perth, 30 November–2 December 2020, Proceedings 25, pp. 461–475. Springer (2020)
Mamun, M.S.I., Rathore, M.A., Lashkari, A.H., Stakhanova, N., Ghorbani, A.A.: Detecting malicious URLs using lexical analysis. In: International Conference on Network and System Security, pp. 467–482. Springer (2016)
Markines, B., Cattuto, C., Menczer, F.: Social spam detection. In: Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web, pp. 41–48 (2009)
Matthew Pisano, C.P., Seneviratne, O.: Predictchain: empowering collaboration and data accessibility for AI in a decentralized blockchain-based marketplace. In: ChainScience 2023, Ledger Journal (2023)
Mühlberger, R., et al.: Foundational oracle patterns: connecting blockchain to the off-chain world. In: Business Process Management: Blockchain and Robotic Process Automation Forum: BPM 2020 Blockchain and RPA Forum, Seville, 13–18 September 2020, Proceedings 18, pp. 35–51. Springer (2020)
Nakamoto, S.: Bitcoin: a peer-to-peer electronic cash system. bitcoin.org (2008)
Nguyen, C.T., Hoang, D.T., Nguyen, D.N., Niyato, D., Nguyen, H.T., Dutkiewicz, E.: Proof-of-stake consensus mechanisms for future blockchain networks: fundamentals, applications and opportunities. IEEE Access 7, 85727–85745 (2019)
Nguyen, K., Ghinita, G., Naveed, M., Shahabi, C.: A privacy-preserving, accountable and spam-resilient geo-marketplace. In: Proceedings of the 27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pp. 299–308 (2019)
Ntoulas, A., Najork, M., Manasse, M., Fetterly, D.: Detecting spam web pages through content analysis. In: Proceedings of the 15th International Conference on World Wide Web, pp. 83–92 (2006)
Sahmoud, T., Mikki, D.M.: Spam detection using bert. arXiv preprint arXiv:2206.02443 (2022)
Seneviratne, O.: Blockchain for social good: combating misinformation on the web with AI and blockchain. In: Proceedings of the 14th ACM Web Science Conference, pp. 435–442 (2022)
Sheikh, S.A., Banday, M.T.: A cryptocurrency-based e-mail system for spam control. Int. J. Adv. Comput. Sci. Appl. 12(1) (2021)
Spirin, N., Han, J.: Survey on web spam detection: principles and algorithms. ACM SIGKDD Explor. Newsl. 13(2), 50–64 (2012)
Urvoy, T., Chauveau, E., Filoche, P., Lavergne, T.: Tracking web spam with html style similarities. ACM Trans. Web 2(1), 1–28 (2008)
Wu, B., Davison, B.D.: Identifying link farm spam pages. In: SpeciaL Interest Tracks and Posters of the 14th International Conference on World Wide Web, pp. 820–829 (2005)
Xu, H., Wei, W., Qi, Y., Qi, S.: Blockchain-based crowdsourcing makes training dataset of machine learning no longer be in short supply. Wireless Commun. Mobile Comput. 2022, 1–13 (2022)
Xu, X., Tian, M., Li, Z.: Improving spam filtering in enterprise email systems with blockchain-based token incentive mechanism. In: The 22nd International Conference on Electronic Business (2022)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2025 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Kader, N., Kang, I., Seneviratne, O. (2025). Enhancing Web Spam Detection Through a Blockchain-Enabled Crowdsourcing Mechanism. In: Barhamgi, M., Wang, H., Wang, X. (eds) Web Information Systems Engineering – WISE 2024. WISE 2024. Lecture Notes in Computer Science, vol 15440. Springer, Singapore. https://doi.org/10.1007/978-981-96-0576-7_35
Download citation
DOI: https://doi.org/10.1007/978-981-96-0576-7_35
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-96-0575-0
Online ISBN: 978-981-96-0576-7
eBook Packages: Computer ScienceComputer Science (R0)