Abstract
In present scenario, social networking and microblogging sites have become dynamic and widely used media for communication. Here people share information on various topics through which they express their likes and interests. Analyzing the process of information diffusion on these platforms not only helps in understanding the underlying social dynamics, but is also important for various applications like marketing and advertising. In this paper, we explore a novel problem in social network analysis which is to identify the active edges in the diffusion of a message in the social network. We cast this task as a binary classification problem of detecting whether a link in the social network participates in the propagation of a given message. We propose a learning-based framework which uses user interests and content similarity modeled using latent topic information, along with the features related to the social network. We evaluate our model on data obtained from a well-known social network platform - Twitter. The experiments show a significant improvement over the existing methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Nail, J.: The consumer advertising backlash. Forrester Research and Intelliseek Market Research Report (2004)
Kempe, D., Kleinberg, J., Tardos, E.: Maximizing the spread of influence through a social network. In: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM (2003)
Golder, S.A., Wilkinson, D.M., Huberman, B.A.: Rhythms of social interaction: Messaging within a massive online network. In: Communities and Technologies 2007. Springer (2007)
Fei, H., Jiang, R., Yang, Y., Luo, B., Huan, J.: Content based social behavior prediction: a multi-task learning approach. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management. ACM (2011)
Romero, et al.: Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter
Petrovic, S., Osborne, M., Lavrenko, V.: Rt to win! predicting message propagation in twitter. In: ICWSM (2011)
Lin, C., Mei, Q., Jiang, Y., Han, J., Qi, S.: Inferring the diffusion and evolution of topics in social communities. Social Network Mining and Analysis 3(d4), d5 (2011)
Zhu, J., Xiong, F., Piao, D., Liu, Y., Zhang, Y.: Statistically modeling the effectiveness of disaster information in social media. In: 2011 IEEE Global Humanitarian Technology Conference (GHTC). IEEE (2011)
Kuo, T.-T., Hung, S.-C., Lin, W.-S., Peng, N., Lin, S.-D., Lin, W.-F.: Exploiting latent information to predict diffusions of novel topics on social networks. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers-Volume 2. Association for Computational Linguistics (2012)
Domingos, P., Richardson, M.: Mining the network value of customers. In: Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM (2001)
Richardson, M., Domingos, P.: Mining knowledge-sharing sites for viral marketing. In: Proceedings of the eighth ACM SIGKDD International Conference on Knowledge Discovery and Data mining. ACM (2002)
Kimura, M., Saito, K.: Tractable models for information diffusion in social networks. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) PKDD 2006. LNCS (LNAI), vol. 4213, pp. 259–271. Springer, Heidelberg (2006)
Chen, W., Wang, C., Wang, Y.: Scalable influence maximization for prevalent viral marketing in large-scale social networks. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1029–1038. ACM (2010)
Leskovec, et al.: Costeffective outbreak detection in networks
Chen, et al.: Efficient influence maximization in social networks
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. The Journal of Machine Learning Research 3, 993–1022 (2003)
Hoffman, M.D., Blei, D.M., Bach, F.R.: Online learning for latent dirichlet allocation. NIPS 2(3), 5 (2010)
Banerjee, N., Chakraborty, D., Dasgupta, K., Mittal, S., Joshi, A., Nagar, S., Rai, A., Madan, S.: User interests in social media sites: an exploration with micro- blogs. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management, pp. 1823–1826. ACM (2009)
Java, A., Song, X., Finin, T., Tseng, B.: Why we twitter: understanding microblogging usage and communities. In: Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 Workshop on Web Mining and Social Network Analysis, pp. 56–65. ACM (2007)
Suh, B., Hong, L., Pirolli, P., Chi, E.H.: Want to be retweeted? large scale analytics on factors impacting retweet in twitter network. In: 2010 IEEE Second International Conference on Social Computing (socialcom), pp. 177–184. IEEE (2010)
Chang, C.-C., Lin, C.-J.: Libsvm: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology (TIST) 2(3), 27 (2011)
Schaul, T., Bayer, J., Wierstra, D., Sun, Y., Felder, M., Sehnke, F., Rückstie, T., Schmidhuber, J.: PyBrain. Journal of Machine Learning Research 11, 743–746 (2010)
Davis, J., Goadrich, M.: The relationship between precision-recall and roc curves. In: Proceedings of the 23rd International Conference on Machine Learning, pp. 233–240. ACM (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Varshney, D., Kumar, S., Gupta, V. (2014). Modeling Information Diffusion in Social Networks Using Latent Topic Information. In: Huang, DS., Bevilacqua, V., Premaratne, P. (eds) Intelligent Computing Theory. ICIC 2014. Lecture Notes in Computer Science, vol 8588. Springer, Cham. https://doi.org/10.1007/978-3-319-09333-8_16
Download citation
DOI: https://doi.org/10.1007/978-3-319-09333-8_16
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09332-1
Online ISBN: 978-3-319-09333-8
eBook Packages: Computer ScienceComputer Science (R0)