Abstract
Today, social media services and multiplatform applications such as microblogs, forums and social networks gives people the ability to communicate, interact and generate content which establish social and collaborative backgrounds. These services now embodies the leading and biggest repository containing millions of Big social Data that can be useful for many applications such as measure public sentiment, trends monitoring, reputation management and marketing campaigns. But social media data are essentially unstructured that’s what makes it so interesting and so hard to analyze. Making sense of it and understanding what it means will require all new technologies and techniques, including the emerging field of big data. In addition, social media is a key model of the velocity and variety which are main characteristics of Big Data. In this paper, we propose a new approach to retrieve conversation on microblogging sites that combine Big Data environment and social media analytics solutions. The goal of our approach is to present a more informatives result and solve the information overload problem within Big Data environment. The proposed approach has been implemented and evaluated by comparing it with Google and Twitter Search engines and we obtained very promising results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Belkaroui, R., Faiz, R.: Towards events tweet contextualization using social influence model and users conversations. In: Proceedings of the 5th International Conference on Web Intelligence, Mining and Semantics, WIMS 2015, Larnaca, Cyprus, p. 3, July 13–15, 2015. http://doi.acm.org/10.1145/2797115.2797134
Bernstein, M., Hong, L., Kairam, S., Chi, H., Suh, B.: A torrent of tweets: managing information overload in online social streams. In. In Workshop on Microblogging: What and How Can We Learn From It? (CHI’10 (2010)
Bollier, D., Firestone, C.M.: The Promise and Peril of Big Data. Aspen Institute, Communications and Society Program Washington, DC, USA (2010)
Boyd, d., Crawford, K.: Six provocations for Big Data. In: A Decade in Internet Time: Symposium on the Dynamics of the Internet and Society (2011)
Bruns, A., Burgess, J.E.: #Ausvotes: how twitter covered the 2010 australian federal election. Commun. Polit. Cult. 44(2), 37–56 (2011), http://search.informit.com.au/documentSummary;dn=627330171744964;res=IELHSS
Cha, M., Mislove, A., Gummadi, K.P.: A measurement-driven analysis of information propagation in the flickr social network. In: Proceedings of the 18th International Conference on World Wide Web, WWW’09, pp. 721–730. ACM, New York, NY, USA (2009). http://doi.acm.org/10.1145/1526709.1526806
Chen, W., Wang, Y., Yang, S.: Efficient influence maximization in social networks. In: Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD’09, pp. 199–208. ACM, New York, NY, USA (2009). http://doi.acm.org/10.1145/1557019.1557047
Cogan, P., Andrews, M., Bradonjic, M., Kennedy, W.S., Sala, A., Tucci, G.: Reconstruction and analysis of twitter conversation graphs. In: Proceedings of the First ACM International Workshop on Hot Topics on Interdisciplinary Social Networks Research, HotSocial’12, pp. 25–31. ACM, New York, NY, USA (2012). http://doi.acm.org/10.1145/2392622.2392626
Cuzzocrea, A., Song, I.Y., Davis, K.C.: Analytics over large-scale multidimensional data: the big data revolution! In: Proceedings of the ACM 14th International Workshop on Data Warehousing and OLAP, DOLAP’11, pp. 101–104. ACM, New York, NY, USA (2011). http://doi.acm.org/10.1145/2064676.2064695
Efron, M., Winget, M.: Questions are content: a taxonomy of questions in a microblogging environment. In: Proceedings of the 73rd ASIS&T Annual Meeting on Navigating Streams in an Information Ecosystem, ASIS&T ’10, vol. 47, pp. 27:1–27:10, American Society for Information Science, Silver Springs, MD, USA (2010). http://dl.acm.org/citation.cfm?id=1920331.1920371
Gómez, V., Kappen, H.J., Kaltenbrunner, A.: Modeling the structure and evolution of discussion cascades. In: Proceedings of the 22Nd ACM Conference on Hypertext and Hypermedia, HT’11, pp. 181–190. ACM, New York, NY, USA (2011). http://doi.acm.org/10.1145/1995966.1995992
Huang, J., Thornton, K.M., Efthimiadis, E.N.: Conversational tagging in twitter. In: Proceedings of the 21st ACM Conference on Hypertext and Hypermedia, HT’10, pp. 173–178. ACM, New York, NY, USA (2010). http://doi.acm.org/10.1145/1810617.1810647
Jabeur, L.B., Tamine, L., Boughanem, M.: Uprising microblogs: A bayesian network retrieval model for tweet search. In: Proceedings of the 27th Annual ACM Symposium on Applied Computing, SAC’12, pp. 943–948. ACM, New York, NY, USA (2012). http://doi.acm.org/10.1145/2245276.2245459
Kempe, D., Kleinberg, J., Tardos, E.: Maximizing the spread of influence through a social network. In: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD’03, pp. 137–146. ACM, New York, NY, USA (2003). http://doi.acm.org/10.1145/956750.956769
Kumar, R., Mahdian, M., McGlohon, M.: Dynamics of conversations. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD’10, pp. 553–562. ACM, New York, NY, USA (2010). http://doi.acm.org/10.1145/1835804.1835875
Kwak, H., Lee, C., Park, H., Moon, S.: What is twitter, a social network or a news media? In: Proceedings of the 19th International Conference on World Wide Web, WWW’10, pp. 591–600. ACM, New York, NY, USA (2010). http://doi.acm.org/10.1145/1772690.1772751
Lee, C., Kwak, H., Park, H., Moon, S.: Finding influentials based on the temporal order of information adoption in twitter. In: Proceedings of the 19th International Conference on World Wide Web, WWW’10, pp. 1137–1138. ACM, New York, NY, USA (2010). http://doi.acm.org/10.1145/1772690.1772842
Magnani, M., Montesi, D., Rossi, L.: Information propagation analysis in a social network site. In: International Conference on Advances in Social Networks Analysis and Mining (ASONAM), pp. 296–300, Aug 2010
Magnani, M., Montesi, D., Nunziante, G., Rossi, L.: Conversation retrieval from Twitter. In: Clough, P., Foley, C., Gurrin, C., Jones, G., Kraaij, W., Lee, H., Mudoch, V. (eds.) ECIR 2011. LNCS, vol. 6611, pp. 780–783. Springer, Heidelberg (2011)
Magnani, M., Montesi, D., Rossi, L.: Conversation Retrieval for Microblogging Sites, vol. 15, pp. 354–372. Springer, Netherlands (2012). http://dx.doi.org/10.1007/s10791-012-9189-9
Manovich, L.: Trending: The Promises and the Challenges of Big Social Data. Debates in the Digital Humanities, pp. 460–475 (2011)
Sagiroglu, S., Sinanc, D.: Big data: a review. In: International Conference on Collaboration Technologies and Systems (CTS), pp. 42–47 (2013)
Scellato, S., Mascolo, C., Musolesi, M., Crowcroft, J.: Track globally, deliver locally: improving content delivery networks by tracking geographic social cascades. In: Proceedings of the 20th International Conference on World Wide Web, WWW’11, pp. 457–466. ACM, New York, NY, USA (2011). http://doi.acm.org/10.1145/1963405.1963471
Smith, M., Szongott, C., Henne, B., von Voigt, G.: Big data privacy issues in public social media. In: 6th IEEE International Conference on Digital Ecosystems Technologies (DEST), pp. 1–6, June 2012
Song, S., Li, Q., Zheng, N.: A Spatio-temporal Framework for related topic search in micro-blogging. In: An, A., Lingras, P., Petty, S., Huang, R. (eds.) AMT 2010, LNCS, vol. 6335, pp. 63–73. Springer, Heidelberg (2010)
Teevan, J., Ramage, D., Morris, M.R.: #twittersearch: a comparison of microblog search and web search. In: Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, WSDM’11, pp. 35–44. ACM, New York, NY, USA (2011) http://doi.acm.org/10.1145/1935826.1935842
Wang, D., Wen, Z., Tong, H., Lin, C.Y., Song, C., Barabási, A.L.: Information spreading in context. In: Proceedings of the 20th International Conference on World Wide Web, WWW’11, pp. 735–744. ACM, New York, NY, USA (2011). http://doi.acm.org/10.1145/1963405.1963508
Yang, J., Leskovec, J.: Patterns of temporal variation in online media. In: Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, WSDM’11, pp. 177–186. ACM, New York, NY, USA (2011) http://doi.acm.org/10.1145/1935826.1935863
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Belkaroui, R., Jemal, D., Faiz, R. (2016). Exploring Big Data Environment for Conversation Data Analysis and Mining on Microblogs. In: Gaber, T., Hassanien, A., El-Bendary, N., Dey, N. (eds) The 1st International Conference on Advanced Intelligent System and Informatics (AISI2015), November 28-30, 2015, Beni Suef, Egypt. Advances in Intelligent Systems and Computing, vol 407. Springer, Cham. https://doi.org/10.1007/978-3-319-26690-9_30
Download citation
DOI: https://doi.org/10.1007/978-3-319-26690-9_30
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26688-6
Online ISBN: 978-3-319-26690-9
eBook Packages: Computer ScienceComputer Science (R0)