[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3227609.3227665acmotherconferencesArticle/Chapter ViewAbstractPublication PageswimsConference Proceedingsconference-collections
research-article

Organized Behavior Classification of Tweet Sets using Supervised Learning Methods

Published: 25 June 2018 Publication History

Abstract

There is an increasing incidence in negative propaganda and fake news, which has recently gained lots of attention during the 2016 elections in United States, France, and United Kingdom. Bots and hired users collaborate to make messages seen and persist so they may spread and gain support. Assuming that most Twitter users post without predetermined, malicious intent, there is a need for automated detection of organized behavior to protect users from manipulation. This work proposes an automated approach to classify tweets with organized behavior. Supervised learning methods are used to classify the tweets by using a training data set with 850 records based on the analysis of over 200 million tweets. Our model gave promising results for detection of organized behavior and this motivated us to proceed with the generation of two more classifiers such as ["political", "non-political"] and ["pro-Trump", "pro-Hillary","neither"]. In each cases, the random forest algorithm consistently results in high scores with an average accuracy and f-measure above 0.95.

References

[1]
{n. d.}. Weka Description. http://www.cs.waikato.ac.nz/ml/weka/. ({n. d.}). Accessed: 2018-02-21.
[2]
Norah Abokhodair, Daisy Yoo, and David W McDonald. 2015. Dissecting a social botnet: Growth, content and influence in Twitter. In Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing (CSCW '15). ACM, ACM, New York, NY, USA, 839--851. 2675133.2675208.
[3]
Ethem Alpaydin. 2010. Introduction to Machine Learning (2nd ed.). The MIT Press.
[4]
Michael Ashcroft, Ali Fisher, Lisa Kaati, Enghin Omer, and Nico Prucha. 2015. Detecting Jihadist Messages on Twitter. In Proceedings of the 2015 European Intelligence and Security Informatics Conference (EISIC). IEEE Computer Society, Washington, DC, USA, 161--164.
[5]
Mohamed Bakillah, Ren-Yu Li, and Steve HL Liang. 2015. Geo-located community detection in Twitter with enhanced fast-greedy optimization of modularity: the case study of typhoon Haiyan. International Journal of Geographical Information Science 29, 2 (2015), 258--279.
[6]
J.M. Berger. 2016. Nazis vs. ISIS on Twitter: A Comparative Study of White Nationalist and ISIS Online Social Media Networks. (2016).
[7]
J.M. Berger and Jonathon Morgan. 2015. The ISIS Twitter Census, Defining and describing the population of ISIS supporters on Twitter. (2015). https://www.brookings.edu/wp-content/uploads/2016/06/isis_twitter_census_berger_morgan.pdf.
[8]
Alessandro Bessi and Emilio Ferrara. 2016. Social bots distort the 2016 U.S. Presidential election online discussion. First Monday 21, 11 (2016). fm.v21i11.7090.
[9]
Cheng Cao and James Caverlee. 2015. Detecting Spam URLs in Social Media via Behavioral Analysis. Springer International Publishing, 703--714.
[10]
Cheng Cao, James Caverlee, Kyumin Lee, Hancheng Ge, and Jinwook Chung. 2015. Organic or Organized?: Exploring URL Sharing Behavior. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. ACM, New York, NY, USA, 513--522.
[11]
Zi Chu, Steven Gianvecchio, Haining Wang, and Sushil Jajodia. 2012. Detecting automation of twitter accounts: Are you a human, bot, or cyborg? IEEE Transactions on Dependable and Secure Computing 9, 6 (2012), 811--824.
[12]
Frank Edwards and Philip N. Joyce Howard. 2013. Digital Activism and Non Violent Conflict. (2013). Available at SSRN: htttps://ssrn.com/abstract=2595115.
[13]
Emilio Ferrara, Onur Varol, Clayton Davis, Filippo Menczer, and Alessandro Flammini. 2016. The Rise of Social Bots. Commun. ACM 59, 7 (June 2016), 96--104.
[14]
Nicolas Foucault and Antoine Courtin. 2016. Automatic Classification of Tweets for Analyzing Communication Behavior of Museums. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), Nicoletta Calzolari (Conference Chair), Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, and Stelios Piperidis (Eds.). European Language Resources Association (ELRA), Paris, France.
[15]
Shahla Ghobadi and Stewart Clegg. 2015. "These Days Will Never Be Forgotten A Critical Mass Approach to Online Activism. Inf. Organ. 25, 1 (Jan. 2015), 20. j.infoandorg.2014.12.002.
[16]
Philip N. Howard and Bence Kollanyi. 2016. Bots, #StrongerIn, and #Brexit: Computational Propaganda during the UK-EU Referendum. (2016). http://arxiv.org/abs/1606.06356
[17]
Bence Kollanyi and Philip N. Howard. 2016. Bots and Automation over Twitter during the Second U.S. Presidential Debate. (2016). http://politicalbots.org/wp-content/uploads/2016/10/Data-Memo-Second-Presidential-Debate.pdf.
[18]
Bence Kollanyi, Philip N. Howard, and Samuel C. Wooley. 2016. Bots and Automation over Twitter during the First U.S. Presidential Debate. (2016). https://www.oii.ox.ac.uk/blog/bots-and-automation-over-twitter-during-the-first-u-s-presidential-debate
[19]
Georgiy Levchuk, Lise Getoor, and Marc Smith. 2014. Classification of group behaviors in social media via social behavior grammars. In SPIE Defense+ Security. International Society for Optics and Photonics, 909707--909707. 12.2050823.
[20]
Jonathan A Obar. 2013. Canadian Advocacy 2.0: An Analysis of Social Media Adoption and Perceived Affordances by Advocacy Groups Looking to Advance Activism in Canada. (2013). Available at SSRN: https://ssrn.com/abstract=2254742.
[21]
Jacob Ratkiewicz, Michael Conover, Mark R Meiss, Bruno Gonçalves, Alessandro Flammini, and Filippo Menczer. 2011. Detecting and Tracking Political Abuse in Social Media. ICWSM 11 (2011), 297--304.
[22]
Craig Silverman and Lawrence Alexander. 2016. How Teens In The Balkans Are Duping Trump Supporters With Fake News. https://www.buzzfeed.com/craigsilverman/how-macedonia-became-a-global-hub-for-pro-trump-misinfo. (November 2016). Accessed: 2018-02-21.
[23]
Apache Spark. {n. d.}. Machine Learning Library (MLlib). https://spark.apache.org/docs/1.1.0/mllib-guide.html. ({n. d.}). Accessed: 2018-02-21.
[24]
Prashanth Vijayaraghavan, Soroush Vosoughi, and Deb Roy. 2016. Automatic Detection and Categorization of Election-Related Tweets. In Tenth International AAAI Conference on Web and Social Media.
[25]
Ahmet Yıldırım, Suzan Üsküdarli, and Arzucan Özgür. 2016. Identifying topics in microblogs using Wikipedia. PloS one 11, 3 (2016), e0151885. journal.pone.0151885.
[26]
Matei Zaharia, Reynold S. Xin, Patrick Wendell, Tathagata Das, Michael Armbrust, Ankur Dave, Xiangrui Meng, Josh Rosen, Shivaram Venkataraman, Michael J. Franklin, Ali Ghodsi, Joseph Gonzalez, Scott Shenker, and Ion Stoica. 2016. Apache Spark: A Unified Engine for Big Data Processing. Commun. ACM 59, 11 (Oct. 2016), 56--65.
[27]
Nina Zumel and John Mount. 2014. Practical Data Science with R (1st ed.). Manning Publications Co., Greenwich, CT, USA.

Cited By

View all
  • (2023)Friendship Preference: Scalable and Robust Category of Features for Social Bot DetectionIEEE Transactions on Dependable and Secure Computing10.1109/TDSC.2022.315900720:2(1516-1528)Online publication date: 1-Mar-2023
  • (2023)Systematic Literature Review of Social Media Bots Detection SystemsJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2023.04.00435:5(101551)Online publication date: May-2023
  • (2023)Machine learning-based social media bot detection: a comprehensive literature reviewSocial Network Analysis and Mining10.1007/s13278-022-01020-513:1Online publication date: 5-Jan-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
WIMS '18: Proceedings of the 8th International Conference on Web Intelligence, Mining and Semantics
June 2018
398 pages
© 2018 Association for Computing Machinery. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 June 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. 2016 US presidential elections
  2. Political propaganda
  3. Twitter
  4. big data
  5. organized behavior detection
  6. social media analysis
  7. supervised learning

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

WIMS '18

Acceptance Rates

Overall Acceptance Rate 140 of 278 submissions, 50%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)10
  • Downloads (Last 6 weeks)0
Reflects downloads up to 11 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Friendship Preference: Scalable and Robust Category of Features for Social Bot DetectionIEEE Transactions on Dependable and Secure Computing10.1109/TDSC.2022.315900720:2(1516-1528)Online publication date: 1-Mar-2023
  • (2023)Systematic Literature Review of Social Media Bots Detection SystemsJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2023.04.00435:5(101551)Online publication date: May-2023
  • (2023)Machine learning-based social media bot detection: a comprehensive literature reviewSocial Network Analysis and Mining10.1007/s13278-022-01020-513:1Online publication date: 5-Jan-2023
  • (2022)A Systematic Comparison of Machine Learning and NLP Techniques to Unveil Propaganda in Social MediaJournal of Information Technology Research10.4018/JITR.29938415:1(1-14)Online publication date: 1-Jan-2022
  • (2022)Identify Twitter Data from Humans or Bots Using Machine Learning Algorithms with Kendalls CorrelationEvolution in Computational Intelligence10.1007/978-981-16-6616-2_19(203-212)Online publication date: 24-Apr-2022
  • (2021)An Intelligent Multicriteria Model for Diagnosing Dementia in People Infected with Human Immunodeficiency VirusApplied Sciences10.3390/app11211045711:21(10457)Online publication date: 7-Nov-2021
  • (2021)Social Media and Microblogs Credibility: Identification, Theory Driven Framework, and RecommendationIEEE Access10.1109/ACCESS.2021.31144179(137744-137781)Online publication date: 2021
  • (2021)Propaganda analysis in social media: a bibliometric reviewInformation Discovery and Delivery10.1108/IDD-06-2020-006549:1(57-70)Online publication date: 29-Jan-2021
  • (2021)Neue Öffentlichkeitsdynamiken: Zu selbstverstärkenden, plattformübergreifenden Effekten von ‚Popularität‘Digitaler Strukturwandel der Öffentlichkeit10.1007/978-3-658-32133-8_19(339-359)Online publication date: 2-Apr-2021
  • (2020)Intelligent Fake News Detection: A Systematic MappingJournal of Applied Security Research10.1080/19361610.2020.1761224(1-22)Online publication date: 14-May-2020
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media