[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3592571.3592971acmconferencesArticle/Chapter ViewAbstractPublication PagesicdarConference Proceedingsconference-collections
short-paper

A Joint Scene Text Recognition and Visual Appearance Model for Protest Issue Classification

Published: 19 July 2023 Publication History

Abstract

Social movements have been a crucial means of political participation in democratic and non-democratic societies. In recent years, the Arab Spring in 2011 as a notable example, such movements aggressively used social media as a platform for organizing protesters. Visual images and texts spread around on the internet played vital roles in bonding and attracting citizens to the movements. Political scientists and sociologists attempted to exploit the vast amount of information on the internet to analyze social movements. Their use of artificial intelligence techniques, however, has been limited. In this paper, we newly introduce a joint scene-text recognition and visual appearance model. Specifically, our model employs a character-level scene text detection and an n-gram character embedding model. Our model can extract semantic information from scene texts including handwritten words on signs and placards, which past automated studies of social movements by social scientists have not yet utilized. By employing such semantic information, our model can, in contrast to past research, classify protest images into semantic categories such as types of protest issues (e.g., gender, race, green). We newly created the Protest Issue Image Dataset consisting of 2,859 images with five protest issue categories to evaluate our model. Our model scored F1 score and provided a significant boost in performance over the conventional visual model used by social scientists, which scored . Our model can be applied to various types of social movement studies, including recent research which examines how different types of protest issues relate to other political and social factors.

References

[1]
Youngmin Baek, Bado Lee, Dongyoon Han, Sangdoo Yun, and Hwalsuk Lee. 2019. Character region awareness for text detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 9365–9374.
[2]
Piotr Bojanowski, Edouard Grave, Armand Joulin, and Tomas Mikolov. 2017. Enriching word vectors with subword information. Transactions of the association for computational linguistics 5 (2017), 135–146.
[3]
Larry Buchanan, Quoctrung Bui, and Jugal K Patel. 2020. Black Lives Matter may be the largest movement in US history. The New York Times 3 (2020).
[4]
Erica Chenoweth, E Perkoski, J Pressman, and C Thurber. 2017. New data shows a sharp increase in US protest activity in April. The Washington Post 22 (2017).
[5]
Kevin Crowston. 2012. Amazon mechanical turk: A research tool for organizations and information systems scholars. In Shaping the future of ict research. methods and approaches. Springer, 210–221.
[6]
Alia E Dastagir. 2014. Largest-ever” climate-change march rolls through NYC. USA Today 22 (2014).
[7]
The Economist. 2020. Political protests have become more widespread and more frequent. https://www.economist.com/graphic-detail/2020/03/10/political-protests-have-become-more-widespread-and-more-frequent
[8]
Ross Girshick, Jeff Donahue, Trevor Darrell, and Jitendra Malik. 2014. Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition. 580–587.
[9]
Dan Glaister and Ewen MacAskill. 2006. US counts cost of day without immigrants. The Guardian 2 (2006).
[10]
Alex Graves, Santiago Fernández, Faustino Gomez, and Jürgen Schmidhuber. 2006. Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks. In Proceedings of the 23rd international conference on Machine learning. 369–376.
[11]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770–778.
[12]
Max Jaderberg, Karen Simonyan, Andrea Vedaldi, and Andrew Zisserman. 2016. Reading text in the wild with convolutional neural networks. International journal of computer vision 116, 1 (2016), 1–20.
[13]
Dimosthenis Karatzas, Lluis Gomez-Bigorda, Anguelos Nicolaou, Suman Ghosh, Andrew Bagdanov, Masakazu Iwamura, Jiri Matas, Lukas Neumann, Vijay Ramaseshan Chandrasekhar, Shijian Lu, 2015. ICDAR 2015 competition on robust reading. In 2015 13th international conference on document analysis and recognition (ICDAR). IEEE, 1156–1160.
[14]
Sandra Laville and Jonathan Watts. 2019. Across the globe, millions join biggest climate protest ever. The Guardian 21 (2019), 09–10.
[15]
Jeff Mapua. 2019. Can Anyone Own a Gun?Enslow Publishing, LLC.
[16]
Juan Ramos 2003. Using tf-idf to determine word relevance in document queries. In Proceedings of the first instructional conference on machine learning, Vol. 242. Citeseer, 29–48.
[17]
Baoguang Shi, Xiang Bai, and Cong Yao. 2016. An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE transactions on pattern analysis and machine intelligence 39, 11 (2016), 2298–2304.
[18]
Charles Tilly and Sidney G Tarrow. 2015. Contentious politics. Oxford University Press.
[19]
Time. 2011. The March for Women’s Lives. http://content.time.com/time/specials/packages/article/0, 28804,2088114_2087975_2087969,00.html
[20]
Time. 2018. The Million Mom March. https://web.archive.org/web/20130823234322http://www.time.com/time/specials/packages/article/0, 28804,2088114_2087975_2087974,00.html
[21]
Sebastián Valenzuela, Arturo Arriagada, and Andrés Scherman. 2012. The social media basis of youth protest behavior: The case of Chile. Journal of communication 62, 2 (2012), 299–314.
[22]
Joris Verhulst and Stefaan Walgrave. 2009. The first time is the hardest? A cross-national and cross-issue comparison of first-time protest participants. Political Behavior 31, 3 (2009), 455–484.
[23]
Kaveh Waddell. 2017. The exhausting work of tallying America’s largest protest. The Atlantic 23 (2017).
[24]
Stefaan Walgrave, Jeroen Van Laer, Joris Verhulst, and Ruud Wouters. 2013. Why do people protest? Comparing demonstrators’ motives across issues and nations. Media, movements and politics (2013).
[25]
Dan J Wang and Sarah A Soule. 2016. Tactical innovation in social movements: The effects of peripheral and multi-issue protest. American Sociological Review 81, 3 (2016), 517–548.
[26]
John Wilkerson and Andreu Casas. 2017. Large-scale computerized text analysis in political science: Opportunities and challenges. Annual Review of Political Science 20 (2017), 529–544.
[27]
Nora Webb Williams, Andreu Casas, and John D Wilkerson. 2020. Images as data for social science research: An introduction to convolutional neural nets for image classification. Cambridge University Press.
[28]
Donghyeon Won, Zachary C Steinert-Threlkeld, and Jungseock Joo. 2017. Protest activity detection and perceived violence estimation from social media images. In Proceedings of the 25th ACM international conference on Multimedia. 786–794.
[29]
Han Zhang and Jennifer Pan. 2019. Casm: A deep-learning approach for identifying collective action events with text and image data from social media. Sociological Methodology 49, 1 (2019), 1–57.
[30]
Xinyu Zhou, Cong Yao, He Wen, Yuzhi Wang, Shuchang Zhou, Weiran He, and Jiajun Liang. 2017. East: an efficient and accurate scene text detector. In Proceedings of the IEEE conference on Computer Vision and Pattern Recognition. 5551–5560.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
ICDAR '23: Proceedings of the 4th ACM Workshop on Intelligent Cross-Data Analysis and Retrieval
June 2023
69 pages
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 July 2023

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Fine-grained image classification
  2. contentious politics
  3. protest issue classification
  4. scene text detection
  5. scene text recognition
  6. social media
  7. social movement

Qualifiers

  • Short-paper
  • Research
  • Refereed limited

Conference

ICMR '23
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 46
    Total Downloads
  • Downloads (Last 12 months)18
  • Downloads (Last 6 weeks)0
Reflects downloads up to 11 Dec 2024

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media