[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Vartalaap: What Drives #AirQuality Discussions: Politics, Pollution or Pseudo-science?

Published: 22 April 2021 Publication History

Abstract

Air pollution is a global challenge for cities across the globe. Understanding the public perception of air pollution can help policymakers engage better with the public and appropriately introduce policies. Accurate public perception can also help people to identify the health risks of air pollution and act accordingly. Unfortunately, current techniques for determining perception are not scalable: it involves surveying few hundred people with questionnaire-based surveys. Using the advances in natural language processing (NLP), we propose a more scalable solution called Vartalaap to gauge public perception of air pollution via the microblogging social network Twitter. We curated a dataset of more than 1.2M tweets discussing Delhi-specific air pollution. We find that (unfortunately) the public is supportive of unproven mitigation strategies to reduce pollution, thus risking their health due to a false sense of security. We also find that air quality is a year-long problem, but the discussions are not proportional to the level of pollution and spike up when pollution is more visible. The information required by Vartalaap is publicly available and, as such, it can be immediately applied to study different societal issues across the world.

References

[1]
Sofiane Abbar, Tahar Zanouda, Laure Berti-Equille, and Javier Borge-Holthoefer. 2016. Using twitter to understand public interest in climate change: The case of qatar. In Tenth International AAAI Conference on Web and Social Media.
[2]
Rishiraj Adhikary and Nipun Batra. 2020 a. Computational tools for understanding air pollution. In Adjunct Proceedings of the 2020 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2020 ACM International Symposium on Wearable Computers. 199--203.
[3]
Rishiraj Adhikary and Nipun Batra. 2020 b. Do we breathe the same air?. In Adjunct Proceedings of the 2020 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2020 ACM International Symposium on Wearable Computers. 1--4.
[4]
Jeff Alstott and Dietmar Plenz Bullmore. 2014. powerlaw: a Python package for analysis of heavy-tailed distributions. PloS one, Vol. 9, 1 (2014).
[5]
Tawfiq Ammari, Sarita Schoenebeck, and Daniel Romero. 2019. Self-declared throwaway accounts on Reddit: How platform affordances and shared norms enable parenting disclosure and support. Proceedings of the ACM on Human-Computer Interaction, Vol. 3, CSCW (2019), 1--30.
[6]
Xiaoran An, Auroop R Ganguly, Yi Fang, Steven B Scyphers, Ann M Hunter, and Jennifer G Dy. 2014. Tracking climate change opinions from twitter data. In Workshop on Data Science for Social Good.
[7]
Joshua S Apte, Michael Brauer, Aaron J Cohen, Majid Ezzati, and C Arden Pope III. 2018. Ambient PM2. 5 reduces global and regional life expectancy. Environmental Science & Technology Letters, Vol. 5, 9 (2018), 546--551.
[8]
Joshua S Apte and Pallavi Pant. 2019. Toward cleaner air for a billion Indians. Proceedings of the National Academy of Sciences, Vol. 116, 22 (2019), 10614--10616.
[9]
Dominic Odwa Atari, Isaac N Luginaah, and Karen Fung. 2009. The relationship between odour annoyance scores and modelled ambient air pollution in Sarnia,?Chemical Valley", Ontario. International Journal of Environmental Research and Public Health, Vol. 6, 10 (2009), 2655--2675.
[10]
Noureddine Azzouza, Karima Akli-Astouati, and Roliana Ibrahim. 2020. TwitterBERT: Framework for Twitter Sentiment Analysis Based on Pre-trained Language Model Representations. In Emerging Trends in Intelligent Computing and Informatics, Faisal Saeed, Fathey Mohammed, and Nadhmi Gazem (Eds.). Springer International Publishing, Cham, 428--437.
[11]
Arnab Jana Rounaq Basu, Aparup Khatua, and Saptarshi Ghosh. 2017. Harnessing Twitter Data for Analyzing Public Reactions to Transportation Policies: Evidences from the Odd-Even Policy in Delhi, India. Proceedings of the Eastern Asia Society For Transportation Studies (EASTS) (2017).
[12]
Eric PS Baumer, David Mimno, Shion Guha, Emily Quan, and Geri K Gay. 2017. Comparing grounded theory and topic modeling: Extreme divergence or unlikely convergence? Journal of the Association for Information Science and Technology, Vol. 68, 6 (2017), 1397--1410.
[13]
Karen Bickerstaff and Gordon Walker. 2001. Public understandings of air pollution: the ?localisation'of environmental risk. Global Environmental Change, Vol. 11, 2 (2001), 133--145.
[14]
David M Blei, Andrew Y Ng, and Michael I Jordan. 2003. Latent dirichlet allocation. Journal of machine Learning research, Vol. 3, Jan (2003), 993--1022.
[15]
Ilias Bougoudis, Konstantinos Demertzis, and Lazaros Iliadis. 2016. HISYCOL a hybrid computational intelligence system for combined machine learning: the case of air pollution modeling in Athens. Neural Computing and Applications, Vol. 27, 5 (2016), 1191--1206.
[16]
Youngchul Cha and Junghoo Cho. 2012. Social-network analysis using topic models. In Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval. 565--574.
[17]
K Chandramouli, N Pannirselvam, D Vijaya Kumar, Sagar Reddy Avuthu, and V Anitha. 2019. A STUDY ON SMOG FILTERING TOWER. Journal of Advanced Cement & Concrete Technology, Vol. 2, 1, 2 (2019).
[18]
Jonathan Chang, Sean Gerrish, Chong Wang, Jordan L Boyd-Graber, and David M Blei. 2009. Reading tea leaves: How humans interpret topic models. In Advances in neural information processing systems. 288--296.
[19]
Sourangsu Chowdhury, Sagnik Dey, Sarath Guttikunda, Ajay Pillarisetti, Kirk R Smith, and Larry Di Girolamo. 2019. Indian annual ambient air quality standard is achievable by completely mitigating emissions from household sources. Proceedings of the National Academy of Sciences, Vol. 116, 22 (2019), 10711--10716.
[20]
Sourangsu Chowdhury, Sagnik Dey, Sachchida Nand Tripathi, Gufran Beig, Amit Kumar Mishra, and Sumit Sharma. 2017. "Traffic intervention" policy fails to mitigate air pollution in megacity Delhi. Environmental science & policy, Vol. 74 (2017), 8--13.
[21]
Jason Chuang, Daniel Ramage, Christopher Manning, and Jeffrey Heer. 2012. Interpretation and trust: Designing model-driven visualizations for text analysis. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. 443--452.
[22]
Anna-Sara Claeson, Edvard Lidén, Maria Nordin, and Steven Nordin. 2013. The role of perceived pollution and health risk perception in annoyance and health symptoms: a population-based study of odorous air pollution. International archives of occupational and environmental health, Vol. 86, 3 (2013), 367--374.
[23]
Biraj Dahal, Sathish AP Kumar, and Zhenlong Li. 2019. Topic modeling and sentiment analysis of global climate change tweets. Social Network Analysis and Mining, Vol. 9, 1 (2019), 24.
[24]
Michael A DeVito, Jeremy Birnholtz, and Jeffery T Hancock. 2017. Platforms, people, and perception: Using affordances to understand self-presentation on social media. In Proceedings of the 2017 ACM conference on computer supported cooperative work and social computing. 740--754.
[25]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018a. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. CoRR, Vol. abs/1810.04805 (2018). arxiv: 1810.04805 http://arxiv.org/abs/1810.04805
[26]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018b. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018).
[27]
Daxin Dong, Xiaowei Xu, Wen Xu, and Junye Xie. 2019. The Relationship Between the Actual Level of Air Pollution and Residents' Concern about Air Pollution: Evidence from Shanghai, China. International Journal of Environmental Research and Public Health, Vol. 16, 23 (2019), 4784.
[28]
Thaddaeus Egondi, Catherine Kyobutungi, Nawi Ng, Kanyiva Muindi, Samuel Oti, Steven Van de Vijver, Remare Ettarh, and Joacim Rocklöv. 2013. Community perceptions of air pollution and related health risks in Nairobi slums. International journal of environmental research and public health, Vol. 10, 10 (2013), 4851--4868.
[29]
Colin S. Gillespie. 2015. Fitting Heavy Tailed Distributions: The poweRlaw Package. Journal of Statistical Software, Vol. 64, 2 (2015), 1--16. http://www.jstatsoft.org/v64/i02/
[30]
Clive WJ Granger. 1969. Investigating causal relations by econometric models and cross-spectral methods. Econometrica: journal of the Econometric Society (1969), 424--438.
[31]
Supraja Gurajala, Suresh Dhaniyala, and Jeanna N Matthews. 2019. Understanding public response to air quality using tweet analysis. Social Media Society, Vol. 5, 3 (2019), 2056305119867656.
[32]
Sarath Guttikunda and Puja Jawahar. 2020. Can We Vacuum Our Air Pollution Problem Using Smog Towers? Atmosphere, Vol. 11, 9 (2020), 922.
[33]
Sarath K Guttikunda, Pallavi Pant, KA Nishadh, and Puja Jawahar. 2019. Particulate Matter Source Contributions for Raipur-Durg-Bhilai Region of Chhattisgarh, India. Aerosol and Air Quality Research, Vol. 19, 3 (2019), 528--540.
[34]
Lei Huang, Chao Rao, Tsering Jan van der Kuijp, Jun Bi, and Yang Liu. 2017. A comparison of individual exposure, perception, and acceptable levels of PM2. 5 with air pollution policy objectives in China. Environmental research, Vol. 157 (2017), 78--86.
[35]
Doaa Mohey El-Din Mohamed Hussein. 2016. A survey on sentiment analysis challenges. https://www.sciencedirect.com/science/article/pii/S1018363916300071
[36]
Aarti Israni, Sheena Erete, and Che L Smith. 2017. Snitches, Trolls, and Social Norms: Unpacking Perceptions of Social Media Use for Crime Prevention. In Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing. 1193--1209.
[37]
Wei Jiang, Yandong Wang, Ming-Hsiang Tsou, and Xiaokang Fu. 2015. Using social media to detect outdoor air pollution and monitor air quality index (AQI): a geo-targeted spatiotemporal analysis framework with Sina Weibo (Chinese Twitter). PloS one, Vol. 10, 10 (2015), e0141185.
[38]
B Kahng, I Yang, H Jeong, and A-L Barabási. 2004. Emergence of power-law behaviors in online auctions. In The Application of Econophysics. Springer, 204--209.
[39]
Denis Kwiatkowski, Peter CB Phillips, Peter Schmidt, Yongcheol Shin, et almbox. 1992. Testing the null hypothesis of stationarity against the alternative of a unit root. Journal of econometrics, Vol. 54, 1--3 (1992), 159--178.
[40]
F. Long, K. Zhou, and W. Ou. 2019. Sentiment Analysis of Text Based on Bidirectional LSTM With Multi-Head Attention. IEEE Access, Vol. 7 (2019), 141960--141969.
[41]
Mary L McHugh. 2012. Interrater reliability: the kappa statistic. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3900052/
[42]
Susheel K Mittal, Nirankar Singh, Ravinder Agarwal, Amit Awasthi, and Prabhat K Gupta. 2009. Ambient air quality during wheat and rice crop stubble burning episodes in Patiala. Atmospheric Environment, Vol. 43, 2 (2009), 238--244.
[43]
Dinesh Mohan, Geetam Tiwari, Rahul Goel, and Paranjyoti Lahkar. 2017. Evaluation of odd--even day traffic restriction experiments in Delhi, India. Transportation Research Record, Vol. 2627, 1 (2017), 9--16.
[44]
Minggang Peng, Hui Zhang, Richard D Evans, Xiaohui Zhong, and Kun Yang. 2019. Actual air pollution, environmental transparency, and the perception of air pollution in China. The Journal of Environment & Development, Vol. 28, 1 (2019), 78--105.
[45]
H. T. Phan, V. C. Tran, N. T. Nguyen, and D. Hwang. 2020. Improving the Performance of Sentiment Analysis of Tweets Containing Fuzzy Sentiment Using the Feature Ensemble Model. IEEE Access, Vol. 8 (2020), 14630--14641.
[46]
Prithviraj Pramanik, Tamal Mondal, Subrata Nandi, and Mousumi Saha. 2020. AirCalypse: Can Twitter Help in Urban Air Quality Measurement and Who are the Influential Users?. In Companion Proceedings of the Web Conference 2020. 540--545.
[47]
Melissa Pujazon-Zazik and M Jane Park. 2010. To tweet, or not to tweet: gender differences and potential positive and negative health outcomes of adolescents' social internet use. American journal of men's health, Vol. 4, 1 (2010), 77--85.
[48]
Manish Rana and Mohammad Atique. 2019. Language Translation: Enhancing Bi-Lingual Machine Translation Approach Using Python. i-Manager's Journal on Computer Science, Vol. 7, 2 (2019), 36.
[49]
Khaiwal Ravindra, Maninder Kaur Sidhu, Suman Mor, Siby John, and Saumyadipta Pyne. 2016. Air pollution in India: bridging the gap between science and policy. Journal of Hazardous, Toxic, and Radioactive Waste, Vol. 20, 4 (2016), A4015003.
[50]
Radim v Rehr uv rek and Petr Sojka. 2010. Software Framework for Topic Modelling with Large Corpora. In Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks. ELRA, Valletta, Malta, 45--50. http://is.muni.cz/publication/884893/en.
[51]
Markus Reichstein, Gustau Camps-Valls, Bjorn Stevens, Martin Jung, Joachim Denzler, Nuno Carvalhais, et almbox. 2019. Deep learning and process understanding for data-driven Earth system science. Nature, Vol. 566, 7743 (2019), 195--204.
[52]
Michael Röder, Andreas Both, and Alexander Hinneburg. 2015. Exploring the space of topic coherence measures. In Proceedings of the eighth ACM international conference on Web search and data mining. 399--408.
[53]
Arif Mohaimin Sadri, Samiul Hasan, Satish V Ukkusuri, and Juan Esteban Suarez Lopez. 2018. Analysis of social interaction network properties and growth on Twitter. Social Network Analysis and Mining, Vol. 8, 1 (2018), 56.
[54]
K. Sarkar and M. Bhowmick. 2017. Sentiment polarity detection in bengali tweets using multinomial Naïve Bayes and support vector machines. In 2017 IEEE Calcutta Conference (CALCON). 31--36.
[55]
Kamal Sarkar and Saikat Chakraborty. 2015. A Sentiment Analysis System for Indian Language Tweets. In Mining Intelligence and Knowledge Exploration, Rajendra Prasath, Anil Kumar Vuppala, and T. Kathirvalavakumar (Eds.). Springer International Publishing, Cham, 694--702.
[56]
Jan C Semenza, Daniel J Wilson, Jeremy Parra, Brian D Bontempo, Melissa Hart, David J Sailor, and Linda A George. 2008. Public perception and behavior change in relationship to hot weather and air pollution. Environmental research, Vol. 107, 3 (2008), 401--411.
[57]
Aliaksei Severyn and Alessandro Moschitti. 2015. Twitter Sentiment Analysis with Deep Convolutional Neural Networks. In Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval (Santiago, Chile) (SIGIR '15). Association for Computing Machinery, New York, NY, USA, 959--962. https://doi.org/10.1145/2766462.2767830
[58]
Kaushik K Shandilya, Mukesh Khare, and Akhilendra Bhushan Gupta. 2007. Suspended particulate matter distribution in rural-industrial Satna and in urban-industrial South Delhi. Environmental monitoring and assessment, Vol. 128, 1--3 (2007), 431--445.
[59]
Carson Sievert and Kenneth Shirley. 2014. LDAvis: A method for visualizing and interpreting topics. In Proceedings of the workshop on interactive language learning, visualization, and interfaces. 63--70.
[60]
Jabrinder Singh, Naveen Singhal, Shailey Singhal, Madhu Sharma, Shilpi Agarwal, and Shefali Arora. 2018. Environmental implications of rice and wheat stubble burning in north-western states of India. In Advances in health and environment safety. Springer, 47--55.
[61]
Torsten Skov, Torben Cordtz, Lilli Kirkeskov Jensen, Peter Saugman, Kirsten Schmidt, and Peter Theilade. 1991. Modifications of health behaviour in response to air pollution notifications in Copenhagen. Social Science & Medicine, Vol. 33, 5 (1991), 621--626.
[62]
Alison Smith, Varun Kumar, Jordan Boyd-Graber, Kevin Seppi, and Leah Findlater. 2018. Closing the loop: User-centered design and evaluation of a human-in-the-loop topic modeling system. In 23rd International Conference on Intelligent User Interfaces. 293--304.
[63]
Yuguo Tao, Feng Zhang, Chunyun Shi, and Yun Chen. 2019. Social Media Data-Based Sentiment Analysis of Tourists' Air Quality Perceptions. Sustainability, Vol. 11, 18 (2019), 5070.
[64]
Hiro Y Toda and Taku Yamamoto. 1995. Statistical inference in vector autoregressions with possibly integrated processes. Journal of econometrics, Vol. 66, 1--2 (1995), 225--250.
[65]
Alexandre Trilla and Francesc Alias. 2012. Sentiment analysis of Twitter messages based on multinomial Naive Bayes. Comput. Surv, Vol. 34 (2012), 1--47.
[66]
Rachana Vidhi and Prasanna Shrivastava. 2018. A review of electric vehicle lifecycle emissions and policy recommendations to increase EV penetration in India. Energies, Vol. 11, 3 (2018), 483.
[67]
Yang Wang, Shi Feng, Daling Wang, Yifei Zhang, and Ge Yu. 2016. Context-Aware Chinese Microblog Sentiment Classification with Bidirectional LSTM. In Web Technologies and Applications, Feifei Li, Kyuseok Shim, Kai Zheng, and Guanfeng Liu (Eds.). Springer International Publishing, Cham, 594--606.
[68]
Xiaojing Zhang, Pawel Wargocki, Zhiwei Lian, and Camilla Thyregod. 2017. Effects of exposure to carbon dioxide and bioeffluents on perceived air quality, self-assessed acute health symptoms, and cognitive performance. Indoor air, Vol. 27, 1 (2017), 47--64.

Cited By

View all
  • (2024)Understanding Delhi's Diwali Emission LoadsSSRN Electronic Journal10.2139/ssrn.5003624Online publication date: 2024
  • (2024)Air pollution perception for air quality management: a systematic review exploring research themes and future perspectivesEnvironmental Research Letters10.1088/1748-9326/ad3bd019:5(053002)Online publication date: 26-Apr-2024
  • (2023)What Is Polluting Delhi’s Air? A Review from 1990 to 2022Sustainability10.3390/su1505420915:5(4209)Online publication date: 26-Feb-2023
  • Show More Cited By

Index Terms

  1. Vartalaap: What Drives #AirQuality Discussions: Politics, Pollution or Pseudo-science?

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Proceedings of the ACM on Human-Computer Interaction
    Proceedings of the ACM on Human-Computer Interaction  Volume 5, Issue CSCW1
    CSCW
    April 2021
    5016 pages
    EISSN:2573-0142
    DOI:10.1145/3460939
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 22 April 2021
    Published in PACMHCI Volume 5, Issue CSCW1

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. air pollution
    2. perception
    3. social media

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)50
    • Downloads (Last 6 weeks)6
    Reflects downloads up to 12 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Understanding Delhi's Diwali Emission LoadsSSRN Electronic Journal10.2139/ssrn.5003624Online publication date: 2024
    • (2024)Air pollution perception for air quality management: a systematic review exploring research themes and future perspectivesEnvironmental Research Letters10.1088/1748-9326/ad3bd019:5(053002)Online publication date: 26-Apr-2024
    • (2023)What Is Polluting Delhi’s Air? A Review from 1990 to 2022Sustainability10.3390/su1505420915:5(4209)Online publication date: 26-Feb-2023
    • (2023)Plugging the ambient air monitoring gaps in India's national clean air programme (NCAP) airshedsAtmospheric Environment10.1016/j.atmosenv.2023.119712301(119712)Online publication date: May-2023
    • (2022)Samachar: Print News Media on Air Pollution in IndiaProceedings of the 5th ACM SIGCAS/SIGCHI Conference on Computing and Sustainable Societies10.1145/3530190.3534812(401-413)Online publication date: 29-Jun-2022

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media