[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

A Deep Content-Based Model for Persian Rumor Verification

Published: 29 November 2021 Publication History

Abstract

During the development of social media, there has been a transformation in social communication. Despite their positive applications in social interactions and news spread, it also provides an ideal platform for spreading rumors. Rumors can endanger the security of society in normal or critical situations. Therefore, it is important to detect and verify the rumors in the early stage of their spreading. Many research works have focused on social attributes in the social network to solve the problem of rumor detection and verification, while less attention has been paid to content features. The social and structural features of rumors develop over time and are not available in the early stage of rumor. Therefore, this study presented a content-based model to verify the Persian rumors on Twitter and Telegram early. The proposed model demonstrates the important role of content in spreading rumors and generates a better-integrated representation for each source rumor document by fusing its semantic, pragmatic, and syntactic information. First, contextual word embeddings of the source rumor are generated by a hybrid model based on ParsBERT and parallel CapsNets. Then, pragmatic and syntactic features of the rumor are extracted and concatenated with embeddings to capture the rich information for rumor verification. Experimental results on real-world datasets demonstrated that the proposed model significantly outperforms the state-of-the-art models in the early rumor verification task. Also, it can enhance the performance of the classifier from 2% to 11% on Twitter and from 5% to 23% on Telegram. These results validate the model's effectiveness when limited content information is available.

References

[1]
A. Zubiaga, A. Aker, K. Bontcheva, M. Liakata, R. Procter, M. Liakata, and R. Procter. 2018. Detection and resolution of rumours in social media: A survey. ACM Comput. Surv. 51 (2018), 32.
[2]
T. Mikolov, K. Chen, G. Corrado, and J. Dean. 2013. Efficient estimation of word representations in vector space. In Proceedings of the 1st International Conference on Learning and Representation (ICLR’13) - Work. Track Proc., 2013. http://ronan.collobert.com/senna/. (accessed December 29, 2019).
[3]
J. Pennington, R. Socher, and C. Manning. 2014. Glove: Global vectors for word representation. In Proceedings of the 2014 Conference on Empirical Methods of the Natural Language Processors. Association for Computational Linguistics, Stroudsburg, PA, 2014, pp. 1532–1543.
[4]
P. Bojanowski, E. Grave, A. Joulin, and T. Mikolov. 2017. Enriching word vectors with subword information. Trans. Assoc. Comput. Linguist. 5 (2017), 135–146.
[5]
C. Castillo, M. Mendoza, and B. Poblete. 2011. Information credibility on twitter. In Proceedings of the 20th International Conference on the World Wide Web (WWW’11). ACM Press, New York, 2011, p. 675.
[6]
F. Yang, Y. Liu, X. Yu, and M. Yang. 2012. Automatic detection of rumor on Sina Weibo. In Proceedings of the ACM SIGKDD Workshop on the Minimum Data Semantics (MDS’12). ACM Press, New York, 2012, pp. 1–7.
[7]
A. Y. K. Chua and S. Banerjee. 2016. Linguistic predictors of rumor veracity on the Internet. In Proceedings of the 24th International MultiConference of Engineers and Computer Scientists (IMECS'16), Vol I. 387--391. http://www.iaeng.org/IMECS2016/.
[8]
Y. R. Tausczik and J. W. Pennebaker. 2010. The psychological meaning of words: LIWC and computerized text analysis methods. J. Lang. Soc. Psychol. 29 (2010), 24–54.
[9]
S. Kwon, M. Cha, K. Jung, W. Chen, and Y. Wang. 2013. Prominent features of rumor propagation in online social media. In Proceedings of the 2013 IEEE 13th International Conference on Data Mining (ICDM’13). IEEE, 2013, pp. 1103–1108.
[10]
K. Wu, S. Yang, and K. Q. Zhu. 2015. False rumors detection on Sina Weibo by propagation structures. In Proceedings of the 2015 IEEE 31st International Conference on Data Engineering (ICDE’15). IEEE, 2015, pp. 651–662.
[11]
Z. Yong, H. Yao, and Y. Wu. 2018. Rumors detection in Sina Weibo based on text and user characteristics. In Proceedings of the 2018 2nd IEEE Advances in Information and Managing Automated Control Conference (IMCEC’18). IEEE, 2018, pp. 1380–1386.
[12]
D. Lin, Y. Lv, and D. Cao. 2012. Rumor diffusion purpose analysis from social attribute to social content. In Proceedings of the 2015 International Conference on Asian Language Processes (IALP’15), IEEE, 2015, pp. 107–110.
[13]
Z. Zhao, P. Resnick, and Q. Mei. 2015. Enquiring minds. In Proceedings of the 24th International Conference on the World Wide Web (WWW’15). ACM, New York, 2015, pp. 1395–1405.
[14]
S. Vosoughi. 2015. Automatic detection and verification of rumors on Twitter. https://dspace.mit.edu/handle/1721.1/98553.
[15]
V. Qazvinian, E. Rosengren, D. R. Radev, and Q. Mei. 2011. Rumor has it: Identifying misinformation in microblogs, In Proc. Conf. Empir. Methods Nat. Lang. Process., Association for Computational Linguistics. 1589--1599.
[16]
S. Hamidian and M. Diab. 2016. Rumor identification and belief investigation on Twitter. In Proceedings of the 7th Annual Workshop on Computer Approaches to Subjective Sentiment and Social Media Analysis. Association for Computational Linguistics, Stroudsburg, PA, 2016, pp. 3–8.
[17]
A. Y. M. Floos, A. Yahya, and M. Floos. 2016. Arabic rumours identification by measuring the credibility of Arabic tweet content. Int. J. Knowl. Soc. Res. 7 (2016), 72–83.
[18]
R. Moin, ZU. Rehman, K. Mahmood, M. E. Alzahrani, and M. Q. Saleem. 2018. Framework for rumors detection in social media. Int. J. Adv. Comput. Sci. Appl. 9 (2018), 439--444. https://doi.org/10.14569/IJACSA.2018.090557
[19]
E. Kochkina, M. Liakata, and A. Zubiaga. 2018. All-in-one: Multi-task learning for rumour verification. In Proceedings of the 27th International Conference on Computer Linguistics (2018), pp. 3402–3413.
[20]
A. Zubiaga, M. Liakata, and R. Procter. 2017. Exploiting context for rumour detection in social media. In Proceedings of the International Conference on Social Informatics. Springer, 2017, pp. 109–123.
[21]
A. Kumar, S. R. Sangwan, and A. Nayyar. 2019. Rumour veracity detection on Twitter using particle swarm optimized shallow classifiers. Multimed. Tools Appl. 78 (2019), 24083–24101.
[22]
A. Dang, A. Moh'd, A. Islam, and E. Milios. 2019. Early detection of rumor veracity in social media. In Proceedings of the 52nd Hawaii International Conference on System Science, 2019.
[23]
S. Zamani, M. Asadpour, and D. Moazzami. 2017. Rumor detection for Persian Tweets. In Proceedings of the 2017 25th Iranian Conference on Electrical Engineering (ICEE’17), 2017, pp. 1532–1536.
[24]
S. D. Mahmoodabad, S. Farzi, and D. B. Bakhtiarvand. 2018. Persian rumor detection on Twitter. In Proceedings of the 9th International Symposium on Telecommunications (IST'18). 597--602. DOI:https://doi.org/10.1109/ISTEL.2018.8661007
[25]
M. Zarharan, S. Ahangar, F. S. Rezvaninejad, M. L. Bidhendi, S. S. Jalali, S. Eetemadi, M. T. Pilehvar, and B. Minaei-Bidgoli, Persian Stance Classification Dataset, (n.d.).
[26]
Z. Jahanbakhsh-Nagadeh, M.-R. Feizi-Derakhshi, and A. Sharifi. 2020. A speech act classifier for Persian texts and its application in identifying rumors. J. Soft Comput. Inf. Technol. (JSCIT) Vol. 9 (2020).
[27]
Z. Jahanbakhsh-Nagadeh, M.-R. Feizi-Derakhshi, M. Ramezani, T. Rahkar-Farshi, M. Asgari-Chenaghlu, N. Nikzad-Khasmakhi, A.-R. Feizi-Derakhshi, M. Ranjbar-Khadivi, E. Zafarani-Moattar, and M.-A. Balafar. 2020. A model to measure the spread power of rumors. ArXiv Prepr. ArXiv2002.07563. (2020).
[28]
C. Yuan, Q. Ma, W. Zhou, J. Han, and S. Hu. 2019. Jointly embedding the local and global relations of heterogeneous graph for rumor detection. In Proceedings of the 2019 IEEE International Conference on Data Mining. IEEE, 2019, pp. 796–805.
[29]
V. Qazvinian, E. Rosengren, D. R. Radev, and Q. Mei. 2011. Rumor has it: Identifying misinformation in microblogs. In Proceedings of the Conference on Empirical Methods, Natural Language Processes. Association for Computational Linguistics, 2011, pp. 1589–1599.
[30]
S. Vosoughi. 2015. Automatic detection and verification of rumors on Twitter, (2015). https://dspace.mit.edu/handle/1721.1/98553.
[31]
S. Zamani, M. Asadpour, and D. Moazzami. 2017. Rumor detection for Persian Tweets. In Proceedings of the 2017 Iranian Conference on Electrical Engineering. IEEE, 2017, pp. 1532–1536.
[32]
S. D. Mahmoodabad, S. Farzi, and D. B. Bakhtiarvand. 2018. Persian rumor detection on Twitter. In Proceedings of the 2018 9th International Symposium on Telecommunications. IEEE, 2018, pp. 597–602.
[33]
J. Ma, W. Gao, P. Mitra, S. Kwon, B. J. Jansen, K.-F. Wong, and M. Cha. 2016. Detecting rumors from microblogs with recurrent neural networks. In Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI'16). AAAI Press, 3818--3824.
[34]
F. Yu, Q. Liu, S. Wu, L. Wang, and T. Tan. 2017. A convolutional approach for misinformation identification. In Proceedings of the 26th International Joint Conference on Artificial Intelligence. 3901--3907. https://doi.org/10.24963/ijcai.2017/545
[35]
L. Poddar, W. Hsu, M. L. Lee, and S. Subramaniyam. 2018. Predicting stances in Twitter conversations for detecting veracity of rumors: A neural approach. In Proceedings of the 2018 IEEE 30th International Conference Tools with Artificial Intelligence. IEEE, 2018, pp. 65–72.
[36]
L. Li, G. Cai, and N. Chen. 2018. A rumor events detection method based on deep bidirectional GRU neural network. In Proceedings of the 2018 3rd IEEE International Conference on Image and Visual Computing (ICIVC’18). IEEE, 2018, pp. 755–759.
[37]
T. Chen, X. Li, H. Yin, and J. Zhang. 2018. Call attention to rumors: Deep attention based recurrent neural networks for early rumor detection. In Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining Springer, 2018, pp. 40–52.
[38]
F. Qian, C. Gong, K. Sharma, and Y. Liu. 2018. Neural user response generator: Fake news detection with collective user intelligence. In Proceedings of IJCAI, 2018, pp. 3834–3840.
[39]
B. Ma, D. Lin, and D. Cao. 2017. Content representation for microblog rumor detection. In Adv. Intell. Syst. Comput. Springer-Verlag, 2017, pp. 245–251.
[40]
Y. Geng, Z. Lin, P. Fu, and W. Wang. 2019. Rumor detection on social media: A multi-view model using self-attention mechanism. In Lecture Notes in Computer Science (Including the Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics. Springer-Verlag, 2019, pp. 339–352.
[41]
S. Singhal, R. R. Shah, T. Chakraborty, P. Kumaraguru, and S. Satoh. 2019. SpotFake: A multi-modal framework for fake news detection. In Proceedings of the 2019 IEEE 5th International Conference on Multimedia and Big Data (BigMM’19). IEEE, 2019, pp. 39–47.
[42]
K. Zhou, C. Shu, B. Li, and J. H. Lau. 2019. Early rumour detection. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL'19). 1614–1623.
[43]
S. Lotfi, M. Mirzarezaee, M. Hosseinzadeh, and V. Seydi. 2021. Detection of rumor conversations in Twitter using graph convolutional networks. Appl. Intell. (2021), 1–14.
[44]
M. H. Goldani, S. Momtazi, and R. Safabakhsh. 2021. Detecting fake news with capsule neural networks. Appl. Soft Comput. 101 (2021), 106991.
[45]
S. Sridhar and S. Sanagavarapu. 2021. Fake news detection and analysis using multitask learning with BiLSTM CapsNet model. In Proceedings of the 2021 11th International Conference on Cloud Computing and Data Science and Engineering. IEEE, 2021, pp. 905–911.
[46]
Z. Jahanbakhsh-Nagadeh, M.-R. Feizi-Derakhshi, and A. Sharifi. 2020. A semi-supervised model for Persian rumor verification based on content information. Multimed. Tools Appl. (2020), 1–29.
[47]
W. Li, P. Liu, Q. Zhang, and W. Liu. 2019. Future Internet: An improved approach for text sentiment classification based on a deep neural network via a sentiment attention mechanism. Futur. Internet. 11 (2019).
[48]
Y. Liu and Y.-F. B. Wu. 2018. Early detection of fake news on social media through propagation path classification with recurrent and convolutional networks. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence. 2018.
[49]
S. Kumar, R. Asthana, S. Upadhyay, N. Upreti, and M. Akbar. 2019. Fake news detection using deep learning models: A novel approach. Trans. Emerg. Telecommun. Technol. (2019), e3767.
[50]
S. Volkova, K. Shaffer, J. Y. Jang, and N. Hodas. 2017. Separating facts from fiction: Linguistic models to classify suspicious and trusted news posts on Twitter. In Proceedings of the 55th Annual Meeting of the Association of Computational Linguistics (Volume 2: Short Papers). 2017, pp. 647–653.
[51]
Y.-C. Chen, Z.-Y. Liu, and H.-Y. Kao. 2017. IKM at SemEval-2017 Task 8: Convolutional neural networks for stance detection and rumor verification. In Proceedings of the 11th International Workshop on Semantics Evaluation. Association for Computational Linguistics, Stroudsburg, PA, 2017, pp. 465–469.
[52]
G. Zhao and P. Yang. 2020. Pretrained embeddings for stance detection with hierarchical capsule network on social media. ACM Trans. Inf. Syst. 39 (2020), 1–32.
[53]
A. R. Feizi-Derakhshi, M. R. Feizi-Derakhshi, M. Ranjbar-Khadivi, N. Nikzad-Khasmakhi, M. Ramezani, T. Rahkar-Farshi, E. Zafarani-Moattar, M. Asgari-Changhlu, and Z. Jahan-Bakhsh. 2019. Sepehr_RumTel01, 1 (2019).
[54]
M. Farahani, M. Gharachorloo, M. Farahani, and M. Manthouri. 2020. ParsBERT: Transformer-based model for Persian language understanding. ArXiv Prepr. ArXiv2005.12515. (2020).
[55]
J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL'19). 4171--4186.
[56]
S. Sabour, N. Frosst, and G. E. Hinton. 2017. Dynamic routing between capsules. In Proceedings of the 31st International Conference on Neural Information Processing Systems (NIPS'17). Curran Associates Inc., Red Hook, NY, USA, 3859--3869.
[57]
H. W. Fentaw and T.-H. Kim. 2019. Design and investigation of capsule networks for sentence classification. Appl. Sci. 9 (2019), 2200.
[58]
M. Yang, W. Zhao, J. Ye, Z. Lei, Z. Zhao, and S. Zhang. 2018. Investigating capsule networks with dynamic routing for text classification. In Proceedings of the 2018 Conference on Empirical Methods of Natural Language Processes. Association for Computational Linguistics, 3110–3119.
[59]
J. Kim, S. Jang, E. Park, and S. Choi. 2020. Text classification using capsules. Neurocomputing. 376 (2020), 214–221.
[60]
K.-W. Liang, Y.-H. Tseng, and P.-C. Chang. 2019. Parallel capsule neural networks for sound event detection. In Proceedings of the 2019 Asia-Pacific Signal Information Processing Association Summit Conference (APSIPA ASC). IEEE, 2019, pp. 1933–1936.
[61]
J. L. Austin. 1975. How to Do Things with Words. Oxford University Press, 1975.
[62]
A. Koller and J. R. Searle. 1970. Speech acts: An essay in the philosophy of language. Language (Baltim). 46 (1970), 217.
[63]
J. R. Searle. 1979. A taxonomy of illocutionary acts. In Expression and Meaning: Studies in the Theory of Speech Acts. Cambridge University Press, 1--29. DOI:https://doi.org/10.1017/CBO9780511609213.003
[64]
M. Homayounpour and P. Soltani. 2010. Speech acts classification of Persian language texts using three machine learning methods. International Journal of Information and Communication Technology Research 2, 1 (2010), 65--71. https://www.sid.ir/en/journal/ViewPaper.aspx?id=208218.
[65]
L. Zhou, J. K. Burgoon, J. F. Nunamaker, and D. Twitchell. 2004. Automating linguistics-based cues for detecting deception in text-based asynchronous computer-mediated communications. Gr. Decis. Negot. 13 (2004), 81–106.
[66]
X. Zhou, A. Jain, V. V. Phoha, and R. Zafarani. 2020. Fake news early detection: A theory-driven model. Digit. Threat. Res. Pract. 1 (2020), 1–25.
[67]
G. A. Afrooz. 2011. Ravanshenasy-e shayee [Psychological Bases of Rumor], Islamic Culture Publishing Office, Tehran, 2011.
[68]
A. Bijad. 2018. Ravanshenasi-e shayee [Rumor psychology]. Vania, 2018.
[69]
O. Kashefi, M. Nasri, and K. Kanani. 2010. Automatic spell checking in Persian language. Supreme Council Inf. Commun. Technol. (SCICT), Tehran, Iran (2010).
[70]
H. Moradi, F. Ahmadi, and M.-R. Feizi-Derakhshi. 2017. A hybrid approach for Persian-named entity recognition. Iran. J. Sci. Technol. Trans. A Sci. 41 (2017), 215–222.
[71]
F. Oroumchian, S. Tasharofi, H. Amiri, H. Hojjat, and F. Raja. 2006. Creating a Feasible Corpus for Persian POS Tagging. Department of Electrical Computer Engineering. University of Tehran. (2006), 1–9. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.103.2154&rep=rep1&type=pdf.
[72]
A. Golfam, A. Afrashi, and G. Moghadam. 2014. Conceptualization of the Persian simple verbs of motion: A cognitive approach. J. Lang. West. Iran. Dialects. 1 (2014), 103–122. http://jlw.razi.ac.ir/article_275_en.html.
[73]
S. M. Mohammad and P. D. Turney. 2013. Crowdsourcing: A word-emotion association lexicon. In Computer Intelligence 2013, pp. 436–465.
[74]
Q. Jiang, L. Chen, R. Xu, X. Ao, and M. Yang. 2019. A challenge dataset and effective models for aspect-based sentiment analysis. In Proceedings of the 2019 Conference on Empirical Methods Natural Language Processing and 9th International Joint Conference on Natural Processing. 2019, pp. 6281–6286.

Cited By

View all
  • (2024)Elevating Fake News Detection Through Deep Neural Networks, Encoding Fused Multi-Modal FeaturesIEEE Access10.1109/ACCESS.2024.341192612(82146-82155)Online publication date: 2024
  • (2024)ERDeR: The Combination of Statistical Shrinkage Methods and Ensemble Approaches to Improve the Performance of Deep RegressionIEEE Access10.1109/ACCESS.2024.336806712(33361-33383)Online publication date: 2024
  • (2024)Improving stance detection accuracy in low-resource languages: a deep learning framework with ParsBERTInternational Journal of Data Science and Analytics10.1007/s41060-024-00630-wOnline publication date: 10-Sep-2024
  • Show More Cited By

Index Terms

  1. A Deep Content-Based Model for Persian Rumor Verification

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Asian and Low-Resource Language Information Processing
    ACM Transactions on Asian and Low-Resource Language Information Processing  Volume 21, Issue 1
    January 2022
    442 pages
    ISSN:2375-4699
    EISSN:2375-4702
    DOI:10.1145/3494068
    Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 29 November 2021
    Accepted: 01 September 2021
    Revised: 01 July 2021
    Received: 01 October 2020
    Published in TALLIP Volume 21, Issue 1

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Rumor verification
    2. ParsBERT
    3. speech act
    4. Persian rumor classification
    5. contextual features
    6. neural language model

    Qualifiers

    • Research-article
    • Refereed

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)94
    • Downloads (Last 6 weeks)7
    Reflects downloads up to 22 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Elevating Fake News Detection Through Deep Neural Networks, Encoding Fused Multi-Modal FeaturesIEEE Access10.1109/ACCESS.2024.341192612(82146-82155)Online publication date: 2024
    • (2024)ERDeR: The Combination of Statistical Shrinkage Methods and Ensemble Approaches to Improve the Performance of Deep RegressionIEEE Access10.1109/ACCESS.2024.336806712(33361-33383)Online publication date: 2024
    • (2024)Improving stance detection accuracy in low-resource languages: a deep learning framework with ParsBERTInternational Journal of Data Science and Analytics10.1007/s41060-024-00630-wOnline publication date: 10-Sep-2024
    • (2023)Causal diffused graph-transformer network with stacked early classification loss for efficient stream classification of rumoursKnowledge-Based Systems10.1016/j.knosys.2023.110807277:COnline publication date: 9-Oct-2023
    • (2022)ADVANCED TURKISH FAKE NEWS PREDICTION WITH BIDIRECTIONAL ENCODER REPRESENTATIONS FROM TRANSFORMERSÇift Yönlü Transformatör Kodlayıcı Temsilleriyle Gelişmiş Türkçe Sahte Haber TahminiKonya Journal of Engineering Sciences10.36306/konjes.99506010:3(750-761)Online publication date: 1-Sep-2022

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    Full Text

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media