[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3366423.3380019acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Twitter User Location Inference Based on Representation Learning and Label Propagation

Published: 19 April 2020 Publication History

Abstract

Social network user location inference technology has been widely used in various geospatial applications like public health monitoring and local advertising recommendation. Due to insufficient consideration of relationships between users and location indicative words, most of existing inference methods estimate label propagation probabilities solely based on statistical features, resulting in large location inference error. In this paper, a Twitter user location inference method based on representation learning and label propagation is proposed. Firstly, the heterogeneous connection relation graph is constructed based on relationships between Twitter users and relationships between users and location indicative words, and relationships unrelated to geographic attributes are filtered. Then, vector representations of users are learnt from the connection relation graph. Finally, label propagation probabilities between adjacent users are calculated based on vector representations, and the locations of unknown users are predicted through iterative label propagation. Experiments on two representative Twitter datasets - GeoText and TwUs, show that the proposed method can accurately calculate label propagation probabilities based on vector representations and improve the accuracy of location inference. Compared with existing typical Twitter user location inference methods - GCN and MLP-TXT+NET, the median error distance of the proposed method is reduced by 18% and 16%, respectively.

References

[1]
Amr Ahmed, Liangjie Hong, and Alexander J. Smola. 2013. Hierarchical Geographical Modeling of User Locations from Social Media Posts. In Proceedings of the 22Nd International Conference on World Wide Web(WWW ’13). ACM, New York, NY, USA, 25–36. https://doi.org/10.1145/2488388.2488392
[2]
Lars Backstrom, Eric Sun, and Cameron Marlow. 2010. Find Me if You Can: Improving Geographical Prediction with Social and Spatial Proximity. In Proceedings of the 19th International Conference on World Wide Web(WWW ’10). ACM, New York, NY, USA, 61–70. https://doi.org/10.1145/1772690.1772698
[3]
Zhiyuan Cheng, James Caverlee, and Kyumin Lee. 2010. You Are Where You Tweet: A Content-based Approach to Geo-locating Twitter Users. In Proceedings of the 19th ACM International Conference on Information and Knowledge Management(CIKM ’10). ACM, New York, NY, USA, 759–768. https://doi.org/10.1145/1871437.1871535
[4]
Ryan Compton, David Jurgens, and David Allen. 2014. Geotagging one hundred million Twitter accounts with total variation minimization. In 2014 IEEE International Conference on Big Data (Big Data). 393–401. https://doi.org/10.1109/BigData.2014.7004256
[5]
Clodoveu Davis Jr, Gisele Pappa, Diogo Rennó Rocha de Oliveira, and Filipe Arcanjo. 2011. Inferring the Location of Twitter Messages Based on User Relationships. Transactions on GIS 15 (12 2011), 735–751. Issue 6. https://doi.org/10.1111/j.1467-9671.2011.01297.x
[6]
Mohammad Ebrahimi, Elaheh ShafieiBavani, Raymond Wong, and Fang Chen. 2018. Twitter user geolocation by filtering of highly mentioned users. Journal of the Association for Information Science and Technology 69 (02 2018), 879–889. Issue 7. https://doi.org/10.1002/asi.24011
[7]
Jacob Eisenstein, Brendan O’Connor, Noah A. Smith, and Eric P. Xing. 2010. A Latent Variable Model for Geographic Lexical Variation. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing(EMNLP ’10). Association for Computational Linguistics, Stroudsburg, PA, USA, 1277–1287. https://doi.org/10.5555/1870658.1870782
[8]
Bo Han, Paul Cook, and Timothy Baldwin. 2012. Geolocation Prediction in Social Media Data by Finding Location Indicative Words. 24th International Conference on Computational Linguistics - Proceedings of COLING 2012: Technical Papers, 1045–1062.
[9]
David Jurgens. 2013. That’s What Friends Are For: Inferring Location in Online Social Media Platforms Based on Social Relationships. https://doi.org/6067
[10]
Jeffrey McGee, James Caverlee, and Zhiyuan Cheng. 2013. Location Prediction in Social Media Based on Tie Strength. In Proceedings of the 22Nd ACM International Conference on Information & Knowledge Management(CIKM ’13). ACM, New York, NY, USA, 459–468. https://doi.org/10.1145/2505515.2505544
[11]
Tomas Mikolov, G.s Corrado, Kai Chen, and Jeffrey Dean. 2013. Efficient Estimation of Word Representations in Vector Space. In Proceedings of the International Conference on Learning Representations(ICLR’13). 1–12.
[12]
Afshin Rahimi, Trevor Cohn, and Timothy Baldwin. 2015. Twitter User Geolocation Using a Unified Text and Network Prediction Model. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics - 7th International Joint Conference on Natural Language Processing(NAACL-HLT ’15). 630–636. https://doi.org/10.3115/v1/P15-2104
[13]
Afshin Rahimi, Trevor Cohn, and Timothy Baldwin. 2017. A Neural Model for User Geolocation and Lexical Dialectology. In Proceedings of ACL-2017 (short papers) preprint. Association for Computational Linguistics, Vancouver, Canada.
[14]
Afshin Rahimi, Trevor Cohn, and Timothy Baldwin. 2018. Semi-supervised User Geolocation via Graph Convolutional Networks. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Melbourne, Australia, 2009–2019. https://doi.org/10.18653/v1/P18-1187
[15]
Afshin Rahimi, Duy Vu, Trevor Cohn, and Timothy Baldwin. 2015. Exploiting Text and Network Context for Geolocation of Social Media Users. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics, Denver, Colorado, 1362–1367. https://doi.org/10.3115/v1/N15-1153
[16]
Stephen Roller, Michael Speriosu, Sarat Rallapalli, Benjamin Wing, and Jason Baldridge. 2012. Supervised Text-based Geolocation Using Language Models on an Adaptive Grid. In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning(EMNLP-CoNLL ’12). Association for Computational Linguistics, Stroudsburg, PA, USA, 1500–1510. https://doi.org/10.5555/2390948.2391120
[17]
KyoungMin Ryoo and Sue Moon. 2014. Inferring Twitter User Locations with 10 Km Accuracy. In Proceedings of the 23rd International Conference on World Wide Web(WWW ’14 Companion). ACM, New York, NY, USA, 643–648. https://doi.org/10.1145/2567948.2579236
[18]
Partha Pratim Talukdar and Koby Crammer. 2009. New Regularized Algorithms for Transductive Learning. In Machine Learning and Knowledge Discovery in Databases, Wray Buntine, Marko Grobelnik, Dunja Mladenić, and John Shawe-Taylor (Eds.). Springer, Berlin, Heidelberg, Berlin, Heidelberg, 442–457. https://doi.org/10.1007/978-3-642-04174-7_29
[19]
Haina Tang, Xiangpeng Zhao, and Yongmao Ren. 2019. A multilayer recognition model for twitter user geolocation. Wireless Networks (09 01 2019). https://doi.org/10.1007/s11276-018-01897-1
[20]
Benjamin Wing and Jason Baldridge. 2014. Hierarchical Discriminative Classification for Text-Based Geolocation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, Doha, Qatar, 336–348. https://doi.org/10.3115/v1/D14-1039
[21]
Xin Zheng, Jialong Han, and Aixin Sun. 2018. A Survey of Location Prediction on Twitter. IEEE Transactions on Knowledge and Data Engineering 30 (09 2018), 1652–1671. Issue 9. https://doi.org/10.1109/TKDE.2018.2807840
[22]
Xiaojin Zhu and Zoubin Ghahramani. 2003. Learning from Labeled and Unlabeled Data with Label Propagation. (07 2003). https://doi.org/10.1007/978-3-540-28649-3_29

Cited By

View all
  • (2024)Effectiveness of the Geographical Specificity Index in Document Geolocation文書ジオロケーション課題における地理的特定性指標の有効性評価Journal of Natural Language Processing10.5715/jnlp.31.166531:4(1665-1690)Online publication date: 2024
  • (2024)A Privacy-Preserving Computation Framework for Multisource Label Propagation ServicesIEEE Transactions on Services Computing10.1109/TSC.2024.3486196(1-14)Online publication date: 2024
  • (2024)Online Social Network User Home Location Inference Based on Heterogeneous NetworksIEEE Transactions on Dependable and Secure Computing10.1109/TDSC.2024.337637221:6(5509-5525)Online publication date: Nov-2024
  • Show More Cited By

Index Terms

  1. Twitter User Location Inference Based on Representation Learning and Label Propagation
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Please enable JavaScript to view thecomments powered by Disqus.

          Information & Contributors

          Information

          Published In

          cover image ACM Conferences
          WWW '20: Proceedings of The Web Conference 2020
          April 2020
          3143 pages
          ISBN:9781450370233
          DOI:10.1145/3366423
          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Sponsors

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 19 April 2020

          Permissions

          Request permissions for this article.

          Check for updates

          Author Tags

          1. Label Propagation
          2. Location Inference
          3. Representation Learning
          4. Social Network Analysis

          Qualifiers

          • Research-article
          • Research
          • Refereed limited

          Conference

          WWW '20
          Sponsor:
          WWW '20: The Web Conference 2020
          April 20 - 24, 2020
          Taipei, Taiwan

          Acceptance Rates

          Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)19
          • Downloads (Last 6 weeks)4
          Reflects downloads up to 06 Jan 2025

          Other Metrics

          Citations

          Cited By

          View all
          • (2024)Effectiveness of the Geographical Specificity Index in Document Geolocation文書ジオロケーション課題における地理的特定性指標の有効性評価Journal of Natural Language Processing10.5715/jnlp.31.166531:4(1665-1690)Online publication date: 2024
          • (2024)A Privacy-Preserving Computation Framework for Multisource Label Propagation ServicesIEEE Transactions on Services Computing10.1109/TSC.2024.3486196(1-14)Online publication date: 2024
          • (2024)Online Social Network User Home Location Inference Based on Heterogeneous NetworksIEEE Transactions on Dependable and Secure Computing10.1109/TDSC.2024.337637221:6(5509-5525)Online publication date: Nov-2024
          • (2024)Breaking Agenda Setting Boundaries: A Multi-Dimensional Approach to Understanding Salience of Gun Control in the Polarized Public SphereMass Communication and Society10.1080/15205436.2024.2333972(1-24)Online publication date: 16-Apr-2024
          • (2024)Enhancing geospatial prediction models with feature engineering from road networks: a graph-driven approachInternational Journal of Geographical Information Science10.1080/13658816.2024.234874038:8(1611-1632)Online publication date: 6-May-2024
          • (2024)Social Media User Geolocation Based on Large Language ModelsData Security and Privacy Protection10.1007/978-981-97-8540-7_19(304-312)Online publication date: 18-Oct-2024
          • (2023)Image Geo-Site Estimation Using Convolutional Auto-Encoder and Multi-Label Support Vector MachineInformation10.3390/info1401002914:1(29)Online publication date: 3-Jan-2023
          • (2023)Self-Supervised Representation Learning for Geographical Data—A Systematic Literature ReviewISPRS International Journal of Geo-Information10.3390/ijgi1202006412:2(64)Online publication date: 12-Feb-2023
          • (2023)UGCC: Social Media User Geolocation via Cyclic CouplingIEEE Transactions on Big Data10.1109/TBDATA.2023.32429619:4(1128-1141)Online publication date: 1-Aug-2023
          • (2023)Twitter user geolocation based on heterogeneous relationship modeling and representation learningInformation Sciences10.1016/j.ins.2023.119427647(119427)Online publication date: Nov-2023
          • Show More Cited By

          View Options

          Login options

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format.

          HTML Format

          Media

          Figures

          Other

          Tables

          Share

          Share

          Share this Publication link

          Share on social media