[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2442810.2442816acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
research-article

Mining future spatiotemporal events and their sentiment from online news articles for location-aware recommendation system

Published: 06 November 2012 Publication History

Abstract

The future-related information mining task for online web resources such as news articles and blogs has been getting more attention due to its potential usefulness in supporting individual's decision making in a world where massive new data are generated daily. Instead of building a data-driven model to predict the future, one extracts future events from these massive data with high probability that they occur at a future time and a specific geographic location. Such spatiotemporal future events can be utilized by a recommender system on a location-aware device to provide localized future event suggestions.
In this paper, we describe a systematic approach for mining future spatiotemporal events from web; in particular, news articles. In our application context, a valid event is defined both spatially and temporally. The mining procedure consists of two main steps: recognition and matching. For the recognition step, we identify and resolve toponyms (geographic location) and future temporal patterns. In the matching step, we perform spatiotemporal disambiguation, de-duplication, and pairing. To provide more useful future event guidance, we attach to each event a sentiment linguistic variable: positive, negative, or neutral, so that one may use these extracted event information for recommendation purposes in the form of "avoid Event A" or "avoid geographic location L at time T" or "attend Event B" based on the event sentiment. The identified future event consists of its geographic location, temporal pattern, sentiment variable, news title, key phrase, and news article URL. Experimental results on 3652 news articles from 21 online new sources collected over a 2-week period in the Greater Washington area are used to illustrate some of the critical steps in our mining procedure.

References

[1]
E. Amitay, N. Har'El, R. Sivan, and A. Soffer. Web-a-where: geotagging web content. In SIGIR, pages 273--280. ACM, 2004.
[2]
W. G. Aref and H. Samet. Efficient processing of window queries in the pyramid data structure. In Proceedings of the 9th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS), pages 265--272, Nashville, TN, 1990.
[3]
R. Baeza-Yates. Challenges in the interaction of natural language processing and information retrieval. In CICLING 2004, volume 2945 of LNCS, pages 445--456. Springer, 2004.
[4]
R. Baeza-Yates. Searching the future. In ACM SIGIR Workshop MF/IR, 2005.
[5]
D. M. Blei and J. D. McAuliffe. Supervised topic models. In NIPS. MIT Press, 2007.
[6]
T. Brants, F. Chen, and A. Farahat. A system for new event detection. In SIGIR, pages 330--337. ACM, 2003.
[7]
H. Cui, V. O. Mittal, and M. Datar. Comparative experiments on sentiment classification for online product reviews. In AAAI. AAAI Press, 2006.
[8]
X. Ding, B. Liu, and P. S. Yu. A holistic lexicon-based approach to opinion mining. In WSDM, pages 231--240, 2008.
[9]
J. Eisenstein, B. O'Connor, N. A. Smith, and E. P. Xing. A latent variable model for geographic lexical variation. In EMNLP, pages 1277--1287. ACL, 2010.
[10]
M. Gao, X.-S. Hua, and R. Jain. Wonderwhat: real-time event determination from photos. In WWW (Companion Volume), pages 37--38, 2011.
[11]
A. Jatowt, K. Kanazawa, S. Oyama, and K. Tanaka. Supporting analysis of future-related information in news archives and the web. In JCDL, pages 115--124. ACM, 2009.
[12]
A. Jatowt, H. Kawai, K. Kanazawa, K. Tanaka, K. Kunieda, and K. Yamada. Analyzing collective view of future, time-referenced events on the web. In WWW, pages 1123--1124. ACM, 2010.
[13]
J. J. Levandoski, M. Sarwat, A. Eldawy, and M. F. Mokbel. Lars: A location-aware recommender system. In A. Kementsietsidis and M. A. V. Salles, editors, ICDE, pages 450--461. IEEE Computer Society, 2012.
[14]
M. D. Lieberman and H. Samet. Multifaceted toponym recognition for streaming news. In Proceedings of the 34th International Conference on Research and Development in Information Retrieval (SIGIR'11), pages 843--852, 2011.
[15]
M. D. Lieberman and H. Samet. Adaptive context features for toponym resolution in streaming news. In Proceedings of the 35th International Conference on Research and Development in Information Retrieval (SIGIR'12), pages 731--740, 2012.
[16]
M. D. Lieberman, H. Samet, and J. Sankaranarayanan. Geotagging: Using proximity, sibling, and prominence clues to understand comma groups. In Proceedings of 6th Workshop on Geographic Information Retrieval, 2010.
[17]
M. D. Lieberman, H. Samet, and J. Sankaranarayanan. Geotagging with local lexicons to build indexes for textually-specified spatial data. In ICDE, pages 201--212. IEEE, 2010.
[18]
X. Ling and D. S. Weld. Temporal information extraction. In AAAI. AAAI Press, 2010.
[19]
I. Mani and D. G. Wilson. Robust temporal processing of news. In ACL, 2000.
[20]
Q. Mei and C. Zhai. A mixture model for contextual text mining. In KDD, pages 649--655, 2006.
[21]
E. Minkov, B. Charrow, J. Ledlie, S. J. Teller, and T. Jaakkola. Collaborative future event recommendation. In CIKM, pages 819--828. ACM, 2010.
[22]
D. Newman, C. Chemudugunta, and P. Smyth. Statistical entity-topic models. In KDD, pages 680--686, 2006.
[23]
A. Pak and P. Paroubek. Twitter as a corpus for sentiment analysis and opinion mining. In LREC. European Language Resources Association, 2010.
[24]
B. Pang and L. Lee. Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval, 2(1-2):1--135, 2007.
[25]
B. Pang, L. Lee, and S. Vaithyanathan. Thumbs up? sentiment classification using machine learning techniques. In EMNLP, 2002.
[26]
A.-M. Popescu, M. Pennacchiotti, and D. Paranjpe. Extracting events and event descriptions from twitter. In WWW (Companion Volume), pages 105--106, 2011.
[27]
M. Porter. An algorithm for suffix stripping. Program, 14(3):130--137, 1980.
[28]
G. Quercini, H. Samet, J. Sankaranarayanan, and M. D. Lieberman. Determining the spatial reader scopes of news sources using local lexicons. In Proceedings of the 18th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pages 43--52, 2010.
[29]
H. Samet, M. D. Adelfio, B. C. Fruin, M. D. Lieberman, and B. E. Teitler. Porting a web-based mapping application to a smartphone app. In Proceedings of the 19th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pages 525--528, 2011.
[30]
H. Samet, B. E. Teitler, M. D. Adelfio, and M. D. Lieberman. Adapting a map query interface for a gesturing touch screen interface. In Proceedings of the Twentieth International Word Wide Web Conference (Companion Volume), pages 257--260, 2011.
[31]
J. Sankaranarayanan, H. Samet, B. E. Teitler, M. D. Lieberman, and J. Sperling. Twitterstand: news in tweets. In GIS, pages 42--51. ACM, 2009.
[32]
Y. Takeuchi and M. Sugimoto. Cityvoyager: An outdoor recommendation system based on user location history. In UIC, volume 4159 of Lecture Notes in Computer Science, pages 625--636. Springer, 2006.
[33]
B. E. Teitler, M. D. Lieberman, D. Panozzo, J. Sankaranarayanan, H. Samet, and J. Sperling. Newsstand: a new view on news. In GIS, page 18. ACM, 2008.
[34]
V. N. Vapnik. The nature of statistical learning theory. Springer, 2nd edition, 2000.
[35]
P. Venetis, H. Gonzalez, C. S. Jensen, and A. Y. Halevy. Hyper-local, directions-based ranking of places. PVLDB, 4(5):290--301, 2011.
[36]
M. Verhagen, I. Mani, R. Sauri, J. Littman, R. Knippen, S. B. Jang, A. Rumshisky, J. Phillips, and J. Pustejovsky. Automating temporal annotation with tarsqi. In ACL. The Association for Computer Linguistics, 2005.
[37]
C. Wang, J. Wang, X. Xie, and W.-Y. Ma. Mining geographic knowledge using location aware topic model. In GIR, pages 65--70. ACM, 2007.
[38]
Y. Wang, B. Yang, S. Zoupanos, M. Spaniol, and G. Weikum. Scalable spatio-temporal knowledge harvesting. In WWW (Companion Volume), pages 143--144, 2011.
[39]
R. Yan, L. Kong, Y. Li, Y. Zhang, and X. Li. A fine-grained digestion of news webpages through event snippet extraction. In WWW (Companion Volume), pages 157--158, 2011.
[40]
M. Ye, R. Xiao, W.-C. Lee, and X. Xie. Location relevance classification for travelogue digests. In WWW (Companion Volume), pages 163--164, 2011.
[41]
M. Ye, P. Yin, and W.-C. Lee. Location recommendation for location-based social networks. In GIS, pages 458--461. ACM, 2010.

Cited By

View all
  • (2023)A Survey of Event Detection Techniques in Intelligent IoT System2023 14th International Conference on Computing Communication and Networking Technologies (ICCCNT)10.1109/ICCCNT56998.2023.10307841(1-6)Online publication date: 6-Jul-2023
  • (2023)A survey on event detection approaches for sensor based IoTInternet of Things10.1016/j.iot.2023.10072022(100720)Online publication date: Jul-2023
  • (2023)Data Science, Machine learning and big data in Digital Journalism: A survey of state-of-the-art, challenges and opportunitiesExpert Systems with Applications10.1016/j.eswa.2023.119795221(119795)Online publication date: Jul-2023
  • Show More Cited By

Index Terms

  1. Mining future spatiotemporal events and their sentiment from online news articles for location-aware recommendation system

        Recommendations

        Comments

        Please enable JavaScript to view thecomments powered by Disqus.

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        MobiGIS '12: Proceedings of the First ACM SIGSPATIAL International Workshop on Mobile Geographic Information Systems
        November 2012
        112 pages
        ISBN:9781450316996
        DOI:10.1145/2442810
        • Conference Chair:
        • Chi-Yin Chow
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Sponsors

        In-Cooperation

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 06 November 2012

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. event extraction
        2. sentiment classification
        3. supervised latent dirichlet allocation
        4. support vector machine
        5. temporal pattern
        6. toponym recognition and resolution

        Qualifiers

        • Research-article

        Funding Sources

        Conference

        SIGSPATIAL'12
        Sponsor:

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)6
        • Downloads (Last 6 weeks)1
        Reflects downloads up to 01 Mar 2025

        Other Metrics

        Citations

        Cited By

        View all
        • (2023)A Survey of Event Detection Techniques in Intelligent IoT System2023 14th International Conference on Computing Communication and Networking Technologies (ICCCNT)10.1109/ICCCNT56998.2023.10307841(1-6)Online publication date: 6-Jul-2023
        • (2023)A survey on event detection approaches for sensor based IoTInternet of Things10.1016/j.iot.2023.10072022(100720)Online publication date: Jul-2023
        • (2023)Data Science, Machine learning and big data in Digital Journalism: A survey of state-of-the-art, challenges and opportunitiesExpert Systems with Applications10.1016/j.eswa.2023.119795221(119795)Online publication date: Jul-2023
        • (2020)Spatiotemporal event detection: a reviewInternational Journal of Digital Earth10.1080/17538947.2020.173856913:12(1339-1365)Online publication date: 9-Mar-2020
        • (2019)Using Opinion Mining in Context-Aware Recommender Systems: A Systematic ReviewInformation10.3390/info1002004210:2(42)Online publication date: 28-Jan-2019
        • (2018)Recommendations based on a heterogeneous spatio-temporal social networkWorld Wide Web10.1007/s11280-017-0454-021:2(345-371)Online publication date: 1-Mar-2018
        • (2017)Smart cities, urban sensing, and big data: mining geo-location in social networksBig Data and Smart Service Systems10.1016/B978-0-12-812013-2.00005-8(59-84)Online publication date: 2017
        • (2016)Sorting in spaceSIGGRAPH ASIA 2016 Courses10.1145/2988458.2988503(1-42)Online publication date: 28-Nov-2016
        • (2016)A Graph-Based Taxonomy of Recommendation Algorithms and Systems in LBSNsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2015.249634428:3(604-622)Online publication date: 1-Mar-2016
        • (2016)Contrasting Public Opinion Dynamics and Emotional Response During CrisisSocial Informatics10.1007/978-3-319-47880-7_19(312-329)Online publication date: 23-Oct-2016
        • Show More Cited By

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media