[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3366423.3380298acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Mining Points-of-Interest for Explaining Urban Phenomena: A Scalable Variational Inference Approach

Published: 20 April 2020 Publication History

Abstract

Points-of-interest (POIs; i.e., restaurants, bars, landmarks, and other entities) are common in web-mined data: they greatly explain the spatial distributions of urban phenomena. The conventional modeling approach relies upon feature engineering, yet it ignores the spatial structure among POIs. In order to overcome this shortcoming, the present paper proposes a novel spatial model for explaining spatial distributions based on web-mined POIs. Our key contributions are: (1) We present a rigorous yet highly interpretable formalization in order to model the influence of POIs on a given outcome variable. Specifically, we accommodate the spatial distributions of both the outcome and POIs. In our case, this modeled by the sum of latent Gaussian processes. (2) In contrast to previous literature, our model infers the influence of POIs without feature engineering, instead we model the influence of POIs via distance-weighted kernel functions with fully learnable parameterizations. (3) We propose a scalable learning algorithm based on sparse variational approximation. For this purpose, we derive a tailored evidence lower bound (ELBO) and, for appropriate likelihoods, we even show that an analytical expression can be obtained. This allows fast and accurate computation of the ELBO. Finally, the value of our approach for web mining is demonstrated in two real-world case studies. Our findings provide substantial improvements over state-of-the-art baselines with regard to both predictive and, in particular, explanatory performance. Altogether, this yields a novel spatial model for leveraging web-mined POIs. Within the context of location-based social networks, it promises an extensive range of new insights and use cases.

References

[1]
Sudipto Banerjee, Bradley P. Carlin, and Alan E. Gelfand. 2014. Hierarchical Modeling and Analysis for Spatial Data. Chapman and Hall/CRC.
[2]
Chris Brunsdon, A. Stewart Fotheringham, and Martin E. Charlton. 2010. Geographically weighted regression: A method for exploring spatial nonstationarity. Geographical Analysis 28, 4 (2010), 281–298. https://doi.org/10.1111/j.1538-4632.1996.tb00936.x
[3]
Hsiu-Min Chuang and Chia-Hui Chang. 2015. Verification of POI and location pairs via weakly labeled web data. In International Conference on World Wide Web (WWW).
[4]
Justin Cranshaw, Raz Schwartz, Jason Hong, and Norman Sadeh. 2012. The livehoods project: Utilizing social media to understand the dynamics of a city. In International AAAI Conference on Web and Social Media (ICWSM).
[5]
Henry Crosby, Paul Davis, Theo Damoulas, and Stephen A. Jarvis. 2016. A spatio-temporal, Gaussian process regression, real-estate price predictor. In SIGSPATIAL.
[6]
Mark R. T. Dale and Marie-Josée Fortin. 2014. Spatial Analysis: A Guide for Ecologists(2nd ed.). Cambridge University Press, Cambridge. https://doi.org/10.1017/CBO9780511978913
[7]
David Darmofal. 2015. Spatial analysis for the social sciences. Cambridge University Press, New York. http://dx.doi.org/10.1017/CBO9781139051293
[8]
A. E. Gelfand. 2003. Proper multivariate conditional autoregressive models for spatial data analysis. Biostatistics 4(2003), 11–25. https://doi.org/10.1093/biostatistics/4.1.11
[9]
Andrew Gelman, John B. Carlin, Hal S. Stern, David B. Dunson, Aki Vehtari, and Donald B. Rubin. 2013. Bayesian Data Analysis. Chapman and Hall/CRC.
[10]
Jean-Benoît Griesner, Talel Abdessalem, and Hubert Naacke. 2015. POI recommendation: Towards fused matrix factorization with geographical and temporal influences. In ACM Conference on Recommender Systems (RecSys).
[11]
Mengyue Hang, Ian Pytlarz, and Jennifer Neville. 2018. Exploring student check-in behavior for improved point-of-interest prediction. In ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD).
[12]
Jing He, Xin Li, Lejian Liao, Dandan Song, and William K. Cheung. 2016. Inferring a personalized next point-of-interest recommendation model with latent behavior patterns. In AAAI Conerence on Artifical Intelligence (AAAI).
[13]
James Hensman, Nicolas Durrande, and Arno Solin. 2017. Variational Fourier features for Gaussian processes. Journal of Machine Learning Research 18, 151 (2017), 1–52.
[14]
James Hensman, Nicoló Fusi, and Neil D. Lawrence. 2013. Gaussian processes for big data. In Conference on Uncertainty in Artificial Intelligence (UAI). 282–290.
[15]
James Hensman, Matthews, Alexander G de G, and Zoubin Ghahramani. 2015. Scalable variational Gaussian process classification. In International Conference on Artificial Intelligence and Statistics (AISTATS).
[16]
Dave Higdon. 2002. Space and space-time modeling using process convolutions. In Quantitative Methods for Current Environmental Issues. Springer, London, 37–56. https://doi.org/10.1007/978-1-4471-0657-9_2
[17]
Desislava Hristova, Matthew J. Williams, Mirco Musolesi, Pietro Panzarasa, and Cecilia Mascolo. 2016. Measuring urban social diversity using interconnected geo-social networks. In International Conference on World Wide Web (WWW).
[18]
Xu Jiao, Yingyuan Xiao, Wenguang Zheng, Hongya Wang, and Youzhi Jin. 2019. R2SIGTP: A novel real-time recommendation system with integration of geography and temporal preference for next point-of-interest. In The Web Conference (WWW).
[19]
Cristina Kadar, Stefan Feuerriegel, Anastasios Noulas, and Cecilia Mascolo. 2020. Leveraging mobility flows from location-based services to test crime pattern theory in large cities. In International AAAI Conference on Web and Social Media (ICWSM).
[20]
Cristina Kadar, Rudolf Maculan, and Stefan Feuerriegel. 2019. Public decision support for low population density areas: An imbalance-aware hyper-ensemble for spatio-temporal crime prediction. Decision Support Systems 119 (2019), 107–117.
[21]
Cristina Kadar and Irena Pletikosa. 2018. Mining large-scale human mobility data for long-term crime prediction. EPJ Data Science 7, 1 (2018), 26. https://doi.org/10.1140/epjds/s13688-018-0150-z
[22]
Dmytro Karamshuk, Anastasios Noulas, Salvatore Scellato, Vincenzo Nicosia, and Cecilia Mascolo. 2013. Geo-spotting: Mining online location-based services for optimal retail store placement. In ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD).
[23]
Michael Kuntz and Marco Helbich. 2014. Geostatistical mapping of real estate prices: An empirical comparison of kriging and cokriging. International Journal of Geographical Information Science 28, 9(2014), 1904–1921. https://doi.org/10.1080/13658816.2014.906041
[24]
Becky P. Y. Loo and Tessa Kate Anderson. 2016. Spatial Analysis Methods of Road Traffic Crashes. CRC Press Taylor & Francis Group, Boca Raton, New York.
[25]
De G. Matthews, G. Alexander, Mark van der Wilk, Tom Nickson, Keisuke Fujii, Alexis Boukouvalas, Pablo León-Villagrá, Zoubin Ghahramani, and James Hensman. 2017. GPflow: A Gaussian process library using TensorFlow. Journal of Machine Learning Research 18, 1 (2017), 1299–1304.
[26]
Grant McKenzie, Krzysztof Janowicz, and Benjamin Adams. 2014. A weighted multi-attribute method for matching user-generated points of interest. Cartography and Geographic Information Science 41, 2 (2014), 125–137. https://doi.org/10.1080/15230406.2014.880327
[27]
Jonathan Milot, Patrick Munroe, Eric Beaudry, Francois Grondin, and Guillaume Bourdeau. 2016. Lookupia: An intelligent real estate search engine for finding houses optimally geolocated to reach points of interest. In International Conference on World Wide Web (WWW).
[28]
Budiman Minasny and Alex B. McBratney. 2007. Spatial prediction of soil properties using EBLUP with the matérn covariance function. Geoderma 140, 4 (2007), 324–336. https://doi.org/10.1016/j.geoderma.2007.04.028
[29]
Raul Montoliu, Jan Blom, and Daniel Gatica-Perez. 2013. Discovering places of interest in everyday life from smartphone data. Multimedia Tools and Applications 62, 1 (2013), 179–207. https://doi.org/10.1007/s11042-011-0982-z
[30]
Anastasios Noulas, Salvatore Scellato, Renaud Lambiotte, Massimiliano Pontil, and Cecilia Mascolo. 2012. A tale of many cities: Universal patterns in human urban mobility. PLOS ONE 7, 5 (2012), 1–10. https://doi.org/10.1371/journal.pone.0037027
[31]
Margaret A. Oliver and Richard Webster. 2015. Basic Steps in Geostatistics: The Variogram and Kriging. Springer, Cham. https://doi.org/10.1007/978-3-319-15865-5
[32]
Joaquin Quiñonero-Candela and Carl Edward Rasmussen. 2005. A unifying view of sparse approximate Gaussian process regression. Journal of Machine Learning Research 6 (2005), 1939–1959. http://dl.acm.org/citation.cfm?id=1046920.1194909
[33]
Adam Rae, Vanessa Murdock, Adrian Popescu, and Hugues Bouchard. 2012. Mining the web for points of interest. In International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR).
[34]
Carl Edward Rasmussen and Christopher K. I. Williams. 2006. Gaussian Processes for Machine Learning. MIT Press, Cambridge, MA.
[35]
Edward Snelson and Zoubin Ghahramani. 2006. Sparse Gaussian processes using pseudo-inputs. In Conference and Workshop on Neural Information Processing Systems (NIPS). 1257–1264.
[36]
Kendall Taylor, Kwan Hui Lim, and Jeffrey Chan. 2018. Travel itinerary recommendations with must-see points-of-interest. In The Web Conference (WWW).
[37]
Sebastian Wagner, Tobias Brandt, and Dirk Neumann. 2016. In free float: Developing business analytics support for carsharing providers. Omega 59(2016), 4–14. https://doi.org/10.1016/j.omega.2015.02.011
[38]
Hongjian Wang, Daniel Kifer, Corina Graif, and Zhenhui Li. 2016. Crime rate inference with big data. In ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD).
[39]
Hao Wang, Huawei Shen, Wentao Ouyang, and Xueqi Cheng. 2018. Exploiting POI-specific geographical influence for point-of-interest recommendation. In International Joint Conferences on Artificial Intelligence (ICJAI).
[40]
Lei Wang, Ram Gopal, Ramesh Shankar, and Joseph Pancras. 2015. On the brink: Predicting business failure with mobile location-based checkins. Decision Support Systems 76, C (2015), 3–13. https://doi.org/10.1016/j.dss.2015.04.010
[41]
Suhang Wang, Yilin Wang, Jiliang Tang, Kai Shu, Suhas Ranganath, and Huan Liu. 2017. What your images reveal: Exploiting visual contents for point-of-interest recommendation. In International Conference on World Wide Web (WWW).
[42]
Yingzi Wang, Xiao Zhou, Anastasios Noulas, Cecilia Mascolo, Xing Xie, and Enhong Chen. 2018. Predicting the spatio-temporal evolution of chronic diseases in population with human mobility data. In IJCAI. 3578–3584. https://doi.org/10.24963/ijcai.2018/497
[43]
Christoph Willing, Konstantin Klemmer, Tobias Brandt, and Dirk Neumann. 2017. Moving in time and space: Location intelligence for carsharing decision support. Decision Support Systems 99 (2017), 75–85.
[44]
Yang Xiao. 2017. Hedonic Housing Price Theory Review. In Urban Morphology and Housing Market. Springer, Singapore, 11–40. https://doi.org/10.1007/978-981-10-2762-8
[45]
Carl Yang, Lanxiao Bai, Chao Zhang, Quan Yuan, and Jiawei Han. 2017. Bridging collaborative filtering and semi-supervised learning. In ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD).
[46]
Yuling Yao, Aki Vehtari, Daniel Simpson, and Andrew Gelman. 2018. Yes, but did it work?: Evaluating variational inference. In International Conference on Machine Learning (ICML). https://doi.org/10.1175/2009JCLI3025.1
[47]
Zijun Yao. 2018. Exploiting human mobility patterns for point-of-interest recommendation. In ACM International Conference on Web Search and Data Mining (WSDM). https://doi.org/10.1145/3159652.3170459
[48]
Mao Ye, Peifeng Yin, Wang-Chien Lee, and Dik-Lun Lee. 2011. Exploiting geographical influence for collaborative point-of-interest recommendation. In International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR). https://doi.org/10.1145/2009916.2009962
[49]
Hongzhi Yin, Weiqing Wang, Hao Wang, Ling Chen, and Xiaofang Zhou. 2017. Spatial-aware hierarchical collaborative deep learning for POI recommendation. IEEE Transactions on Knowledge and Data Engineering 29, 11(2017), 2537–2551. https://doi.org/10.1109/TKDE.2017.2741484
[50]
Yonghong Yu and Xingguo Chen. 2015. A survey of point-of-interest recommendation in location-based social networks. In Workshops at AAAI.
[51]
Jing Yuan, Yu Zheng, and Xing Xie. 2012. Discovering regions of different functions in a city using human mobility and POIs. In ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD). 186–194. https://doi.org/10.1145/2339530.2339561
[52]
Jia-Dong Zhang and Chi-Yin Chow. 2015. Spatiotemporal sequential influence modeling for location recommendations. ACM Transactions on Intelligent Systems and Technology 7, 1(2015), 1–25. https://doi.org/10.1145/2786761
[53]
Shenglin Zhao, Tong Zhao, Irwin King, and Michael R. Lyu. 2017. Geo-Teaser: Geo-temporal sequential embedding rank for point-of-interest recommendation. In International Conference on World Wide Web (WWW).
[54]
Ning Zheng, Xiaoming Jin, and Lianghao Li. 2013. Cross-region collaborative filtering for new point-of-interest recommendation. In International Conference on World Wide Web (WWW).
[55]
Fan Zhou, Ruiyang Yin, Kunpeng Zhang, Goce Trajcevski, Ting Zhong, and Jin Wu. 2019. Adversarial point-of-interest recommendation. In The Web Conference (WWW).

Cited By

View all
  • (2024)Housing prices and points of interest in three Polish citiesJournal of Housing and the Built Environment10.1007/s10901-024-10124-739:3(1509-1540)Online publication date: 15-May-2024
  • (2022)Web Mining to Inform Locations of Charging Stations for Electric VehiclesCompanion Proceedings of the Web Conference 202210.1145/3487553.3524264(166-170)Online publication date: 25-Apr-2022
  • (2021)A Latent Customer Flow Model for Interpretable Predictions of Check-In Counts2021 IEEE International Conference on Big Data (Big Data)10.1109/BigData52589.2021.9671946(529-539)Online publication date: 15-Dec-2021

Index Terms

  1. Mining Points-of-Interest for Explaining Urban Phenomena: A Scalable Variational Inference Approach
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Please enable JavaScript to view thecomments powered by Disqus.

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        WWW '20: Proceedings of The Web Conference 2020
        April 2020
        3143 pages
        ISBN:9781450370233
        DOI:10.1145/3366423
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 20 April 2020

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. Gaussian Process
        2. Point-of-Interest
        3. Spatial Analytics
        4. Variational Inference
        5. Web-Mined Location Data

        Qualifiers

        • Research-article
        • Research
        • Refereed limited

        Conference

        WWW '20
        Sponsor:
        WWW '20: The Web Conference 2020
        April 20 - 24, 2020
        Taipei, Taiwan

        Acceptance Rates

        Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)10
        • Downloads (Last 6 weeks)1
        Reflects downloads up to 26 Dec 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)Housing prices and points of interest in three Polish citiesJournal of Housing and the Built Environment10.1007/s10901-024-10124-739:3(1509-1540)Online publication date: 15-May-2024
        • (2022)Web Mining to Inform Locations of Charging Stations for Electric VehiclesCompanion Proceedings of the Web Conference 202210.1145/3487553.3524264(166-170)Online publication date: 25-Apr-2022
        • (2021)A Latent Customer Flow Model for Interpretable Predictions of Check-In Counts2021 IEEE International Conference on Big Data (Big Data)10.1109/BigData52589.2021.9671946(529-539)Online publication date: 15-Dec-2021

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format.

        HTML Format

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media