[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1341012.1341044acmotherconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
research-article

Robust location search from text queries

Published: 07 November 2007 Publication History

Abstract

Robust, global, address geocoding is challenging because there is no single address format that applies to all geographies, and in any case, users may not restrict themselves to well-formed addresses. Particularly in online mapping systems, users frequently enter queries with missing or conflicting information, misspellings, address transpositions, and other such variations.
We present a novel system which handles these difficulties by using a combination of textual similarity and spatial coherence to guide a depth-first search over the large space of possible interpretations of a text query. The system robustly matches text subsequences of a query with text attributes (i.e., any text labels associated with the entity) in a spatial-entity database. Each matched attribute is associated with the pre-computed spatial union of all entities that have that attribute. Candidate results are formed by incremental spatial intersections of these unions.
Experimental results demonstrate that our system is capable of supporting regions with widely differing address formats, without region-specific customization or training. Furthermore, we show that our system significantly outperforms commercial systems for unstructured location queries and queries containing errors.

References

[1]
Bakshi R., Knoblock C. A, and Thakkar S. Exploiting Online Sources to Accurately Geocode Addresses. Proceedings of the 12th ACM International Symposium on Advances in Geographic Information Systems, Washington DC, USA, November, 2004, 194--203.
[2]
Cayo, M. R. and Talbot, T. O. Positional error in automated geocoding of residential addresses. International Journal of Health Geographics 2003, 2:10.
[3]
Chaudhary S., Ganjam, K., Ganti V. and Motwani R. Robust and Efficient Fuzzy Match for Online Data Cleaning ACM SIGMOD International Conference on Management of Data 2003
[4]
Chen, Y. Y., Suel, T. and Markowetz, A. Efficient Query Processing in Geographic Web Search Engines. Proceedings of the 2006 ACM SIGMOD International Conference on Management of data, Chicago IL USA 2006.
[5]
Christen, P., Churches, T. and Willmore, A. A Probabilistic Geocoding System based on a National Address File. Proceedings of the 3rd Australasian Data Mining Conference, Cairns, December 2004.
[6]
Gargantini I., An Effective Way to Represent Quadtrees. Communications of the ACM 1982
[7]
Goldberg, D. W., Wilson, J. P. and Knoblock, C. A. From Text To Geographic Coordinates: The Current State of Geocoding. Urban and Regional Information Systems Association Journal 2006
[8]
Jacox, E. H. and Samet, H. Spatial Join Techniques ACM Transactions on Database Systems, Vol. 32, No. 1, Article 7 2007.
[9]
Kimler M. Geo-Coding: Recognition of geographical references in unstructured text and their visualization. Diplomarbeit, Fachhochschule Hof, 2004
[10]
Krieger, N., Waterman, P., Lemieux, K., Zierler, S. and Hogan J. W. On the wrong side of the tracts? Evaluating the accuracy of geocoding in public health research American Journal of Public Health, Vol 91, Issue 7 2001
[11]
Leidner J. L. Toponym Resolution in Text: "Which Sheffield is it?" Proceedings of the 27th annual International ACM SIGIR Conference on Research and Development in Information Retrieval 2004
[12]
Nicoara, G. Exploring the Geocoding Process: A Municipal Case Study using Crime Data. Masters thesis, The University of Texas at Dallas, Dallas, TX, USA 2005
[13]
Pouliquen, B., R. Steinberger, C. Ignat, and T. De Groeve Geographical Information Recognition and Visualisation in Texts Written in Various Languages. In Proceedings of the 19th Annual ACM Symposium on Applied Computing 2004.
[14]
Rajagopalan S., Spatial Data in Telematics: An Indian Experience Conference cum Exposition on Telematics in Transportation, Chennai September 2004
[15]
Ratcliffe, J. H., On the accuracy of TIGER-type geocoded address data in relation to cadastral and census areal units. International Journal of Geographic Information Sciences 15 (5) 2001
[16]
Rhind, G. R. Global Sourcebook of Address Data Management A Guide to Address Formats and Data in 193 Countries. Gower Publishing Ltd, 2005
[17]
Trillium Software System ®, Harte-Hanks Trillium Software, Billerica, MA 01821. http://www.trilliumsoftware.com
[18]
Viola, P. and Narasimhan, M. Learning to Extract Information from Semistructured Text using a Discriminative Context Free Grammar. In Proc. of the ACM SIGIR, pages 330--337, 2005.
[19]
Zhou, Y., Xie, X., Wang, C., Gong, Y. and Ma, W. Y. Hybrid index structures for location based web search. In CIKM '05: Proceedings of the 14th ACM international conference on Information and knowledge management, New York, NY, USA, ACM Press (2005)

Cited By

View all
  • (2021)Conceptualizing Hyperlocal Information Systems for Developing CountriesProceedings of the ACM on Human-Computer Interaction10.1145/34795095:CSCW2(1-26)Online publication date: 18-Oct-2021
  • (2021)Fast Attention-based Learning-To-Rank Model for Structured Map SearchProceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3404835.3462904(942-951)Online publication date: 11-Jul-2021
  • (2021)Web Object Ranking for Location-Based Web Object SearchAdvances in Smart Communication and Imaging Systems10.1007/978-981-15-9938-5_16(151-165)Online publication date: 14-Apr-2021
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
GIS '07: Proceedings of the 15th annual ACM international symposium on Advances in geographic information systems
November 2007
439 pages
ISBN:9781595939142
DOI:10.1145/1341012
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • ESRI
  • Google Inc.
  • Oak Ridge National Laboratory
  • Microsoft: Microsoft

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 November 2007

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. ambiguous spatial queries
  2. geocoding
  3. location search

Qualifiers

  • Research-article

Conference

GIS07
Sponsor:
  • Microsoft

Acceptance Rates

Overall Acceptance Rate 257 of 1,238 submissions, 21%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)2
Reflects downloads up to 13 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2021)Conceptualizing Hyperlocal Information Systems for Developing CountriesProceedings of the ACM on Human-Computer Interaction10.1145/34795095:CSCW2(1-26)Online publication date: 18-Oct-2021
  • (2021)Fast Attention-based Learning-To-Rank Model for Structured Map SearchProceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3404835.3462904(942-951)Online publication date: 11-Jul-2021
  • (2021)Web Object Ranking for Location-Based Web Object SearchAdvances in Smart Communication and Imaging Systems10.1007/978-981-15-9938-5_16(151-165)Online publication date: 14-Apr-2021
  • (2019)Probabilistic classification techniques to perform geographical labeling of web objectsCluster Computing10.1007/s10586-018-1822-y22:1(277-285)Online publication date: 1-Jan-2019
  • (2019)Indexing Spelling Variants for Accurate Address SearchGeographical Information Systems Theory, Applications and Management10.1007/978-3-030-29948-4_4(73-87)Online publication date: 22-Aug-2019
  • (2017)Geographical labeling of web objects through density estimator model2017 International Conference on Computing Methodologies and Communication (ICCMC)10.1109/ICCMC.2017.8282649(1130-1135)Online publication date: Jul-2017
  • (2015)A new approach to geocodingProceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems10.1145/2820783.2820827(1-10)Online publication date: 3-Nov-2015
  • (2013)Map search via a factor graph modelProceedings of the 22nd ACM international conference on Information & Knowledge Management10.1145/2505515.2505674(69-78)Online publication date: 27-Oct-2013
  • (2012)Location Extraction from Social Networks with Commodity Software and Online DataProceedings of the 2012 IEEE 12th International Conference on Data Mining Workshops10.1109/ICDMW.2012.128(827-834)Online publication date: 10-Dec-2012
  • (2011)Ranking Spatial Data by Quality PreferencesIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2010.11923:3(433-446)Online publication date: 1-Mar-2011
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media