[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1076034.1076107acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
Article

Detecting dominant locations from search queries

Published: 15 August 2005 Publication History

Abstract

Accurately and effectively detecting the locations where search queries are truly about has huge potential impact on increasing search relevance. In this paper, we define a search query's dominant location (QDL) and propose a solution to correctly detect it. QDL is geographical location(s) associated with a query in collective human knowledge, i.e., one or few prominent locations agreed by majority of people who know the answer to the query. QDL is a subjective and collective attribute of search queries and we are able to detect QDLs from both queries containing geographical location names and queries not containing them. The key challenges to QDL detection include false positive suppression (not all contained location names in queries mean geographical locations), and detecting implied locations by the context of the query. In our solution, a query is recursively broken into atomic tokens according to its most popular web usage for reducing false positives. If we do not find a dominant location in this step, we mine the top search results and/or query logs (with different approaches discussed in this paper) to discover implicit query locations. Our large-scale experiments on recent MSN Search queries show that our query location detection solution has consistent high accuracy for all query frequency ranges.

References

[1]
Amitay, E., Har'El, N., Sivan R., and Soffer, A. Web-a-where: geotagging web content. Proc. 27th Annual International Conference on Research and Development in Information Retrieval (SIGIR'04), Jul. 2004, Sheffield, UK, 273--280.
[2]
Banko, M., Brill, E., Dumais S., and Lin J. AskMSR: Questing answering using the worldwide web. Proc. 2002 AAAI Spring Symposium on Mining Answers from Texts and Knowledge Bases, Mar 2003, Palo Alto, CA, USA, 7--8.
[3]
Bourigault, D. Surface grammatical analysis for extraction of terminological noun phrases. Proc. 14th COLING, 1992, Nantes, France, 977--981.
[4]
Church, K.W. A stochastic parts program and noun phrase parser for unrestricted test. Proc. 2nd Conference on Applied Natural Language Processing, Feb. 1988, Austin, Texas, USA, 136--143.
[5]
Cucerzan, S., and Yarowsky, D. Language independent NER using a unified model of internal and contextual evidence. Proc. 19th COLING, Aug. 2002, Taipei, Taiwan, 171--175.
[6]
Ding, J., Gravano, L., and Shivakumar N. Computing geographical scopes of web resource. Proc. 26th International Conference on Very Large Data Bases (VLDB'00), Sep. 2000, Cairo, Egypt., 545--556.
[7]
Geographic Names Information System (GNIS). http://geonames.usgs.gov/
[8]
Google local search: http://local.google.com
[9]
Google search. http://www.google.com
[10]
Gravano, L., Hatzivassiloglou, V., and Lichenstein, R. Categorizing Web Queries according to Geographical Locality. Proc. 12th Int'l Conference on Information and knowledge management (CIKM'03), Nov. 2003, New Orleans, LA, USA, 325--333.
[11]
Li, H., Srihari, R. K., Niu, C., and Li, W. Location normalization for information extraction. Proc. 19th COLING, Aug. 2002, Taipei, Taiwan.
[12]
Li, H., Srihari, R. K., Niu, C., and Li, W. InfoXtract location normalizations: a hybrid approach to geographical references in information extraction. Workshop on the Analysis of Geographic References, May 2003, Edmonton, Canada.
[13]
MSN Search. http://search.msn.com/
[14]
North American Numbering Plan. http://sd.wareonearth.com/~phil/npanxx
[15]
USPS -- The United States Postal Services. http://www.usps.com/
[16]
Van Rijsbergen, C.J. Information Retrieval. Butterworths, London, Second Edition, 1979.
[17]
Yahoo! local search. http://local.yahoo.com/
[18]
Zhou G., and Su, J. Named entity tagging using an HMM-based chunk tagger. Proc. 40th Annual Meeting of the ACL, July 2002, Philadelphia, PA, USA, 209--219.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '05: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
August 2005
708 pages
ISBN:1595930345
DOI:10.1145/1076034
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 August 2005

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. information retrieval
  2. local search
  3. query's dominant location
  4. search
  5. search query location
  6. search relevance

Qualifiers

  • Article

Conference

SIGIR05
Sponsor:

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)2
Reflects downloads up to 04 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)A systematic review of multidimensional relevance estimation in information retrievalWIREs Data Mining and Knowledge Discovery10.1002/widm.154114:5Online publication date: 7-May-2024
  • (2023)Representing Spatial Data with Graph Contrastive LearningRemote Sensing10.3390/rs1504088015:4(880)Online publication date: 5-Feb-2023
  • (2020)Query ClassificationQuery Understanding for Search Engines10.1007/978-3-030-58334-7_2(15-41)Online publication date: 2-Dec-2020
  • (2019)Place Questions and Human-Generated Answers: A Data Analysis ApproachGeospatial Technologies for Local and Regional Development10.1007/978-3-030-14745-7_1(3-19)Online publication date: 16-Apr-2019
  • (2018)Geo-Targeted Web SearchEncyclopedia of Database Systems10.1007/978-1-4614-8265-9_176(1625-1630)Online publication date: 7-Dec-2018
  • (2017)Large-Scale Location Prediction for Web PagesIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2017.270263129:9(1902-1915)Online publication date: 1-Sep-2017
  • (2017)Retrieval Algorithms, SpatialEncyclopedia of GIS10.1007/978-3-319-17885-1_1125(1767-1774)Online publication date: 12-May-2017
  • (2016)SGT FrameworkProceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval10.1145/2911451.2914743(1001-1004)Online publication date: 7-Jul-2016
  • (2016)Improving IP Geolocation using Query LogsProceedings of the Ninth ACM International Conference on Web Search and Data Mining10.1145/2835776.2835820(347-356)Online publication date: 8-Feb-2016
  • (2016)From query analysis to user information needs: a study of campus map searchesLibrary Hi Tech10.1108/LHT-12-2014-011034:1(104-129)Online publication date: 21-Mar-2016
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media