[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1096985.1096991acmconferencesArticle/Chapter ViewAbstractPublication PagesgirConference Proceedingsconference-collections
Article

Detecting geographic locations from web resources

Published: 04 November 2005 Publication History

Abstract

The rapid pervasion of the web into users' daily lives has put much importance on capturing location-specific information on the web, due to the fact that most human activities occur locally around where a user is located. This is especially true in the increasingly popular mobile and local search environments. Thus, how to correctly and effectively detect geographic locations from web resources has become a key challenge to location-based web applications. In our previous work, we proposed to explicitly distinguish three types of locations for web resources, namely provider location, content location and serving location. Provider location is the physical location of the provider who owns the web resource; content location is the geographic location described in the web content; while serving location is the geographic scope that a web resource can reach. In this paper, we present a system that comprehensively employs a set of algorithms and different geographic sources by extracting geographic information from the web content, and mining hyperlink structures as well as user logs. As the result, only relevant geographic sources, rather than all of possible ones are used in computation of each category of web location. Finally, experimental results on large samples of web data show that our solution outperforms previous approaches.

References

[1]
Amitay, E., Har'EI, N., Sivan, R., and Soffer, A. Web-a-where: geotagging web content. 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'04), Sheffield, UK, Jul. 2004
[2]
Brin, S. and Page, L. The anatomy of a large-scale hypertextual web search engine. 7th International World Wide Web Conference (WWW7), Brisbane, Australia, Apr. 1998
[3]
Buyukkokten, O., Cho, J., Garcia-Molina, H., Gravano, L., and Shivakumar, N. Exploiting geographic location information of web pages. ACM SIGMOD Workshop on the Web and Databases 1999 (WebDB'99), Philadelphia, USA, Jun. 1999
[4]
Columbia GeoSearch. http://geosearch.cs.columbia.edu
[5]
Ding, J., Gravano, L., and Shivakumar N. Computing geographic scopes of web resource. 26th International Conference on Very Large Data Bases (VLDB'00), Cairo, Egypt, Sep. 2000
[6]
Geotags GeoSearch. http://geotags.com
[7]
Google Local Search. http://www.google.com/local
[8]
Hearst, M.A. Trends and controversies: support vector machines. IEEE Intelligent Systems, 13(4), Jul. 1998, 18-28
[9]
Hill, L.L., Frew, J., and Zheng, Q. Place names: the implementation of a gazetteer in a georeferenced digital library. Digital Library, 5(1), Jan. 1999
[10]
Jones, M., Jain, P., Buchanan, G., Marsden, G. Using a mobile device to vary the pace of search. 5th International Symposium on Human Computer Interaction with Mobile Devices and Services ('Mobile HCI03 <http://hcilab.uniud.it/mobilehci/index.html>), Udine, Italy, Sep. 2003
[11]
Kaasinen, E. User needs for location-aware mobile services. Personal and Ubiquitous Computing 7(1), May 2003, 70--79
[12]
Kan, M.Y. Web page categorization without the web page. 13th International World Wide Web Conference (WWW'04), New York, USA, May 2004
[13]
Larson, R.R. Geographic information retrieval and spatial browsing. Smith, L.C. and Gluck M. (Eds), Geographic Information Systems and Libraries: Patrons, Maps, and Spatial Information, University of Illinois, Urbana, IL, USA, 1996, 81-123
[14]
Li, H., Srihari, R. K., Niu, C., and Li, W. Location normalization for information extraction. Proc. 19th COLING, Aug. 2002, Taipei, Taiwan
[15]
Li, H., Srihari, R. K., Niu, C., and Li, W. InfoXtract location normalizations: a hybrid approach to geographic references in information extraction. Workshop on the Analysis of Geographic References, May 2003, Edmonton, Canada
[16]
Ma, Q., Matsumoto, C., and Tanaka, K. A localness-filter for searched web pages. 5th Asia Pacific Web Conference (APWeb'03), Xi'an, China, Sep. 2003
[17]
Ma, Q. and Tanaka, K. Retrieving regional information from web by contents localness and user location. 1st Asia Information Retrieval Symposium (AIRS'04), Beijing, China, Oct. 2004
[18]
Markowetz, A., Chen, Y., Suel, T., Long, X. and Seeger, B. Design and implementation of a geographic search engine. Technical Report TR-CIS-2005-03, Polytechnic University, Brooklyn, New York, 2005
[19]
McCurley, K. S. Geographic mapping and navigation of the web. 10th International World Wide Web Conference (WWW10), Hong Kong, May 2001
[20]
Microsoft MapPoint. http://mappoint.msn.com
[21]
MSN New York local page. http://local.msn.com/NewYork/
[22]
Place names Information System (GNIS). http://geonames.usgs.gov
[23]
MSN Portal. http://www.msn.com
[24]
North American Numbering Plan. http://sd.wareonearth.com/ phil/npanxx
[25]
USPS - The United States Postal Services. http://www.usps.com
[26]
Wang, C., Xie, X., Wang, L., Lu, Y., and Ma, W.Y. Web resource geographic location classification and detection. In Proceedings of the 14th International World Wide Web Conference (WWW'05), poster, Chiba, Japan, 2005
[27]
Woodruff, A.G. and Plaunt, C. GIPSY: geo-referenced information processing system. Journal of the American Society for Information Science, 45(9), 1994, 645--655
[28]
Yahoo Regional. http://www.yahoo.com/regional
[29]
Yokoji, S., Takahashi, K., and Miura, N. Kokono search: a location based search engine. 10th International World Wide Web Conference (WWW10), Hong Kong, May 2001

Cited By

View all
  • (2021)Implicit, Formal, and Powerful Semantics in GeoinformationISPRS International Journal of Geo-Information10.3390/ijgi1005033010:5(330)Online publication date: 13-May-2021
  • (2018)Geotagging Text Data on the Web—A Geometrical ApproachIEEE Access10.1109/ACCESS.2018.28438146(30086-30099)Online publication date: 2018
  • (2017)Large-Scale Location Prediction for Web PagesIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2017.270263129:9(1902-1915)Online publication date: 1-Sep-2017
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
GIR '05: Proceedings of the 2005 workshop on Geographic information retrieval
November 2005
78 pages
ISBN:1595931651
DOI:10.1145/1096985
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 04 November 2005

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. content location
  2. dominant location
  3. location-based web application
  4. provider location
  5. serving location
  6. web location

Qualifiers

  • Article

Conference

CIKM05
Sponsor:

Acceptance Rates

Overall Acceptance Rate 46 of 61 submissions, 75%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)11
  • Downloads (Last 6 weeks)0
Reflects downloads up to 28 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2021)Implicit, Formal, and Powerful Semantics in GeoinformationISPRS International Journal of Geo-Information10.3390/ijgi1005033010:5(330)Online publication date: 13-May-2021
  • (2018)Geotagging Text Data on the Web—A Geometrical ApproachIEEE Access10.1109/ACCESS.2018.28438146(30086-30099)Online publication date: 2018
  • (2017)Large-Scale Location Prediction for Web PagesIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2017.270263129:9(1902-1915)Online publication date: 1-Sep-2017
  • (2017)Location detection and disambiguation from twitter messagesJournal of Intelligent Information Systems10.1007/s10844-017-0458-349:2(237-253)Online publication date: 1-Oct-2017
  • (2016)Geotagging Named Entities in News and Online DocumentsProceedings of the 25th ACM International on Conference on Information and Knowledge Management10.1145/2983323.2983795(1321-1330)Online publication date: 24-Oct-2016
  • (2016)Reference data enhancement for geographic information retrieval using linked dataTransactions in GIS10.1111/tgis.1223821:4(683-700)Online publication date: 2-Nov-2016
  • (2016)A survey on the geographic scope of textual documentsComputers & Geosciences10.1016/j.cageo.2016.07.01796:C(23-34)Online publication date: 1-Nov-2016
  • (2015)Choosing ScrapyJournal of Computing Sciences in Colleges10.5555/2831373.283138731:1(83-89)Online publication date: 1-Oct-2015
  • (2015)Reconnecting Digital Publications to the Web using their Spatial InformationProceedings of the 24th International Conference on World Wide Web10.1145/2740908.2741714(749-754)Online publication date: 18-May-2015
  • (2015)When Location Meets Social MultimediaACM Transactions on Intelligent Systems and Technology10.1145/25971816:1(1-18)Online publication date: 26-Mar-2015
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media