[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3400903.3400918acmotherconferencesArticle/Chapter ViewAbstractPublication PagesssdbmConference Proceedingsconference-collections
short-paper

Improving geocoding quality via learning to integrate multiple geocoders

Published: 30 July 2020 Publication History

Abstract

In this paper, we introduce an approach for improving the quality of the geocoding process. Geocoding refers to the procedure of mapping an address of textual form to a pair of accurate spatial coordinates. While there is a variety of available geocoders, both open source and commercial, that curate this mapping in either a semi-automated or fully-automated way, there is no one-size-fits-all system. Depending on the underlying algorithm of each geocoder, its output may be very accurate for some addresses, districts or countries, while failing to properly locate some others. Given that, our setup can be thought of as a meta-geocoding pipeline, built on top of the available geocoders. We propose a machine learning approach, which, given an address and a sequence of coordinate pairs suggested by standalone geocoders, it is able to identify the most accurate one. In order to achieve this, we formulate the task as a multi-class classification problem and introduce a series of domain specific training features, capturing essential information about each coordinate pair suggestion, as well as computing comparative metrics among different suggestions. These features are fed into several classification algorithms and are evaluated on a proprietary address dataset of a geo-marketing company. Furthermore, we present LGM-GC, a QGIS plugin, which provides the functionality of our approach through a user-friendly interface.

References

[1]
ArcGIS. 2020. REST API. https://developers.arcgis.com/rest/geocode/api-reference/geocoding-geocode-addresses.htm. [Online; accessed 9-March-2020].
[2]
GeoData. 2020. Data Cleansing & Geocoding. http://www.eranet.gr/geodata/en/eaddress.html. [Online; accessed 9-March-2020].
[3]
Google. 2020. Geocoding API. https://developers.google.com/maps/documentation/geocoding/start. [Online; accessed 9-March-2020].
[4]
OpenStreetMap. 2020. Nominatim. https://nominatim.org/. [Online; accessed 9-March-2020].
[5]
QGIS Development Team. 2020. QGIS Geographic Information System. Open Source Geospatial Foundation Project. https://qgis.org/en/site/. [Online; accessed 9-March-2020].

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
SSDBM '20: Proceedings of the 32nd International Conference on Scientific and Statistical Database Management
July 2020
241 pages
ISBN:9781450388146
DOI:10.1145/3400903
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 July 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Classification
  2. Feature Extraction
  3. Geocoding
  4. Machine Learning

Qualifiers

  • Short-paper
  • Research
  • Refereed limited

Conference

SSDBM 2020

Acceptance Rates

Overall Acceptance Rate 56 of 146 submissions, 38%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 104
    Total Downloads
  • Downloads (Last 12 months)21
  • Downloads (Last 6 weeks)1
Reflects downloads up to 01 Jan 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media