[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to main content

You Can Find Me Here: A Study of the Early Adoption of Geofeeds

  • Conference paper
  • First Online:
Passive and Active Measurement (PAM 2024)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14538))

Included in the following conference series:

  • 351 Accesses

Abstract

IP geolocation is a popular mechanism for determining the physical locations of Internet-connected devices. However, despite its widespread use, IP geolocation is known to be inaccurate, especially for devices in less industrialized nations. In 2020, geofeeds were standardized by the IETF, providing a mechanism for owners of IP addresses (i.e., autonomous systems) to self-report the physical locations of IP blocks under their control. Assuming IP address owners accurately report these locations, geofeeds conceptually have the potential to enable “groundtruth” location data. This short paper takes a first look at the roll-out of geofeeds. We examine the opt-in rates of geofeeds by autonomous systems, and surmise the use of geofeed data by two major IP geolocation providers. Over the course of our 14-month data collection efforts (August 2022–October 2023), the number of IP addresses covered by geofeeds has increased tenfold; however, the adoption rate is still low—less than 1% of the IPv4 address space is covered by geofeeds. We find that the rollout is also uneven, with more industrialized nations opting into geofeeds at rates higher than those of less industrialized ones. Moreover, our comparison of geofeed data to locations reported by commercial IP geolocation services suggests that these commercial services may be beginning to incorporate geofeed data into their resolutions. We discuss the implications of our findings, including the potential that uneven adoption rates may further disenfranchise Internet users in less industrialized nations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
£29.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
GBP 19.95
Price includes VAT (United Kingdom)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
GBP 87.50
Price includes VAT (United Kingdom)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
GBP 109.99
Price includes VAT (United Kingdom)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    For ease of exposition, we will often use the shorthand geolocation to refer to IP-based geolocation.

  2. 2.

    A user could always use a means of obscuring their IP address such as Tor [16], a VPN, SmartDNS services [18], or a proxy. However, using these technologies requires technical sophistication and imposes performance and usability bottlenecks [17, 34].

  3. 3.

    For example, while most commercial providers showed over 90% accuracy in identifying routers in the U.S., most providers showed between 20% and 39% accuracy when locating routers in Canada [28].

  4. 4.

    Measurements were initially pulled manually once every two weeks and were later automated to run on the 13th and 28th of each month.

  5. 5.

    To account for locales having numerous names or versions of the same name (e.g., the city name for Đakovo, Croatia could also be spelled Djakovo or Dakovo), we computed the normalized Damerau-Levenshtein distance [15] between the two location names and asserted that to match, the result had to be less than 0.5.

  6. 6.

    To account for locations having multiple names or spellings, we used fuzzy matching with tokenized Levenshtein distance to find many of the named locations.

  7. 7.

    See https://github.com/GUSecLab/geofeed-measurement.

References

  1. Geonames. https://download.geonames.org/export/dump/

  2. Geopy documentation. https://geopy.readthedocs.io/en/latest/

  3. IP geolocation database: Fastly Support. https://support.fastly.com/hc/en-us/community/posts/360078589732-IP-geolocation-database

  4. CDN Comparison - Most Popular CDNs of 2021 (2021). https://www.experte.com/website/cdn

  5. Seclists.org - re: Google IP geolocation (2021). https://seclists.org/nanog/2021/Apr/67

  6. Digital Element (2022). https://www.digitalelement.com

  7. Geoguard (2022). https://www.geocomply.com/products/geoguard/

  8. Central Intelligence Agency: CIA world factbook. https://www.cia.gov/the-world-factbook/field/internet-users/country-comparison/

  9. McDonald, A., et al.: 403 forbidden: a global view of CDN geoblocking. In: Proceedings of the Internet Measurement Conference 2018. IMC 2018. Association for Computing Machinery (2018)

    Google Scholar 

  10. Bano, S., et al.: Scanning the internet for liveness. SIGCOM Comput. Commun. Rev. 48(2), 2–9 (2018)

    Article  Google Scholar 

  11. Wong, B., Stoyanov, I., Sirer, E.G.: Octant: a comprejensive framework for the geolocalization of internet hosts. In: 4th USENIX Symposium on Networked Systems Design & Implementation, NSDI 2007. USENIX Association (2007)

    Google Scholar 

  12. Cloudflare: Configuring Cloudflare IP Geolocation (2021). https://support.cloudflare.com/hc/en-us/articles/200168236-What-does-Cloudflare-IP-Geolocation-do-. Accessed 6 Dec 2021

  13. Iordanou, C., Smaragdakis, G., Poese, I., Laoutaris, N.: Tracing cross border web tracking. In: Proceedings of the Internet Measurement Conference 2018, IMC 2018. Association for Computing Machinery (2018)

    Google Scholar 

  14. Dainotti, A., et al.: Estimating internet address space usage through passive measurements. SIGCOMM Comput. Commun. Rev. 44(1), 42–49 (2014)

    Article  Google Scholar 

  15. Damerau, F.J.: A technique for computer detection and correction of spelling errors. Commun. ACM 7(3), 171–176 (1964). https://dl.acm.org/doi/10.1145/363958.363994

  16. Dingledine, R., Mathewson, N., Syverson, P.: Tor: the second-generation onion router. In: USENIX Security Symposium (USENIX) (2004)

    Google Scholar 

  17. Fainchtein, R.A., Aviv, A.J., Sherr, M.: User perceptions of the privacy and usability of smart DNS. In: Proceedings of the 38th Annual Computer Security Applications Conference, ACSAC 2022, pp. 591–604. Association for Computing Machinery (2022). https://dl.acm.org/doi/10.1145/3564625.3567978

  18. Fainchtein, R.A., Aviv, A.J., Sherr, M., Ribaudo, S., Khullar, A.: Holes in the geofence: privacy vulnerabilities in “smart” DNS services. In: Proceedings on Privacy Enhancing Technologies (PoPETS) (2021)

    Google Scholar 

  19. Gueye, B., Ziviani, A., Crovella, M., Fdida, S.: Constraint based geolocation of internet hosts. In: Proceedings of the 2004 Internet Measurement Conference, IMC 2004. Association for Computing Machinery (2004)

    Google Scholar 

  20. Internet Corporation for Assigned Names and Numbers: About WHOIS (2022). https://whois.icann.org/en/about-whois

  21. IPGeolocation: ip-geolocation-api-jquery-sdk (2022). https://www.jsdelivr.com/package/npm/ip-geolocation-api-jquery-sdk

  22. ipgeolocation.io: IP geolocation API and IP lookup documentation. https://ipgeolocation.io/documentation.html

  23. ipgeolocation.io: IP Geolocation FAQs. https://ipgeolocation.io/faq.html

  24. Karney, C.F.: Algorithms for geodesics. J. Geodesy 87, 43–55 (2013)

    Article  Google Scholar 

  25. Kline, E., Duleba, K., Szamonek, Z., Moser, S., Kumari, W.: A format for self-published IP geolocation feeds. Informational 8805, RFC-Editor (2020)

    Google Scholar 

  26. Kumar, R., Virkud, A., Raman, R.S., Prakash, A., Ensafi, R.: A large-scale investigation into geodifferences in mobile apps. In: 31st USENIX Security Symposium (USENIX Security 2022). USENIX Association (2022). https://www.usenix.org/conference/usenixsecurity22/presentation/kumar

  27. Laki, S., Mátray, P., Hága, P., Sebők, T., Csabai, I., Vattay, G.: Spotter: a model based active geolocation service. In: 2011 Proceedings IEEE INFOCOM (2011)

    Google Scholar 

  28. Gharaibeh, M., Shah, A., Huffaker, B., Zhang, H., Ensafi, R., Papadopoulos, C.: A look at router geolocation in public and commercial databases. In: Proceedings of the Internet Measurement Conference 2017, IMC 2017. Association for Computing Machinery (2017)

    Google Scholar 

  29. Massimo Candela: geofeed-finder (2022). https://github.com/massimocandela/geofeed-finder

  30. Maxmind: Geoip2 and geolite city and country databases. https://dev.maxmind.com/geoip/docs/databases/city-and-country

  31. Maxmind: Geolocation accuracy. https://support.maxmind.com/hc/en-us/articles/4407630607131-Geolocation-Accuracy

  32. Ziv, M., Izhikevich, L., Ruth, K., Izhikevich, K., Durumeric, Z.: ASdb: a system for classifying owners of autonomous systems. In: ACM Internet Measurement Conference (IMC) (2021)

    Google Scholar 

  33. Muir, J.A., Oorschot, P.C.V.: Internet geolocation: evasion and counterevasion. ACM Comput. Surv. 42(1), 1–23 (2009)

    Article  Google Scholar 

  34. Namara, M., Wilkinson, D., Caine, K., Knijnenburg, B.P.: Emotional and practical considerations towards the adoption and abandonment of VPNs as a privacy-enhancing technology. Proc. Priv. Enh. Technol. (PoPETS) 2020(1), 83–102 (2020)

    Google Scholar 

  35. NRO: Delegated-extended-file. https://ftp.ripe.net/pub/stats/ripencc/nro-stats/latest/nro-delegated-stats. Accessed 15 Oct 2023

  36. Richter, P., Smaragdakis, G., Plonka, D., Berger, A.: Beyond counting: new perspectives on the active IPv4 address space. In: Proceedings of the 2016 Internet Measurement Conference, IMC 2016. Association for Computing Machinery (2016)

    Google Scholar 

  37. Poese, I., Uhlig, S., Kaafar, M.A., Donnet, B., Gueye, B.: IP geolocation databases: unreliable? ACM SIGCOMM Comput. Commun. Rev. 41(2), 53–56 (2011)

    Article  Google Scholar 

  38. Kumari, W., Bush, R., Candela, M., Housley, R.: RFC 9092 Finding and Using Geofeed Data. RFC Proposed Standard 9092, Internet Engineering Task Force (IETF) (2021)

    Google Scholar 

  39. Trimble, M.: The future of cybertravel: legal implications of the evasion of geolocation. Fordham Intell. Prop. Media Entertain. Law J. 22, 567 (2012)

    Google Scholar 

  40. Tschantz, M.C., Afroz, S., Sajid, S., Qazi, S.A., Javed, M., Paxson, V.: A bestiary of blocking: the motivations and modes behind website unavailability. In: 8th USENIX Workshop on Free and Open Communications on the Internet (FOCI 2018). USENIX Association (2018). https://www.usenix.org/conference/foci18/presentation/tschantz

  41. Weinberg, Z., Cho, S., Christin, N., Sekar, V., Gill, P.: How to catch when proxies lie: verifying the physical locations of network proxies with active geolocation. In: ACM SIGCOMM Conference on Internet Measurement (IMC) (2018)

    Google Scholar 

Download references

Acknowledgments

We thank the anonymous reviewers and shepherd for their invaluable feedback and suggestions. This work is partially funded by the National Science Foundation through grants 1925497 and 2138078, and by the Callahan Family Chair fund. The opinions and findings expressed in this paper are those of the authors and do not necessarily those of any employer or funding agency.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rahel A. Fainchtein .

Editor information

Editors and Affiliations

Appendices

A Geofeed and Commercial IP Fetch Dates

Table 3 lists the dates of fetches for the geofeeds and the corresponding dates of the commercial IP datasets that were used for comparison.

Table 3. Mapping of pull dates for geofeed results and matched commercial DB pulls. Pairings were selected to minimize the time between the geofeed and commercial pull dates (or vice versa).

B Country’s Representation Within the Geofeed Results

Table 4 presents a breakdown of the top ten most and bottom 20 least represented countries within the geofeed results before normalization.

Figure 9 provides a breakdown of countries’ representation within the geofeeds normalized by their respective number of Internet users [8] and Fig. 10 shows geofeeds normalized by each country’s IPv4 address allocation. Additionally, Fig. 11 provides a country-wise breakdown of the total ASes categorized as ISPs by the ASdb in the November 10, 2023 geofeed results and Fig. 12 denotes the proportion of ISPs amongst each represented country’s ASes in the same geofeed data.

Table 4. Top ten most (top) and bottom 20 least (bottom) represented countries.
Fig. 9.
figure 9

Countries’ IPv4 address representation within the geofeeds normalized by number of Internet users [8].

Fig. 10.
figure 10

Countries’ IPv4 address representation within the geofeeds normalized by their IPv4 address allocations.

Fig. 11.
figure 11

Total number of ISPs in each country within the Geofeed Results for November 10, 2023 as categorized by the Stanford ASdb.

Fig. 12.
figure 12

Proportion of each country’s ASNs that were categorized as ISPs by the Stanford ASdb in the November 10, 2023 Geofeed Results.

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Fainchtein, R.A., Sherr, M. (2024). You Can Find Me Here: A Study of the Early Adoption of Geofeeds. In: Richter, P., Bajpai, V., Carisimo, E. (eds) Passive and Active Measurement. PAM 2024. Lecture Notes in Computer Science, vol 14538. Springer, Cham. https://doi.org/10.1007/978-3-031-56252-5_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-56252-5_11

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-56251-8

  • Online ISBN: 978-3-031-56252-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics