Abstract
The creation of a routing overlay network on the Internet requires the identification of shorter detour paths between end hosts in comparison to the default path available. These detour paths are typically the edges forming a Triangle Inequality Violation (TIV), an artifact of the Internet delay space where the sum of latencies across an intermediate hop is lesser than the direct latency between the pair of end hosts. These violations are caused mainly due to interdomain routing policies between Autonomous Systems (ASes) and AS peering through Internet eXchange Points (IXPs). Identifying detours for a global overlay network requires large amounts of computational capabilities due to the sheer number of possible paths linking source and destination ASes. In this work, we use parallel programming paradigms to exploit the massively parallel capabilities of analyzing the large network measurement datasets made available to the network research community by CAIDA. We study Internet routes traversing IXPs and measure potential TIVs created by these paths. Large scale analysis of the dataset is carried out by implementing an efficient parallel solution on the CPU and then the general purpose graphics processor unit (GPGPU) as well. Both multicore CPU and GPGPU implementations can be carried out with ease on desktop environments with readily available software. We find both parallel solutions yield high improvements in speedup (2-35x) in comparison to the serial methodologies thereby opening up the possibility of harnessing the power of parallel programming with readily available hardware. The large amount of data analyzed and studied helps draw various inferences for the networking research community in building future scalable Internet routing overlays with greater routing efficiencies.
Similar content being viewed by others
Notes
IP to AS mapping is a difficult and inexact problem in its own right, but for consistency here we use the Team Cymru whose service is available at [5].
This number is hardware dependent and with more available memory we can increase the number of traceroute logs loaded.
The server being on the same machine reduced communication delays with every Matlab worker running an instance of the parallel implementation. This helped us isolate and reduce network communication delays which would increase the total job execution times.
Refer to [29] for details about this clustering scheme.
Passive measurement systems do not add traffic to the network unlike in active measurements where packets are specifically introduced into the network and their effects measured instantly.
References
Caida: Archipelago measurement infrastructure. http://www.caida.org/projects/ark/
Nvidia corporation: Nvida cuda compute unified device architecture programming. http://developer.nvidia.com/cuda-toolkit-40
Packet clearing house. http://www.pch.net
Public exchange points search/list. http://www.peeringdb.com/private/exchange_list.php
Team cymru ip to asn mapping. http://www.team-cymru.org/Services/ip-to-asn.html
Triangle inequality violations due to ixps, public page. http://j.mp/mPyLMV
Ahmad MZ, Guha R (2010) Studying the effect of Internet exchange points on Internet link delays. In: Proceedings of the communications and networking symposium (CNS), SCS SpringSim 2010, Orlando, FL
Aho AV, Corasick MJ (1975) Efficient string matching: an aid to bibliographic search. Commun ACM 18:333–340. http://doi.acm.org/10.1145/360825.360855
Augustin B, Krishnamurthy B, Willinger W (2009) IXPs: mapped? In: IMC’09: proceedings of the 9th ACM SIGCOMM conference on Internet measurement conference. ACM, New York, pp 336–349. doi:10.1145/1644893.1644934
Buluç A, Gilbert JR, Budak C (2010) Solving path problems on the gpu. Parallel Comput 36:241–253. http://dx.doi.org/10.1016/j.parco.2009.12.002
Dabek F, Cox R, Kaashoek F, Morris R (2004) Vivaldi: a decentralized network coordinate system. Comput Commun Rev 34:15–26. http://doi.acm.org/10.1145/1030194.1015471
Dhamdhere A, Dovrolis C (2010) The Internet is flat: modeling the transition from a transit hierarchy to a peering mesh. In: Proceedings of the 6th international conference, Co-NEXT’10. ACM, New York, pp 21:1–21:12. http://doi.acm.org/10.1145/1921168.1921196
Gill P, Arlitt M, Li Z, Mahanti A (2008) The flattening Internet topology: natural evolution, unsightly barnacles or contrived collapse? In: PAM’08: proceedings of the 9th international conference on passive and active network measurement. Springer, Berlin, pp 1–10
Harish P, Narayanan PJ (2007) Accelerating large graph algorithms on the gpu using cuda. In: HiPC, pp 197–208
He Y, Siganos G, Faloutsos M, Krishnamurthy S (2009) Lord of the links: a framework for discovering missing links in the Internet topology. IEEE/ACM Trans Netw 17(2):391–404. http://dx.doi.org/10.1109/TNET.2008.926512
Huang NF, Hung HW, Lai SH, Chu YM, Tsai WY (2008) A gpu-based multiple-pattern matching algorithm for network intrusion detection systems. In: Proceedings of the 22nd international conference on advanced information networking and applications—workshops. IEEE Computer Society, Washington, DC, pp 62–67. http://portal.acm.org/citation.cfm?id=1395080.1395492
Katz GJ, Kider JT Jr (2008) All-pairs shortest-paths for large graphs on the gpu. In: Proceedings of the 23rd ACM SIGGRAPH/EUROGRAPHICS symposium on graphics hardware, GH’08. Eurographics Association, Aire-la-Ville, pp 47–55. http://portal.acm.org/citation.cfm?id=1413957.1413966
Lin CH, Tsai SY, Liu CH, Chang SC, Shyu JM (2010) Accelerating string matching using multi-threaded algorithm on gpu. In: GLOBECOM, pp 1–5
Lumezanu C, Baden R, Spring N, Bhattacharjee B (2009) Triangle inequality and routing policy violations in the Internet. In: Proceedings of the 10th international conference on passive and active network measurement, PAM’09. Springer, Berlin, pp 45–54
Lumezanu C, Baden R, Spring N, Bhattacharjee B (2009) Triangle inequality variations in the Internet. In: Internet measurement conference, pp 177–183
Lumezanu C, Levin D, Spring N (2007) Peerwise discovery and negotiation of faster paths. In: ACM Sigcomm workshop on hot topics in networking
Ly C, Hsu CH, Hefeeda M (2010) Improving online gaming quality using detour paths. In: ACM multimedia, pp 55–64
Madhyastha HV, Isdal T, Piatek M, Dixon C, Anderson T, Krishnamurthy A, Venkataramani A (2006) iplane: an information plane for distributed services. In: OSDI’06: proceedings of the 7th symposium on operating systems design and implementation. USENIX Association, Berkeley, pp 367–380
Mahadevan P, Krioukov D, Fomenkov M, Huffaker B, Dimitropoulos X, Claffy K, Vahdat A (2005) Lessons from three views of the Internet topology. http://arxiv.org/abs/cs.NI/0508033
Oliveira RV, Pei D, Willinger W, Zhang B, Zhang L (2008) In search of the elusive ground truth: the Internet’s as-level connectivity structure. ACM SIGMETRICS Perform Eval Rev 36(1):217–228. http://doi.acm.org/10.1145/1384529.1375482
Savage S, Collins A, Hoffman E, Snell J, Anderson T (1999) The end-to-end effects of Internet path selection. Comput Commun Rev 29(4):289–299. doi:10.1145/316194.316233
Smith R, Goyal N, Ormont J, Sankaralingam K, Estan C (2009) Evaluating gpus for network packet signature matching. In: IEEE international symposium on performance analysis of systems and software, ISPASS 2009, pp 175–184
Wang G, Zhang B, Ng TSE (2007) Towards network triangle inequality violation aware distributed systems. In: Proceedings of the 7th ACM SIGCOMM conference on Internet measurement, IMC’07. ACM, New York, pp 175–188. http://doi.acm.org/10.1145/1298306.1298331
Zhang B, Ng TSE, Nandi A, Riedi R, Druschel P, Wang G (2006) Measurement based analysis, modeling, and synthesis of the Internet delay space. In: Proceedings of the 6th ACM SIGCOMM conference on Internet measurement, IMC’06. ACM, New York, pp 85–98. http://doi.acm.org/10.1145/1177080.1177091
Zheng H, Lua EK, Pias M, Griffin TG (2005) Internet routing policies and round-trip-times. In: PAM, pp 236–250
Zhu Y, Chen Y, Zhang Z, Fu X, Li D, Deng B, Li X (2010) Taming the triangle inequality violations with network coordinate system on real Internet. In: Proceedings of the re-architecting the Internet workshop, ReARCH’10. ACM, New York, pp 7:1–7:6. http://doi.acm.org/10.1145/1921233.1921242
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ahmad, M.Z., Guha, R. Analysis of large scale traceroute datasets in Internet routing overlays by parallel computation. J Supercomput 62, 1425–1450 (2012). https://doi.org/10.1007/s11227-012-0811-9
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-012-0811-9