[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article
Free access

Locality-aware request distribution in cluster-based network servers

Published: 01 October 1998 Publication History

Abstract

We consider cluster-based network servers in which a front-end directs incoming requests to one of a number of back-ends. Specifically, we consider content-based request distribution: the front-end uses the content requested, in addition to information about the load on the back-end nodes, to choose which back-end will handle this request. Content-based request distribution can improve locality in the back-ends' main memory caches, increase secondary storage scalability by partitioning the server's database, and provide the ability to employ back-end nodes that are specialized for certain types of requests.As a specific policy for content-based request distribution, we introduce a simple, practical strategy for locality-aware request distribution (LARD). With LARD, the front-end distributes incoming requests in a manner that achieves high locality in the back-ends' main memory caches as well as load balancing. Locality is increased by dynamically subdividing the server's working set over the back-ends. Trace-based simulation results and measurements on a prototype implementation demonstrate substantial performance improvements over state-of-the-art approaches that use only load information to distribute requests. On workloads with working sets that do not fit in a single server node's main memory cache, the achieved throughput exceeds that of the state-of-the-art approach by a factor of two to four.With content-based distribution, incoming requests must be handed off to a back-end in a manner transparent to the client, after the front-end has inspected the content of the request. To this end, we introduce an efficient TCP handoflprotocol that can hand off an established TCP connection in a client-transparent manner.

References

[1]
D. Andresen et al. SWEB: Towards a Scalable WWW Server on MultiComputers. In Proccedings of the l Oth International Parallel Processing Symposium, Apr. 1996.]]
[2]
Apache. http://www.apache.org/.]]
[3]
G. Banga, F. Douglis, and M. Rabinovich. Optimistic Deltas for WWW Latency Reduction. In Proceedings of the 19.97 Usenix Technical Conference, Jan. 1997.]]
[4]
T. 13risco. DNS Support for Load Balancing. RFC 1794, Apr. 1995.]]
[5]
P. Cao and S. lrani. Cost-aware WWW proxy caching algorithms. In Proceedings of the USEN{X Symposium on Internet Technologies and Systems (USITS), Monterey, CA, Dec. 1997.]]
[6]
A. Chankhunthod, P. B. Danzig, C. Neerdaels, M. F. Schwartz, and K. J. Worrell. A Hierarchical Internet Object Cache. In Proceedings of the 1996 Usenix Technical Conference, Jan. t996.]]
[7]
Cisco Systems Inc. LocalDirector. http://w ww. cisco.com.]]
[8]
M. Dahlin, R. Yang, T. Anderson, and D. Patterson. Cooperative caching: Using remote client memory to improve file system performance. In Proc. Syrup. on Operating Systems Design and Implementation, Monterey, CA, Nov. 1994.]]
[9]
O. P. Damani, P.-Y. E. Chung, Y. Huang, C. Kintala, and Y.-M. Wang. ONE-IP: Techniques for hosting a service on a cluster of machines. Computer Networks and ISDN Systems, 29:1019-1027, 1997.]]
[10]
P. Danzig, R. Hall, and M. Schwartz. A case for caching file objects inside internetworks. In Proceedings of the SIGCOMM '93 Conference, Sept. 1993.]]
[11]
M. J. Feeley, W. E. Morgan, F. H. Pighin, A. R. Karlin, H. M. Levy, and C. A. Thekkath. Implementing global memory management in a workstation cluster. In Proceedings of the Fifteenth A CM Symposium on Operating System Principles, Copper Mountain, CO, Dec. 1995.]]
[12]
A. Fox, S. D. Gribble, Y. Chawathe, E. A. Brewer, and P. Gauthier. Cluster-based scalable network services. Ill Proceedings of the Sixteenth A UM Symposium on Operating System Principles, San Malo, France, Oct. 1997.]]
[13]
G. Hunt, E. Nahum, and J. Tracey. Enabling content-based load distribution for scalable services. Technical report, IBM T.J. Watson Research Center, May 1997.]]
[14]
IBM Corporation. IBM interactive network dispatcher. ht tp://w ww.ics, raleigh.ibm, corn/ics /issl earn. ht m.]]
[15]
A. Iyengar and J. Challenger. Improving web server performance by caching dynamic data. In Proceedings of the USENIX Symposium on Internet Technologies and Systems (USITS), Monterey, CA, Dec. 1997.]]
[16]
T. M. Kroeger, D. D. Long, and J. C. Mogul. Exploring the bounds of Web latency reduction from caching and prefetching. In Proceedings of the USENIX Symposium on Internet Technologies and Systems (USITS), Monterey, CA, Dec. 1997.]]
[17]
H. Levy, G. Voelker, A. Karlin, E. Anderson, and T. Kimbrel. Implementing Cooperative Prefetching and Caching in a Globally-Managed Memory System. In Proceedings of the A CM SiGMETRICS '98 Conference, Madison, WI, June 1998.]]
[18]
J. Liedtke, V. Panteleenko, T. Jaeger, and N. Islam. High-performance caching with the Lava hit-server. In Proceedings of the USENIX 1998 Annual Technical Conference, New Orleans, LA, June 1998.]]
[19]
G. R. Malan, F. Jahanian, and S. Subramanian. Salamander: A push-based distribution substrate for Internet applications. In Proceedings of the USENIX Symposium on Internet Technologies and Systems (USITS), Monterey, CA, Dec. 1997.]]
[20]
M. K. McKusick, K. Bostic, M. J. Karels, and J. S. Quarterman. The Design and Implementation of the 4.dBSD Operating System. Addison-Wesley Publishing Company, 1996.]]
[21]
Resonate Inc. Resonate dispatch. http: //w w w.resonateinc.com.]]
[22]
M. Seltzer and J. Gwertzman. The Case for Geographical Pushcaching. In Proceedings of the 1995 Workshop on Hot Topics in Operating Systems, 1995.]]
[23]
G. Wright and W. Stevens. TCP/IP Illustrated Volume ~2. Addison-Wesley, Reading, MA, 1995.]]
[24]
D. J. Yates, E. M. Nahum, J. F. Kurose, and D. Towsley. Networking support for large scale multiprocessor servers. In Proceedings of the A CM Sigmetrics Conference on Measurement and Modeling of Computer Systems, Philadelphia, Pennsylvania, May 1996.]]
[25]
B. Yoshikawa et al. Using Smart Clients to Build Scalable Services. In Proceedings of the 1997 Usenix Technical Conference, Jan. 1997.]]

Cited By

View all
  • (2020)Two-level fuzzy-neural load distribution strategy in cloud-based web systemJournal of Cloud Computing: Advances, Systems and Applications10.1186/s13677-020-00179-69:1Online publication date: 11-Jun-2020
  • (2019)Beyond Load Balancing: Package-Aware Scheduling for Serverless Platforms2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID)10.1109/CCGRID.2019.00042(282-291)Online publication date: May-2019
  • (2019)Cooperation of Neuro-Fuzzy and Standard Cloud Web BrokersInformation Systems Architecture and Technology: Proceedings of 40th Anniversary International Conference on Information Systems Architecture and Technology – ISAT 201910.1007/978-3-030-30440-9_23(243-254)Online publication date: 5-Sep-2019
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM SIGOPS Operating Systems Review
ACM SIGOPS Operating Systems Review  Volume 32, Issue 5
Dec. 1998
309 pages
ISSN:0163-5980
DOI:10.1145/384265
Issue’s Table of Contents
  • cover image ACM Conferences
    ASPLOS VIII: Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
    October 1998
    326 pages
    ISBN:1581131070
    DOI:10.1145/291069
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 October 1998
Published in SIGOPS Volume 32, Issue 5

Check for updates

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)281
  • Downloads (Last 6 weeks)72
Reflects downloads up to 15 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2020)Two-level fuzzy-neural load distribution strategy in cloud-based web systemJournal of Cloud Computing: Advances, Systems and Applications10.1186/s13677-020-00179-69:1Online publication date: 11-Jun-2020
  • (2019)Beyond Load Balancing: Package-Aware Scheduling for Serverless Platforms2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID)10.1109/CCGRID.2019.00042(282-291)Online publication date: May-2019
  • (2019)Cooperation of Neuro-Fuzzy and Standard Cloud Web BrokersInformation Systems Architecture and Technology: Proceedings of 40th Anniversary International Conference on Information Systems Architecture and Technology – ISAT 201910.1007/978-3-030-30440-9_23(243-254)Online publication date: 5-Sep-2019
  • (2018)A SPEC RG Cloud Group's Vision on the Performance Challenges of FaaS Cloud ArchitecturesCompanion of the 2018 ACM/SPEC International Conference on Performance Engineering10.1145/3185768.3186308(21-24)Online publication date: 2-Apr-2018
  • (2018)Package-Aware Scheduling of FaaS FunctionsCompanion of the 2018 ACM/SPEC International Conference on Performance Engineering10.1145/3185768.3186294(101-106)Online publication date: 2-Apr-2018
  • (2018)Block Placement in Distributed File Systems Based on Block Access FrequencyIEEE Access10.1109/ACCESS.2018.28515716(38411-38420)Online publication date: 2018
  • (2016)Operating System Enhancement for Supporting Massively Multiplayer Online Games in a Server ClusterInternational Journal of Parallel Programming10.1007/s10766-014-0341-644:1(46-67)Online publication date: 1-Feb-2016
  • (2015)Context-sensitive text mining with fitness leveling Genetic Algorithm2015 IEEE 2nd International Conference on Cybernetics (CYBCONF)10.1109/CYBConf.2015.7175957(342-347)Online publication date: Jun-2015
  • (2014)Predictive and content-aware load balancing algorithm for peer-service area based IPTV networksMultimedia Tools and Applications10.1007/s11042-012-1209-770:3(1987-2010)Online publication date: 1-Jun-2014
  • (2013)A simple, adaptable and efficient heterogeneous multi-tenant database architecture for ad hoc cloudJournal of Cloud Computing: Advances, Systems and Applications10.1186/2192-113X-2-52:1(5)Online publication date: 2013
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media