[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2254756.2254763acmconferencesArticle/Chapter ViewAbstractPublication PagesmetricsConference Proceedingsconference-collections
research-article

Minimizing slowdown in heterogeneous size-aware dispatching systems

Published: 11 June 2012 Publication History

Abstract

We consider a system of parallel queues where tasks are assigned (dispatched) to one of the available servers upon arrival. The dispatching decision is based on the full state information, i.e., on the sizes of the new and existing jobs. We are interested in minimizing the so-called mean slowdown criterion corresponding to the mean of the sojourn time divided by the processing time. Assuming no new jobs arrive, the shortest-processing-time-product (SPTP) schedule is known to minimize the slowdown of the existing jobs. The main contribution of this paper is three-fold: 1) To show the optimality of SPTP with respect to slowdown in a single server queue under Poisson arrivals; 2) to derive the so-called size-aware value functions for M/G/1-FIFO/LIFO/SPTP with general holding costs of which the slowdown criterion is a special case; and 3) to utilize the value functions to derive efficient dispatching policies so as to minimize the mean slowdown in a heterogeneous server system. The derived policies offer a significantly better performance than e.g., the size-aware-task-assignment with equal load (SITA-E) and least-work-left (LWL) policies.

References

[1]
S. Aalto, U. Ayesta, S. Borst, V. Misra, and R. Núnez-Queija. Beyond processor sharing. SIGMETRICS Perform. Eval. Rev., 34:36--43, 2007.
[2]
S. Aalto, U. Ayesta, and R. Righter. On the Gittins index in the M/G/1 queue. Queueing Systems, 63(1--4):437--458, 2009.
[3]
S. Aalto and J. Virtamo. Basic packet routing problem. In NTS-13, Trondheim, Norway, Aug. 1996.
[4]
E. Bachmat and H. Sarfati. Analysis of SITA policies. Performance Evaluation, 67(2):102--120, 2010.
[5]
C. E. Bell and J. Shaler Stidham. Individual versus social optimization in the allocation of customers to alternative servers. Management Science, 29(7):831--839, July 1983.
[6]
M. E. Crovella, M. Harchol-Balter, and C. D. Murta. Task assignment in a distributed system: Improving performance by unbalancing load. In ACM SIGMETRICS, pages 268--269, June 1998.
[7]
A. Downey. A parallel workload model and its implications for processor allocation. In IEEE HPDC'97, pages 112--123, Aug. 1997.
[8]
A. Ephremides, P. Varaiya, and J. Walrand. A simple dynamic routing problem. IEEE Transactions on Automatic Control, 25(4):690--693, Aug. 1980.
[9]
H. Feng and V. Misra. Mixed scheduling disciplines for network flows. SIGMETRICS Perform. Eval. Rev., 31:36--39, Sept. 2003.
[10]
H. Feng, V. Misra, and D. Rubenstein. Optimal state-free, size-aware dispatching for heterogeneous M/G/-type systems. Perform. Eval., 62(1--4), 2005.
[11]
J. Gittins. Multi-armed Bandit Allocation Indices. Wiley, 1989.
[12]
V. Gupta, M. Harchol-Balter, K. Sigman, and W. Whitt. Analysis of join-the-shortest-queue routing for web server farms. Performance Evaluation, 64(9--12):1062--1081, Oct. 2007.
[13]
M. Harchol-Balter. Task assignment with unknown duration. J. ACM, 49(2):260--288, Mar. 2002.
[14]
M. Harchol-Balter, M. E. Crovella, and C. D. Murta. On choosing a task assignment policy for a distributed server system. Journal of Parallel and Distributed Computing, 59:204--228, 1999.
[15]
M. Harchol-Balter and A. B. Downey. Exploiting process lifetime distributions for dynamic load balancing. ACM Transactions on Computer Systems, 15(3):253--285, Aug. 1997.
[16]
M. Harchol-Balter, A. Scheller-Wolf, and A. R. Young. Surprising results on task assignment in server farms with high-variability workloads. In ACM SIGMETRICS, pages 287--298, 2009.
[17]
M. Harchol-Balter, K. Sigman, and A. Wierman. Asymptotic convergence of scheduling policies with respect to slowdown. Perform. Eval., 49(1--4):241--256, Sept. 2002.
[18]
M. Haviv and T. Roughgarden. The price of anarchy in an exponential multi-server. Oper. Res. Lett., 35(4):421--426, 2007.
[19]
E. Hyytia, S. Aalto, and A. Penttinen. Minimizing slowdown in heterogeneous size-aware dispatching systems (full version). Technical report, Mar. 2012. arXiv:1203.5040.
[20]
E. Hyytia, A. Penttinen, and S. Aalto. Size- and state-aware dispatching problem with queue-specific job sizes. European Journal of Operational Research, 217(2):357--370, Mar. 2012.
[21]
E. Hyytia, A. Penttinen, S. Aalto, and J. Virtamo. Dispatching problem with fixed size jobs and processor sharing discipline. In ITC'23, SFO, USA, Sept. 2011.
[22]
E. Hyytia, J. Virtamo, S. Aalto, and A. Penttinen. M/M/1-PS queue and size-aware task assignment. Performance Evaluation, 68(11):1136--1148, Nov. 2011.
[23]
K. R. Krishnan. Joining the right queue: a state-dependent decision rule. IEEE Transactions on Automatic Control, 35(1):104--108, Jan. 1990.
[24]
Z. Liu and R. Righter. Optimal load balancing on distributed homogeneous unreliable processors. Operations Research, 46(4):563--573, 1998.
[25]
Z. Liu and D. Towsley. Optimality of the round-robin routing policy. Journal of Applied Probability, 31(2):466--475, June 1994.
[26]
L. Schrage. A proof of the optimality of the shortest remaining processing time discipline. Operations Research, 16(3), 1968.
[27]
W. Whitt. Deciding which queue to join: Some counterexamples. Oper. Res., 34(1):55--62, 1986.
[28]
A. Wierman. Fairness and scheduling in single server queues. Surveys in Operations Research and Management Science, 16(1):39--48, 2011.
[29]
A. Wierman, M. Harchol-Balter, and T. Osogami. Nearly insensitive bounds on SMART scheduling. In ACM SIGMETRICS, pages 205--216, 2005.
[30]
W. Winston. Optimality of the shortest line discipline. Journal of Applied Probability, 14:181--189, 1977.
[31]
S. Yang and G. de Veciana. Size-based adaptive bandwidth allocation: optimizing the average QoS for elastic flows. In IEEE INFOCOM, 2002.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMETRICS '12: Proceedings of the 12th ACM SIGMETRICS/PERFORMANCE joint international conference on Measurement and Modeling of Computer Systems
June 2012
450 pages
ISBN:9781450310970
DOI:10.1145/2254756
  • cover image ACM SIGMETRICS Performance Evaluation Review
    ACM SIGMETRICS Performance Evaluation Review  Volume 40, Issue 1
    Performance evaluation review
    June 2012
    433 pages
    ISSN:0163-5999
    DOI:10.1145/2318857
    Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 June 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. M/G/1
  2. MDP
  3. policy improvement
  4. task assignment

Qualifiers

  • Research-article

Conference

SIGMETRICS '12
Sponsor:

Acceptance Rates

Overall Acceptance Rate 459 of 2,691 submissions, 17%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 31 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2021)Decentralized elastic electricity demand scheduling2021 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)10.1109/CSDE53843.2021.9718482(1-6)Online publication date: 8-Dec-2021
  • (2021)Open problems in queueing theory inspired by datacenter computingQueueing Systems10.1007/s11134-020-09684-6Online publication date: 27-Jan-2021
  • (2019)SOAPACM SIGMETRICS Performance Evaluation Review10.1145/3308809.330882946:1(36-38)Online publication date: 17-Jan-2019
  • (2018)SOAPProceedings of the ACM on Measurement and Analysis of Computing Systems10.1145/31794192:1(1-30)Online publication date: 3-Apr-2018
  • (2018)Distributed Cost-Optimized Placement for Latency-Critical Applications in Heterogeneous Environments2018 IEEE International Conference on Autonomic Computing (ICAC)10.1109/ICAC.2018.00022(121-130)Online publication date: Sep-2018
  • (2018)SRPT for multiserver systemsPerformance Evaluation10.1016/j.peva.2018.10.001Online publication date: Oct-2018
  • (2017)Value (Generating) Functions for the MX/G/1 Queue2017 29th International Teletraffic Congress (ITC 29)10.23919/ITC.2017.8064360(232-240)Online publication date: Sep-2017
  • (2016)Online Packet Dispatching for Delay Optimal Concurrent Transmissions in Heterogeneous Multi-RAT NetworksIEEE Transactions on Wireless Communications10.1109/TWC.2016.2552517(1-1)Online publication date: 2016
  • (2015)Online Packet Dispatching for Concurrent Transmissions in Heterogeneous Multi-RAT Networks2015 IEEE Global Communications Conference (GLOBECOM)10.1109/GLOCOM.2015.7417838(1-6)Online publication date: Dec-2015
  • (2014)Online Packet Dispatching for Concurrent Transmissions in Heterogeneous Multi-RAT Networks2015 IEEE Global Communications Conference (GLOBECOM)10.1109/GLOCOM.2014.7417838(1-6)Online publication date: Dec-2014
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media