[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2745844.2745859acmconferencesArticle/Chapter ViewAbstractPublication PagesmetricsConference Proceedingsconference-collections
research-article

Computable Bounds in Fork-Join Queueing Systems

Published: 15 June 2015 Publication History

Abstract

In a Fork-Join (FJ) queueing system an upstream fork station splits incoming jobs into N tasks to be further processed by N parallel servers, each with its own queue; the response time of one job is determined, at a downstream join station, by the maximum of the corresponding tasks' response times. This queueing system is useful to the modelling of multi-service systems subject to synchronization constraints, such as MapReduce clusters or multipath routing. Despite their apparent simplicity, FJ systems are hard to analyze.
This paper provides the first computable stochastic bounds on the waiting and response time distributions in FJ systems. We consider four practical scenarios by combining 1a) renewal and 1b) non-renewal arrivals, and 2a) non-blocking and 2b) blocking servers. In the case of non blocking servers we prove that delays scale as O(logN), a law which is known for first moments under renewal input only. In the case of blocking servers, we prove that the same factor of log N dictates the stability region of the system. Simulation results indicate that our bounds are tight, especially at high utilizations, in all four scenarios. A remarkable insight gained from our results is that, at moderate to high utilizations, multipath routing 'makes sense' from a queueing perspective for two paths only, i.e., response times drop the most when N = 2; the technical explanation is that the resequencing (delay) price starts to quickly dominate the tempting gain due to multipath transmissions.

References

[1]
Amazon Elastic Compute Cloud EC2.texttthttp://aws.amazon.com/ec2.
[2]
S. Babu. Towards automatic optimization of MapReduce programs. In Proc. of ACM SoCC, pages 137--142, 2010.
[3]
F. Baccelli, E. Gelenbe, and B. Plateau. An end-to-end approach to the resequencing problem. J. ACM, 31(3):474--485, June 1984.
[4]
F. Baccelli, A. M. Makowski, and A. Shwartz. The Fork-Join queue and related systems with synchronization constraints: Stochastic ordering and computable bounds. Adv. in Appl. Probab., 21(3):629--660, Sept. 1989.
[5]
S. Balsamo, L. Donatiello, and N. M. Van Dijk. Bound performance models of heterogeneous parallel processing systems. IEEE Trans. Parallel Distrib. Syst., 9(10):1041--1056, Oct. 1998.
[6]
P. Billingsley. Probability and Measure. Wiley, 3rd edition, 1995.
[7]
O. Boxma, G. Koole, and Z. Liu. Queueing-theoretic solution methods for models of parallel and distributed systems. In Proc. of Performance Evaluation of Parallel and Distributed Systems. CWI Tract 105, pages 1--24, 1994.
[8]
E. Buffet and N. G. Duffield. Exponential upper bounds via martingales for multiplexers with Markovian arrivals. J. Appl. Probab., 31(4):1049--1060, Dec. 1994.
[9]
C. Chang. Performance Guarantees in Communication Networks. Springer, 2000.
[10]
Y. Chen, S. Alspaugh, and R. Katz. Interactive analytical processing in big data systems: A cross-industry study of mapreduce workloads. Proc. VLDB Endow., 5(12):1802--1813, Aug. 2012.
[11]
F. Ciucu, F. Poloczek, and J. Schmitt. Sharp per-flow delay bounds for bursty arrivals: The case of FIFO, SP, and EDF scheduling. In Proc. of IEEE INFOCOM, pages 1896--1904, April 2014.
[12]
J. Dean and S. Ghemawat. MapReduce: Simplified data processing on large clusters. Commun. ACM, 51(1):107--113, Jan. 2008.
[13]
N. Duffield. Exponential bounds for queues with Markovian arrivals. Queueing Syst., 17(3--4):413--430, Sept. 1994.
[14]
L. Flatto and S. Hahn. Two parallel queues created by arrivals with two demands I. SIAM J. Appl. Math., 44(5):1041--1053, Oct. 1984.
[15]
R. J. Gibbens. Traffic characterisation and effective bandwidths for broadband network traces. J. R. Stat. Soc. Ser. B. Stat. Methodol., 1996.
[16]
Y. Han and A. Makowski. Resequencing delays under multipath routing - Asymptotics in a simple queueing model. In Proc. of IEEE INFOCOM, pages 1--12, April 2006.
[17]
G. Harrus and B. Plateau. Queueing analysis of a reordering issue. IEEE Trans. Softw. Eng., 8(2):113--123, Mar. 1982.
[18]
Y. Jiang and Y. Liu. Stochastic Network Calculus. Springer, 2008.
[19]
S. Kandula, S. Sengupta, A. Greenberg, P. Patel, and R. Chaiken. The nature of data center traffic: Measurements & analysis. In Proc. of ACM IMC, pages 202--208, 2009.
[20]
S. Kavulya, J. Tan, R. Gandhi, and P. Narasimhan. An analysis of traces from a production MapReduce cluster. In Proc. of IEEE/ACM CCGRID, pages 94--103, May 2010.
[21]
B. Kemper and M. Mandjes. Mean sojourn times in two-queue Fork-Join systems: Bounds and approximations. OR Spectr., 34(3):723--742, July 2012.
[22]
G. Kesidis, B. Urgaonkar, Y. Shan, S. Kamarava, and J. Liebeherr. Network calculus for parallel processing. CoRR, abs/1409.0820, 2014.
[23]
J. F. C. Kingman. Inequalities in the theory of queues. J. R. Stat. Soc. Ser. B. Stat. Methodol., 32(1):102--110, 1970.
[24]
S.-S. Ko and R. F. Serfozo. Sojourn times in G/M/1 Fork-Join networks. Naval Res. Logist., 55(5):432--443, May 2008.
[25]
A. S. Lebrecht and W. J. Knottenbelt. Response time approximations in Fork-Join queues. In Proc. of UKPEW, July 2007.
[26]
R. Nelson and A. Tantawi. Approximate analysis of Fork/Join synchronization in parallel queues. IEEE Trans. Computers, 37(6):739--743, June 1988.
[27]
R. Pike, S. Dorward, R. Griesemer, and S. Quinlan. Interpreting the data: Parallel analysis with Sawzall. Sci. Program., 13(4):277--298, Oct. 2005.
[28]
I. Polato, R. Ré, A. Goldman, and F. Kon. A comprehensive view of Hadoop research - a systematic literature review. J. Netw. Comput. Appl., 46(0):1 -- 25, Nov. 2014.
[29]
F. Poloczek and F. Ciucu. Scheduling analysis with martingales. Perform. Evaluation, 79:56--72, Sept. 2014.
[30]
C. Raiciu, S. Barre, C. Pluntke, A. Greenhalgh, D. Wischik, and M. Handley. Improving datacenter performance and robustness with multipath TCP. SIGCOMM Comput. Commun. Rev., 41(4):266--277, Aug. 2011.
[31]
A. Rényi. On the theory of order statistics. Acta Mathematica Academiae Scientiarum Hungarica, 4(3--4):191--231, 1953.
[32]
J. Tan, X. Meng, and L. Zhang. Delay tails in MapReduce scheduling. SIGMETRICS Perform. Eval. Rev., 40(1):5--16, June 2012.
[33]
J. Tan, Y. Wang, W. Yu, and L. Zhang. Non-work-conserving effects in MapReduce: Diffusion limit and criticality. SIGMETRICS Perform. Eval. Rev., 42(1):181--192, June 2014.
[34]
E. Varki. Mean value technique for closed Fork-Join networks. SIGMETRICS Perform. Eval. Rev., 27(1):103--112, May 1999.
[35]
S. Varma and A. M. Makowski. Interpolation approximations for symmetric Fork-Join queues. Perform. Eval., 20(1--3):245--265, May 1994.
[36]
E. Vianna, G. Comarela, T. Pontes, J. Almeida, V. Almeida, K. Wilkinson, H. Kuno, and U. Dayal. Analytical performance models for MapReduce workloads. Int. J. Parallel Prog., 41(4):495--525, Aug. 2013.
[37]
T. White. Hadoop: The Definitive Guide. O'Reilly Media, Inc., 1st edition, 2009.
[38]
Y. Xia and D. Tse. On the large deviation of resequencing queue size: 2-M/M/1 case. IEEE Trans. Inf. Theory, 54(9):4107--4118, Sept. 2008.
[39]
M. Zaharia, A. Konwinski, A. D. Joseph, R. Katz, and I. Stoica. Improving MapReduce performance in heterogeneous environments. In Proc. of USENIX OSDI, pages 29--42, Dec. 2008.

Cited By

View all
  • (2023)Tail Prediction for Heterogeneous Data Center ClustersProcesses10.3390/pr1102040711:2(407)Online publication date: 30-Jan-2023
  • (2023)Optimal Collaborative Uploading in Crowdsensing with Graph LearningICC 2023 - IEEE International Conference on Communications10.1109/ICC45041.2023.10279250(1792-1797)Online publication date: 28-May-2023
  • (2021)Optimizing the Response Time of Memcached Systems via Model and Quantitative AnalysisIEEE Transactions on Computers10.1109/TC.2020.301161970:9(1458-1471)Online publication date: 1-Sep-2021
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMETRICS '15: Proceedings of the 2015 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems
June 2015
488 pages
ISBN:9781450334860
DOI:10.1145/2745844
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 June 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. fork-join queue
  2. mapreduce
  3. multipath
  4. parallel systems
  5. performance evaluation

Qualifiers

  • Research-article

Funding Sources

  • DFG

Conference

SIGMETRICS '15
Sponsor:

Acceptance Rates

SIGMETRICS '15 Paper Acceptance Rate 32 of 239 submissions, 13%;
Overall Acceptance Rate 459 of 2,691 submissions, 17%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)18
  • Downloads (Last 6 weeks)0
Reflects downloads up to 31 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Tail Prediction for Heterogeneous Data Center ClustersProcesses10.3390/pr1102040711:2(407)Online publication date: 30-Jan-2023
  • (2023)Optimal Collaborative Uploading in Crowdsensing with Graph LearningICC 2023 - IEEE International Conference on Communications10.1109/ICC45041.2023.10279250(1792-1797)Online publication date: 28-May-2023
  • (2021)Optimizing the Response Time of Memcached Systems via Model and Quantitative AnalysisIEEE Transactions on Computers10.1109/TC.2020.301161970:9(1458-1471)Online publication date: 1-Sep-2021
  • (2020)A Black-Box Fork-Join Latency Prediction Model for Data-Intensive ApplicationsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2020.298213731:9(1983-2000)Online publication date: 1-Sep-2020
  • (2019)Overcome Heterogeneity Impact in Modeled Fork-Join Queuing Networks for Tail Prediction2019 International Conference on Computing, Networking and Communications (ICNC)10.1109/ICCNC.2019.8685575(270-275)Online publication date: Feb-2019
  • (2018)Improving Output Bounds in the Stochastic Network Calculus Using Lyapunov’s Inequality2018 IFIP Networking Conference (IFIP Networking) and Workshops10.23919/IFIPNetworking.2018.8696709(1-9)Online publication date: May-2018
  • (2018)ForkTailProceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing10.1145/3208040.3208058(206-217)Online publication date: 11-Jun-2018
  • (2018)Approximations and bounds for (n, k) fork-join queuesProceedings of the 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing10.1109/CCGRID.2018.00069(422-431)Online publication date: 1-May-2018
  • (2018)Toward an analytical method for SLA validationSoftware and Systems Modeling (SoSyM)10.1007/s10270-017-0604-y17:2(527-545)Online publication date: 1-May-2018
  • (2018)Biased Processor Sharing in Fork-Join QueuesQuantitative Evaluation of Systems10.1007/978-3-319-99154-2_17(273-288)Online publication date: 15-Aug-2018
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media