Abstract
The dynamic nature of large-size Network Computing Systems (NCSs) and the varying monitoring demands from the end-users pose serious challenges for monitoring systems (MSs). A statically configured MS initially adjusted to perform optimally may end performing poorly. A reconfiguration mechanism for a distributed MS is proposed. It enables the MS to react to changes in the available resources, operating conditions, and monitoring requirements, while maintaining high performance and low monitoring overheads. A localized decision process involving two adjacent intermediate-level managers (ILMs) and values of a local node performance parameter called temperature together determine transformations (merge, split, migrate) for each ILM. The reconfiguration mechanisms are derived reusing SNMP primitives. Interactions between MS and NCS are studied by defining a queuing model, and by evaluating different configuration schemes using simulation. Results for the static and reconfigurable schemes indicate that reconfiguration improves performance in terms of lower processing delays at the ILMs.
This work has been done with the support of the Ministerio de Ciencia y Tecnologia, Spain, under contract MCYT TIC2001-0591-C02-02.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Foster, I., Kesselman, C., Tuecke, S.: The Anatomy of the Grid: Enabling Scalable Virtual Organizations . Intl. J. Supercomputer Applications (2001)
Subramanyan, R., Miguel-Alonso, J., Fortes, J.A.B.: A Scalable SNMP-based Distributed Monitoring System for Heterogeneous Network Computing. In: Supercomputing (November 2000)
Abdu, H., Lutfiyya, H., Bauer, M.A.: A Testbed for Optimizing the Monitoring of Distributed Systems. In: Proceeding of PDCS 1998 (1998)
Liotta, A., Pavlou, G., Knight, G.: A Self-adaptable Agent System for Efficient Information Gathering. In: MATA (2001)
Hollingsworth, J.K., Miller, B.P.: An Adaptive Cost System for Parallel Program Instrumentation. In: Euro-Par1996 (August 1996)
Subramanyan, R.: Scalable SNMP-Based Monitoring Systems for Network Computing PhD thesis, Purdue University (August 2002)
Kapadia, N.H., Fortes, J.A.B.: PUNCH: An Architecture for Web-Enabled Wide-Area Network-Computing, Cluster Computing (September 1999)
Ptolemy 0.7 User’s Manual, UC Berkeley, http://ptolemy.eecs.berkeley.edu
Wolski, R.: Dynamically Forecasting Network Performance using the Network Weather Service, Cluster Computing (1998)
Lee, C.A., Stepanek, J., Wolski, R., Kesselman, C., Foster, I.: A Network Performance Tool for Grid Environments .In:HPDC 1998 (1998)
Tierney, B., Johnston, W., Crowley, B.: The Netlogger Methodology for High Performance Distributed Systems Performance Analysis. In: HPDC 1998 (1998)
Miller, N., Steenkiste, P.: Collecting Network Status Information for Network-Aware Applications. In: Proceeding of Infocom 2000 (2000)
Liotta, A., Knight, G., Pavlou, G.: On the Performance and Scalability of Decentralized Monitoring using Mobile Agents. In:DSOM (1999)
Black, U.: Network Management Standards. McGraw-Hill, New York (1995)
Malony, A.D., Reed, D.A., Wijshoff, H.A.G.: Performance Measurement Intrusion and Perturbation Analysis. In: IEEE Transactions on Parallel and Distributed Systems (July 1992)
DARPA, Quorum Project, http://www.darpa.mil/ito/research/quorum/index.html
Goldszmidt, G.: Distributed Management by Delegation, PhD thesis, Columbia University (December 1995)
Waheed, A., Rover, D.T., Mutka, M.W., Smith, H., Bakic, A.: Modeling, Evaluation, and Adaptive Control of an Instrumentation System. In: Proc. Real- Time Technology and Applications Symposium (RTAS 1997) (June 1997)
Siegl, M.: Design and Realization of a Mid-Level Management System, PhD thesis, TUW-HDNM, Vienna University of Technology (November 1996)
Pras, A.: Network Management Architectures, PhD thesis, Centre for Telematics and Information Technology, University of Twente (April 1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Subramanyan, R., Miguel-Alonso, J., Fortes, J.A.B. (2003). A Reconfigurable Monitoring System for Large-Scale Network Computing. In: Kosch, H., Böszörményi, L., Hellwagner, H. (eds) Euro-Par 2003 Parallel Processing. Euro-Par 2003. Lecture Notes in Computer Science, vol 2790. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45209-6_17
Download citation
DOI: https://doi.org/10.1007/978-3-540-45209-6_17
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40788-1
Online ISBN: 978-3-540-45209-6
eBook Packages: Springer Book Archive