Abstract
Small organisations can now have access to high raw processing power using networks of workstations (NOW) as parallel computing platforms. Software Distributed Shared Memory (Software DSM) packages have been developed to facilitate the programming of such systems. However, because of the high interprocess latencies in a NOW, the performance of a software DSM application is more susceptible to the partitioning of the problem than what might be expected.
This paper presents an approach for a tool to visualise the execution of a program in a way that highlights performance bottlenecks. The tool associates identified bottlenecks with the corresponding source code lines in order to determine what piece of code is the cause of poor performance. The visualisation technique is demonstrated in two case studies. They clearly show that the visualisation is indeed useful and provides an effective way to acquire an understanding of what characterises an applications sharing behaviour.
Similar content being viewed by others
References
C. Amza, A. L. Cox, S. Dwarkadas, P. Keleher, H. Lu, R. Rajamony, W. Yu, and W. Zwaenepoel. TreadMarks: Shared Memory Computing on Networks of Workstations, IEEE Computer, vol. 29(2)q, pp. 18-28, February 1996.
M. Brorsson. “SM-prof: A tool to visualise and find cache coherence bottlenecks in multiprocessor programs,” in Proceedings of the 1995 ACM SIGMETRICS International Conference on Measurement & Modeling of Computer Systems, Ottawa, Canada, pp. 178-187, May 1995.
M. Brorsson. “Performance tuning of small scale shared memory multiprocessor applications using visualisation, ” in Proceedings of the 10th International Conference on Parallel and Distributed Computing Systems, New Orleans, October 1997.
M. T. Heath, A. D. Malony, and D. T. Rover. The Visual Display of Parallel Performance Data, IEEE Computer, pp. 21-28, November 1995.
P. Keleher, S. Dwarkadas, A. L. & Cox, and W. Zwaenepoel. “TreadMarks: Distributed shared memory on standard workstations and operating systems,” in Proceedings of the Winter 94 Usenix Conference, pp. 115-131, January 1994.
M. Kral. Programmer's Aid for Parallel Programming, MSc thesis, Department of Information Technology, Lund University, P.O. Box 118, SE-221 00 Lund, Sweden.
B. P. Miller, M. D. Callaghan, J. M. Cargille, J. K. Hollingswirth, R. B. Irvin, K. L. Karavanic, K. Kunchithapadam, and T. Newhall. The Paradyn Performance Tools. IEEE Computer, vol. 28(11), November 1995.
E. W. Parsons, M. Brorsson, and K. C. Sevcik. Predicting the Performance of Distributed Virtual Shared Memory Applications, IBM Systems Journal, vol. 36(4)q.
R. Rajamony and A. L. Cox. “Performance debugging shared memory parallel programs using run-time dependence analysis,” in Proceedings of the ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, Seattle, WA, June 1997
I. Schoinas, B. Falsafi, A. R. Lebeck, S. K. Reinhardt, J. R. Larus, and D. A. Wood. “Fine-grain access control for distributed shared memory,” in Proceedings of the Sixth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS VI), pp. 297-307, October 1994.
J. P. Singh, W.-D. Weber, and A. Gupta. SPLASH: Stanford Parallel Applications for Shared-Memory. Computer Architecture News, vol. 20(1) pp. 5-44, March 1992.
Z. Xu, J. R. Larus, and B. P. Miller. “Shared-memory performance profiling,” in Proceedings of the 6th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP'97), Las Vegas, Nevada, 1997.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Brorsson, M., Kral, M. Performance Tuning Software DSM Applications using Visualisation. The Journal of Supercomputing 13, 249–265 (1999). https://doi.org/10.1023/A:1008005003054
Issue Date:
DOI: https://doi.org/10.1023/A:1008005003054