[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
Skip header Section
Using MPI (2nd ed.): portable parallel programming with the message-passing interfaceDecember 1999
Publisher:
  • MIT Press
  • 55 Hayward St.
  • Cambridge
  • MA
  • United States
ISBN:978-0-262-57132-6
Published:01 December 1999
Pages:
371
Skip Bibliometrics Section
Reflects downloads up to 10 Dec 2024Bibliometrics
Abstract

No abstract available.

Cited By

  1. Chen Y, Wen Z and Biros G A Scalable Algorithm for Active Learning Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, (1-17)
  2. ACM
    Suggs E, Olivier S, Ciesko J and Skjellum A View-aware Message Passing Through the Integration of Kokkos and ExaMPI Proceedings of the 30th European MPI Users' Group Meeting, (1-10)
  3. ACM
    Cowan M, Maleki S, Musuvathi M, Saarikivi O and Xiong Y MSCCLang: Microsoft Collective Communication Language Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2, (502-514)
  4. Kang H, Zhao Y, Blelloch G, Dhulipala L, Gu Y, McGuffey C and Gibbons P (2022). PIM-Tree, Proceedings of the VLDB Endowment, 16:4, (946-958), Online publication date: 1-Dec-2022.
  5. ACM
    Lawson M, Gropp W and Lofstead J (2022). EMPRESS: Accelerating Scientific Discovery through Descriptive Metadata Management, ACM Transactions on Storage, 18:4, (1-49), Online publication date: 30-Nov-2022.
  6. Moses W, Narayanan S, Paehler L, Churavy V, Schanen M, Hückelheim J, Doerfert J and Hovland P Scalable automatic differentiation of multiple parallel paradigms through compiler augmentation Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, (1-18)
  7. Shang H, Shen L, Fan Y, Xu Z, Guo C, Liu J, Zhou W, Ma H, Lin R, Yang Y, Li F, Wang Z, Zhang Y and Li Z Large-scale simulation of quantum computational chemistry on a new sunway supercomputer Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, (1-14)
  8. ACM
    Bhardwaj R, Tumanov A, Wang S, Liaw R, Moritz P, Nishihara R and Stoica I ESCHER Proceedings of the 13th Symposium on Cloud Computing, (47-62)
  9. Singh N, Zhang Z, Wu X, Zhang N, Zhang S and Solomonik E (2022). Distributed-memory tensor completion for generalized loss functions in python using new sparse tensor kernels, Journal of Parallel and Distributed Computing, 169:C, (269-285), Online publication date: 1-Nov-2022.
  10. Yang F, Chandra A, Zhang Y, Tendulkar S, Nastasia R, Oberai A, Shephard M and Sahni O (2022). A parallel interface tracking approach for evolving geometry problems, Engineering with Computers, 38:5, (4289-4305), Online publication date: 1-Oct-2022.
  11. ACM
    Wald I and Parker S (2022). Data Parallel Path Tracing with Object Hierarchies, Proceedings of the ACM on Computer Graphics and Interactive Techniques, 5:3, (1-16), Online publication date: 25-Jul-2022.
  12. ACM
    Kim J, Kwon H, Kang J, Park J, Lee S and Lee J SnuHPL Proceedings of the 36th ACM International Conference on Supercomputing, (1-12)
  13. ACM
    Fan K, Gilray T, Pascucci V, Huang X, Micinski K and Kumar S Optimizing the Bruck Algorithm for Non-uniform All-to-all Communication Proceedings of the 31st International Symposium on High-Performance Parallel and Distributed Computing, (172-184)
  14. ACM
    He J, Zhai J, Antunes T, Wang H, Luo F, Shi S and Li Q FasterMoE Proceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, (120-134)
  15. Riaño‐Briceño G, Sela L and Hodges B (2022). Distributed and vectorized method of characteristics for fast transient simulations in water distribution systems, Computer-Aided Civil and Infrastructure Engineering, 37:2, (163-184), Online publication date: 4-Jan-2022.
  16. ACM
    Zheng Q, Cranor C, Ganger G, Gibson G, Amvrosiadis G, Settlemyer B and Grider G DeltaFS Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, (1-15)
  17. Milan P, Hickey J, Wang X and Yang V (2022). Deep-learning accelerated calculation of real-fluid properties in numerical simulation of complex flowfields, Journal of Computational Physics, 444:C, Online publication date: 1-Nov-2021.
  18. Deng C, Zhang Y and Bailey R (2021). Evaluating crop-soil-water dynamics in waterlogged areas using a coupled groundwater-agronomic model, Environmental Modelling & Software, 143:C, Online publication date: 1-Sep-2021.
  19. Elshazly H, Lordan F, Ejarque J and Badia R (2021). Accelerated execution via eager-release of dependencies in task-based workflows, International Journal of High Performance Computing Applications, 35:4, (325-343), Online publication date: 1-Jul-2021.
  20. ACM
    Chen Z, Xu C, Soto J, Markl V, Qian W and Zhou A Hybrid Evaluation for Distributed Iterative Matrix Computation Proceedings of the 2021 International Conference on Management of Data, (300-312)
  21. ACM
    Verbraeken J, Wolting M, Katzy J, Kloppenburg J, Verbelen T and Rellermeyer J (2020). A Survey on Distributed Machine Learning, ACM Computing Surveys, 53:2, (1-33), Online publication date: 31-Mar-2021.
  22. Lanihun O, Tiddeman B, Shaw P and Tuci E (2021). Evolutionary active vision system, Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems, 29:1, (3-24), Online publication date: 1-Feb-2021.
  23. ACM
    Lyerly R, Min C, Rossbach C and Ravindran B An OpenMP Runtime for Transparent Work Sharing Across Cache-Incoherent Heterogeneous Nodes Proceedings of the 21st International Middleware Conference, (415-429)
  24. Pedretti K, Younge A, Hammond S, Laros J, Curry M, Aguilar M, Hoekstra R and Brightwell R Chronicles of astra Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, (1-14)
  25. Verma K, McCabe C, Peng C and Wille R (2020). A PCISPH implementation using distributed multi-GPU acceleration for simulating industrial engineering applications, International Journal of High Performance Computing Applications, 34:4, (450-464), Online publication date: 1-Jul-2020.
  26. Junttila T, Karppa M, Kaski P and Kohonen J (2020). An adaptive prefix-assignment technique for symmetry reduction, Journal of Symbolic Computation, 99:C, (21-49), Online publication date: 1-Jul-2020.
  27. ACM
    Yu H, Chen Z, Fu X, Wang J, Su Z, Sun J, Huang C and Dong W Symbolic verification of message passing interface programs Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering, (1248-1260)
  28. ACM
    Chuang H, Lyerly R, Lankes S and Ravindran B Scaling Shared Memory Multiprocessing Applications in Non-cache-coherent Domains Proceedings of the 13th ACM International Systems and Storage Conference, (13-24)
  29. Chen J, Zhou W, Dong Y, Wang Z, Cui C, Wu F, Zhou E and Tang Y (2019). Analyzing time-dimension communication characterizations for representative scientific applications on supercomputer systems, Frontiers of Computer Science: Selected Publications from Chinese Universities, 13:6, (1228-1242), Online publication date: 1-Dec-2019.
  30. ACM
    Barijough K, Zhao Z and Gerstlauer A (2019). Quality/Latency-Aware Real-time Scheduling of Distributed Streaming IoT Applications, ACM Transactions on Embedded Computing Systems, 18:5s, (1-23), Online publication date: 31-Oct-2019.
  31. Park H, Park T and Lee Y (2019). Partially collapsed Gibbs sampling for latent Dirichlet allocation, Expert Systems with Applications: An International Journal, 131:C, (208-218), Online publication date: 1-Oct-2019.
  32. Fang Y, Chen Q and Xiong N (2022). A multi-factor monitoring fault tolerance model based on a GPU cluster for big data processing, Information Sciences: an International Journal, 496:C, (300-316), Online publication date: 1-Sep-2019.
  33. Cao S, Yang X, Chen C, Zhou J, Li X and Qi Y (2019). TitAnt, Proceedings of the VLDB Endowment, 12:12, (2082-2093), Online publication date: 1-Aug-2019.
  34. Matheou G, Soteriou V and Evripidou P (2022). Toward data-driven architectural support in improving the performance of future HPC architectures, Parallel Computing, 86:C, (82-106), Online publication date: 1-Aug-2019.
  35. ACM
    Nie J, Zhang C, Zou D, Xia F, Lu L, Wang X and Zhao F Adaptive Sparse Matrix-Vector Multiplication on CPU-GPU Heterogeneous Architecture Proceedings of the 2019 3rd High Performance Computing and Cluster Technologies Conference, (6-10)
  36. ACM
    Macià S, Martínez-Ferrer P, Mateo S, Beltran V and Ayguadé E Assembling a High-Productivity DSL for Computational Fluid Dynamics Proceedings of the Platform for Advanced Scientific Computing Conference, (1-11)
  37. Zhu C, Seo J and Mittal R (2019). A graph-partitioned sharp-interface immersed boundary solver for efficient solution of internal flows, Journal of Computational Physics, 386:C, (37-46), Online publication date: 1-Jun-2019.
  38. López-Gómez J, Fernández Muñoz J, del Rio Astorga D, Dolz M and Garcia J (2019). Exploring stream parallel patterns in distributed MPI environments, Parallel Computing, 84:C, (24-36), Online publication date: 1-May-2019.
  39. Witt C, Bux M, Gusew W and Leser U (2019). Predictive performance modeling for distributed batch processing using black box monitoring and machine learning, Information Systems, 82:C, (33-52), Online publication date: 1-May-2019.
  40. Llanos D and Vigo-Aguiar J (2019). Computational and mathematical models meet heterogeneous computing, The Journal of Supercomputing, 75:3, (999-1000), Online publication date: 1-Mar-2019.
  41. Jeong E, Cho S, Yu G, Jeong J, Shin D and Chun B JANUS Proceedings of the 16th USENIX Conference on Networked Systems Design and Implementation, (453-467)
  42. ACM
    Lyerly R, Kim S and Ravindran B libMPNode Proceedings of the 10th International Workshop on Programming Models and Applications for Multicores and Manycores, (81-90)
  43. Fresno J, Barba D, Gonzalez-Escribano A and Llanos D (2019). HitFlow, International Journal of Parallel Programming, 47:1, (3-23), Online publication date: 1-Feb-2019.
  44. ACM
    Netto M, Calheiros R, Rodrigues E, Cunha R and Buyya R (2018). HPC Cloud for Scientific and Business Applications, ACM Computing Surveys, 51:1, (1-29), Online publication date: 31-Jan-2019.
  45. ACM
    Peng Z, Powell A, Wu B, Bicer T and Ren B Graphphi Proceedings of the 27th International Conference on Parallel Architectures and Compilation Techniques, (1-14)
  46. García J, Hammond K and Schubert L (2018). Reengineering for parallelism in heterogeneous parallel platforms, The Journal of Supercomputing, 74:11, (5625-5627), Online publication date: 1-Nov-2018.
  47. Bordage C and Jeannot E Process affinity, metrics and impact on performance Proceedings of the 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, (523-532)
  48. ACM
    Zhou B and Buyya R (2018). Augmentation Techniques for Mobile Cloud Computing, ACM Computing Surveys, 51:1, (1-38), Online publication date: 14-Apr-2018.
  49. Chafik S, Larach A and Daoui C (2018). Parallel Hierarchical Pre-Gauss-Seidel Value Iteration Algorithm, International Journal of Decision Support System Technology, 10:2, (1-22), Online publication date: 1-Apr-2018.
  50. ACM
    Macià S, Mateo S, Martínez-Ferrer P, Beltran V, Mira D and Ayguadé E Saiph Proceedings of the Real World Domain Specific Languages Workshop 2018, (1-10)
  51. Mahafzah B and Al-Zoubi I (2018). Broadcast communication operations for hyper hexa-cell interconnection network, Telecommunications Systems, 67:1, (73-93), Online publication date: 1-Jan-2018.
  52. ACM
    Matheou G and Evripidou P (2017). Data-Driven Concurrency for High Performance Computing, ACM Transactions on Architecture and Code Optimization, 14:4, (1-26), Online publication date: 20-Dec-2017.
  53. ACM
    Petruzza S, Venkat A, Gyulassy A, Scorzelli G, Federer F, Angelucci A, Pascucci V and Bremer P ISAVS SIGGRAPH Asia 2017 Symposium on Visualization, (1-8)
  54. ACM
    Roth P, Shan H, Riegner D, Antolin N, Sreepathi S, Oliker L, Williams S, Moore S and Windl W Performance analysis and optimization of the RAMPAGE metal alloy potential generation software Proceedings of the 4th ACM SIGPLAN International Workshop on Software Engineering for Parallel Systems, (11-20)
  55. ACM
    Luo Z, Zheng M and Siegel S Verification of MPI programs using CIVL Proceedings of the 24th European MPI Users' Group Meeting, (1-11)
  56. ACM
    Kalim U, Gardner M and Feng W A Non-Invasive Approach for Realizing Resilience in MPI Proceedings of the 2017 Workshop on Fault-Tolerance for HPC at Extreme Scale, (1-8)
  57. González-Álvarez D, Vega-Rodríguez M and Rubio-Largo Á (2017). A hybrid MPI/OpenMP parallel implementation of NSGA-II for finding patterns in protein sequences, The Journal of Supercomputing, 73:6, (2285-2312), Online publication date: 1-Jun-2017.
  58. ACM
    Samarah M and Fatmi R Dynamic Self-assembling Petaflop Scale Clusters Proceedings of the International Conference on High Performance Compilation, Computing and Communications, (1-5)
  59. Maillo J, Ramírez S, Triguero I and Herrera F (2017). kNN-IS, Knowledge-Based Systems, 117:C, (3-15), Online publication date: 1-Feb-2017.
  60. ACM
    Gao Z and Jermaine C (2016). Distributed Algorithms for Computing Very Large Thresholded Covariance Matrices, ACM Transactions on Knowledge Discovery from Data, 11:2, (1-25), Online publication date: 26-Dec-2016.
  61. ACM
    Kongmanee J, Thanapattheerakul T and Chan J Parallel Affinity Propagation Clustering in Identifying Sub-Network Biomarker Genes of Lung Cancer Proceedings of the 7th International Conference on Computational Systems-Biology and Bioinformatics, (19-23)
  62. Palmer B Application of PGAS programming to power grid simulation Proceedings of the First Workshop on PGAS Applications, (33-40)
  63. Guan S, De Grande R and Boukerche A An HLA-based Cloud Simulator for Mobile Cloud Environments Proceedings of the 20th International Symposium on Distributed Simulation and Real-Time Applications, (128-135)
  64. Salles T and Hardiman L (2016). Badlands, Computers & Geosciences, 91:C, (77-89), Online publication date: 1-Jun-2016.
  65. ACM
    Aleksandrov L, Chapuis G and Djidjev H Parallel Shortest-Path Queries in Planar Graphs Proceedings of the ACM Workshop on High Performance Graph Processing, (9-16)
  66. Palmer B, Perkins W, Chen Y, Jin S, Callahan D, Glass K, Diao R, Rice M, Elbert S, Vallem M and Huang Z (2016). GridPACKTM, International Journal of High Performance Computing Applications, 30:2, (223-240), Online publication date: 1-May-2016.
  67. ACM
    Koukos K, Ros A, Hagersten E and Kaxiras S (2016). Building Heterogeneous Unified Virtual Memories (UVMs) without the Overhead, ACM Transactions on Architecture and Code Optimization, 13:1, (1-22), Online publication date: 5-Apr-2016.
  68. Yang Q, Du Z, Cao Z, Tao J and Bader D (2016). A New Parallel Method for Binary Black Hole Simulations, Scientific Programming, 2016, (6), Online publication date: 1-Mar-2016.
  69. ACM
    López H, Marques E, Martins F, Ng N, Santos C, Vasconcelos V and Yoshida N (2015). Protocol-based verification of message-passing parallel programs, ACM SIGPLAN Notices, 50:10, (280-298), Online publication date: 18-Dec-2015.
  70. ACM
    Thébault L, Petit E and Dinh Q (2015). Scalable and efficient implementation of 3d unstructured meshes computation: a case study on matrix assembly, ACM SIGPLAN Notices, 50:8, (120-129), Online publication date: 18-Dec-2015.
  71. ACM
    Götz M, Bodenstein C and Riedel M HPDBSCAN Proceedings of the Workshop on Machine Learning in High-Performance Computing Environments, (1-10)
  72. ACM
    Hayashi A, Zhao J, Ferguson M and Sarkar V LLVM-based communication optimizations for PGAS programs Proceedings of the Second Workshop on the LLVM Compiler Infrastructure in HPC, (1-11)
  73. ACM
    López H, Marques E, Martins F, Ng N, Santos C, Vasconcelos V and Yoshida N Protocol-based verification of message-passing parallel programs Proceedings of the 2015 ACM SIGPLAN International Conference on Object-Oriented Programming, Systems, Languages, and Applications, (280-298)
  74. D'Azevedo E and Imam N Graph 500 in OpenSHMEM Revised Selected Papers of the Second Workshop on OpenSHMEM and Related Technologies. Experiences, Implementations, and Technologies - Volume 9397, (154-163)
  75. ACM
    Arora R, Chen K, Gupta M, Clark S and Song C Leveraging DiaGrid hub for interactively generating and running parallel programs Proceedings of the 2015 XSEDE Conference: Scientific Advancements Enabled by Enhanced Cyberinfrastructure, (1-8)
  76. ACM
    Fathollahzadeh S, Karimi R, Sharifi M, Teymourian K, Hasan A and Paschke A Parallel event processing on unbound streams with multi-step windowing Proceedings of the 9th ACM International Conference on Distributed Event-Based Systems, (328-329)
  77. ACM
    Roth P, Meredith J and Vetter J Automated Characterization of Parallel Application Communication Patterns Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing, (73-84)
  78. Kaeli D, Mistry P, Schaa D and Zhang D (2015). Heterogeneous Computing with OpenCL 2.0, 10.5555/2815521, Online publication date: 1-Jun-2015.
  79. ACM
    Hammouda A, Siegel A and Siegel S (2015). Noise-Tolerant Explicit Stencil Computations for Nonuniform Process Execution Rates, ACM Transactions on Parallel Computing, 2:1, (1-33), Online publication date: 21-May-2015.
  80. ACM
    Fiore S, Mancini M, Elia D, Nassisi P, Brasileiro F and Blanquer I Big data analytics for climate change and biodiversity in the EUBrazilCC federated cloud infrastructure Proceedings of the 12th ACM International Conference on Computing Frontiers, (1-8)
  81. ACM
    Sura Z, Jacob A, Chen T, Rosenburg B, Sallenave O, Bertolli C, Antao S, Brunheroto J, Park Y, O'Brien K and Nair R Data access optimization in a processing-in-memory system Proceedings of the 12th ACM International Conference on Computing Frontiers, (1-8)
  82. Acı Ç and Akay M (2015). A hybrid congestion control algorithm for broadcast-based architectures with multiple input queues, The Journal of Supercomputing, 71:5, (1907-1931), Online publication date: 1-May-2015.
  83. Mathew B, Rai N, Gupta A and Harode A Exploiting computing power of Xeon and Intel Xeon Phi for a molecular dynamics application Proceedings of the Symposium on High Performance Computing, (9-16)
  84. Santander-Jiménez S and Vega-Rodríguez M (2015). A hybrid approach to parallelize a fast non-dominated sorting genetic algorithm for phylogenetic inference, Concurrency and Computation: Practice & Experience, 27:3, (702-734), Online publication date: 10-Mar-2015.
  85. ACM
    Thébault L, Petit E and Dinh Q Scalable and efficient implementation of 3d unstructured meshes computation: a case study on matrix assembly Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, (120-129)
  86. Takizawa H, Hirasawa S, Sugawara M, Gelado I, Kobayashi H and Hwu W (2016). Optimized data transfers based on the OpenCL event management mechanism, Scientific Programming, 2015, (2-2), Online publication date: 1-Jan-2015.
  87. Chapuis G, Boudic-Jamin M, Andonov R, Djidjev H and Lavenier D (2015). Parallel seed-based approach to multiple protein structure similarities detection, Scientific Programming, 2015, (2-2), Online publication date: 1-Jan-2015.
  88. Palmer B, Perkins W, Chen Y, Jin S, Callahan D, Glass K, Diao R, Rice M, Elbert S, Vallem M and Huang Z GridPACK™ Proceedings of the Fourth International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing, (68-77)
  89. Cassidy A, Alvarez-Icaza R, Akopyan F, Sawada J, Arthur J, Merolla P, Datta P, Tallada M, Taba B, Andreopoulos A, Amir A, Esser S, Kusnitz J, Appuswamy R, Haymes C, Brezzo B, Moussalli R, Bellofatto R, Baks C, Mastro M, Schleupen K, Cox C, Inoue K, Millman S, Imam N, McQuinn E, Nakamura Y, Vo I, Guo C, Nguyen D, Lekuch S, Asaad S, Friedman D, Jackson B, Flickner M, Risk W, Manohar R and Modha D Real-time scalable cortical computing at 46 giga-synaptic OPS/watt with ~100× speedup in time-to-solution and ~100,000× reduction in energy-to-solution Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, (27-38)
  90. Chen J and Edelman A Parallel prefix polymorphism permits parallelization, presentation & proof Proceedings of the 1st First Workshop for High Performance Technical Computing in Dynamic Languages, (47-56)
  91. ACM
    Fredriksson O, Ghica D and Wheen B Towards native higher-order remote procedure calls Proceedings of the 26nd 2014 International Symposium on Implementation and Application of Functional Languages, (1-12)
  92. Alborzi S, Maduranga D, Fan R, Rajapakse J and Zheng J CUDAGRN Proceedings of the 9th IAPR International Conference on Pattern Recognition in Bioinformatics - Volume 8626, (85-97)
  93. ACM
    Arora R, Olaya J and Gupta M A Tool for Interactive Parallelization Proceedings of the 2014 Annual Conference on Extreme Science and Engineering Discovery Environment, (1-8)
  94. ACM
    Arnold G, Gajbe M, Koric S and Urbanic J XSEDE OpenACC workshop enables Blue Waters Researchers to Accelerate Key Algorithms Proceedings of the 2014 Annual Conference on Extreme Science and Engineering Discovery Environment, (1-6)
  95. ACM
    Radenski A Big data, high-performance computing, and MapReduce Proceedings of the 15th International Conference on Computer Systems and Technologies, (13-24)
  96. ACM
    Roth P and Meredith J Value influence analysis for message passing applications Proceedings of the 28th ACM international conference on Supercomputing, (145-154)
  97. Wang Z, Shi X, Jin H, Wu S and Chen Y Iteration based collective I/O strategy for parallel I/O systems Proceedings of the 14th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, (287-294)
  98. Valero M, Moreto M, Casas M, Ayguade E and Labarta J (2014). Runtime-Aware Architectures, Supercomputing Frontiers and Innovations: an International Journal, 1:1, (29-44), Online publication date: 6-Apr-2014.
  99. ACM
    Hower D, Hechtman B, Beckmann B, Gaster B, Hill M, Reinhardt S and Wood D (2014). Heterogeneous-race-free memory models, ACM SIGARCH Computer Architecture News, 42:1, (427-440), Online publication date: 5-Apr-2014.
  100. ACM
    Hower D, Hechtman B, Beckmann B, Gaster B, Hill M, Reinhardt S and Wood D (2014). Heterogeneous-race-free memory models, ACM SIGPLAN Notices, 49:4, (427-440), Online publication date: 5-Apr-2014.
  101. ACM
    Hower D, Hechtman B, Beckmann B, Gaster B, Hill M, Reinhardt S and Wood D Heterogeneous-race-free memory models Proceedings of the 19th international conference on Architectural support for programming languages and operating systems, (427-440)
  102. Guo Z, Mi J, Xiong S and Grant P (2014). Phase field study of the tip operating state of a freely growing dendrite against convection using a novel parallel multigrid approach, Journal of Computational Physics, 257:PA, (278-297), Online publication date: 15-Jan-2014.
  103. ACM
    Chong N, Donaldson A and Ketema J (2014). A sound and complete abstraction for reasoning about parallel prefix sums, ACM SIGPLAN Notices, 49:1, (397-409), Online publication date: 13-Jan-2014.
  104. ACM
    Chong N, Donaldson A and Ketema J A sound and complete abstraction for reasoning about parallel prefix sums Proceedings of the 41st ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, (397-409)
  105. Yue S and Gray J (2014). SPOT, Advances in Software Engineering, 2014, (9-9), Online publication date: 1-Jan-2014.
  106. Li P, Wu J and Shang L Fast Approximate Attribute Reduction with MapReduce Proceedings of the 8th International Conference on Rough Sets and Knowledge Technology - Volume 8171, (271-278)
  107. ACM
    Huang L, Wang Z, Xiao N, Wang Y and Dou Q (2013). Adaptive communication mechanism for accelerating MPI functions in NoC-based multicore processors, ACM Transactions on Architecture and Code Optimization, 10:3, (1-25), Online publication date: 16-Sep-2013.
  108. Wilke J, Sargsyan K, Kenny J, Debusschere B, Najm H and Hendry G Validation and uncertainty assessment of extreme-scale HPC simulation through bayesian inference Proceedings of the 19th international conference on Parallel Processing, (41-52)
  109. ACM
    Arora R, Capetillo E, Bangalore P and Mernik M A high-level framework for parallelizing legacy applications for multiple platforms Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery, (1-8)
  110. Ghoting A, Gunnels J, Kambadur P, Pednault E and Squillante M (2013). Trends and outlook for the massive-scale analytics stack, IBM Journal of Research and Development, 57:3-4, (2-2), Online publication date: 1-May-2013.
  111. Hewitt M, Nemhauser G, Savelsbergh M and Song J (2013). A branch-and-price guided search approach to maritime inventory routing, Computers and Operations Research, 40:5, (1410-1419), Online publication date: 1-May-2013.
  112. Cano R, Kunigami G, De Souza C and De Rezende P (2013). A hybrid GRASP heuristic to construct effective drawings of proportional symbol maps, Computers and Operations Research, 40:5, (1435-1447), Online publication date: 1-May-2013.
  113. Hewitt M, Nemhauser G and Savelsbergh M (2013). Branch-and-Price Guided Search for Integer Programs with an Application to the Multicommodity Fixed-Charge Network Flow Problem, INFORMS Journal on Computing, 25:2, (302-316), Online publication date: 1-Apr-2013.
  114. Ben Romdhanne B, Nikaein N and Bonnet C Coordinator-master-worker model for efficient large scale network simulation Proceedings of the 6th International ICST Conference on Simulation Tools and Techniques, (119-128)
  115. Zhao G, Bryan B, King D, Luo Z, Wang E, Bende-Michl U, Song X and Yu Q (2013). Large-scale, high-resolution agricultural systems modeling using a hybrid approach combining grid computing and parallel processing, Environmental Modelling & Software, 41, (231-238), Online publication date: 1-Mar-2013.
  116. Prakash A, Chaudhury A and Ramachandran R (2013). Parallel simulation of population balance model-based particulate processes using multicore CPUs and GPUs, Modelling and Simulation in Engineering, 2013, (2-2), Online publication date: 1-Jan-2013.
  117. ACM
    Hawe G, Coates G, Wilson D and Crouch R (2012). Agent-based simulation for large-scale emergency response, ACM Computing Surveys, 45:1, (1-51), Online publication date: 1-Nov-2012.
  118. Cerise R, da Graça F, Magoulès F and Callet P Natural lighting and medieval glass --- scientific data acquisition, methodology and physically based rendering Proceedings of the 4th international conference on Progress in Cultural Heritage Preservation, (636-643)
  119. Pellegrini S, Hoefler T and Fahringer T Exact dependence analysis for increased communication overlap Proceedings of the 19th European conference on Recent Advances in the Message Passing Interface, (89-99)
  120. Cheng G, Liu L, Jing N, Chen L and Xiong W (2012). General-purpose optimization methods for parallelization of digital terrain analysis based on cellular automata, Computers & Geosciences, 45, (57-67), Online publication date: 1-Aug-2012.
  121. ACM
    Chen Q, Guo M and Huang Z CATS Proceedings of the 26th ACM international conference on Supercomputing, (163-172)
  122. Ng N, Yoshida N and Honda K Multiparty session c Proceedings of the 50th international conference on Objects, Models, Components, Patterns, (202-218)
  123. ACM
    Chirigati F, Silva V, Ogasawara E, de Oliveira D, Dias J, Porto F, Valduriez P and Mattoso M Evaluating parameter sweep workflows in high performance computing Proceedings of the 1st ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies, (1-10)
  124. Yin Y, Byna S, Song H, Sun X and Thakur R Boosting Application-Specific Parallel I/O Optimization Using IOSIG Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012), (196-203)
  125. D'Angelo G, Ferretti S and Marzolla M Time warp on the go Proceedings of the 5th International ICST Conference on Simulation Tools and Techniques, (242-248)
  126. ACM
    Meredith J, Sisneros R, Pugmire D and Ahern S A distributed data-parallel framework for analysis and visualization algorithm development Proceedings of the 5th Annual Workshop on General Purpose Processing with Graphics Processing Units, (11-19)
  127. Cai Y, Li G, Wang H, Zheng G and Lin S (2012). Development of parallel explicit finite element sheet forming simulation system based on GPU architecture, Advances in Engineering Software, 45:1, (370-379), Online publication date: 1-Mar-2012.
  128. ACM
    Epstein J, Black A and Peyton-Jones S (2011). Towards Haskell in the cloud, ACM SIGPLAN Notices, 46:12, (118-129), Online publication date: 12-Jan-2012.
  129. Ivanov L (2012). The right balance, Journal of Computing Sciences in Colleges, 27:3, (115-121), Online publication date: 1-Jan-2012.
  130. ACM
    Gopalakrishnan G, Kirby R, Siegel S, Thakur R, Gropp W, Lusk E, De Supinski B, Schulz M and Bronevetsky G (2011). Formal analysis of MPI-based parallel programs, Communications of the ACM, 54:12, (82-91), Online publication date: 1-Dec-2011.
  131. Ben-Hafaiedh I, Graf S and Mazouz N Distributed implementation of systems with multiparty interactions and priorities Proceedings of the 9th international conference on Software engineering and formal methods, (38-57)
  132. ACM
    Crawl D, Wang J and Altintas I Provenance for MapReduce-based data-intensive workflows Proceedings of the 6th workshop on Workflows in support of large-scale science, (21-30)
  133. ACM
    Daily J and Lewis R Poster Proceedings of the 2011 companion on High Performance Computing Networking, Storage and Analysis Companion, (43-44)
  134. ACM
    Dias J, Ogasawara E, de Oliveira D and Mattoso M Poster Proceedings of the 2011 companion on High Performance Computing Networking, Storage and Analysis Companion, (27-28)
  135. ACM
    Nicolae B and Cappello F BlobCR Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, (1-12)
  136. ACM
    Stuart J, Balaji P and Owens J Extending MPI to accelerators Proceedings of the 1st Workshop on Architectures and Systems for Big Data, (19-23)
  137. ACM
    Epstein J, Black A and Peyton-Jones S Towards Haskell in the cloud Proceedings of the 4th ACM symposium on Haskell, (118-129)
  138. ACM
    Wagle R, Andrade H, Hildrum K, Venkatramani C and Spicer M Distributed middleware reliability and fault tolerance support in system S Proceedings of the 5th ACM international conference on Distributed event-based system, (335-346)
  139. Ng N, Yoshida N, Pernet O, Hu R and Kryftis Y Safe parallel programming with session Java Proceedings of the 13th international conference on Coordination models and languages, (110-126)
  140. Leonardi C, Holmes D, Williams J and Tilke P A multi-core numerical framework for characterizing flow in oil reservoirs Proceedings of the 19th High Performance Computing Symposia, (166-174)
  141. Shams R and Sadeghi P (2011). On optimization of finite-difference time-domain (FDTD) computation on heterogeneous and GPU clusters, Journal of Parallel and Distributed Computing, 71:4, (584-593), Online publication date: 1-Apr-2011.
  142. ACM
    Kang P, Tilevich E, Varadarajan S and Ramakrishnan N Maintainable and reusable scientific software adaptation Proceedings of the tenth international conference on Aspect-oriented software development, (165-176)
  143. ACM
    Gross T Breadth in depth Proceedings of the 42nd ACM technical symposium on Computer science education, (435-440)
  144. ACM
    Sadowski C, Ball T, Bishop J, Burckhardt S, Gopalakrishnan G, Mayo J, Musuvathi M, Qadeer S and Toub S Practical parallel and concurrent programming Proceedings of the 42nd ACM technical symposium on Computer science education, (189-194)
  145. ACM
    van der Wijngaart R, Mattson T and Haas W (2011). Light-weight communications on Intel's single-chip cloud computer processor, ACM SIGOPS Operating Systems Review, 45:1, (73-83), Online publication date: 18-Feb-2011.
  146. Siegel S and Gopalakrishnan G Formal analysis of message passing Proceedings of the 12th international conference on Verification, model checking, and abstract interpretation, (2-18)
  147. Weerapurage D, Eblen J, Rogers G and Langston M Parallel vertex cover Proceedings of the Ninth Australasian Symposium on Parallel and Distributed Computing - Volume 118, (25-32)
  148. ACM
    Rolfe T (2010). A specimen of parallel programming, ACM Inroads, 1:4, (72-79), Online publication date: 1-Dec-2010.
  149. Bejleri A Practical parameterised session types Proceedings of the 12th international conference on Formal engineering methods and software engineering, (270-286)
  150. ACM
    Shang W, Adams B and Hassan A An experience report on scaling tools for mining software repositories using MapReduce Proceedings of the 25th IEEE/ACM International Conference on Automated Software Engineering, (275-284)
  151. Emoto K, Hu Z, Kakehi K, Matsuzaki K and Takeichi M Generators-of-generators library with optimization capabilities in fortress Proceedings of the 16th international Euro-Par conference on Parallel processing: Part II, (26-37)
  152. ACM
    Jardak C, Riihijärvi J, Oldewurtel F and Mähönen P Parallel processing of data from very large-scale wireless sensor networks Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, (787-794)
  153. ACM
    Kamal H, Mirtaheri S and Wagner A Scalability of communicators and groups in MPI Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, (264-275)
  154. ACM
    Reid A, Gain J and Kuttel M Dynamic load balancing of Lattice Boltzmann free-surface fluid animations Proceedings of the 7th International Conference on Computer Graphics, Virtual Reality, Visualisation and Interaction in Africa, (91-100)
  155. ACM
    Cieslewicz J, Ross K, Satsumi K and Ye Y Automatic contention detection and amelioration for data-intensive operations Proceedings of the 2010 ACM SIGMOD International Conference on Management of data, (483-494)
  156. ACM
    Grogono P and Jafroodi N A fair protocol for non-deterministic message passing Proceedings of the Third C* Conference on Computer Science and Software Engineering, (53-58)
  157. Dell'aquila C, Di Tria F, Lefons E and Tangorra F (2010). A parallel algorithm to compute data synopsis, WSEAS Transactions on Information Science and Applications, 7:5, (691-701), Online publication date: 1-May-2010.
  158. Dell'Aquila C, Di Tria F, Lefons E and Tangorra F Parallel computing for data reduction Proceedings of the 9th WSEAS international conference on Artificial intelligence, knowledge engineering and data bases, (280-285)
  159. Pflaum C and Rahimi Z (2009). Automatic parallelisation of staggered grid codes with expression templates, International Journal of Computational Science and Engineering, 4:4, (306-313), Online publication date: 1-Nov-2009.
  160. Chao J, Haselbacher A and Balachandar S (2009). A massively parallel multi-block hybrid compact-WENO scheme for compressible flows, Journal of Computational Physics, 228:19, (7473-7491), Online publication date: 30-Oct-2009.
  161. Wolffe G and Trefftz C (2009). Teaching parallel computing, Journal of Computing Sciences in Colleges, 25:1, (21-28), Online publication date: 1-Oct-2009.
  162. Arbenz P, Bryner J and Tobler C Parallelized transient elastic wave propagation in orthotropic structures Proceedings of the 8th international conference on Parallel processing and applied mathematics: Part II, (310-319)
  163. Iglberger K and Rüde U A Parallel Rigid Body Dynamics Algorithm Proceedings of the 15th International Euro-Par Conference on Parallel Processing, (760-771)
  164. Setia A, Swarup V, Kumar S and Singh L A novel adaptive fuzzy load balancer for heterogeneous LAM/MPI clusters applied to evolutionary learning in neuro-fuzzy systems Proceedings of the 18th international conference on Fuzzy Systems, (68-73)
  165. Pllana S, Benkner S, Xhafa F and Barolli L (2009). A novel approach for hybrid performance modelling and prediction of large-scale computing systems, International Journal of Grid and Utility Computing, 1:4, (316-327), Online publication date: 1-Aug-2009.
  166. ACM
    Mueller C, Baumgartner B, Ofenbeck G, Schrader B and Sbalzarini I pCMALib Proceedings of the 11th Annual conference on Genetic and evolutionary computation, (1411-1418)
  167. ACM
    Kraus J and Kestler H Multi-core parallelization in Clojure Proceedings of the 6th European Lisp Workshop, (8-17)
  168. ACM
    Zima H, Hall M, Chen C and Chame J Model-guided autotuning of high-productivity languages for petascale computing Proceedings of the 18th ACM international symposium on High performance distributed computing, (151-166)
  169. Santhanaraman G, Balaji P, Gopalakrishnan K, Thakur R, Gropp W and Panda D Natively Supporting True One-Sided Communication in MPI on Multi-core Systems with InfiniBand Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, (380-387)
  170. von Laszewski G, Younge A, He X, Mahinthakumar K and Wang L Experiment and Workflow Management Using Cyberaide Shell Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, (568-573)
  171. D'Auriol B (2009). The systems edge of the Parameterized Linear Array with a Reconfigurable Pipelined Bus System (LARPBS(p)) optical bus parallel computing model, The Journal of Supercomputing, 48:2, (183-209), Online publication date: 1-May-2009.
  172. Hentschel B, Wolter M, Renze P, Schröder W, Bischof C and Kuhlen T Hybrid parallelization for multi-view visualization of time-dependent simulation data Proceedings of the 9th Eurographics conference on Parallel Graphics and Visualization, (79-86)
  173. Teresco J, Faik J and Flaherty J (2009). Resource-Aware Scientific Computation on a Heterogeneous Cluster, Computing in Science and Engineering, 7:2, (40-50), Online publication date: 1-Mar-2009.
  174. Gobbert M (2009). Configuration and Performance of a Beowulf Cluster for Large-Scale Scientific Simulations, Computing in Science and Engineering, 7:2, (14-26), Online publication date: 1-Mar-2009.
  175. Ababei C (2009). Speeding up FPGA placement via partitioning and multithreading, International Journal of Reconfigurable Computing, 2009, (1-1), Online publication date: 1-Jan-2009.
  176. Levesque M, Ichikawa K, Date S and Haga J Bringing flexibility to virtual screening for enzymatic inhibitors on the grid Proceedings of the 2008 9th IEEE/ACM International Conference on Grid Computing, (201-208)
  177. Jimenez E and Cabrera F Parallel method of moments analysis of microstrip structures using the MPI library Proceedings of the 2nd conference on European computing conference, (252-256)
  178. Santos G, Duarte A, Rexachs D and Luque E Providing Non-stop Service for Message-Passing Based Parallel Applications with RADIC Proceedings of the 14th international Euro-Par conference on Parallel Processing, (58-67)
  179. ACM
    Grossman R and Gu Y Data mining using high performance data clouds Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, (920-927)
  180. ACM
    Siegel S, Mironova A, Avrunin G and Clarke L (2008). Combining symbolic execution with model checking to verify parallel numerical programs, ACM Transactions on Software Engineering and Methodology, 17:2, (1-34), Online publication date: 1-Apr-2008.
  181. ACM
    Guo J, Bikshandi G, Fraguela B, Garzaran M and Padua D Programming with tiles Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming, (111-122)
  182. Santhanaraman G, Narravula S, Mamidala A and Panda D MPI-2 one-sided usage and implementation for read modify write operations Proceedings of the 14th European conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface, (251-259)
  183. Guarracino M, Cuciniello S and Feminiano D A parallel classification and feature reduction method for biomedical applications Proceedings of the 7th international conference on Parallel processing and applied mathematics, (1210-1219)
  184. Kengue J, Valtchev P and Djamegni C Parallel computation of closed itemsets and implication rule bases Proceedings of the 5th international conference on Parallel and Distributed Processing and Applications, (359-370)
  185. ACM
    Correa W, Klosowski J, Morris C and Jackmann T SPVN ACM SIGGRAPH 2007 courses, (6-es)
  186. Yu K, Wang H and Wu X A parallel algorithm for learning Bayesian networks Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining, (1055-1063)
  187. Ben Youssef B, Gang Cheng , Zygourakis K and Markenscoff P (2007). Regular Paper, International Journal of High Performance Computing Applications, 21:2, (196-209), Online publication date: 1-May-2007.
  188. ACM
    Pheatt C (2007). An easy to use distributed computing framework, ACM SIGCSE Bulletin, 39:1, (571-575), Online publication date: 7-Mar-2007.
  189. ACM
    Pheatt C An easy to use distributed computing framework Proceedings of the 38th SIGCSE technical symposium on Computer science education, (571-575)
  190. Bikshandi G, Guo J, Praun C, Tanase G, Fraguela B, Garzarán M, Padua D and Rauchwerger L Design and use of htalib Proceedings of the 19th international conference on Languages and compilers for parallel computing, (17-32)
  191. Chen N, Glazier J and Alber M A parallel implementation of the cellular potts model for simulation of cell-based morphogenesis Proceedings of the 7th international conference on Cellular Automata for Research and Industry, (58-67)
  192. Liao S Parallelizing user-defined and implicit reductions globally on multiprocessors Proceedings of the 11th Asia-Pacific conference on Advances in Computer Systems Architecture, (189-202)
  193. ACM
    Siegel S, Mironova A, Avrunin G and Clarke L Using model checking with symbolic execution to verify parallel numerical programs Proceedings of the 2006 international symposium on Software testing and analysis, (157-168)
  194. Silva R, De Aguiar M, De Rose C and Dimuro G Extending the HPC-ICTM geographical categorization model for grid computing Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing, (850-859)
  195. ACM
    Woollard D and Medvidovic N An architectural style for high-performance asymmetrical parallel computations Proceedings of the 28th international conference on Software engineering, (857-860)
  196. Fernández J, Anguita M, Ros E and Bernier J SCE toolboxes for the development of high-level parallel applications Proceedings of the 6th international conference on Computational Science - Volume Part II, (518-525)
  197. Wolter M, Hentschel B, Schirski M, Gerndt A and Kuhlen T Time step prioritising in parallel feature extraction on unsteady simulation data Proceedings of the 6th Eurographics conference on Parallel Graphics and Visualization, (91-98)
  198. Perrin D, Ruskin H, Burns J and Crane M An agent-based approach to immune modelling Proceedings of the 6th international conference on Computational Science and Its Applications - Volume Part I, (612-621)
  199. Mo Z, Zhang A and Cao X Towards a parallel framework of grid-based numerical algorithms on DAGs Proceedings of the 20th international conference on Parallel and distributed processing, (310-310)
  200. Guo J, Bikshandi G, Hoeflinger D, Almasi G, Fraguela B, Garzará M, Padua D and von Praun C Hierarchically tiled arrays for parallelism and locality Proceedings of the 20th international conference on Parallel and distributed processing, (281-281)
  201. Melatti I, Palmer R, Sawaya G, Yang Y, Kirby R and Gopalakrishnan G Parallel and distributed model checking in eddy Proceedings of the 13th international conference on Model Checking Software, (108-125)
  202. ACM
    Sharapov I, Kroeger R, Delamarter G, Cheveresan R and Ramsay M A case study in top-down performance estimation for a large-scale parallel application Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming, (81-89)
  203. ACM
    Bikshandi G, Guo J, Hoeflinger D, Almasi G, Fraguela B, Garzarán M, Padua D and von Praun C Programming for parallelism and locality with hierarchically tiled arrays Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming, (48-57)
  204. Chen H, Decker J and Bierbaum N Future networking for scalable I/O Proceedings of the 24th IASTED international conference on Parallel and distributed computing and networks, (128-135)
  205. Yavuz S A neural network based methodology for performance evaluation of parallel systems Proceedings of the 18th Australian Joint conference on Advances in Artificial Intelligence, (297-307)
  206. Zuberek W and Perera T Performance analysis of distributed iterative linear solvers Proceedings of the 7th WSEAS International Conference on Mathematical Methods and Computational Techniques In Electrical Engineering, (194-199)
  207. Castro F, Pollo C, Meuli R, Maeder P, Cuadra M, Cuisenaire O, Villemure J and Thiran J Cross validation of experts versus registration methods for target localization in deep brain stimulation Proceedings of the 8th international conference on Medical Image Computing and Computer-Assisted Intervention - Volume Part I, (417-424)
  208. ACM
    Charles P, Grothoff C, Saraswat V, Donawa C, Kielstra A, Ebcioglu K, von Praun C and Sarkar V X10 Proceedings of the 20th annual ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications, (519-538)
  209. ACM
    Charles P, Grothoff C, Saraswat V, Donawa C, Kielstra A, Ebcioglu K, von Praun C and Sarkar V (2005). X10, ACM SIGPLAN Notices, 40:10, (519-538), Online publication date: 12-Oct-2005.
  210. Ghose D, Kim H and Kim T (2005). Adaptive Divisible Load Scheduling Strategies for Workstation Clusters with Unknown Network Resources, IEEE Transactions on Parallel and Distributed Systems, 16:10, (897-907), Online publication date: 1-Oct-2005.
  211. Baraglia R, Ferrini R and Ritrovato P HMM Proceedings of the 6th international conference on Parallel Processing and Applied Mathematics, (731-740)
  212. Aridor Y, Domany T, Goldshmidt O, Kliteynik Y, Moreira J and Shmueli E Open job management architecture for the blue gene/l supercomputer Proceedings of the 11th international conference on Job Scheduling Strategies for Parallel Processing, (91-107)
  213. Brown R and Sharapov I Performance and programmability comparison between OpenMP and MPI implementations of a molecular modeling application Proceedings of the 2005 and 2006 international conference on OpenMP shared memory parallel programming, (349-360)
  214. Butz T and Thiran J (2005). From error probability to information theoretic (multi-modal) signal processing, Signal Processing, 85:5, (875-902), Online publication date: 1-May-2005.
  215. Fournier A, Bunge H, Hollerbach R and Vilotte J (2005). A Fourier-spectral element algorithm for thermal convection in rotating axisymmetric containers, Journal of Computational Physics, 204:2, (462-489), Online publication date: 10-Apr-2005.
  216. Huang W, Santhanaraman G, Jin H and Panda D Scheduling of MPI-2 One Sided Operations over InfiniBand Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 9 - Volume 10
  217. ACM
    Holliday M, Wilkinson B, House J, Daoud S and Ferner C A geographically-distributed, assignment-structured undergraduate grid computing course Proceedings of the 36th SIGCSE technical symposium on Computer science education, (206-210)
  218. ACM
    Holliday M, Wilkinson B, House J, Daoud S and Ferner C (2005). A geographically-distributed, assignment-structured undergraduate grid computing course, ACM SIGCSE Bulletin, 37:1, (206-210), Online publication date: 23-Feb-2005.
  219. Siegel S Efficient verification of halting properties for MPI programs with wildcard receives Proceedings of the 6th international conference on Verification, Model Checking, and Abstract Interpretation, (413-429)
  220. Liu J, Mamidala A, Vishnu A and Panda D (2005). Evaluating InfiniBand Performance with PCI Express, IEEE Micro, 25:1, (20-29), Online publication date: 1-Jan-2005.
  221. Sundararajan E, Premaratne M, Karunasekera S and Harwood A Algorithmic-Parameter optimization of a parallelized split-step fourier transform using a modified BSP cost model Proceedings of the Second international conference on Parallel and Distributed Processing and Applications, (233-244)
  222. Dean J and Ghemawat S MapReduce Proceedings of the 6th conference on Symposium on Operating Systems Design & Implementation - Volume 6, (10-10)
  223. ACM
    Grossman R (2004). Data mining standards, services, and platforms 2004 (DM-SSP 2004), ACM SIGKDD Explorations Newsletter, 6:2, (157-158), Online publication date: 1-Dec-2004.
  224. Gerndt A, Hentschel B, Wolter M, Kuhlen T and Bischof C VIRACOCHA Proceedings of the 2004 ACM/IEEE conference on Supercomputing
  225. Mohror K and Karavanic K Performance Tool Support for MPI-2 on Linux Proceedings of the 2004 ACM/IEEE conference on Supercomputing
  226. Liu J, Vishnu A and Panda D Building Multirail InfiniBand Clusters Proceedings of the 2004 ACM/IEEE conference on Supercomputing
  227. Xu S and Zhang J (2004). A parallel hybrid web document clustering algorithm and its performance study, The Journal of Supercomputing, 30:2, (117-131), Online publication date: 1-Nov-2004.
  228. ACM
    Fraguela B, Guo J, Bikshandi G, Garzarán M, Almási G, Moreira J and Padua D The Hierarchically Tiled Arrays programming approach Proceedings of the 7th workshop on Workshop on languages, compilers, and run-time support for scalable systems, (1-12)
  229. Zeyao M and Lianxiang F (2004). Parallel Flux Sweep Algorithm for Neutron Transport on Unstructured Grid, The Journal of Supercomputing, 30:1, (5-17), Online publication date: 1-Oct-2004.
  230. Bikshandi G, Fraguela B, Guo J, Garzarán M, Almási G, Moreira J and Padua D Implementation of parallel numerical algorithms using hierarchically tiled arrays Proceedings of the 17th international conference on Languages and Compilers for High Performance Computing, (87-101)
  231. Heath M and Jiao X Parallel simulation of multicomponent systems Proceedings of the 6th international conference on High Performance Computing for Computational Science, (496-513)
  232. de Aguiar M, Dimuro G, Costa F, Silva R, De Rose C, Costa A and Kreinovich V HPC-ICTM Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing, (83-92)
  233. ACM
    Gerndt A, Asbach M, Kuhlen T, Bischof C, Lankes S and Bemmerl T Conceptual design and implementation of a pipeline-based VR-system parallelized by CORBA, and comparison with existing approaches Proceedings of the 2004 ACM SIGGRAPH international conference on Virtual Reality continuum and its applications in industry, (368-374)
  234. Andrade H, Kurc T, Sussman A and Saltz J (2004). Optimizing the Execution of Multiple Data Analysis Queries on Parallel and Distributed Environments, IEEE Transactions on Parallel and Distributed Systems, 15:6, (520-532), Online publication date: 1-Jun-2004.
  235. McGuire T (2004). A simplified message-passing library, Journal of Computing Sciences in Colleges, 19:4, (252-256), Online publication date: 1-Apr-2004.
  236. Zakharian A, Moloney J and Mansuripur M (2003). Simulating Near-Field Effects in High-Density Optical-Disk Data Storage, Computing in Science and Engineering, 5:6, (15-21), Online publication date: 1-Nov-2003.
  237. Vetter J and Mueller F (2003). Communication characteristics of large-scale scientific applications for contemporary cluster architectures, Journal of Parallel and Distributed Computing, 63:9, (853-865), Online publication date: 1-Sep-2003.
  238. Saber M and Mirenkov N Multimedia parallel programming tool for cellular automata systems Proceedings of the 2003 international conference on Parallel and distributed processing and applications, (437-448)
  239. ACM
    Schirski M, Gerndt A, van Reimersdahl T, Kuhlen T, Adomeit P, Lang O, Pischinger S and Bischof C ViSTA FlowLib - framework for interactive visualization and exploration of unsteady flows in virtual environments Proceedings of the workshop on Virtual environments 2003, (77-85)
  240. Allen K and Gobbert M Coarse-grained parallel matrix-free solution of a three- dimensional elliptic prototype problem Proceedings of the 2003 international conference on Computational science and its applications: PartII, (290-299)
  241. Huang S, Aubanel E and Bhavsar V Mesh partitioners for computational grids Proceedings of the 2003 international conference on Computational science and its applications: PartIII, (60-68)
  242. Sultanov R and Guster D Parallel computing for semiquantal few-body systems in atomic physics Proceedings of the 2003 international conference on Computational science and its applications: PartI, (568-576)
  243. D'Apuzzo M and Marino M (2003). Parallel computational issues of an interior point method for solving large bound-constrained quadratic programming problems, Parallel Computing, 29:4, (467-483), Online publication date: 1-Apr-2003.
  244. ACM
    Lamberti F, Zunino C, Sanna A, Fiume A and Maniezzo M An accelerated remote graphics architecture for PDAS Proceedings of the eighth international conference on 3D Web technology, (55-ff)
  245. Dongarra J, Foster I, Fox G, Gropp W, Kennedy K, Torczon L and White A References Sourcebook of parallel computing, (729-789)
  246. Vetter J and Yoo A An empirical performance evaluation of scalable scientific applications Proceedings of the 2002 ACM/IEEE conference on Supercomputing, (1-18)
  247. Wang P and Wu X (2002). OpenMP programming for a global inverse model, Scientific Programming, 10:3, (253-261), Online publication date: 1-Aug-2002.
  248. ACM
    Kruengkrai C and Jaruskulchai C A parallel learning algorithm for text classification Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, (201-206)
  249. ACM
    Vetter J Dynamic statistical profiling of communication activity in distributed applications Proceedings of the 2002 ACM SIGMETRICS international conference on Measurement and modeling of computer systems, (240-250)
  250. Lampoudi S and Beazley D SWILL Proceedings of the FREENIX Track: 2002 USENIX Annual Technical Conference, (19-27)
  251. ACM
    Vetter J (2002). Dynamic statistical profiling of communication activity in distributed applications, ACM SIGMETRICS Performance Evaluation Review, 30:1, (240-250), Online publication date: 1-Jun-2002.
  252. Liu P and Li K Programming the Bi-CGSTAB Matrix Solver for HPC and Benchmarking IBM SP3 and Alpha ES40 Proceedings of the 16th International Parallel and Distributed Processing Symposium
  253. Vetter J and Mueller F Communication Characteristics of Large-Scale Scientific Applications for Contemporary Cluster Architectures Proceedings of the 16th International Parallel and Distributed Processing Symposium
  254. Swann C (2002). Maximum Likelihood Estimation Using Parallel Computing, Computational Economics, 19:2, (145-178), Online publication date: 1-Apr-2002.
  255. Evans D and Saman M Cluster computing with message-passing interface Highly parallel computaions, (3-42)
  256. ACM
    Vetter J and McCracken M (2001). Statistical scalability analysis of communication operations in distributed applications, ACM SIGPLAN Notices, 36:7, (123-132), Online publication date: 1-Jul-2001.
  257. ACM
    Vetter J and McCracken M Statistical scalability analysis of communication operations in distributed applications Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programming, (123-132)
  258. Vetter J and de Supinski B Dynamic software testing of MPI applications with umpire Proceedings of the 2000 ACM/IEEE conference on Supercomputing, (51-es)
  259. Roy A, Foster I, Gropp W, Toonen B, Karonis N and Sander V MPICH-GQ Proceedings of the 2000 ACM/IEEE conference on Supercomputing, (19-es)
  260. Seitzer J, Buckley J and Pan Y (2000). INDED, IEEE Intelligent Systems, 15:5, (38-46), Online publication date: 1-Sep-2000.
  261. ACM
    Dickens P and Thakur R An evaluation of Java's I/O capabilities for high-performance computing Proceedings of the ACM 2000 conference on Java Grande, (26-35)
  262. ACM
    Vetter J Performance analysis of distributed applications using automatic classification of communication inefficiencies Proceedings of the 14th international conference on Supercomputing, (245-254)
Contributors
  • University of Illinois Urbana-Champaign
  • Argonne National Laboratory
  • Tennessee Technological University

Reviews

Lorie M. Liebrock

MPI combines many of the best features of numerous message-passing systems, while showing that parallel processing need not compromise efficiency, portability, and functionality. Along with standard message-passing capabilities, MPI extends the notion of a message tag to define a context that is allocated at runtime. Contexts allow user programs and libraries to maintain separate “message spaces.” A communicator is defined by the system-allocated context and the group of processes. Communicators define precisely which processes are involved in a communication. Furthermore, with intercommunicators, controlled group-to-group communication is supported. Just as the pattern of communication, or application topology, is an integral part of an application, the definition of the virtual topology is an integral part of MPI. Cartesian topologies are predefined with special operations such as shift. Other MPI features include support for debugging, profiling, blocking and non-blocking communication, libraries, and heterogeneous networks of processors. Beginning with a very small, but sufficient, set of routines, MPI is introduced in terms of what is needed to solve particular problems. This approach is used in the early chapters of the book. More advanced features are introduced with the focus on the feature instead of the application, but examples are provided. The authors alternate between the use of Fortran, C, and C++ in the examples. Only occasionally is this mixing of languages confusing or distracting. The summary of MPI routines has three parts—one each for C, Fortran, and C++. The summary is somewhat unclear as it “contains bindings for the MPI-1 routines” with “added to this list the MPI-2 functions that replace deprecated MPI-1 functions.” However, for example, the (deprecated) MPI-1 function MPI_Copy_function does not appear at all. Some of the new MPI-2 functions, such as MPI-Finalized, are also omitted. It would be helpful for the index to indicate where each function appears in the summary of routines. Throughout the book, MPI implementors choices are often discussed to warn users of where implementations can vary. In a number of places, the authors make statements like “few implementations currently exploit the opportunities offered to an MPI implementation, though at least one does.” If the point was worth making in the first place, I would have liked to know which implementations were exceptional. The body of the text is divided into ten chapters: Background Introduction to MPI Using MPI in Simple Programs Intermediate MPI Advanced Message Passing in MPI Parallel Libraries Other Features of MPI Understanding how MPI Implementations Work Comparing MPI with Other Systems for Inter process Communication Beyond Message Passing Five appendices present a “Summary of MPI Routines and Their Arguments,” “The MPICH Implementation of MPI,” “The MPE Multiprocessing Environment Functions,” “MPI Resources on the World Wide Web,” and “Language Details.” The authors also provide a glossary, references, and subject and function term indexes. Overall, this book is easy to read and follow, but there are a few problems. In a number of places, the authors refer readers to reference [64] to see something “described in a more tutorial approach.” Unfortunately, reference [64] is this book! In addition, it would be helpful to give a brief introduction to advanced topics before referring readers to another source.

Len Freeman

Published in 1999 as part of the MIT Press series on scientific and engineering computation, this is the second edition of a book first published in 1994, and has been updated to reflect the revisions incorporated in the message-passing interface (MPI) 1.2 specification. The stated aim of the book is to be a companion to the MPI Standard; its intention is to demonstrate the use of MPI for the development of practical parallel applications. The book adopts a tutorial approach, with lots of examples to motivate the increasing complexity of MPI functionality. The examples are presented in Fortran, C, and C++ throughout. There are some references to the MPI 2.0 standard and its differences from the MPI 1.2 standard, though more complete details of the MPI 2.0 standard are to be found in the companion volume [1]. The book contains ten chapters. Chapter 1 provides an introduction to and motivation for parallel computing. It also presents a review of parallel programming models and, unsurprisingly, the advantages of the message-passing model. Chapters 2 through 5 introduce much of the functionality of the MPI library. The basic structure of the presentation is to use examples to motivate the need for additional MPI functionality; MPI listings are presented, and the purpose of each of the MPI operations is carefully described. Chapter 2 introduces the core send and receive functionality. Chapter 3 uses numerical quadrature, based on a repeated mid-point rule, and matrix-vector multiplication to introduce a variety of communication routines. Also included in this chapter is an introduction to both the theoretical and experimental study of the performance of parallel programs; performance analysis is returned to in subsequent chapters. More complex examples, and the attendant requirements for more complex functionality, are found in chapters 4 and 5. The full richness of the MPI communications functionality is progressively introduced. Chapters 3, 4, and 5 include very useful “Common Errors and Misunderstandings” sections. Chapter 6 introduces parallel libraries and describes the difficulties that arise in their development. This introduction is used to motivate a review of the features of MPI that support the development of “safe” parallel libraries. The final major chapter, chapter 7, is taken as an opportunity to describe those features of MPI that have yet to be introduced; this includes further communication operations, intercommunicators, and error handlers. Chapters 8, 9, and 10 are relatively short chapters that deal respectively with the way MPI implementations work, a comparison of MPI with other communication systems, and future alternatives to message passing. The book concludes with a useful glossary of terms and five appendices, the first of which is a list of the bindings for the MPI-1 routines in C, Fortran, and C++. There is both a regular subject index and an index of MPI routines and terms. I enjoyed reading this book a great deal. One might expect a book on such a specific technical subject as MPI to be rather dry, but that was not the case with this book. The full functionality of MPI is revealed over the first seven chapters of the book, with convincing and realistic examples used to motivate each additional feature of the library. The book is extremely well written and well presented. An occasional typographical error did escape the proofreading net, but, given the very technical nature of the material, the rarity of these slips is admirable. I do not see the book as a textbook from which to teach parallel computing. It is more of a reference book to which those already experienced in parallel computing will be able to turn for answers to their MPI queries. It will certainly find a prominent place on my bookshelf. Online Computing Reviews Service

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.

Please enable JavaScript to view thecomments powered by Disqus.

Recommendations