Rojek et al., 2017 - Google Patents
Systematic adaptation of stencil‐based 3D MPDATA to GPU architecturesRojek et al., 2017
- Document ID
- 4338260488128069327
- Author
- Rojek K
- Wyrzykowski R
- Kuczynski L
- Publication year
- Publication venue
- Concurrency and Computation: Practice and Experience
External Links
Snippet
In this work, we focus on a systematic adaptation of the stencil‐based multidimensional positive definite advection transport algorithm (MPDATA) to different graphics processing unit (GPU)‐based computing platforms. Another objective of this work is to compare the …
- 230000004301 light adaptation 0 title abstract description 20
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/30—Arrangements for executing machine-instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30007—Arrangements for executing specific machine instructions to perform operations on data operands
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/40—Transformations of program code
- G06F8/41—Compilation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G06F17/5009—Computer-aided design using simulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/11—Complex mathematical operations for solving equations, e.g. nonlinear equations, general mathematical optimization problems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/16—Combinations of two or more digital computers each having at least an arithmetic unit, a programme unit and a register, e.g. for a simultaneous processing of several programmes
- G06F15/163—Interprocessor communication
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F12/00—Accessing, addressing or allocating within memory systems or architectures
- G06F12/02—Addressing or allocation; Relocation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored programme computers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F1/00—Details of data-processing equipment not covered by groups G06F3/00 - G06F13/00, e.g. cooling, packaging or power supply specially adapted for computer application
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Fournier et al. | Optimizing Code_Saturne computations on Petascale systems | |
Rojek et al. | Systematic adaptation of stencil‐based 3D MPDATA to GPU architectures | |
Szustak et al. | Adaptation of MPDATA heterogeneous stencil computation to Intel Xeon Phi coprocessor | |
George et al. | Multifrontal factorization of sparse SPD matrices on GPUs | |
Dongarra et al. | HPC Programming on Intel Many‐Integrated‐Core Hardware with MAGMA Port to Xeon Phi | |
Szustak et al. | Using Intel Xeon Phi coprocessor to accelerate computations in MPDATA algorithm | |
Huang et al. | Strassen’s algorithm reloaded on GPUs | |
Rojek et al. | Adaptation of fluid model EULAG to graphics processing unit architecture | |
Chen et al. | clmf: A fine-grained and portable alternating least squares algorithm for parallel matrix factorization | |
Playne et al. | Comparison of GPU architectures for asynchronous communication with finite‐differencing applications | |
Huang et al. | Implementing Strassen's algorithm with CUTLASS on NVIDIA Volta GPUs | |
Dong et al. | Implementing molecular dynamics simulation on the Sunway TaihuLight system with heterogeneous many‐core processors | |
Wyrzykowski et al. | Towards efficient decomposition and parallelization of MPDATA on hybrid CPU-GPU cluster | |
Gan et al. | Solving the global atmospheric equations through heterogeneous reconfigurable platforms | |
Rojek | Machine learning method for energy reduction by utilizing dynamic mixed precision on GPU‐based supercomputers | |
Oyarzun et al. | Portable implementation model for CFD simulations. Application to hybrid CPU/GPU supercomputers | |
Ibrahim et al. | Analysis and optimization of gyrokinetic toroidal simulations on homogenous and heterogenous platforms | |
Abdelfattah et al. | Performance optimization of Sparse Matrix‐Vector Multiplication for multi‐component PDE‐based applications using GPUs | |
Malenza et al. | Toward HPC application portability via C++ PSTL: the Gaia AVU-GSR code assessment | |
Rocha et al. | TOAST: Automatic tiling for iterative stencil computations on GPUs | |
Zaza et al. | A CUDA based parallel multi-phase oil reservoir simulator | |
Ciznicki et al. | Elliptic solver performance evaluation on modern hardware architectures | |
Waidyasooriya et al. | OpenCL‐Based FPGA Accelerator for 3D FDTD with Periodic and Absorbing Boundary Conditions | |
Brus et al. | Performance and scalability improvements for discontinuous Galerkin solutions to conservation laws on unstructured grids | |
Cecilia et al. | Stencil computations on heterogeneous platforms for the Jacobi method: GPUs versus Cell BE |