Bender et al., 2017 - Google Patents
Two-level main memory co-design: Multi-threaded algorithmic primitives, analysis, and simulationBender et al., 2017
View PDF- Document ID
- 5428722960912638177
- Author
- Bender M
- Berry J
- Hammond S
- Hemmert K
- McCauley S
- Moore B
- Moseley B
- Phillips C
- Resnick D
- Rodrigues A
- Publication year
- Publication venue
- Journal of Parallel and Distributed Computing
External Links
Snippet
A challenge in computer architecture is that processors often cannot be fed data from DRAM as fast as CPUs can consume it. Therefore, many applications are memory-bandwidth bound. With this motivation and the realization that traditional architectures (with all DRAM …
- 230000015654 memory 0 title abstract description 186
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G06F9/5011—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resources being hardware resources other than CPUs, Servers and Terminals
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F12/00—Accessing, addressing or allocating within memory systems or architectures
- G06F12/02—Addressing or allocation; Relocation
- G06F12/0223—User address space allocation, e.g. contiguous or non contiguous base addressing
- G06F12/023—Free address space management
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30587—Details of specialised database models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G06F17/5009—Computer-aided design using simulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F12/00—Accessing, addressing or allocating within memory systems or architectures
- G06F12/02—Addressing or allocation; Relocation
- G06F12/08—Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/40—Transformations of program code
- G06F8/41—Compilation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored programme computers
- G06F15/78—Architectures of general purpose stored programme computers comprising a single central processing unit
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/16—Combinations of two or more digital computers each having at least an arithmetic unit, a programme unit and a register, e.g. for a simultaneous processing of several programmes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F1/00—Details of data-processing equipment not covered by groups G06F3/00 - G06F13/00, e.g. cooling, packaging or power supply specially adapted for computer application
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2217/00—Indexing scheme relating to computer aided design [CAD]
- G06F2217/78—Power analysis and optimization
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2212/00—Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
- G06F2212/25—Using a specific main memory architecture
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Singh et al. | Near-memory computing: Past, present, and future | |
Ghose et al. | Processing-in-memory: A workload-driven perspective | |
Suchard et al. | Understanding GPU programming for statistical computation: Studies in massively parallel massive mixtures | |
Bender et al. | Two-level main memory co-design: Multi-threaded algorithmic primitives, analysis, and simulation | |
Seshadri et al. | Simple operations in memory to reduce data movement | |
Bergman et al. | Exascale computing study: Technology challenges in achieving exascale systems | |
Han et al. | A novel ReRAM-based processing-in-memory architecture for graph traversal | |
Ghasemi et al. | GraphA: An efficient ReRAM-based architecture to accelerate large scale graph processing | |
Rubin et al. | Maps: Optimizing massively parallel applications using device-level memory abstraction | |
García-Risueño et al. | A review of High Performance Computing foundations for scientists | |
Hazarika et al. | Survey on memory management techniques in heterogeneous computing systems | |
Oliveira et al. | MIMDRAM: An End-to-End Processing-Using-DRAM System for High-Throughput, Energy-Efficient and Programmer-Transparent Multiple-Instruction Multiple-Data Computing | |
Fang et al. | Aristotle: A performance impact indicator for the OpenCL kernels using local memory | |
Wei et al. | LICOM3-CUDA: a GPU version of LASG/IAP climate system ocean model version 3 based on CUDA | |
Aananthakrishnan et al. | The Intel programmable and integrated unified memory architecture graph analytics processor | |
Mittal | A survey on evaluating and optimizing performance of Intel Xeon Phi | |
Gonthier et al. | Locality-Aware Scheduling of Independent Tasks for Runtime Systems | |
Halbiniak et al. | Exploration of OpenCL heterogeneous programming for porting solidification modeling to CPU‐GPU platforms | |
Sliwinski et al. | Applying parallel computing techniques to analyze terabyte atmospheric boundary layer model outputs | |
Haldeman et al. | Exploring energy-performance-quality tradeoffs for scientific workflows with in-situ data analyses | |
Ciznicki et al. | Energy aware scheduling model and online heuristics for stencil codes on heterogeneous computing architectures | |
Davis et al. | Paradigmatic shifts for exascale supercomputing | |
Cicotti et al. | Data movement in data-intensive high performance computing | |
Carracciuolo et al. | Toward a new linpack‐like benchmark for heterogeneous computing resources | |
Jin et al. | Population count on intel® cpu, GPU and FPGA |