Chen et al., 2014 - Google Patents

Dadiannao: A machine-learning supercomputer

Chen et al., 2014

Document ID: 6197973223185259372
Author: Chen Y; Luo T; Liu S; Zhang S; He L; Wang J; Li L; Chen T; Xu Z; Sun N; Temam O
Publication year: 2014
Publication venue: 2014 47th Annual IEEE/ACM International Symposium on Microarchitecture

External Links

Cited by

Snippet

Many companies are deploying services, either for consumers or industry, which are largely based on machine-learning algorithms for sophisticated processing of large amounts of data. The state-of-the-art and most popular such machine-learning algorithms are …

Continue reading at pages.saclay.inria.fr (PDF) (other versions)

238000010801 machine learning 0 title abstract description 30

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored programme computers
- G06F15/80—Architectures of general purpose stored programme computers comprising an array of processing units with common control, e.g. single instruction multiple data processors
- G06F15/8007—Architectures of general purpose stored programme computers comprising an array of processing units with common control, e.g. single instruction multiple data processors single instruction multiple data [SIMD] multiprocessors
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/30—Arrangements for executing machine-instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline, look ahead
- G06F9/3885—Concurrent instruction execution, e.g. pipeline, look ahead using a plurality of independent parallel functional units
- G06F9/3889—Concurrent instruction execution, e.g. pipeline, look ahead using a plurality of independent parallel functional units controlled by multiple instructions, e.g. MIMD, decoupled access or execute
- G06F9/3891—Concurrent instruction execution, e.g. pipeline, look ahead using a plurality of independent parallel functional units controlled by multiple instructions, e.g. MIMD, decoupled access or execute organised in groups of units sharing resources, e.g. clusters
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/30—Arrangements for executing machine-instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30007—Arrangements for executing specific machine instructions to perform operations on data operands
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/30—Arrangements for executing machine-instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/3004—Arrangements for executing specific machine instructions to perform operations on memory
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored programme computers
- G06F15/78—Architectures of general purpose stored programme computers comprising a single central processing unit
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G06F17/5068—Physical circuit design, e.g. layout for integrated circuits or printed circuit boards
- G06F17/5072—Floorplanning, e.g. partitioning, placement
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/16—Combinations of two or more digital computers each having at least an arithmetic unit, a programme unit and a register, e.g. for a simultaneous processing of several programmes
- G06F15/163—Interprocessor communication
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F12/00—Accessing, addressing or allocating within memory systems or architectures
- G06F12/02—Addressing or allocation; Relocation
- G06F12/08—Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
- G06F12/0802—Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2217/00—Indexing scheme relating to computer aided design [CAD]
- G06F2217/78—Power analysis and optimization
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T1/00—General purpose image data processing
- G06T1/20—Processor architectures; Processor configuration, e.g. pipelining

Similar Documents

Publication	Publication Date	Title
Chen et al.	2014	Dadiannao: A machine-learning supercomputer
Luo et al.	2016	DaDianNao: A neural network supercomputer
Chen et al.	2016	DianNao family: energy-efficient hardware accelerators for machine learning
US20230153621A1 (en)	2023-05-18	Arithmetic unit for deep learning acceleration
Shawahna et al.	2018	FPGA-based accelerators of deep learning networks for learning and classification: A review
Samajdar et al.	2020	A systematic methodology for characterizing scalability of dnn accelerators using scale-sim
CN110197276B (en)	2024-03-22	Data volume engraving device for deep learning acceleration
Azarkhish et al.	2017	Neurostream: Scalable and energy efficient deep learning with smart memory cubes
JP6977239B2 (en)	2021-12-08	Matrix multiplier
Du et al.	2015	ShiDianNao: Shifting vision processing closer to the sensor
Kim et al.	2016	Neurocube: A programmable digital neuromorphic architecture with high-density 3D memory
Chen et al.	2014	Diannao: A small-footprint high-throughput accelerator for ubiquitous machine-learning
US20200285950A1 (en)	2020-09-10	Structured Weight Based Sparsity In An Artificial Neural Network Compiler
EP3346425B1 (en)	2023-12-20	Hardware accelerator engine and method
US11551028B2 (en)	2023-01-10	Structured weight based sparsity in an artificial neural network
Kim et al.	2008	A 125 GOPS 583 mW network-on-chip based parallel processor with bio-inspired visual attention engine
CN110968543A (en)	2020-04-07	Computing system and method in memory
Han et al.	2016	CNN-MERP: An FPGA-based memory-efficient reconfigurable processor for forward and backward propagation of convolutional neural networks
EP3346427B1 (en)	2023-12-20	Configurable accelerator framework, system and method
Chang et al.	2019	VWA: Hardware efficient vectorwise accelerator for convolutional neural network
Chen et al.	2020	A NoC-based simulator for design and evaluation of deep neural networks
US20200005127A1 (en)	2020-01-02	System And Method Of Input Alignment For Efficient Vector Operations In An Artificial Neural Network
Huang et al.	2021	IECA: An in-execution configuration CNN accelerator with 30.55 GOPS/mm² area efficiency
Firuzan et al.	2022	Reconfigurable network-on-chip based convolutional neural network accelerator
Yoshida et al.	1991	The approach to multiple instruction execution in the GMICRO/400 processor