Sasaki et al., 2019 - Google Patents

Post training weight compression with distribution-based filter-wise quantization step

Sasaki et al., 2019

Document ID: 6419765911415733700
Author: Sasaki S; Maki A; Miyashita D; Deguchi J
Publication year: 2019
Publication venue: 2019 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS)

External Links

Cited by

Snippet

Quantization of models with lower bit precision is a promising method to develop lower- power and smaller-area neural network hardware. However, 4-or lower bit quantization usually requires additional retraining with labeled dataset for backpropagation to improve …

Continue reading at ieeexplore.ieee.org (other versions)

238000007906 compression 0 title description 2

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
- G06F7/38—Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation
- G06F7/48—Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation using non-contact-making devices, e.g. tube, solid state device; using unspecified devices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores

Similar Documents

Publication	Publication Date	Title
CN114140353B (en)	2023-04-07	Swin-Transformer image denoising method and system based on channel attention
CN113159173B (en)	2024-04-26	Convolutional neural network model compression method combining pruning and knowledge distillation
CN110517329B (en)	2021-05-14	A deep learning image compression method based on semantic analysis
CN106991646B (en)	2020-05-26	Image super-resolution method based on dense connection network
CN111079781A (en)	2020-04-28	A lightweight convolutional neural network image recognition method based on low-rank and sparse decomposition
CN110222821A (en)	2019-09-10	Convolutional neural networks low-bit width quantization method based on weight distribution
CN111147862B (en)	2021-02-23	End-to-end image compression method based on target coding
Khashman et al.	2008	Image compression using neural networks and Haar wavelet
CN112016674A (en)	2020-12-01	Knowledge distillation-based convolutional neural network quantification method
Laha et al.	2004	Design of vector quantizer for image compression using self-organizing feature map and surface fitting
KR20210125425A (en)	2021-10-18	System and method of training GAN for real-world super resolution with unknown degradations
Sasaki et al.	2019	Post training weight compression with distribution-based filter-wise quantization step
CN112734867A (en)	2021-04-30	Multispectral image compression method and system based on space spectrum feature separation and extraction
Yang et al.	2019	JPEG steganalysis with combined dense connected CNNs and SCA-GFR
CN118038269A (en)	2024-05-14	Hyperspectral image classification method based on spectral Transformer self-supervised learning algorithm model
CN112686384A (en)	2021-04-20	Bit-width-adaptive neural network quantization method and device
Wu et al.	2023	Fedcomp: A federated learning compression framework for resource-constrained edge computing devices
Ando et al.	2018	Dither nn: An accurate neural network with dithering for low bit-precision hardware
TW202004568A (en)	2020-01-16	Full exponential operation method applied to deep neural network, computer apparatus, and computer-readable recording medium reducing the operation complexity and circuit complexity, increasing the operation speed of the deep neural network and reducing the occupation of memory space.
CN108390871A (en)	2018-08-10	A kind of radar data compression method based on the prediction of autoregression model frame
Khashman et al.	2007	Neural networks arbitration for optimum DCT image compression
Zhao et al.	2021	Learned image compression using adaptive block-wise encoding and reconstruction network
WO2022247368A1 (en)	2022-12-01	Methods, systems, and mediafor low-bit neural networks using bit shift operations
Gafour et al.	2003	Genetic fractal image compression
Hirose et al.	2018	Quantization error-based regularization for hardware-aware neural network training