Pentecost et al., 2019 - Google Patents

Maxnvm: Maximizing dnn storage density and inference efficiency with sparse encoding and error mitigation

Pentecost et al., 2019

Document ID: 8013618013168750593
Author: Pentecost L; Donato M; Reagen B; Gupta U; Ma S; Wei G; Brooks D
Publication year: 2019
Publication venue: Proceedings of the 52Nd Annual IEEE/ACM International Symposium on Microarchitecture

External Links

Cited by

Snippet

Deeply embedded applications require low-power, low-cost hardware that fits within stringent area constraints. Deep learning has many potential uses in these domains, but introduces significant inefficiencies stemming from off-chip DRAM accesses of model …

Continue reading at www.eecs.tufts.edu (PDF) (other versions)

238000003860 storage 0 title description 61

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Error detection; Error correction; Monitoring responding to the occurence of a fault, e.g. fault tolerance
- G06F11/08—Error detection or correction by redundancy in data representation, e.g. by using checking codes
- G06F11/10—Adding special bits or symbols to the coded information, e.g. parity check, casting out 9's or 11's
- G06F11/1008—Adding special bits or symbols to the coded information, e.g. parity check, casting out 9's or 11's in individual solid state devices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F12/00—Accessing, addressing or allocating within memory systems or architectures
- G06F12/02—Addressing or allocation; Relocation
- G06F12/0223—User address space allocation, e.g. contiguous or non contiguous base addressing
- G06F12/023—Free address space management
- G06F12/0238—Free address space management in non-volatile memory
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11C—STATIC STORES
- G11C11/00—Digital stores characterised by the use of particular electric or magnetic storage elements; Storage elements therefor
- G11C11/56—Digital stores characterised by the use of particular electric or magnetic storage elements; Storage elements therefor using storage elements with more than two stable states represented by steps, e.g. of voltage, current, phase, frequency
- G11C11/5621—Digital stores characterised by the use of particular electric or magnetic storage elements; Storage elements therefor using storage elements with more than two stable states represented by steps, e.g. of voltage, current, phase, frequency using charge storage in a floating gate
- G11C11/5642—Sensing or reading circuits; Data output circuits
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11C—STATIC STORES
- G11C2211/00—Indexing scheme relating to digital stores characterized by the use of particular electric or magnetic storage elements; Storage elements therefor
- G11C2211/56—Indexing scheme relating to G11C11/56 and sub-groups for features not covered by these groups
- G11C2211/564—Miscellaneous aspects
- G11C2211/5641—Multilevel memory having cells with different number of storage levels
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2212/00—Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements

Similar Documents

Publication	Publication Date	Title
Pentecost et al.	2019	Maxnvm: Maximizing dnn storage density and inference efficiency with sparse encoding and error mitigation
CN103348330B (en)	2017-05-24	dynamic higher-level redundancy mode management with independent silicon elements
KR101981355B1 (en)	2019-05-22	Soft information generation for memory systems
Mittal	2017	A survey of soft-error mitigation techniques for non-volatile memories
US20140229655A1 (en)	2014-08-14	Storing Error Correction Code (ECC) Data In a Multi-Tier Memory Structure
Wen et al.	2014	State-restrict MLC STT-RAM designs for high-reliable high-performance memory system
WO2014004184A1 (en)	2014-01-03	Fast tracking for flash channels
TW201203269A (en)	2012-01-16	LDPC erasure decoding for flash memories
CN109582493A (en)	2019-04-05	Method and apparatus for reducing the silence data mistake in Nonvolatile memory system
CN109799950A (en)	2019-05-24	The adaptive management of intermediate storage
US20200364118A1 (en)	2020-11-19	Optimized neural network data organization
Liu et al.	2014	Improving read performance of NAND flash SSDs by exploiting error locality
Liang et al.	2019	An empirical study of quad-level cell (qlc) nand flash ssds for big data applications
US10042565B2 (en)	2018-08-07	All-flash-array primary storage and caching appliances implementing triple-level cell (TLC)-NAND semiconductor microchips
CN117693758A (en)	2024-03-12	Hybrid memory management for non-volatile memory (NVM) devices for use with recurrent neural networks
Seyedzadeh et al.	2018	Enabling fine-grain restricted coset coding through word-level compression for pcm
Guo et al.	2016	Flexlevel NAND flash storage system design to reduce LDPC latency
Mohseni et al.	2023	A survey on techniques for improving Phase Change Memory (PCM) lifetime
Jasemi et al.	2020	Reliable and energy efficient MLC STT-RAM buffer for CNN accelerators
Nguyen et al.	2021	ZEM: zero-cycle bit-masking module for deep learning refresh-less DRAM
Wan et al.	2022	Accuracy and resiliency of analog compute-in-memory inference engines
Wu et al.	2024	Mitigating Write Disturbance in Non-Volatile Memory via Coupling Machine Learning with Out-of-Place Updates
Zhao et al.	2022	A Low-Latency and High-Endurance MLC STT-MRAM-Based Cache System
Pozidis et al.	2021	Circuit and system-level aspects of phase change memory
Chen et al.	2020	Optimizing Lifetime Capacity and Read Performance of Bit-Alterable 3-D NAND Flash