[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3299874.3319450acmconferencesArticle/Chapter ViewAbstractPublication PagesglsvlsiConference Proceedingsconference-collections
research-article

Ferroelectric FET Based In-Memory Computing for Few-Shot Learning

Published: 13 May 2019 Publication History

Abstract

As CMOS technology advances, the performance gap between the CPU and main memory has not improved. Furthermore, the hardware deployed for Internet of Things (IoT) applications need to process ever growing volumes of data, which can further exacerbate the "memory wall". Computing-in-memory (CiM) architectures, where logic and arithmetic operations are performed in memory, can significantly reduce energy and latency overheads associated with data transfer, and potentially alleviate processor-memory bottlenecks. In this paper, we consider the utility of ternary content addressable memory (TCAM) arrays and CiM arrays based on ferroelectric field effect transistors (FeFETs) to support emerging machine learning models that can learn new classes of data with significantly less training overhead - highly desirable in IoT applications. Architecturally, we use TCAM and CiM arrays to implement the external memory module in a memory enhanced neural network (MENN) - which can be used to minimize catastrophic forgetting - a major problem in applications such as lifelong and few-shot learning. As a representative example, we achieve 95.14% accuracy for a few-shot learning task with the Omniglot data set by using a combined L∞ infinity and L1 distance metric computed via a TCAM-CiM cascaded architecture (as opposed to 99.06% accuracy assuming a GPU backed by DRAM). While there is a slight drop in accuracy, the TCAM-CiM approach is 4.34X faster and 4.18X more energy efficient than a CMOS implementation for the same task. The ability of an FeFET to serve as both a compact logic and storage element helps to enable dense CiM and TCAM structures that drive the aforementioned improvements to application-level figures of merit (FOMs).

References

[1]
A. Laguna, et al. Design of hardware-friendly memory enhanced neural networks. In Design, Automation Test in Europe Conference Exhibition (DATE), 2017, March 2019.
[2]
O. Vinyals, et al. Matching networks for one shot learning. CoRR, abs/1606.04080, 2016.
[3]
A. Santoro, et al. One-shot learning with memory-augmented neural networks. CoRR, abs/1605.06065, 2016.
[4]
L. Kaiser, et al. Learning to remember rare events. CoRR, abs/1703.03129, 2017.
[5]
M. Pharr et al. GPU Gems 2: Programming Techniques for High-Performance Graphics and General-Purpose Computation (Gpu Gems). Addison-Wesley Professional, 2005.
[6]
Z. Lai, et al. Revisiting multi-pass scatter and gather on gpus. In Proceedings of the 47th International Conference on Parallel Processing, ICPP 2018, pages 25:1--25:11, New York, NY, USA, 2018. ACM.
[7]
A. Bremler-Barr, et al. Encoding short ranges in tcam without expansion: Efficient algorithm and applications. IEEE/ACM Transactions on Networking, 26(2):835--850, April 2018.
[8]
D. Reis, et al. Computing in Memory with FeFETs. In ISLPED, pages 24:1--24:6, New York, NY, USA, 2018. ACM.
[9]
S. Jeloka, et al. A 28 nm configurable memory (tcam/bcam/sram) using push-rule 6t bit cell enabling logic-in-memory. IEEE Journal of Solid-State Circuits, 51(4):1009--1021, April 2016.
[10]
Q. Dong, et al. A 0.3v vddmin 4+2t sram for searching and in-memory computing using 55nm ddc technology. In 2017 Symposium on VLSI Circuits, pages C160--C161, June 2017.
[11]
S. Jeloka, et al. A 28 nm configurable memory (tcam/bcam/sram) using push-rule 6t bit cell enabling logic-in-memory. JSSC, 51(4):1009--1021, 2016.
[12]
J. Zhang, et al. In-memory computation of a machine learning classifier in a standard 6t sram array. JSSC, 52(4):915--924, 2017.
[13]
K. Nii, et al. 13.6 a 28nm 400mhz 4-parallel 1.6gsearch/s 80mb ternary cam. In 2014 IEEE International Solid-State Circuits Conference Digest of Technical Papers (ISSCC), pages 240--241, Feb 2014.
[14]
W. Kang, et al. In-memory processing paradigm for bitwise logic operations in stt-mram. TMAG, 53(11):1--4, 2017.
[15]
D. Fujiki, et al. In-memory data parallel processor. In Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS '18, pages 1--14, New York, NY, USA, 2018. ACM.
[16]
A. Aziz, et al. Computing with ferroelectric fets: Devices, models, systems, and applications. In 2018 Design, Automation & Test in Europe Conference & Exhibition (DATE), pages 1289--1298. IEEE, 2018.
[17]
X. Yin, et al. Design and benchmarking of ferroelectric fet based tcam. In Proceedings of the Conference on Design, Automation & Test in Europe, pages 1448--1453. European Design and Automation Association, 2017.
[18]
S. Hochreiter, et al. Learning to learn using gradient descent. In Artificial Neural Networks - ICANN 2001, International Conference Vienna, Austria, August 21-25, 2001 Proceedings, pages 87--94, 2001.
[19]
S. Hochreiter et al. Long short-term memory. Neural Computation, 9(8):1735--1780, 1997.
[20]
A. Pritzel, et al. Neural episodic control. In Proceedings of the 34th International Conference on Machine Learning, ICML 2017, Sydney, NSW, Australia, 6-11 August 2017, pages 2827--2836, 2017.
[21]
J. Weston, et al. Memory networks. CoRR, abs/1410.3916, 2014.
[22]
M. Zhu, et al. Performance evaluation and optimization of hbm-enabled gpu for data-intensive applications. In Design, Automation Test in Europe Conference Exhibition (DATE), 2017, pages 1245--1248, March 2017.
[23]
R. Shinde, et al. Similarity search and locality sensitive hashing using tcams. CoRR, abs/1006.3514, 2010.
[24]
M.-F. Chang, et al. A 3t1r nonvolatile tcam using mlc reram with sub-1ns search time. In Solid-State Circuits Conference-(ISSCC), 2015 IEEE International, pages 1--3. IEEE, 2015.
[25]
M. Imani, et al. Exploring hyperdimensional associative memory. In International Symposium on High-Performance Computer Architecture, pages 445--456. IEEE, 2017.
[26]
N. Talati, et al. Logic design within memristive memories using memristor-aided logic (magic). TNANO, 15(4):635--650, 2016.
[27]
D. Reis, et al. Computing in memory with fefets. In Proceedings of the International Symposium on Low Power Electronics and Design, ISLPED '18, pages 24:1--24:6, New York, NY, USA, 2018. ACM.
[28]
S. Aga, et al. Compute caches. In 2017 IEEE International Symposium on High Performance Computer Architecture (HPCA), pages 481--492, Feb 2017.
[29]
M. H. Lee, et al. Prospects for ferroelectric HfZrOx FETs with experimentally CET=0.98nm, SSfor=42mv/dec, SSrev=28mv/dec, switch-off <0.2v, and hysteresis-free strategies. In 2015 IEEE International Electron Devices Meeting (IEDM), pages 22.5.1--22.5.4, Dec 2015.
[30]
K. Ni, et al. Critical role of interlayer in hf 0.5 zr 0.5 o 2 ferroelectric fet nonvolatile memory performance. IEEE Transactions on Electron Devices, 65(6):2461--2469, 2018.
[31]
R. Vattikonda, et al. Modeling and minimization of pmos nbti effect for robust nanometer design. In DAC, 2006.
[32]
V. Sze, et al. Efficient processing of deep neural networks: A tutorial and survey. Proceedings of the IEEE, 105(12):2295--2329, 2017.
[33]
P. Chen, et al. Neurosim+: An integrated device-to-algorithm framework for benchmarking synaptic devices and array architectures. In 2017 IEEE International Electron Devices Meeting (IEDM), pages 6.1.1--6.1.4, Dec 2017.

Cited By

View all
  • (2024)SAL: Optimizing the Dataflow of Spin-based Architectures for Lightweight Neural NetworksACM Transactions on Architecture and Code Optimization10.1145/367365421:3(1-27)Online publication date: 14-Jun-2024
  • (2024)SAC: An Ultra-Efficient Spin-based Architecture for Compressed DNNsACM Transactions on Architecture and Code Optimization10.1145/363295721:1(1-26)Online publication date: 19-Jan-2024
  • (2024)PointCIM: A Computing-in-Memory Architecture for Accelerating Deep Point Cloud Analytics2024 57th IEEE/ACM International Symposium on Microarchitecture (MICRO)10.1109/MICRO61859.2024.00097(1309-1322)Online publication date: 2-Nov-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
GLSVLSI '19: Proceedings of the 2019 Great Lakes Symposium on VLSI
May 2019
562 pages
ISBN:9781450362528
DOI:10.1145/3299874
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 May 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. compute-in-memory
  2. fefet
  3. few-shot learning
  4. lifelong learning
  5. mann
  6. neural networks
  7. tcam

Qualifiers

  • Research-article

Funding Sources

  • Semiconductor Research Corporation (SRC)

Conference

GLSVLSI '19
Sponsor:
GLSVLSI '19: Great Lakes Symposium on VLSI 2019
May 9 - 11, 2019
VA, Tysons Corner, USA

Acceptance Rates

Overall Acceptance Rate 312 of 1,156 submissions, 27%

Upcoming Conference

GLSVLSI '25
Great Lakes Symposium on VLSI 2025
June 30 - July 2, 2025
New Orleans , LA , USA

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)131
  • Downloads (Last 6 weeks)10
Reflects downloads up to 17 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)SAL: Optimizing the Dataflow of Spin-based Architectures for Lightweight Neural NetworksACM Transactions on Architecture and Code Optimization10.1145/367365421:3(1-27)Online publication date: 14-Jun-2024
  • (2024)SAC: An Ultra-Efficient Spin-based Architecture for Compressed DNNsACM Transactions on Architecture and Code Optimization10.1145/363295721:1(1-26)Online publication date: 19-Jan-2024
  • (2024)PointCIM: A Computing-in-Memory Architecture for Accelerating Deep Point Cloud Analytics2024 57th IEEE/ACM International Symposium on Microarchitecture (MICRO)10.1109/MICRO61859.2024.00097(1309-1322)Online publication date: 2-Nov-2024
  • (2023)FeFET-Based Logic-in-Memory Supporting SA-Free Write-Back and Fully Dynamic Access With Reduced Bitline Charging Activity and Recycled Bitline ChargeIEEE Transactions on Circuits and Systems I: Regular Papers10.1109/TCSI.2023.325196170:6(2398-2411)Online publication date: Jun-2023
  • (2023)FeCrypto: Instruction Set Architecture for Cryptographic Algorithms Based on FeFET-Based In-Memory ComputingIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2022.323373642:9(2889-2902)Online publication date: 1-Sep-2023
  • (2023)ISAC: In-Switch Approximate Cache for IoT Object Detection and RecognitionIEEE INFOCOM 2023 - IEEE Conference on Computer Communications10.1109/INFOCOM53939.2023.10229067(1-10)Online publication date: 17-May-2023
  • (2023)Invited Paper: Algorithm/Hardware Co-Design for Few-Shot Learning at the Edge2023 IEEE/ACM International Conference on Computer Aided Design (ICCAD)10.1109/ICCAD57390.2023.10323933(1-9)Online publication date: 28-Oct-2023
  • (2023)Ferroelectricity of hafnium oxide-based materials: Current status and future prospects from physical mechanisms to device applicationsJournal of Semiconductors10.1088/1674-4926/44/5/05310144:5(053101)Online publication date: 1-May-2023
  • (2022)Computing-in-memory circuits and cross-layer integrated design and optimization: from SRAM to FeFETSCIENTIA SINICA Informationis10.1360/SSI-2021-042052:4(612)Online publication date: 29-Mar-2022
  • (2022)Re-FeMAT: A Reconfigurable Multifunctional FeFET-Based Memory ArchitectureIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2021.314019441:11(5071-5084)Online publication date: 1-Nov-2022
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media