More Web Proxy on the site http://driver.im/

research-article

Hardware-Software Codesign of DNN Accelerators Using Approximate Posit Multipliers

Authors:

Kailash Prasad,

Krishil Gandhi,

Joycee MekieAuthors Info & Claims

ASPDAC '23: Proceedings of the 28th Asia and South Pacific Design Automation Conference

Pages 469 - 474

https://doi.org/10.1145/3566097.3567866

Published: 31 January 2023 Publication History

Abstract

Emerging data intensive AI/ML workloads encounter memory and power wall when run on general-purpose compute cores. This has led to the development of a myriad of techniques to deal with such workloads, among which DNN accelerator architectures have found a prominent place. In this work, we propose a hardware-software co-design approach to achieve system-level benefits. We propose a quantized data-aware POSIT number representation that leads to a highly optimized DNN accelerator. We demonstrate this work on SOTA SIMBA architecture, extendable to any other accelerator. Our proposal reduces the buffer/storage requirements within the architecture and reduces the data transfer cost between the main memory and the DNN accelerator. We have investigated the impact of using integer, IEEE floating point, and posit multipliers for LeNet, ResNet and VGG NNs trained and tested on MNIST, CIFAR10 and ImageNet datasets, respectively. Our system-level analysis shows that the proposed approximate-fixed-posit multiplier when implemented on SIMBA architecture, achieves on average ~2.2× speed up, consumes ~3.1× less energy and requires ~3.2× less area, respectively, against the baseline SOTA architecture, without loss of accuracy (~±1%)

References

[1]

Yiran Chen, et al. 2020. A survey of accelerator architectures for deep neural networks. Engineering 6, 3 (2020), 264--274.

[2]

Lei Deng, et al. 2020. Model Compression and Hardware Acceleration for Neural Networks: A Comprehensive Survey. Proc. IEEE 108, 4 (2020), 485--532.

[3]

Joydeep Kumar Devnath, et al. 2020. A mathematical approach towards quantization of floating point weights in low power neural networks. In 2020 33rd International Conference on VLSI Design and 2020 19th International Conference on Embedded Systems (VLSID). IEEE, 177--182.

[4]

Xiaochen Guo, et al. 2010. Resistive computation: Avoiding the power wall with low-leakage, STT-MRAM based computing. ACM SIGARCH computer architecture news 38, 3 (2010), 371--382.

[5]

John L Gustafson and Isaac T Yonemoto. 2017. Beating floating point at its own game: Posit arithmetic. Supercomputing frontiers and innovations 4, 2 (2017), 71--86.

[6]

Kaiming He, et al. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770--778.

[7]

Navjot Kukreja, et al. 2019. Training on the Edge: The why and the how. In 2019 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW). 899--903.

[8]

Yann LeCun, et al. 1998. Gradient-based learning applied to document recognition. Proc. IEEE 86, 11 (1998), 2278--2324.

[9]

Peter Mattson, et al. 2019. MLPerf training benchmark. arXiv preprint arXiv:1910.01500 (2019).

[10]

Naveen Muralimanohar, et al. 2009. CACTI 6.0: A tool to model large caches. HP laboratories 27 (2009), 28.

[11]

Angshuman Parashar, et al. 2019. Timeloop: A systematic approach to dnn accelerator evaluation. In 2019 IEEE international symposium on performance analysis of systems and software (ISPASS). IEEE, 304--315.

[12]

Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).

[13]

Sumit Walia, et al. 2021. Fast and low-power quantized fixed posit high-accuracy DNN implementation. IEEE Transactions on Very Large Scale Integration (VLSI) Systems 30, 1 (2021), 108--111.

[14]

Yannan Nellie Wu, et al. 2019. Accelergy:An architecture-level energy estimation methodology for accelerator designs. In 2019 IEEE/ACM International Conference on Computer-Aided Design (ICCAD). IEEE, 1--8.

[15]

Wm A Wulf and Sally A McKee. 1995. Hitting the memory wall: Implications of the obvious. ACM SIGARCH computer architecture news 23, 1 (1995), 20--24.

Digital Library

[16]

Brian Zimmer, et al. 2020. A 0.32--128 TOPS, scalable multi-chip-module-based deep neural network inference accelerator with ground-referenced signaling in 16 nm. IEEE Journal of Solid-State Circuits 55, 4 (2020), 920--932.

Cited By

Glint TJha CAwasthi MMekie J(2023)Analysis of Quantization Across DNN Accelerator Architecture Paradigms2023 Design, Automation & Test in Europe Conference & Exhibition (DATE)10.23919/DATE56975.2023.10136899(1-2)Online publication date: Apr-2023
https://doi.org/10.23919/DATE56975.2023.10136899

Index Terms

Hardware-Software Codesign of DNN Accelerators Using Approximate Posit Multipliers

Recommendations

Positive/Negative Approximate Multipliers for DNN Accelerators
2021 IEEE/ACM International Conference On Computer Aided Design (ICCAD)
Recent Deep Neural Networks (DNNs) manage to deliver superhuman accuracy levels on many AI tasks. DNN accelerators are becoming integral components of modern systems-on-chips. DNNs perform millions of arithmetic operations per inference and DNN ...
Deep Learning Inferencing with High-performance Hardware Accelerators
As computer architectures continue to integrate application-specific hardware, it is critical to understand the relative performance of devices for maximum app acceleration. The goal of benchmarking suites, such as MLPerf for analyzing machine learning (...
An Application-oblivious Memory Scheduling System for DNN Accelerators
Deep Neural Networks (DNNs) tend to go deeper and wider, which poses a significant challenge to the training of DNNs, due to the limited memory capacity of DNN accelerators. Existing solutions for memory-efficient DNN training are densely coupled with the ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

ASPDAC '23: Proceedings of the 28th Asia and South Pacific Design Automation Conference

January 2023

807 pages

ISBN:9781450397834

DOI:10.1145/3566097

General Chair:
Atsushi Takahashi
Tokyo Institute of Technology

Copyright © 2023 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGDA: ACM Special Interest Group on Design Automation

In-Cooperation

IPSJ
IEEE CAS
IEEE CEDA
IEICE

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 January 2023

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Intel Corporation
Semiconductor Research Corporation
MHRD
Science and Engineering Research Board

Conference

ASPDAC '23

Sponsor:

SIGDA

ASPDAC '23: 28th Asia and South Pacific Design Automation Conference

January 16 - 19, 2023

Tokyo, Japan

Acceptance Rates

ASPDAC '23 Paper Acceptance Rate 102 of 328 submissions, 31%;

Overall Acceptance Rate 466 of 1,454 submissions, 32%

Upcoming Conference

ASPDAC '25

Sponsor:
sigda

30th Asia and South Pacific Design Automation Conference

January 20 - 23, 2025

Tokyo , Japan

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
397
Total Downloads

Downloads (Last 12 months)129
Downloads (Last 6 weeks)12

Reflects downloads up to 13 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Glint TJha CAwasthi MMekie J(2023)Analysis of Quantization Across DNN Accelerator Architecture Paradigms2023 Design, Automation & Test in Europe Conference & Exhibition (DATE)10.23919/DATE56975.2023.10136899(1-2)Online publication date: Apr-2023
https://doi.org/10.23919/DATE56975.2023.10136899

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents