Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJune 2021
MAHASIM: Machine-Learning Hardware Acceleration Using a Software-Defined Intelligent Memory System
Journal of Signal Processing Systems (JSPS), Volume 93, Issue 6Pages 659–675https://doi.org/10.1007/s11265-019-01505-1AbstractAs computations in machine-learning applications are increasing simultaneously along the size of datasets, the energy and performance costs of data movement dominate that of compute. This issue is more pronounced in embedded systems with limited ...
- research-articleApril 2021
Quantifying the design-space tradeoffs in autonomous drones
ASPLOS '21: Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating SystemsPages 661–673https://doi.org/10.1145/3445814.3446721With fully autonomous flight capabilities coupled with user-specific applications, drones, in particular quadcopter drones, are becoming prevalent solutions in myriad commercial and research contexts. However, autonomous drones must operate within ...
- research-articleNovember 2020
PISCES: power-aware implementation of SLAM by customizing efficient sparse algebra
DAC '20: Proceedings of the 57th ACM/EDAC/IEEE Design Automation ConferenceArticle No.: 79, Pages 1–6A key real-time task in autonomous systems is simultaneous localization and mapping (SLAM). Although prior work has proposed hardware accelerators to process SLAM in real time, they paid less attention to power consumption. To be more power-efficient, ...
- research-articleJune 2020
ASCELLA: accelerating sparse computation by enabling stream accesses to memory
Sparse computations dominate a wide range of applications from scientific problems to graph analytics. The main characterization of sparse computations, indirect memory accesses, prevents them from effectively achieving high performance on general-...
- research-articleJune 2019
LODESTAR: Creating Locally-Dense CNNs for Efficient Inference on Systolic Arrays
DAC '19: Proceedings of the 56th Annual Design Automation Conference 2019Article No.: 233, Pages 1–2https://doi.org/10.1145/3316781.3322472The performance of sparse problems suffers from lack of spatial locality and low memory bandwidth utilization. However, the distribution of non-zero values in the data structures of a class of sparse problems, such as matrix operations in neural ...