Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleNovember 2024
Towards High Performance QNNs via Distribution-Based CNOT Gate Reduction
ACM Transactions on Architecture and Code Optimization (TACO), Volume 21, Issue 4Article No.: 87, Pages 1–22https://doi.org/10.1145/3695872Quantum Neural Networks (QNNs) are one of the most promising applications that can be implemented on NISQ-era quantum computers. In this study, we observe that QNNs often suffer from gate redundancy, which hugely declines the performance and accuracy of ...
- research-articleOctober 2024
Efficient one-shot Neural Architecture Search with progressive choice freezing evolutionary search
AbstractNeural Architecture Search (NAS) is a fast-developing research field to promote automatic machine learning. Among the recently popular NAS methods, one-shot NAS has attracted significant attention since it greatly reduces the training cost ...
- short-paperJune 2024
A New Routing Strategy to Improve Success Rates of Quantum Computers
GLSVLSI '24: Proceedings of the Great Lakes Symposium on VLSI 2024Pages 546–550https://doi.org/10.1145/3649476.3658790In the current noisy intermediate-scale quantum (NISQ) Era, Quantum Computing faces significant challenges due to noise, which severely restricts the application of computing complex algorithms. Superconducting quantum chips, one of the pioneer quantum ...
- research-articleMay 2024
Enhancing Neural Network Reliability: Insights From Hardware/Software Collaboration With Neuron Vulnerability Quantization
IEEE Transactions on Computers (ITCO), Volume 73, Issue 8Pages 1953–1966https://doi.org/10.1109/TC.2024.3398492Ensuring the reliability of deep neural networks (DNNs) is paramount in safety-critical applications. Although introducing supplementary fault-tolerant mechanisms can augment the reliability of DNNs, an efficiency tradeoff may be introduced. This study ...
- research-articleJune 2024
HSAS: Efficient task scheduling for large scale heterogeneous systolic array accelerator cluster
Future Generation Computer Systems (FGCS), Volume 154, Issue CPages 440–450https://doi.org/10.1016/j.future.2024.01.023AbstractTo efficiently process a large amount of deep neural network models can be challenging, due to significant differences among models and even layers. Nowadays, systolic array has become a common architecture for processing neural networks. With ...
Highlights- We introduce well validated systolic array performance and energy models.
- We propose a scheduling strategy for heterogeneous systolic array accelerators.
- We propose a task decomposition algorithm for CNN tasks.
-
- research-articleMay 2024
Design and application of small-diameter closed thoracic drainage tube fixation following lung wedge resection: A randomised prospective study
Technology and Health Care (TAHC), Volume 32, Issue 3Pages 1503–1513https://doi.org/10.3233/THC-230558BACKGROUND:The use of indwelling closed thoracic drainage tubes in the wedge resection of the lungs is of great significance to postoperative recovery. However, there are potential risks.
OBJECTIVE:To explore the design feasibility and application ...
- research-articleDecember 2023
NAS-SE: Designing A Highly-Efficient In-Situ Neural Architecture Search Engine for Large-Scale Deployment
MICRO '23: Proceedings of the 56th Annual IEEE/ACM International Symposium on MicroarchitecturePages 756–768https://doi.org/10.1145/3613424.3614265The emergence of Neural Architecture Search (NAS) enables an automated neural network development process that potentially replaces manually-enabled machine learning expertise. A state-of-the-art NAS method, namely One-Shot NAS, has been proposed to ...
- research-articleOctober 2023
Saca-FI: A microarchitecture-level fault injection framework for reliability analysis of systolic array based CNN accelerator
Future Generation Computer Systems (FGCS), Volume 147, Issue CPages 251–264https://doi.org/10.1016/j.future.2023.05.009AbstractAs convolutional neural network CNN accelerators are being adopted in emerging safety-critical areas, their reliability becomes prominent. The systolic array is widely used as the major processing structure in CNN accelerators, so its reliability ...
Highlights- We propose the first microarchitecture level fault injection framework saca-FI to evaluate the resiliency characteristics of CNN accelerator’s systolic array.
- We comprehensively analyze the resiliency of three commonly used CNN models.
- research-articleJuly 2023
Accelerating Convolutional Neural Network by Exploiting Sparsity on GPUs
ACM Transactions on Architecture and Code Optimization (TACO), Volume 20, Issue 3Article No.: 36, Pages 1–26https://doi.org/10.1145/3600092The convolutional neural network (CNN) is an important deep learning method, which is widely used in many fields. However, it is very time consuming to implement the CNN where convolution usually takes most of the time. There are many zero values in ...
- surveyJuly 2023
A Survey of AI-enabled Dynamic Manufacturing Scheduling: From Directed Heuristics to Autonomous Learning
ACM Computing Surveys (CSUR), Volume 55, Issue 14sArticle No.: 307, Pages 1–36https://doi.org/10.1145/3590163As one of the most complex parts in manufacturing systems, scheduling plays an important role in the efficient allocation of resources to meet individual customization requirements. However, due to the uncertain disruptions (e.g., task arrival time, ...
- research-articleJune 2023
Enabling High-Efficient ReRAM-Based CNN Training Via Exploiting Crossbar-Level Insignificant Writing Elimination
IEEE Transactions on Computers (ITCO), Volume 72, Issue 11Pages 3218–3230https://doi.org/10.1109/TC.2023.3288763Convolutional neural networks (CNNs) have been widely adopted in many deep learning applications. However, training a deep CNN requests intensive data transfer, which is both time and energy consuming. Using resistive random-access memory (ReRAM) to ...
- research-articleJune 2023
EEFL: High-Speed Wireless Communications Inspired Energy Efficient Federated Learning over Mobile Devices
MobiSys '23: Proceedings of the 21st Annual International Conference on Mobile Systems, Applications and ServicesPages 544–556https://doi.org/10.1145/3581791.3596865Energy efficiency is essential for federated learning (FL) over mobile devices and its potential prosperous applications. Different from existing communication efficient FL research efforts, which regard communication energy consumption as the bottleneck,...
- research-articleMay 2023
Accelerating Reinforcement Learning-Based CCSL Specification Synthesis Using Curiosity-Driven Exploration
IEEE Transactions on Computers (ITCO), Volume 72, Issue 5Pages 1431–1446https://doi.org/10.1109/TC.2022.3197956The Clock Constraint Specification Language (CCSL) has been widely acknowledged as a promising system-level specification for the modeling and analysis of timing behaviors of real-time and embedded systems. However, along with the increasing complexity of ...
- research-articleNovember 2022
Enabling PIM-based AES encryption for online video streaming
Journal of Systems Architecture: the EUROMICRO Journal (JOSA), Volume 132, Issue Chttps://doi.org/10.1016/j.sysarc.2022.102734AbstractEncryption of streaming video is becoming critical to the success of commercial enterprise and to consumers alike. To meet copyright and privacy requirements, encrypting video data on-the-fly during transmission is necessary. In this ...
- research-articleNovember 2022
PervasiveFL: Pervasive Federated Learning for Heterogeneous IoT Systems
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCADICS), Volume 41, Issue 11Pages 4100–4111https://doi.org/10.1109/TCAD.2022.3197491Federated learning (FL) has been recognized as a promising collaborative on-device machine learning method in the design of Internet of Things (IoT) systems. However, most existing FL methods fail to deal with IoT applications that contain a variety of ...
- ArticleOctober 2022
Multi-intent Compatible Transformer Network for Recommendation
AbstractThe core of recommendation systems is to explore users’ preferences from users’ historical records and accordingly recommend items to meet users’ interests. Previous works explore interaction graph to capture multi-order collaborative signals and ...
- research-articleOctober 2022
DynamAP: Architectural Support for Dynamic Graph Traversal on the Automata Processor
ACM Transactions on Architecture and Code Optimization (TACO), Volume 19, Issue 4Article No.: 60, Pages 1–26https://doi.org/10.1145/3556976Dynamic graph traversals (DGTs) currently are widely used in many important application domains, especially in this big-data era that urgently demands high-performance graph processing and analysis. Unlike static graph traversals, DGTs in real-world ...
- research-articleSeptember 2022
Siamese Graph-Based Dynamic Matching for Collaborative Filtering
Information Sciences: an International Journal (ISCI), Volume 611, Issue CPages 185–198https://doi.org/10.1016/j.ins.2022.08.062AbstractBehaviorally similar neighbors in the interaction graph have been actively explored to facilitate the collaboration between users and items and address the interaction sparsity issue. We investigate homogenous neighbors between users ...
- ArticleAugust 2022
Design of Enveloping Underwater Soft Gripper Based on the Bionic Structure
AbstractThe ocean has been an important research site for scientists in recent years. Many marine creatures with soft bodies such as sea cucumbers are fragile and easily deformed, so it is difficult when grasping these kinds of targets. In this regard, ...
- research-articleFebruary 2022
An Effective Image Enhancement Method for Color Fundus Images
ACAI '21: Proceedings of the 2021 4th International Conference on Algorithms, Computing and Artificial IntelligenceArticle No.: 43, Pages 1–5https://doi.org/10.1145/3508546.3508589A method is proposed for adaptive fundus image enhancement so as to restore the color images that are with extremely low or nonuniform brightness. Our proposed method is capable of increasing the brightness, contrast of images without losing original ...