Bateni et al., 2018 - Google Patents
Predjoule: A timing-predictable energy optimization framework for deep neural networksBateni et al., 2018
View PDF- Document ID
- 9947927613312248584
- Author
- Bateni S
- Zhou H
- Zhu Y
- Liu C
- Publication year
- Publication venue
- 2018 IEEE Real-Time Systems Symposium (RTSS)
External Links
Snippet
The revolution of deep neural networks (DNNs) is enabling dramatically better autonomy in autonomous driving. However, it is not straightforward to simultaneously achieve both timing predictability (ie, meeting job latency requirements) and energy efficiency that are essential …
- 230000001537 neural 0 title abstract description 52
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/48—Programme initiating; Programme switching, e.g. by interrupt
- G06F9/4806—Task transfer initiation or dispatching
- G06F9/4843—Task transfer initiation or dispatching by program, e.g. task dispatcher, supervisor, operating system
- G06F9/4881—Scheduling strategies for dispatcher, e.g. round robin, multi-level priority queues
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F1/00—Details of data-processing equipment not covered by groups G06F3/00 - G06F13/00, e.g. cooling, packaging or power supply specially adapted for computer application
- G06F1/26—Power supply means, e.g. regulation thereof
- G06F1/32—Means for saving power
- G06F1/3203—Power Management, i.e. event-based initiation of power-saving mode
- G06F1/3234—Action, measure or step performed to reduce power consumption
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Bateni et al. | Predjoule: A timing-predictable energy optimization framework for deep neural networks | |
CN110070181A (en) | A kind of optimization method of the deep learning for edge calculations equipment | |
WO2021057720A1 (en) | Neural network model processing method and apparatus, computer device, and storage medium | |
CN111738434B (en) | Method for executing deep neural network on heterogeneous processing unit | |
CN109257429A (en) | A kind of calculating unloading dispatching method based on deeply study | |
Chen et al. | Accurate workload prediction for edge data centers: Savitzky-Golay filter, CNN and BiLSTM with attention mechanism | |
CN111427758A (en) | Task calculation amount determining method and device and electronic equipment | |
Dublish et al. | Poise: Balancing thread-level parallelism and memory system performance in GPUs using machine learning | |
CN116909378A (en) | Deep reinforcement learning-based GPU dynamic energy efficiency optimization runtime method and system | |
Lin et al. | A workload-aware dvfs robust to concurrent tasks for mobile devices | |
Spantidi et al. | Frequency-based power efficiency improvement of CNNs on heterogeneous IoT computing systems | |
Samie et al. | Fast operation mode selection for highly efficient iot edge devices | |
CN117873690B (en) | Method for managing power consumption of arithmetic unit chip, computing subsystem and intelligent computing platform | |
US12093836B2 (en) | Automatic multi-objective hardware optimization for processing of deep learning networks | |
CN116820730B (en) | Task scheduling method, device and storage medium of multi-engine computing system | |
CN112990461B (en) | Method, device, computer equipment and storage medium for constructing neural network model | |
Zhou et al. | CPU frequency scheduling of real-time applications on embedded devices with temporal encoding-based deep reinforcement learning | |
Kunas et al. | Neuropar, a neural network-driven EDP optimization strategy for parallel workloads | |
CN114217688B (en) | NPU power consumption optimization system and method based on neural network structure | |
Wu et al. | MOC: Multi-Objective Mobile CPU-GPU Co-Optimization for Power-Efficient DNN Inference | |
CN117435451A (en) | Method for establishing power consumption and performance model of virtual computing unit in mobile edge computing | |
CN113485848B (en) | Deep neural network deployment method and device, computer equipment and storage medium | |
KR20220099487A (en) | Method for exploration via manipulating curiosity by anticipating others and prioritization of experiences for multi-agent reinforcement learning | |
Lin et al. | On-Demand Accelerating Deep Neural Network Inference via Edge Computing | |
Lin et al. | A multi-agent reinforcement learning-based method for server energy efficiency optimization combining DVFS and dynamic fan control |