[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

Bateni et al., 2018 - Google Patents

Predjoule: A timing-predictable energy optimization framework for deep neural networks

Bateni et al., 2018

View PDF
Document ID
9947927613312248584
Author
Bateni S
Zhou H
Zhu Y
Liu C
Publication year
Publication venue
2018 IEEE Real-Time Systems Symposium (RTSS)

External Links

Snippet

The revolution of deep neural networks (DNNs) is enabling dramatically better autonomy in autonomous driving. However, it is not straightforward to simultaneously achieve both timing predictability (ie, meeting job latency requirements) and energy efficiency that are essential …
Continue reading at pdfs.semanticscholar.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for programme control, e.g. control unit
    • G06F9/06Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
    • G06F9/46Multiprogramming arrangements
    • G06F9/48Programme initiating; Programme switching, e.g. by interrupt
    • G06F9/4806Task transfer initiation or dispatching
    • G06F9/4843Task transfer initiation or dispatching by program, e.g. task dispatcher, supervisor, operating system
    • G06F9/4881Scheduling strategies for dispatcher, e.g. round robin, multi-level priority queues
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for programme control, e.g. control unit
    • G06F9/06Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F1/00Details of data-processing equipment not covered by groups G06F3/00 - G06F13/00, e.g. cooling, packaging or power supply specially adapted for computer application
    • G06F1/26Power supply means, e.g. regulation thereof
    • G06F1/32Means for saving power
    • G06F1/3203Power Management, i.e. event-based initiation of power-saving mode
    • G06F1/3234Action, measure or step performed to reduce power consumption
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/005Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/50Computer-aided design
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints

Similar Documents

Publication Publication Date Title
Bateni et al. Predjoule: A timing-predictable energy optimization framework for deep neural networks
CN110070181A (en) A kind of optimization method of the deep learning for edge calculations equipment
WO2021057720A1 (en) Neural network model processing method and apparatus, computer device, and storage medium
CN111738434B (en) Method for executing deep neural network on heterogeneous processing unit
CN109257429A (en) A kind of calculating unloading dispatching method based on deeply study
Chen et al. Accurate workload prediction for edge data centers: Savitzky-Golay filter, CNN and BiLSTM with attention mechanism
CN111427758A (en) Task calculation amount determining method and device and electronic equipment
Dublish et al. Poise: Balancing thread-level parallelism and memory system performance in GPUs using machine learning
CN116909378A (en) Deep reinforcement learning-based GPU dynamic energy efficiency optimization runtime method and system
Lin et al. A workload-aware dvfs robust to concurrent tasks for mobile devices
Spantidi et al. Frequency-based power efficiency improvement of CNNs on heterogeneous IoT computing systems
Samie et al. Fast operation mode selection for highly efficient iot edge devices
CN117873690B (en) Method for managing power consumption of arithmetic unit chip, computing subsystem and intelligent computing platform
US12093836B2 (en) Automatic multi-objective hardware optimization for processing of deep learning networks
CN116820730B (en) Task scheduling method, device and storage medium of multi-engine computing system
CN112990461B (en) Method, device, computer equipment and storage medium for constructing neural network model
Zhou et al. CPU frequency scheduling of real-time applications on embedded devices with temporal encoding-based deep reinforcement learning
Kunas et al. Neuropar, a neural network-driven EDP optimization strategy for parallel workloads
CN114217688B (en) NPU power consumption optimization system and method based on neural network structure
Wu et al. MOC: Multi-Objective Mobile CPU-GPU Co-Optimization for Power-Efficient DNN Inference
CN117435451A (en) Method for establishing power consumption and performance model of virtual computing unit in mobile edge computing
CN113485848B (en) Deep neural network deployment method and device, computer equipment and storage medium
KR20220099487A (en) Method for exploration via manipulating curiosity by anticipating others and prioritization of experiences for multi-agent reinforcement learning
Lin et al. On-Demand Accelerating Deep Neural Network Inference via Edge Computing
Lin et al. A multi-agent reinforcement learning-based method for server energy efficiency optimization combining DVFS and dynamic fan control