Huang et al., 2022 - Google Patents
Enabling latency-sensitive DNN inference via joint optimization of model surgery and resource allocation in heterogeneous edgeHuang et al., 2022
- Document ID
- 17611342634659294349
- Author
- Huang Z
- Dong F
- Shen D
- Wang H
- Guo X
- Fu S
- Publication year
- Publication venue
- Proceedings of the 51st International Conference on Parallel Processing
External Links
Snippet
Nowadays, edge computing is widely adopted to resolve the emerging deep neural networks (DNNs)-driven intelligence scenarios with the requirement of low-latency and high- accuracy, which includes heterogeneous end devices and DNNs. In such scenarios, the …
- 238000001356 surgical procedure 0 title abstract description 55
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
- G06F9/5066—Algorithms for mapping a plurality of inter-dependent sub-tasks onto a plurality of physical CPUs
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G06F9/5027—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
- G06F9/505—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering the load
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G06F9/5027—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
- G06F9/5044—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering hardware capabilities
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/48—Programme initiating; Programme switching, e.g. by interrupt
- G06F9/4806—Task transfer initiation or dispatching
- G06F9/4843—Task transfer initiation or dispatching by program, e.g. task dispatcher, supervisor, operating system
- G06F9/4881—Scheduling strategies for dispatcher, e.g. round robin, multi-level priority queues
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G06F17/5009—Computer-aided design using simulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30943—Information retrieval; Database structures therefor; File system structures therefor details of database functions independent of the retrieved data type
- G06F17/30946—Information retrieval; Database structures therefor; File system structures therefor details of database functions independent of the retrieved data type indexing structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2217/00—Indexing scheme relating to computer aided design [CAD]
- G06F2217/78—Power analysis and optimization
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management, e.g. organising, planning, scheduling or allocating time, human or machine resources; Enterprise planning; Organisational models
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Abed-Alguni et al. | Distributed Grey Wolf Optimizer for scheduling of workflow applications in cloud environments | |
Liu et al. | Adaptive asynchronous federated learning in resource-constrained edge computing | |
Mohan et al. | Edge-Fog cloud: A distributed cloud for Internet of Things computations | |
Xu et al. | Dynamic deployment of virtual machines in cloud computing using multi-objective optimization | |
Driscoll et al. | A communication-optimal n-body algorithm for direct interactions | |
Amarjeet et al. | TA-ABC: two-archive artificial bee colony for multi-objective software module clustering problem | |
Yi et al. | Optimizing distributed training deployment in heterogeneous GPU clusters | |
Samani et al. | Multilayer resource-aware partitioning for fog application placement | |
Huang et al. | Enabling latency-sensitive DNN inference via joint optimization of model surgery and resource allocation in heterogeneous edge | |
Yadav et al. | An opposition-based hybrid evolutionary approach for task scheduling in fog computing network | |
Zhou et al. | Deep reinforcement learning-based algorithms selectors for the resource scheduling in hierarchical cloud computing | |
Czarnul et al. | Optimization of execution time under power consumption constraints in a heterogeneous parallel system with gpus and cpus | |
Aliyu et al. | Dynamic partial computation offloading for the metaverse in in-network computing | |
Samikwa et al. | Disnet: Distributed micro-split deep learning in heterogeneous dynamic iot | |
El Gaily et al. | Constrained quantum optimization for resource distribution management | |
Anwar et al. | Recommender system for optimal distributed deep learning in cloud datacenters | |
Klimenko et al. | The comparative estimation of workload relocation approaches in the fog-and edge-computing environments | |
Zhang et al. | Resource and delay aware fine-grained service offloading in collaborative edge computing | |
Pacut et al. | Brief announcement: Deterministic lower bound for dynamic balanced graph partitioning | |
Su et al. | Using grasshopper optimization algorithm to solve 0-1 knapsack computation resources allocation problem in mobile edge computing | |
Kontos et al. | Cloud-Native Applications' Workload Placement over the Edge-Cloud Continuum. | |
Chen et al. | Deep reinforcement learning based container cluster placement strategy in edge computing environment | |
Chen et al. | A scheduling algorithm for heterogeneous computing systems by edge cover queue | |
Satouf et al. | Grey Wolf Optimizer-based Task Scheduling for IoT-based Applications in the Edge Computing | |
Qi et al. | A task unloading strategy of IoT devices using deep reinforcement learning based on mobile cloud computing environment |