More Web Proxy on the site http://driver.im/

research-article

Public Access

Characterizing the Execution of Deep Neural Networks on Collaborative Robots and Edge Devices

Authors:

Matthew L. Merck,

Arthur Siqueira,

Abhijeet Saraha,

Hyesoon KimAuthors Info & Claims

PEARC '19: Practice and Experience in Advanced Research Computing 2019: Rise of the Machines (learning)

Article No.: 65, Pages 1 - 6

https://doi.org/10.1145/3332186.3333049

Published: 28 July 2019 Publication History

Abstract

Edge devices and robots have access to an abundance of raw data that needs to be processed on the edge. Deep neural networks (DNNs) can help these devices understand and learn from this complex data; however, executing DNNs while achieving high performance is a challenge for edge devices. This is because of the high computational demands of DNN execution in real-time. This paper describes and implements a method to enable edge devices to execute DNNs collaboratively. This is possible and useful because in many environments, several on-edge devices are already integrated in their surroundings, but are usually idle and can provide additional computing power to a distributed system. We implement this method on two iRobots, each of which has been equipped with a Raspberry Pi 3. Then, we characterize the execution performance, communication latency, energy consumption, and thermal behavior of our system while it is executing AlexNet.

References

[1]

Martín Abadi et al. 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. https://www.tensorflow.org/ Software available from tensorflow.org.

[2]

Bahar Asgari, Ramyad Hadidi, Hyesoon Kim, and Sudhakar Yalamanchili. 2019. LODESTAR: Creating Locally-Dense CNNs for Efficient Inference on Systolic Arrays. ACM/IEE Design Automation Conference (DAC) - Late Breaking Results, Las Vegas, NV (2019).

Digital Library

[3]

Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2015. Neural Machine Translation by Jointly Learning to Align and Translate. In ICLR'15). ACM.

[4]

F Biscotti, J Skorupa, R Contu, et al. 2014. The Impact of the Internet of Things on Data Centers. Gartner Research 18 (2014).

[5]

François Chollet et al. 2015. Keras. https://github.com/fchollet/keras.

[6]

Ronan Collobert and Jason Weston. 2008. A Unified Architecture for Natural Language Processing: Deep Neural Networks with Multitask Learning. In ICML'8. ACM, 160--167.

Digital Library

[7]

Matthieu Courbariaux, Yoshua Bengio, and Jean-Pierre David. 2014. Training Deep Neural Networks with Low Precision Multiplication. arXiv preprint arXiv:1412.7024 (2014).

[8]

Raspberry PI Foundation. 2017. Raspberry Pi 3B+. www.raspber-rypi.org/products/raspberry-pi-3-model-b/. {Online; accessed 04/01/19}.

[9]

Alessandro Giusti, Jérôme Guzzi, Dan C Cireşan, Fang-Lin He, Juan P Rodríguez, Flavio Fontana, Matthias Faessler, Christian Forster, Jürgen Schmidhuber, Gianni Di Caro, et al. 2016. A machine learning approach to visual perception of forest trails for mobile robots. IEEE Robotics and Automation Letters 1, 2 (2016), 661--667.

[10]

Yunchao Gong, Liu Liu, Ming Yang, and Lubomir Bourdev. 2014. Compressing Deep Convolutional Networks Using Vector Quantization. arXiv preprint arXiv:1412.6115 (2014).

[11]

Binita Gupta. 2015. Discovering cloud-based services for iot devices in an iot network associated with a user. US Patent App. 14/550,595.

[12]

Ramyad Hadidi, Jiashen Cao, Michael Ryoo, and Hyesoon Kim. 2018. Collaborative Execution of Deep Neural Networks on Internet of Things Device. arXiv preprint (2018).

[13]

Ramyad Hadidi, Jiashen Cao, Michael S. Ryoo, and Hyesoon Kim. 2018. Distributed Perception by Collaborative Robots. IEEE Robotics and Automation Letters (RA-L), and International Conference on Intelligent Robots and Systems 2018 (IROS) 3, 4 (Oct 2018), 3709--3716.

[14]

Ramyad Hadidi, Jiashen Cao, Michael S. Ryoo, and Hyesoon Kim. 2019. Robustly Executing DNNs in IoT Systems Using Coded Distributed Computing. ACM/IEE Design Automation Conference (DAC)-Late Breaking Results, Las Vegas, NV (2019).

Digital Library

[15]

Ramyad Hadidi, Jiashen Cao, Matthew Woodward, Michael Ryoo, and Hyesoon Kim. 2018. Musical Chair: Efficient Real-Time Recognition Using Collaborative IoT Devices. arXiv preprint arXiv:1802.02138 (2018).

[16]

Ramyad Hadidi, Jiashen Cao, Matthew Woodward, Michael S. Ryoo, and Hyesoon Kim. 2018. Real-Time Image Recognition Using Collaborative IoT Devices. In Proceedings of the 1st on Reproducible Quality-Efficient Systems Tournament on Co-designing Pareto-efficient Deep Learning (ReQuEST '18). ACM, New York, NY, USA, Article 4.

Digital Library

[17]

Ramyad Hadidi, Jiashen Cao, Fei Wu, Tushar Kirshna, Michael S. Ryoo, and Hyesoon Kim. 2019. An Edge-Centric Scalable Intelligent Framework To Collaboratively Execute DNN. Demo for SysML Conference, Palo Alto, CA (2019).

[18]

Song Han, Huizi Mao, and William J Dally. 2016. Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding. In 4th International Conference on Learning Representations. ACM.

[19]

iRobot Inc. 2019. iRobot Create 2 Open Interface. www.cdn-shop.adafruit.com/datasheetscreate_2_Open_Interface_Spec.pdf. {Online; accessed 15/03/19}.

[20]

iRobot Inc. 2019. iRobot Create 2 Programmable Robot. www.irobot.com/about-irobot/stem/create-2. {Online; accessed 15/03/19}.

[21]

Rafiullah Khan, Sarmad Ullah Khan, Rifaqat Zaheer, and Shahid Khan. 2012. Future Internet: The Internet of Things Architecture, Possible Applications and Key Challenges. In FIT'12. IEEE, 257--260.

Digital Library

[22]

Urs Köster, Tristan Webb, Xin Wang, Marcel Nassar, Arjun K Bansal, William Constable, Oguz Elibol, Scott Gray, Stewart Hall, Luke Hornof, et al. 2017. Flexpoint: An adaptive numerical format for efficient training of deep neural networks. In Advances in Neural Information Processing Systems (NIPS). 1742--1752.

[23]

Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. Imagenet Classification With Deep Convolutional Neural Networks. In 26th Annual Conference on Neural Information Processing Systems (NIPS). ACM, 1097--1105.

Digital Library

[24]

Yann LeCun, Yoshua Bengio, and Geoffrey Hinton. 2015. Deep learning. nature 521, 7553 (2015), 436.

[25]

In Lee and Kyoochun Lee. 2015. The Internet of Things (IoT): Applications, Investments, and Challenges for Enterprises. Business Horizons 58, 4 (2015), 431--440.

[26]

Hui Li and Xiaojiang Xing. 2015. Internet of things service architecture and method for realizing internet of things service. US Patent 8,984,113.

[27]

Shancang Li, Li Da Xu, and Shanshan Zhao. 2015. The internet of things: a survey. Information Systems Frontiers 17, 2 (2015), 243--259.

Digital Library

[28]

Ji Lin, Yongming Rao, Jiwen Lu, and Jie Zhou. 2017. Run-time neural pruning. In Advances in Neural Information Processing Systems (NIPS). 2181--2191.

Digital Library

[29]

Huimin Lu, Yujie Li, Shenglin Mu, Dong Wang, Hyoungseop Kim, and Seiichi Serikawa. 2018. Motor anomaly detection for unmanned aerial vehicles using reinforcement learning. IEEE internet of things journal 5, 4 (2018), 2315--2322.

[30]

Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A Rusu, Joel Veness, Marc G Bellemare, Alex Graves, Martin Riedmiller, Andreas K Fidjeland, Georg Ostrovski, et al. 2015. Human-level control through deep reinforcement learning. Nature 518, 7540 (2015), 529.

[31]

Mark Pfeiffer, Michael Schaeuble, Juan Nieto, Roland Siegwart, and Cesar Cadena. 2017. From perception to decision: A data-driven approach to end-to-end motion planning for autonomous ground robots. In 2017 ieee international conference on robotics and automation (icra). IEEE, 1527--1533.

[32]

Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi. 2016. You only look once: Unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition. 779--788.

[33]

Omer Berat Sezer, Erdogan Dogdu, and Ahmet Murat Ozbayoglu. 2018. Context-aware computing, learning, and big data in internet of things: a survey. IEEE Internet of Things Journal 5, 1 (2018), 1--27.

[34]

David Silver, Aja Huang, Chris J Maddison, Arthur Guez, Laurent Sifre, George Van Den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, et al. 2016. Mastering the game of Go with deep neural networks and tree search. nature 529, 7587 (2016), 484.

[35]

Karen Simonyan and Andrew Zisserman. 2014. Two-Stream Convolutional Networks for Action Recognition in Videos. In NIPS'14. ACM, 568--576.

Digital Library

[36]

Karen Simonyan and Andrew Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. In 3rd International Conference on Learning Representations. ACM.

[37]

Arti Singh, Baskar Ganapathysubramanian, Asheesh Kumar Singh, and Soumik Sarkar. 2016. Machine learning for high-throughput stress phenotyping in plants. Trends in plant science 21, 2 (2016), 110--124.

[38]

Vincent Vanhoucke, Andrew Senior, and Mark Z Mao. 2011. Improving the Speed of Neural Networks on CPUs. In Proceeding Deep Learning and Unsupervised Feature Learning NIPS Workshop, Vol. 1. ACM, 4.

[39]

Wei Wen, Chunpeng Wu, Yandan Wang, Yiran Chen, and Hai Li. 2016. Learning structured sparsity in deep neural networks. In Advances in neural information processing systems. 2074--2082.

Digital Library

[40]

Jiecao Yu, Andrew Lukefahr, David Palframan, Ganesh Dasika, Reetuparna Das, and Scott Mahlke. 2017. Scalpel: Customizing DNN Pruning to the Underlying Hardware Parallelism. In 44th International Symposium on Computer Architecture (ISCA). IEEE, 548--560.

Digital Library

Cited By

Siwach GLi C(2024)Unveiling the Potential of Natural Language Processing in Collaborative Robots (Cobots): A Comprehensive Survey2024 IEEE International Conference on Consumer Electronics (ICCE)10.1109/ICCE59016.2024.10444393(1-6)Online publication date: 6-Jan-2024
https://doi.org/10.1109/ICCE59016.2024.10444393
Liu WYu BGan YLiu QTang JLiu SZhu Y(2021)Archytas: A Framework for Synthesizing and Dynamically Optimizing Accelerators for Robotic LocalizationMICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture10.1145/3466752.3480077(479-493)Online publication date: 18-Oct-2021
https://dl.acm.org/doi/10.1145/3466752.3480077
Gan YBo YTian BXu LHu WLiu SLiu QZhang YTang JZhu Y(2021)Eudoxus: Characterizing and Accelerating Localization in Autonomous Machines Industry Track Paper2021 IEEE International Symposium on High-Performance Computer Architecture (HPCA)10.1109/HPCA51647.2021.00074(827-840)Online publication date: Feb-2021
https://doi.org/10.1109/HPCA51647.2021.00074
Show More Cited By

Index Terms

Characterizing the Execution of Deep Neural Networks on Collaborative Robots and Edge Devices
1. Computer systems organization
  1. Embedded and cyber-physical systems
2. Computing methodologies
  1. Distributed computing methodologies
  2. Machine learning

Recommendations

Adaptive parallel execution of deep neural networks on heterogeneous edge devices
SEC '19: Proceedings of the 4th ACM/IEEE Symposium on Edge Computing

New applications such as smart homes, smart cities, and autonomous vehicles are driving an increased interest in deploying machine learning on edge devices. Unfortunately, deploying deep neural networks (DNNs) on resource-constrained devices presents ...
Edge-preserving image denoising using a deep convolutional neural network
Highlights
- This paper makes use of a deep CNN for image denoising.
- The network is trained ...
Abstract
This paper introduces a novel denoising approach making use of a deep convolutional neural network to preserve image edges. The network is trained by using the edge map obtained from the well-known Canny algorithm and aims at ...
Distributing deep learning inference on edge devices
CoNEXT '20: Proceedings of the 16th International Conference on emerging Networking EXperiments and Technologies

Deep Neural Networks (DNNs) and Convolutional Neural Networks (CNNs) are widely used in IoT related applications. However, inferencing pre-trained large DNNs and CNNs consumes a significant amount of time, memory and computational resources. This makes ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

PEARC '19: Practice and Experience in Advanced Research Computing 2019: Rise of the Machines (learning)

July 2019

775 pages

ISBN:9781450372275

DOI:10.1145/3332186

General Chair:
Tom Furlani
Roswell Park Comprehensive Cancer Center

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 July 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Research-article
Research
Refereed limited

Funding Sources

National Science Foundation

Conference

PEARC '19

PEARC '19: Practice and Experience in Advanced Research Computing

July 28 - August 1, 2019

IL, Chicago, USA

Acceptance Rates

Overall Acceptance Rate 133 of 202 submissions, 66%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

6
Total Citations
View Citations
412
Total Downloads

Downloads (Last 12 months)77
Downloads (Last 6 weeks)11

Reflects downloads up to 05 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Siwach GLi C(2024)Unveiling the Potential of Natural Language Processing in Collaborative Robots (Cobots): A Comprehensive Survey2024 IEEE International Conference on Consumer Electronics (ICCE)10.1109/ICCE59016.2024.10444393(1-6)Online publication date: 6-Jan-2024
https://doi.org/10.1109/ICCE59016.2024.10444393
Liu WYu BGan YLiu QTang JLiu SZhu Y(2021)Archytas: A Framework for Synthesizing and Dynamically Optimizing Accelerators for Robotic LocalizationMICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture10.1145/3466752.3480077(479-493)Online publication date: 18-Oct-2021
https://dl.acm.org/doi/10.1145/3466752.3480077
Gan YBo YTian BXu LHu WLiu SLiu QZhang YTang JZhu Y(2021)Eudoxus: Characterizing and Accelerating Localization in Autonomous Machines Industry Track Paper2021 IEEE International Symposium on High-Performance Computer Architecture (HPCA)10.1109/HPCA51647.2021.00074(827-840)Online publication date: Feb-2021
https://doi.org/10.1109/HPCA51647.2021.00074
Hadidi RCao JRyoo MKim H(2020)Toward Collaborative Inferencing of Deep Neural Networks on Internet-of-Things DevicesIEEE Internet of Things Journal10.1109/JIOT.2020.29720007:6(4950-4960)Online publication date: Jun-2020
https://doi.org/10.1109/JIOT.2020.2972000
Abbas MNarayan JBanerjee SDwivedy S(2020)AlexNet based Real-Time Detection and Segregation of Household Objects using Scorbot2020 4th International Conference on Computational Intelligence and Networks (CINE)10.1109/CINE48825.2020.234392(1-6)Online publication date: Feb-2020
https://doi.org/10.1109/CINE48825.2020.234392
Hadidi RCao JXie YAsgari BKrishna TKim H(2019)Characterizing the Deployment of Deep Neural Networks on Commercial Edge Devices2019 IEEE International Symposium on Workload Characterization (IISWC)10.1109/IISWC47752.2019.9041955(35-48)Online publication date: Nov-2019
https://doi.org/10.1109/IISWC47752.2019.9041955

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Figures

Tables

Media

View Table of Conten