[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3332186.3333049acmotherconferencesArticle/Chapter ViewAbstractPublication PagespearcConference Proceedingsconference-collections
research-article
Public Access

Characterizing the Execution of Deep Neural Networks on Collaborative Robots and Edge Devices

Published: 28 July 2019 Publication History

Abstract

Edge devices and robots have access to an abundance of raw data that needs to be processed on the edge. Deep neural networks (DNNs) can help these devices understand and learn from this complex data; however, executing DNNs while achieving high performance is a challenge for edge devices. This is because of the high computational demands of DNN execution in real-time. This paper describes and implements a method to enable edge devices to execute DNNs collaboratively. This is possible and useful because in many environments, several on-edge devices are already integrated in their surroundings, but are usually idle and can provide additional computing power to a distributed system. We implement this method on two iRobots, each of which has been equipped with a Raspberry Pi 3. Then, we characterize the execution performance, communication latency, energy consumption, and thermal behavior of our system while it is executing AlexNet.

References

[1]
Martín Abadi et al. 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. https://www.tensorflow.org/ Software available from tensorflow.org.
[2]
Bahar Asgari, Ramyad Hadidi, Hyesoon Kim, and Sudhakar Yalamanchili. 2019. LODESTAR: Creating Locally-Dense CNNs for Efficient Inference on Systolic Arrays. ACM/IEE Design Automation Conference (DAC) - Late Breaking Results, Las Vegas, NV (2019).
[3]
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2015. Neural Machine Translation by Jointly Learning to Align and Translate. In ICLR'15). ACM.
[4]
F Biscotti, J Skorupa, R Contu, et al. 2014. The Impact of the Internet of Things on Data Centers. Gartner Research 18 (2014).
[5]
François Chollet et al. 2015. Keras. https://github.com/fchollet/keras.
[6]
Ronan Collobert and Jason Weston. 2008. A Unified Architecture for Natural Language Processing: Deep Neural Networks with Multitask Learning. In ICML'8. ACM, 160--167.
[7]
Matthieu Courbariaux, Yoshua Bengio, and Jean-Pierre David. 2014. Training Deep Neural Networks with Low Precision Multiplication. arXiv preprint arXiv:1412.7024 (2014).
[8]
Raspberry PI Foundation. 2017. Raspberry Pi 3B+. www.raspber-rypi.org/products/raspberry-pi-3-model-b/. {Online; accessed 04/01/19}.
[9]
Alessandro Giusti, Jérôme Guzzi, Dan C Cireşan, Fang-Lin He, Juan P Rodríguez, Flavio Fontana, Matthias Faessler, Christian Forster, Jürgen Schmidhuber, Gianni Di Caro, et al. 2016. A machine learning approach to visual perception of forest trails for mobile robots. IEEE Robotics and Automation Letters 1, 2 (2016), 661--667.
[10]
Yunchao Gong, Liu Liu, Ming Yang, and Lubomir Bourdev. 2014. Compressing Deep Convolutional Networks Using Vector Quantization. arXiv preprint arXiv:1412.6115 (2014).
[11]
Binita Gupta. 2015. Discovering cloud-based services for iot devices in an iot network associated with a user. US Patent App. 14/550,595.
[12]
Ramyad Hadidi, Jiashen Cao, Michael Ryoo, and Hyesoon Kim. 2018. Collaborative Execution of Deep Neural Networks on Internet of Things Device. arXiv preprint (2018).
[13]
Ramyad Hadidi, Jiashen Cao, Michael S. Ryoo, and Hyesoon Kim. 2018. Distributed Perception by Collaborative Robots. IEEE Robotics and Automation Letters (RA-L), and International Conference on Intelligent Robots and Systems 2018 (IROS) 3, 4 (Oct 2018), 3709--3716.
[14]
Ramyad Hadidi, Jiashen Cao, Michael S. Ryoo, and Hyesoon Kim. 2019. Robustly Executing DNNs in IoT Systems Using Coded Distributed Computing. ACM/IEE Design Automation Conference (DAC)-Late Breaking Results, Las Vegas, NV (2019).
[15]
Ramyad Hadidi, Jiashen Cao, Matthew Woodward, Michael Ryoo, and Hyesoon Kim. 2018. Musical Chair: Efficient Real-Time Recognition Using Collaborative IoT Devices. arXiv preprint arXiv:1802.02138 (2018).
[16]
Ramyad Hadidi, Jiashen Cao, Matthew Woodward, Michael S. Ryoo, and Hyesoon Kim. 2018. Real-Time Image Recognition Using Collaborative IoT Devices. In Proceedings of the 1st on Reproducible Quality-Efficient Systems Tournament on Co-designing Pareto-efficient Deep Learning (ReQuEST '18). ACM, New York, NY, USA, Article 4.
[17]
Ramyad Hadidi, Jiashen Cao, Fei Wu, Tushar Kirshna, Michael S. Ryoo, and Hyesoon Kim. 2019. An Edge-Centric Scalable Intelligent Framework To Collaboratively Execute DNN. Demo for SysML Conference, Palo Alto, CA (2019).
[18]
Song Han, Huizi Mao, and William J Dally. 2016. Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding. In 4th International Conference on Learning Representations. ACM.
[19]
iRobot Inc. 2019. iRobot Create 2 Open Interface. www.cdn-shop.adafruit.com/datasheetscreate_2_Open_Interface_Spec.pdf. {Online; accessed 15/03/19}.
[20]
iRobot Inc. 2019. iRobot Create 2 Programmable Robot. www.irobot.com/about-irobot/stem/create-2. {Online; accessed 15/03/19}.
[21]
Rafiullah Khan, Sarmad Ullah Khan, Rifaqat Zaheer, and Shahid Khan. 2012. Future Internet: The Internet of Things Architecture, Possible Applications and Key Challenges. In FIT'12. IEEE, 257--260.
[22]
Urs Köster, Tristan Webb, Xin Wang, Marcel Nassar, Arjun K Bansal, William Constable, Oguz Elibol, Scott Gray, Stewart Hall, Luke Hornof, et al. 2017. Flexpoint: An adaptive numerical format for efficient training of deep neural networks. In Advances in Neural Information Processing Systems (NIPS). 1742--1752.
[23]
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. Imagenet Classification With Deep Convolutional Neural Networks. In 26th Annual Conference on Neural Information Processing Systems (NIPS). ACM, 1097--1105.
[24]
Yann LeCun, Yoshua Bengio, and Geoffrey Hinton. 2015. Deep learning. nature 521, 7553 (2015), 436.
[25]
In Lee and Kyoochun Lee. 2015. The Internet of Things (IoT): Applications, Investments, and Challenges for Enterprises. Business Horizons 58, 4 (2015), 431--440.
[26]
Hui Li and Xiaojiang Xing. 2015. Internet of things service architecture and method for realizing internet of things service. US Patent 8,984,113.
[27]
Shancang Li, Li Da Xu, and Shanshan Zhao. 2015. The internet of things: a survey. Information Systems Frontiers 17, 2 (2015), 243--259.
[28]
Ji Lin, Yongming Rao, Jiwen Lu, and Jie Zhou. 2017. Run-time neural pruning. In Advances in Neural Information Processing Systems (NIPS). 2181--2191.
[29]
Huimin Lu, Yujie Li, Shenglin Mu, Dong Wang, Hyoungseop Kim, and Seiichi Serikawa. 2018. Motor anomaly detection for unmanned aerial vehicles using reinforcement learning. IEEE internet of things journal 5, 4 (2018), 2315--2322.
[30]
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A Rusu, Joel Veness, Marc G Bellemare, Alex Graves, Martin Riedmiller, Andreas K Fidjeland, Georg Ostrovski, et al. 2015. Human-level control through deep reinforcement learning. Nature 518, 7540 (2015), 529.
[31]
Mark Pfeiffer, Michael Schaeuble, Juan Nieto, Roland Siegwart, and Cesar Cadena. 2017. From perception to decision: A data-driven approach to end-to-end motion planning for autonomous ground robots. In 2017 ieee international conference on robotics and automation (icra). IEEE, 1527--1533.
[32]
Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi. 2016. You only look once: Unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition. 779--788.
[33]
Omer Berat Sezer, Erdogan Dogdu, and Ahmet Murat Ozbayoglu. 2018. Context-aware computing, learning, and big data in internet of things: a survey. IEEE Internet of Things Journal 5, 1 (2018), 1--27.
[34]
David Silver, Aja Huang, Chris J Maddison, Arthur Guez, Laurent Sifre, George Van Den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, et al. 2016. Mastering the game of Go with deep neural networks and tree search. nature 529, 7587 (2016), 484.
[35]
Karen Simonyan and Andrew Zisserman. 2014. Two-Stream Convolutional Networks for Action Recognition in Videos. In NIPS'14. ACM, 568--576.
[36]
Karen Simonyan and Andrew Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. In 3rd International Conference on Learning Representations. ACM.
[37]
Arti Singh, Baskar Ganapathysubramanian, Asheesh Kumar Singh, and Soumik Sarkar. 2016. Machine learning for high-throughput stress phenotyping in plants. Trends in plant science 21, 2 (2016), 110--124.
[38]
Vincent Vanhoucke, Andrew Senior, and Mark Z Mao. 2011. Improving the Speed of Neural Networks on CPUs. In Proceeding Deep Learning and Unsupervised Feature Learning NIPS Workshop, Vol. 1. ACM, 4.
[39]
Wei Wen, Chunpeng Wu, Yandan Wang, Yiran Chen, and Hai Li. 2016. Learning structured sparsity in deep neural networks. In Advances in neural information processing systems. 2074--2082.
[40]
Jiecao Yu, Andrew Lukefahr, David Palframan, Ganesh Dasika, Reetuparna Das, and Scott Mahlke. 2017. Scalpel: Customizing DNN Pruning to the Underlying Hardware Parallelism. In 44th International Symposium on Computer Architecture (ISCA). IEEE, 548--560.

Cited By

View all
  • (2024)Unveiling the Potential of Natural Language Processing in Collaborative Robots (Cobots): A Comprehensive Survey2024 IEEE International Conference on Consumer Electronics (ICCE)10.1109/ICCE59016.2024.10444393(1-6)Online publication date: 6-Jan-2024
  • (2021)Archytas: A Framework for Synthesizing and Dynamically Optimizing Accelerators for Robotic LocalizationMICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture10.1145/3466752.3480077(479-493)Online publication date: 18-Oct-2021
  • (2021)Eudoxus: Characterizing and Accelerating Localization in Autonomous Machines Industry Track Paper2021 IEEE International Symposium on High-Performance Computer Architecture (HPCA)10.1109/HPCA51647.2021.00074(827-840)Online publication date: Feb-2021
  • Show More Cited By

Index Terms

  1. Characterizing the Execution of Deep Neural Networks on Collaborative Robots and Edge Devices

        Recommendations

        Comments

        Please enable JavaScript to view thecomments powered by Disqus.

        Information & Contributors

        Information

        Published In

        cover image ACM Other conferences
        PEARC '19: Practice and Experience in Advanced Research Computing 2019: Rise of the Machines (learning)
        July 2019
        775 pages
        ISBN:9781450372275
        DOI:10.1145/3332186
        • General Chair:
        • Tom Furlani
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 28 July 2019

        Permissions

        Request permissions for this article.

        Check for updates

        Qualifiers

        • Research-article
        • Research
        • Refereed limited

        Funding Sources

        Conference

        PEARC '19

        Acceptance Rates

        Overall Acceptance Rate 133 of 202 submissions, 66%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)69
        • Downloads (Last 6 weeks)9
        Reflects downloads up to 11 Dec 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)Unveiling the Potential of Natural Language Processing in Collaborative Robots (Cobots): A Comprehensive Survey2024 IEEE International Conference on Consumer Electronics (ICCE)10.1109/ICCE59016.2024.10444393(1-6)Online publication date: 6-Jan-2024
        • (2021)Archytas: A Framework for Synthesizing and Dynamically Optimizing Accelerators for Robotic LocalizationMICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture10.1145/3466752.3480077(479-493)Online publication date: 18-Oct-2021
        • (2021)Eudoxus: Characterizing and Accelerating Localization in Autonomous Machines Industry Track Paper2021 IEEE International Symposium on High-Performance Computer Architecture (HPCA)10.1109/HPCA51647.2021.00074(827-840)Online publication date: Feb-2021
        • (2020)Toward Collaborative Inferencing of Deep Neural Networks on Internet-of-Things DevicesIEEE Internet of Things Journal10.1109/JIOT.2020.29720007:6(4950-4960)Online publication date: Jun-2020
        • (2020)AlexNet based Real-Time Detection and Segregation of Household Objects using Scorbot2020 4th International Conference on Computational Intelligence and Networks (CINE)10.1109/CINE48825.2020.234392(1-6)Online publication date: Feb-2020
        • (2019)Characterizing the Deployment of Deep Neural Networks on Commercial Edge Devices2019 IEEE International Symposium on Workload Characterization (IISWC)10.1109/IISWC47752.2019.9041955(35-48)Online publication date: Nov-2019

        View Options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Login options

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media