[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3358528.3358551acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicbdtConference Proceedingsconference-collections
research-article

Optimization Methods for Computing System in Mobile CPS

Published: 28 August 2019 Publication History

Abstract

In Mobile Cyber-physical system (CPS) is a popular research field in recent years. It aims to control and monitor mobile devices in complex and real-time scenes, and provide people with convenience and economy by using intelligent applications. The scenes in mobile CPS have close relationships with everyone's life and it has pervasive effect on people's life anytime and anywhere. In the paper, we focus on the intelligent transportation systems (ITS) in mobile CPS. Specifically, we analyse the bottlenecks in automated and autonomous vehicles and find the on-car computer is the key component for low latency and low power self-driving vehicles. We show detailed research developments on the most advanced CPU-GPU platform including four aspects, mainly putting focus on the methods to reduce latency and energy in the system. The four aspects cover reducing data transferring overhead, reducing power consumption with power-gating, optimizing warp scheduling scheme and optimizing cache performance in coprocessors. Finally, we give the prospects and development trends in the computing platform of self-driving on-car system.

References

[1]
S. K. Khaitan and J. D. McCalley, "Design techniques and applications of cyberphysical systems: A survey," IEEE Systems Journal, vol. 9, no. 2, pp. 350--365, 2015.
[2]
C.-R. Rad, O. Hancu, I.-A. Takacs, and G. Olteanu, "Smart monitoring of potato crop: a cyber-physical system architecture model in the field of precision agriculture," Agriculture and Agricultural Science Procedia, vol. 6, pp. 73--79, 2015.
[3]
O. Hancu, V. Maties, R. Balan, and S. Stan, "Mechatronic approach for design and control of a hydraulic 3-dof parallel robot," Annals of DAAAM & Proceedings, pp. 321--323, 2007.
[4]
E. A. Lee and S. A. Seshia, Introduction to embedded systems: A cyber-physical systems approach. Lee & Seshia, 2011.
[5]
S. C. Suh, U. J. Tanik, J. N. Carbone, and A. Eroglu, "Applied cyber-physical systems," Springer, vol. 2, p. 27, 2014.
[6]
"https://en.wikipedia.org/wiki/intelligent transportation system."
[7]
K.-D. Kim and P. R. Kumar, "Cyber--physical systems: A perspective at the centennial," Proceedings of the IEEE, vol. 100, no. Special Centennial Issue, pp. 1287--1308, 2012.
[8]
M. Garc'ia-Valls, T. Cucinotta, and C. Lu, "Challenges in real-time virtualization and predictable cloud computing," Journal of Systems Architecture, vol. 60, no. 9, pp. 726--740, 2014.
[9]
"http://www.nvidia.com/object/drive-px.html."
[10]
"http://nvidianews.nvidia.com/news/nvidia-boosts-iq-of-self-driving-cars-with-world-s-first-in-car-artificial-intelligence-supercomputer."
[11]
"http://www.nvidia.com/object/pascal-architecture-whitepaper.html."
[12]
"Nvidia, "nvidia cuda c programming guide v7.5," 2015. available: http://developer.nvidia.com/nvidia-gpu-computing-documentation."
[13]
B. Ren, N. Ravi, Y. Yang, M. Feng, G. Agrawal, and S. Chakradhar, Automatic and Efficient Data Host-Device Communication for Many-Core Coprocessors. Cham: Springer International Publishing, 2016, pp. 173--190. [Online]. Available: http://dx.doi.org/10.1007/978-3-319-29778-1 11
[14]
Y. Fujii, T. Azumi, N. Nishio, S. Kato, and M. Edahiro, "Data transfer matters for gpu computing," in International Conference on Parallel and Distributed Systems, 2013, pp. 275--282.
[15]
S. Kato, J. Aumiller, and S. Brandt, "Zero-copy i/o processing for low-latency gpu computing," in ACM/IEEE International Conferenceon Cyber-Physical Systems, 2013, pp. 170--178.
[16]
C. J. Rossbach, J. Currey, M. Silberstein, B. Ray, and E. Witchel, "Ptask:operating system abstractions to manage gpus as compute devices," in ACM Symposium on Operating Systems Principles 2011, SOSP 2011, Cascais, Portugal, October, 2011, pp. 233--248.
[17]
T. B. Jablin, P. Prabhu, J. A. Jablin, N. P. Johnson, S. R. Beard, and D. I. August, "Automatic cpu-gpu communication management and optimization." Acm Sigplan Notices, vol. 46, no. 6, pp. 142--151, 2011.
[18]
C. Basaran and K. D. Kang, "Supporting preemptive task executions and memory copies in gpgpus," in Euromicro Conference on Real-Time Systems, 2012, pp. 287--296.
[19]
S. Kato, K. Lakshmanan, A. Kumar, and M. Kelkar, "Rgem: A responsive gpgpu execution model for runtime engines," in IEEE Real-Time Systems Symposium, RTSS 2011, Vienna, Austria, November 29 - December, 2011, pp. 57--66.
[20]
Q. Xu and M. Annavaram, "Pats: pattern aware scheduling and power gating for gpgpus," in International Conference on Parallel Architectures and Compilation, 2014, pp. 225--236.
[21]
M. Namaki-Shoushtari, A. Rahimi, N. Dutt, and P. Gupta, "Argo: Aging-aware gpgpu register file allocation," in International Conference on Hardware/software Codesign and System Synthesis, 2013, pp. 1--9.
[22]
H. Aghilinasab, M. Sadrosadati, M. H. Samavatian, and H. Sarbazi-Azad, "Reducing power consumption of gpgpus through instruction reordering," in International Symposium, 2016, pp. 356--361.
[23]
Y. Wang, S. Roy, and N. Ranganathan, "Run-time power-gating in caches of gpus for leakage energy savings," in Design, Automation & Test in Europe Conference & Exhibition, 2012, pp. 300--303.
[24]
M. Abdel-Majeed, D. Wong, and M. Annavaram, "Warped gates:gating aware scheduling and power gating for gpgpus," in Ieee/acm International Symposium on Microarchitecture, 2013, pp. 111--122.
[25]
M. Gebhart, D. R. Johnson, D. Tarjan, S. W. Keckler, W. J. Dally, E. Lindholm, and K. Skadron, "Energy-efficient mechanisms for managing thread context in throughput processors," in ACM SIGARCH Computer Architecture News, vol. 39, no. 3. ACM, 2011, pp. 235--246.
[26]
V. Narasiman, M. Shebanow, C. J. Lee, R. Miftakhutdinov, O. Mutlu, and Y. N. Patt, "Improving gpu performance via large warps and two-level warp scheduling," in Proceedings of the 44th Annual IEEE/ACM International Symposium on Microarchitecture. ACM, 2011, pp. 308--317.
[27]
S. Y. Lee and C. J. Wu, "Caws: criticality-aware warp scheduling for gpgpu workloads," in International Conference on Parallel Architectures and Compilation, 2014, pp. 175--186.
[28]
Y. Huangfu and W. Zhang, "Warp-based load/store reordering to improve gpu data cache time predictability and performance," in IEEE International Symposium on Real-Time Distributed Computing, 2016, pp. 166--173.
[29]
N. Brunie, S. Collange, and G. Diamos, "Simultaneous branch and warp interweaving for sustained gpu performance," in International Symposium on Computer Architecture, 2012, pp. 49--60.
[30]
B. Wang, Y. Zhu, and W. Yu, "Oaws: Memory occlusion aware warp scheduling," in International Conference on Parallel Architectures and Compilation, 2016, pp. 45--55.
[31]
W. W. Fung, I. Sham, G. Yuan, and T. M. Aamodt, "Dynamic warp formation and scheduling for efficient gpu control flow," in Proceedings of the 40th Annual IEEE/ACM International Symposium on Microarchitecture. IEEE Computer Society, 2007, pp. 407--420.
[32]
J. J. K. Park, Y. Park, and S. Mahlke, "Elf: maximizing memory-level parallelism for gpus with coordinated warp and fetch scheduling," in International Conference for High PERFORMANCE Computing, Networking, Storage and Analysis, 2015, p. 18.
[33]
Y. Zhang, Z. Xing, L. Zhou, and C. Zhu, "Locality protected dynamic cache allocation scheme on gpus," in 2016 IEEE Trustcom/BigDataSE/ISPA, Aug 2016, pp. 1524--1530.
[34]
T. G. Rogers, M. O'Connor, and T. M. Aamodt, "Cache-conscious wavefront scheduling," in Proceedings of the 2012 45th Annual IEEE/ACM International Symposium on Microarchitecture. IEEE Computer Society, 2012, pp. 72--83.
[35]
T. G. Rogers, M. O. Connor, and T. M. Aamodt, "Divergence-aware warp scheduling," in Proceedings of the 46th Annual IEEE/ACM International Symposium on Microarchitecture. ACM, 2013, pp. 99--110.
[36]
W. Jia, K. A. Shaw, and M. Martonosi, "Mrpb: Memory request prioritization for massively parallel processors," in High Perfor-mance Computer Architecture (HPCA), 2014 IEEE 20th International Symposium on. IEEE, 2014, pp. 272--283.
[37]
X. Chen, L.-W. Chang, C. I. Rodrigues, J. Lv, Z. Wang, and W.-M.Hwu, "Adaptive cache management for energy-efficient gpu computing," in Proceedings of the 47th Annual IEEE/ACM International Symposium on Microarchitecture. IEEE Computer Society, 2014, pp. 343--355.
[38]
Y. Tian, S. Puthoor, J. L. Greathouse, and B. M. Beckmann, "Adaptive gpu cache bypassing," pp. 25--35, 2015.
[39]
C. Li, S. L. Song, H. Dai, A. Sidelnik, S. K. S. Hari, and H. Zhou, "Locality-driven dynamic gpu cache bypassing," in The ACM on International Conference on Supercomputing, 2015, pp. 67--77.
[40]
B. Wang, W. Yu, X. H. Sun, and X. Wang, "Dacache: Memory divergence-aware gpu cache management," in ACM on International Conference on Supercomputing, 2015, pp. 89--98.

Cited By

View all
  • (2020)A Real-Time Data Delivery for Mobile Sinks Group on Mobile Cyber-Physical SystemsApplied Sciences10.3390/app1017595010:17(5950)Online publication date: 27-Aug-2020

Index Terms

  1. Optimization Methods for Computing System in Mobile CPS

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICBDT '19: Proceedings of the 2nd International Conference on Big Data Technologies
    August 2019
    382 pages
    ISBN:9781450371926
    DOI:10.1145/3358528
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    • Shandong Univ.: Shandong University

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 28 August 2019

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. CPS
    2. CPU-GPU platform
    3. Intelligent Transportation Systems

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICBDT2019

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 10 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2020)A Real-Time Data Delivery for Mobile Sinks Group on Mobile Cyber-Physical SystemsApplied Sciences10.3390/app1017595010:17(5950)Online publication date: 27-Aug-2020

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media