[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3319499.3328231acmconferencesArticle/Chapter ViewAbstractPublication PageseicsConference Proceedingsconference-collections
poster

A system for real-time interactive analysis of deep learning training

Published: 18 June 2019 Publication History

Abstract

Performing diagnosis or exploratory analysis during the training of deep learning models is challenging but often necessary for making a sequence of decisions guided by the incremental observations. Currently available systems for this purpose are limited to monitoring only the logged data that must be specified before the training process starts. Each time a new information is desired, a cycle of stop-change-restart is required in the training process. These limitations make interactive exploration and diagnosis tasks difficult, imposing long tedious iterations during the model development. We present a new system that enables users to perform interactive queries on live processes generating real-time information that can be rendered in multiple formats on multiple surfaces in the form of several desired visualizations simultaneously. To achieve this, we model various exploratory inspection and diagnostic tasks for deep learning training processes as specifications for streams using a map-reduce paradigm with which many data scientists are already familiar. Our design achieves generality and extensibility by defining composable primitives which is a fundamentally different approach than is used by currently available systems.

References

[1]
{n. d.}. Visualize your deep learning training and data flawlessly. https://github.com/PaddlePaddle/VisualDL
[2]
Foto N. Afrati, Vinayak Borkar, Michael Carey, Neoklis Polyzotis, and Jeffrey D. Ullman. 2011. Map-reduce Extensions and Recursive Queries. In Proceedings of the 14th International Conference on Extending Database Technology (EDBT/ICDT '11). ACM, New York, NY, USA, 1--8.
[3]
Shivnath Babu and Jennifer Widom. 2001. Continuous Queries over Data Streams. SIGMOD Rec. 30, 3 (Sept. 2001), 109--120.
[4]
James S. Bergstra, Rémi Bardenet, Yoshua Bengio, and Balázs Kégl. 2011. Algorithms for Hyper-Parameter Optimization. In Advances in Neural Information Processing Systems 24, J. Shawe-Taylor, R. S. Zemel, P. L. Bartlett, F. Pereira, and K. Q. Weinberger (Eds.). Curran Associates, Inc., 2546--2554. http://papers.nips.cc/paper/4443-algorithms-for-hyper-parameter-optimization.pdf
[5]
Bryan Catanzaro, Narayanan Sundaram, and Kurt Keutzer. 2008. A map reduce framework for programming graphics processors.
[6]
Jaegul Choo and Shixia Liu. 2018. Visual Analytics for Explainable Deep Learning. IEEE Computer Graphics and Applications 38, 4 (jul 2018), 84--92.
[7]
Jeffrey Dean and Sanjay Ghemawat. 2008. MapReduce. Commun. ACM 51, 1 (jan 2008), 107.
[8]
Chandrajit Bajaj (Ed.), Chandrajit Bajaj, and C Fl John Wiley. 1998. Data Visualization Techniques.
[9]
Jaliya Ekanayake, Shrideep Pallickara, and Geoffrey Fox. 2008. MapReduce for Data Intensive Scientific Analyses. In 2008 IEEE Fourth International Conference on eScience. IEEE.
[10]
B. Ellis. 2014. Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data. Wiley. https://books.google.com/books?id=DFnOAwAAQBAJ
[11]
Stephen Few. 2006. Information Dashboard Design: The Effective Visual Communication of Data. O'Reilly Media, Inc.
[12]
Alan F. Gates, Olga Natkovich, Shubham Chopra, Pradeep Kamath, Shravan M. Narayanamurthy, Christopher Olston, Benjamin Reed, Santhosh Srinivasan, and Utkarsh Srivastava. 2009. Building a High-level Dataflow System on Top of Map-Reduce: The Pig Experience. Proc. VLDB Endow. 2, 2 (Aug. 2009), 1414--1425.
[13]
Trevor Hastie and Robert Tibshirani. 1986. Generalized Additive Models. Statist. Sci. 1, 3 (aug 1986), 297--310.
[14]
Shixia Liu, Xiting Wang, Mengchen Liu, and Jun Zhu. 2017. Towards better analysis of machine learning models: A visual analytics perspective. Visual Informatics 1, 1 (mar 2017), 48--56.
[15]
Ivan Logre and Anne-Marie Déry-Pinna. 2018. MDE in Support of Visualization Systems Design: A Multi-Staged Approach Tailored for Multiple Roles. Proc. ACM Hum.-Comput. Interact. 2, EICS, Article 14 (June 2018), 17 pages.
[16]
Constantinos Mourlas. 2009. Intelligent user interfaces : adaptation and personalization systems and technologies. Information Science Reference, Hershey, PA.
[17]
Kawa Nazemi. 2016. Adaptive Semantics Visualization. Springer International Publishing.
[18]
Andrew Pavlo, Erik Paulson, Alexander Rasin, Daniel J. Abadi, David J. DeWitt, Samuel Madden, and Michael Stonebraker. 2009. A Comparison of Approaches to Large-scale Data Analysis. In Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data (SIGMOD '09). ACM, New York, NY, USA, 165--178.
[19]
B. Plale and K. Schwan. 2003. Dynamic querying of streaming data with the dQUOB system. IEEE Transactions on Parallel and Distributed Systems 14, 4 (apr 2003), 422--432.
[20]
Jonathan C. Roberts. 2007. State of the Art: Coordinated & Multiple Views in Exploratory Visualization. In Fifth International Conference on Coordinated and Multiple Views in Exploratory Visualization (CMV 2007). IEEE.
[21]
Guy L. Steele. 1995. Parallelism in Lisp. ACM SIGPLAN Lisp Pointers VIII, 2 (may 1995), 1--14.
[22]
K. Stockinger, J. Shalf, Kesheng Wu, and E.W. Bethel. {n. d.}. Query-Driven Visualization of Large Data Sets. In VIS 05. IEEE Visualization, 2005. IEEE.
[23]
Nako Sung, Minkyu Kim, Hyunwoo Jo, Youngil Yang, Jingwoong Kim, Leonard Lausen, Youngkwan Kim, Gayoung Lee, Dong-Hyun Kwak, Jung-Woo Ha, and Sunghun Kim. 2017. NSML: A Machine Learning Platform That Enables You to Focus on Your Models. CoRR abs/1712.05902 (2017). arXiv:1712.05902 http://arxiv.org/abs/1712.05902
[24]
Jonas Traub, Nikolaas Steenbergen, Philipp Grulich, Tilmann Rabl, and Volker Markl. 2017. IÂCš: Interactive Real-Time Visualization for Streaming Data.
[25]
Kanit Wongsuphasawat, Daniel Smilkov, James Wexler, Jimbo Wilson, Dandelion Mane, Doug Fritz, Dilip Krishnan, Fernanda B. Viegas, and Martin Wattenberg. 2018. Visualizing Dataflow Graphs of Deep Learning Models in TensorFlow. IEEE Transactions on Visualization and Computer Graphics 24, 1 (jan 2018), 1--12.
[26]
Yang You, Zhao Zhang, Cho-Jui Hsieh, James Demmel, and Kurt Keutzer. 2018. ImageNet Training in Minutes. In Proceedings of the 47th International Conference on Parallel Processing (ICPP 2018). ACM, New York, NY, USA, Article 1, 10 pages.

Cited By

View all
  • (2022)Boosting Domain-Specific Debug Through Inter-frame Compression2022 International Conference on Field-Programmable Technology (ICFPT)10.1109/ICFPT56656.2022.9974385(1-10)Online publication date: 5-Dec-2022
  • (2021)Marcelle: Composing Interactive Machine Learning Workflows and InterfacesThe 34th Annual ACM Symposium on User Interface Software and Technology10.1145/3472749.3474734(39-53)Online publication date: 10-Oct-2021
  • (2021)UMLAUT: Debugging Deep Learning Programs using Program Structure and Model BehaviorProceedings of the 2021 CHI Conference on Human Factors in Computing Systems10.1145/3411764.3445538(1-16)Online publication date: 6-May-2021
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
EICS '19: Proceedings of the ACM SIGCHI Symposium on Engineering Interactive Computing Systems
June 2019
141 pages
ISBN:9781450367455
DOI:10.1145/3319499
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 June 2019

Check for updates

Author Tags

  1. debugging
  2. deep learning
  3. diagnostics
  4. exploratory inspection
  5. map-reduce
  6. monitoring
  7. streams
  8. visualization

Qualifiers

  • Poster

Conference

EICS '19
Sponsor:

Acceptance Rates

Overall Acceptance Rate 73 of 299 submissions, 24%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2022)Boosting Domain-Specific Debug Through Inter-frame Compression2022 International Conference on Field-Programmable Technology (ICFPT)10.1109/ICFPT56656.2022.9974385(1-10)Online publication date: 5-Dec-2022
  • (2021)Marcelle: Composing Interactive Machine Learning Workflows and InterfacesThe 34th Annual ACM Symposium on User Interface Software and Technology10.1145/3472749.3474734(39-53)Online publication date: 10-Oct-2021
  • (2021)UMLAUT: Debugging Deep Learning Programs using Program Structure and Model BehaviorProceedings of the 2021 CHI Conference on Human Factors in Computing Systems10.1145/3411764.3445538(1-16)Online publication date: 6-May-2021
  • (2021)Flexible Instrumentation for Live On-Chip Debug of Machine Learning Training on FPGAs2021 IEEE 29th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)10.1109/FCCM51124.2021.00011(20-28)Online publication date: May-2021
  • (2020)SCRAM: Simple Checks for Realtime Analysis of Model Training for Non-Expert ML ProgrammersExtended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems10.1145/3334480.3382879(1-10)Online publication date: 25-Apr-2020

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media