More Web Proxy on the site http://driver.im/

research-article

Jumping in at the deep end: how to experiment with machine learning in post-production software

Authors:

Johanna Barbier,

Guillaume Gales,

Sebastian LutzAuthors Info & Claims

DigiPro '19: Proceedings of the 2019 Digital Production Symposium

Article No.: 6, Pages 1 - 5

https://doi.org/10.1145/3329715.3338880

Published: 27 July 2019 Publication History

Abstract

Recent years has seen an explosion in Machine Learning (ML) research. The challenge is now to transfer these new algorithms into the hands of artists and TD's in visual effects and animation studios, so that they can start experimenting with ML within their existing pipelines. This paper presents some of the current challenges to experimentation and deployment of ML frameworks in the post-production industry. It introduces our open-source "ML-Server" client / server system as an answer to enabling rapid prototyping, experimentation and development of ML models in post-production software. Data, code and examples for the system can be found on the GitHub repository page:

https://github.com/TheFoundryVisionmongers/nuke-ML-server

References

[1]

Waleed Abdulla. 2017. Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow. https://github.com/matterport/Mask_RCNN. (2017).

[2]

Yoshua Bengio, Aaron C. Courville, and Pascal Vincent. 2012. Unsupervised Feature Learning and Deep Learning: A Review and New Perspectives. CoRR abs/1206.5538 (2012).

[3]

Blue Fairy. 2019. Blue Fairy Inc: Developing A.I. tools for the VFX industry. https://prisma-ai.com/. (2019). {Online; accessed 03-May-2019}.

[4]

G. Bradski. 2000. The OpenCV Library. Dr. Dobb's Journal of Software Tools (2000).

[5]

Cronobo. 2019. Nexture Online: Coherent, photo-accurate textures for anyone. http://cronobo.com/. (2019). {Online; accessed 03-May-2019}.

[6]

Docker. 2019. Enterprise Container Platform for High-Velocity Innovation. http://www.docker.com/. (2019). {Online; accessed 03-May-2019}.

[7]

Leon A. Gatys, Alexander S. Ecker, and Matthias Bethge. 2016. Image Style Transfer Using Convolutional Neural Networks. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, June 27--30, 2016. 2414--2423.

[8]

Google. 2019. TensorBoard, TensorFlow's visualization toolkit. https://www.tensorflow.org/tensorboard. (2019). {Online; accessed 10-June-2019}.

[9]

Byungsoo Kim, Vinicius C. Azevedo, Nils Thuerey, Theodore Kim, Markus Gross, and Barbara Solenthaler. 2019. Deep Fluids: A Generative Network for Parameterized Fluid Simulations. Computer Graphics Forum (Proc. Eurographics) 38, 2 (2019).

[10]

Kognat. 2019. RotoBot: Semantic Instance Segmentation Tool for VFX. https://kognat.com/. (2019). {Online; accessed 03-May-2019}.

[11]

Hugo Larochelle. 2019. Tweet:. https://twitter.com/hugo_larochelle/status/997620696967733249. (2019). {Online; accessed 03-May-2019}.

[12]

Wenbin Li, Fabio Viola, Jonathan Starck, Gabriel J. Brostow, and Neill D.F. Campbell. 2016. Roto++: Accelerating Professional Rotoscoping using Shape Manifolds. ACM Transactions on Graphics (In proceeding of ACM SIGGRAPH' 16) 35, 4 (2016).

Digital Library

[13]

K.-K. Maninis, S. Caelles, J. Pont-Tuset, and L. Van Gool. 2018. Deep Extreme Cut: From Extreme Points to Object Segmentation. In Computer Vision and Pattern Recognition (CVPR).

[14]

NVidia. 2019. NVidia Docker: Build and run Docker containers leveraging NVIDIA GPUs. https://github.com/NVIDIA/nvidia-docker. (2019). {Online; accessed 03-May-2019}.

[15]

NVIDIA. 2019. TensorRT, Programmable Inference Accelerator. https://developer.nvidia.com/tensorrt. (2019). {Online; accessed 03-May-2019}.

[16]

Onnx. 2019. Open Neural Network Exchange Format. https://onnx.ai/. (2019). {Online; accessed 03-May-2019}.

[17]

Taesung Park, Ming-Yu Liu, Ting-Chun Wang, and Jun-Yan Zhu. 2019. Semantic Image Synthesis with Spatially-Adaptive Normalization. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.

[18]

PlaidML. 2019. PlaidML: A platform for making deep learning work everywhere. https://github.com/plaidml/plaidml. (2019). {Online; accessed 03-May-2019}.

[19]

Micah J. Sheller, G. Anthony Reina, Brandon P. M. Edwards, Jason Martin, and Spyridon Bakas. 2018. Multi-institutional Deep Learning Modeling Without Sharing Patient Data: A Feasibility Study on Brain Tumor Segmentation. In BrainLes@MICCAI.

[20]

AI Technology & Industry Review Synced. 2019. CVPR 2019 Accepts Record 1300 Papers. https://medium.com/syncedreview/cvpr-2019-accepts-record-1300-papers-91b9e3b315f5. (2019). {Online; accessed 03-May-2019}.

[21]

Xin Tao, Hongyun Gao, Xiaoyong Shen, Jue Wang, and Jiaya Jia. 2018. Scale-recurrent Network for Deep Image Deblurring. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]

Justus Thies, Michael Zollhöfer, Marc Stamminger, Christian Theobalt, and Matthias Niessner. 2018. Face2Face: Real-time Face Capture and Reenactment of RGB Videos. Commun. ACM 62, 1 (Dec. 2018), 96--104.

Digital Library

[23]

Wikipedia. 2019. Comparison of deep-learning software - Wikipedia, The Free Encyclopedia. http://en.wikipedia.org/w/index.php?title=Comparison%20of%20deep-learning%20software&oldid=893977550. (2019). {Online; accessed 03-May-2019}.

[24]

Ning Xu, Brian Price, Scott Cohen, Jimei Yang, and Thomas S Huang. 2016. Deep interactive object selection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 373--381.

Cited By

Pal AMitra SLakshmi D(2024)Illuminating the Path From Script to Screen Using Lights, Camera, and AITransforming Cinema with Artificial Intelligence10.4018/979-8-3693-3916-9.ch006(97-142)Online publication date: 27-Dec-2024
https://doi.org/10.4018/979-8-3693-3916-9.ch006
Trottnow JGreenly WShaw CHudson SHelzle VVera HRing D(2020)SAUCE: Asset Libraries of the FutureProceedings of the 2020 Digital Production Symposium10.1145/3403736.3403941(1-5)Online publication date: 11-Aug-2020
https://dl.acm.org/doi/10.1145/3403736.3403941

Index Terms

Jumping in at the deep end: how to experiment with machine learning in post-production software
1. Computing methodologies
  1. Machine learning
2. Software and its engineering
  1. Software creation and management
    1. Software verification and validation
      1. Software prototyping

Recommendations

Deep reinforcement learning in computer vision: a comprehensive survey
Abstract
Deep reinforcement learning augments the reinforcement learning framework and utilizes the powerful representation of deep neural networks. Recent works have demonstrated the remarkable successes of deep reinforcement learning in various domains ...
An Overview of Deep Reinforcement Learning
CACRE2019: Proceedings of the 2019 4th International Conference on Automation, Control and Robotics Engineering

As a new machine learning method, deep reinforcement learning has made important progress in various fields of people's production and life since it was proposed. However, there are still many difficulties in function design and other aspects. Therefore,...
Deep Learning: Methods and Applications

This monograph provides an overview of general deep learning methodology and its applications to a variety of signal and information processing tasks. The application areas are chosen with the following three criteria in mind: (1) expertise or knowledge ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

DigiPro '19: Proceedings of the 2019 Digital Production Symposium

July 2019

52 pages

ISBN:9781450367998

DOI:10.1145/3329715

Conference Chairs:
Sandy Kao
DreamWorks Animation
,
Barbara Balents
BB Consulting
,
Program Chairs:
Trina Roy
Pixar Animation Studios
,
Rachel Rose
Industrial Light & Magic

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 July 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

DigiPro '19

Sponsor:

SIGGRAPH

DigiPro '19: The Digital Production Symposium

July 27, 2019

California, Los Angeles

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
275
Total Downloads

Downloads (Last 12 months)24
Downloads (Last 6 weeks)1

Reflects downloads up to 25 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Pal AMitra SLakshmi D(2024)Illuminating the Path From Script to Screen Using Lights, Camera, and AITransforming Cinema with Artificial Intelligence10.4018/979-8-3693-3916-9.ch006(97-142)Online publication date: 27-Dec-2024
https://doi.org/10.4018/979-8-3693-3916-9.ch006
Trottnow JGreenly WShaw CHudson SHelzle VVera HRing D(2020)SAUCE: Asset Libraries of the FutureProceedings of the 2020 Digital Production Symposium10.1145/3403736.3403941(1-5)Online publication date: 11-Aug-2020
https://dl.acm.org/doi/10.1145/3403736.3403941

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents