[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3419016.3431492acmconferencesArticle/Chapter ViewAbstractPublication PagessensysConference Proceedingsconference-collections
research-article

A video dataset of a wooden box assembly process: dataset

Published: 16 November 2020 Publication History

Abstract

This paper presents a video dataset of a 9-step wooden box assembly process including 17 subjects. The main strength of this dataset is the design of a standard and uniform workflow and the use of multiple cameras capturing videos from two different viewpoints. 62 video files were collected with the total size of 20 GB and the total duration of 13 hours. Each of the video is complemented with temporal annotations that indicate the starting and ending timestamps of each work step in the assembly process. We also provide statistical descriptive analyses of the recorded processes. Our dataset can be utilized for developing solutions to human activity recognition, process documentation, and many others that involve human-object interaction.

References

[1]
Jean-Baptiste Alayrac, Piotr Bojanowski, Nishant Agrawal, Josef Sivic, Ivan Laptev, and Simon Lacoste-Julien. 2016. Unsupervised learning from narrated instruction videos. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 4575--4583.
[2]
Jean-Baptiste Alayrac, Ivan Laptev, Josef Sivic, and Simon Lacoste-Julien. 2017. Joint discovery of object states and manipulation actions. In Proceedings of the IEEE International Conference on Computer Vision. 2127--2136.
[3]
Kai Kang, Hongsheng Li, Tong Xiao, Wanli Ouyang, Junjie Yan, Xihui Liu, and Xiaogang Wang. 2017. Object detection in videos with tubelet proposal networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 727--735.
[4]
Alina Kuznetsova, Sung Ju Hwang, Bodo Rosenhahn, and Leonid Sigal. 2015. Expanding object detector's horizon: Incremental learning framework for object detection in videos. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 28--36.
[5]
Jonathan Malmaud, Jonathan Huang, Vivek Rathod, Nick Johnston, Andrew Rabinovich, and Kevin Murphy. 2015. What's cookin'? interpreting cooking videos using text, speech and vision. arXiv preprint arXiv:1503.01558 (2015).
[6]
Antoine Miech, Dimitri Zhukov, Jean-Baptiste Alayrac, Makarand Tapaswi, Ivan Laptev, and Josef Sivic. 2019. HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips. CoRR abs/1906.03327 (2019). arXiv:1906.03327 http://arxiv.org/abs/1906.03327
[7]
Alba Roda-Sales, Margarita Vergara, Joaquín L Sancho-Bru, Verónica Gracia-Ibáñez, and Néstor J Jarque-Bou. 2019. Human hand kinematic data during feeding and cooking tasks. Scientific data 6, 1 (2019), 1--10.
[8]
Ozan Sener, Amir R Zamir, Silvio Savarese, and Ashutosh Saxena. 2015. Unsupervised semantic parsing of video collections. In Proceedings of the IEEE International Conference on Computer Vision. 4480--4488.
[9]
Dongsong Zhang, Lina Zhou, Robert O Briggs, and Jay F Nunamaker Jr. 2006. Instructional video in e-learning: Assessing the impact of interactive video on learning effectiveness. Information & management 43, 1 (2006), 15--27.
[10]
Jiayun Zhang, Petr Byvshev, and Yu Xiao. 2020. A video dataset for wooden box assembly. Funded by Business Finland (1660/31/2018) European Union's Horizon 2020 Research and Innovation Programme (grant number: 777222.
[11]
Luowei Zhou, Chenliang Xu, and Jason J Corso. 2018. Towards automatic learning of procedures from web instructional videos. In Proceedings of Thirty-Second AAAI Conference on Artificial Intelligence.
[12]
Dimitri Zhukov, Jean-Baptiste Alayrac, Ramazan Gokberk Cinbis, David Fouhey, Ivan Laptev, and Josef Sivic. 2019. Cross-task weakly supervised learning from instructional videos. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3537--3545.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
DATA '20: Proceedings of the Third Workshop on Data: Acquisition To Analysis
November 2020
42 pages
ISBN:9781450381369
DOI:10.1145/3419016
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 November 2020

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Funding Sources

  • Business Finland

Conference

Acceptance Rates

Overall Acceptance Rate 74 of 167 submissions, 44%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 128
    Total Downloads
  • Downloads (Last 12 months)19
  • Downloads (Last 6 weeks)2
Reflects downloads up to 11 Jan 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media