research-article

Open access

DeepFormableTag: end-to-end generation and recognition of deformable fiducial markers

Authors:

Min H. KimAuthors Info & Claims

ACM Transactions on Graphics (TOG), Volume 40, Issue 4

Article No.: 67, Pages 1 - 14

https://doi.org/10.1145/3450626.3459762

Published: 19 July 2021 Publication History

PDF eReader

Abstract

Fiducial markers have been broadly used to identify objects or embed messages that can be detected by a camera. Primarily, existing detection methods assume that markers are printed on ideally planar surfaces. The size of a message or identification code is limited by the spatial resolution of binary patterns in a marker. Markers often fail to be recognized due to various imaging artifacts of optical/perspective distortion and motion blur. To overcome these limitations, we propose a novel deformable fiducial marker system that consists of three main parts: First, a fiducial marker generator creates a set of free-form color patterns to encode significantly large-scale information in unique visual codes. Second, a differentiable image simulator creates a training dataset of photorealistic scene images with the deformed markers, being rendered during optimization in a differentiable manner. The rendered images include realistic shading with specular reflection, optical distortion, defocus and motion blur, color alteration, imaging noise, and shape deformation of markers. Lastly, a trained marker detector seeks the regions of interest and recognizes multiple marker patterns simultaneously via inverse deformation transformation. The deformable marker creator and detector networks are jointly optimized via the differentiable photorealistic renderer in an end-to-end manner, allowing us to robustly recognize a wide range of deformable markers with high accuracy. Our deformable marker system is capable of decoding 36-bit messages successfully at ~29 fps with severe shape deformation. Results validate that our system significantly outperforms the traditional and data-driven marker methods. Our learning-based marker system opens up new interesting applications of fiducial markers, including cost-effective motion capture of the human body, active 3D scanning using our fiducial markers' array as structured light patterns, and robust augmented reality rendering of virtual objects on dynamic surfaces.

Supplementary Material

VTT File (3450626.3459762.vtt)

Download
24.07 KB

ZIP File (a67-yaldiz.zip)

a67-yaldiz.zip

Download
272.44 KB

MP4 File (a67-yaldiz.mp4)

Download
192.56 MB

MP4 File (3450626.3459762.mp4)

Presentation.

Download
738.53 MB

References

[1]

Shumeet Baluja. 2017. Hiding images in plain sight: Deep steganography. In The Conference and Workshop on Neural Information Processing Systems. 2069--2079.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Hand-Eye Camera Calibration with an Optical Tracking System

ASSET-2: real-time motion segmentation and shape tracking

Tracking hand rotation and various grasping gestures from an IR camera using extended cylindrical manifold embedding

Comments

Information

Published In

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Funding Sources

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

PDF

eReader

Login options

Full Access

Share

Share this Publication link

Share on social media

Affiliations