Learning Omnidirectional Flow in 360 $$^\circ $$ Video via Siamese Representation

Keshav Bhandari ORCID: orcid.org/0000-0003-4775-5948¹²,
Bin Duan¹³,
Gaowen Liu¹⁴,
Hugo Latapie¹⁴,
Ziliang Zong¹² &
…
Yan Yan¹³

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13668))

Included in the following conference series:

European Conference on Computer Vision

2211 Accesses
2 Citations

Abstract

Optical flow estimation in omnidirectional videos faces two significant issues: the lack of benchmark datasets and the challenge of adapting perspective video-based methods to accommodate the omnidirectional nature. This paper proposes the first perceptually natural-synthetic omnidirectional benchmark dataset with a 360$^\circ $ field of view, FLOW360, with 40 different videos and 4,000 video frames. We conduct comprehensive characteristic analysis and comparisons between our dataset and existing optical flow datasets, which manifest perceptual realism, uniqueness, and diversity. To accommodate the omnidirectional nature, we present a novel Siamese representation Learning framework for Omnidirectional Flow (SLOF). We train our network in a contrastive manner with a hybrid loss function that combines contrastive loss and optical flow loss. Extensive experiments verify the proposed framework’s effectiveness and show up to 40% performance improvement over the state-of-the-art approaches. Our FLOW360 dataset and code are available at https://siamlof.github.io/.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 79.50; Price includes VAT (United Kingdom)

Softcover Book: GBP 99.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Learning to estimate optical flow using dual-frequency paradigm

Article 28 August 2023

Deep 360 $$^\circ $$ Optical Flow Estimation Based on Multi-projection Fusion

Video Enhancement with Task-Oriented Flow

Article 12 February 2019

References

Adobe: Mixamo. www.mixamo.com/
Ahmadi, A., Patras, I.: Unsupervised convolutional neural networks for motion estimation. In: ICIP (2016)
Google Scholar
Artizzu, C.O., Zhang, H., Allibert, G., Demonceaux, C.: OmniFlowNet: a perspective neural network adaptation for optical flow estimation in omnidirectional images. In: ICPR (2021)
Google Scholar
Azevedo, R., Birkbeck, N., Simone, F., Janatra, I., Adsumilli, B., Frossard, P.: Visual distortions in 360-degree videos. TCSVT 2019(8), 2524–2537 (2020)
Google Scholar
Bailer, C., Taetz, B., Stricker, D.: Flow fields: dense correspondence fields for highly accurate large displacement optical flow estimation. In: ICCV (2015)
Google Scholar
Baker, S., Roth, S., Scharstein, D., Black, M.J., Lewis, J., Szeliski, R.: A database and evaluation methodology for optical flow. In: ICCV (2007)
Google Scholar
Barron, J.L., Fleet, D.J., Beauchemin, S.S.: Performance of optical flow techniques. IJCV 12(1), 43–77 (1994)
Article Google Scholar
Bertinetto, L., Valmadre, J., Henriques, J.F., Vedaldi, A., Torr, P.H.S.: Fully-convolutional Siamese networks for object tracking. In: Hua, G., Jégou, H. (eds.) ECCV 2016. LNCS, vol. 9914, pp. 850–865. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-48881-3_56
Chapter Google Scholar
Bhandari, K., Zong, Z., Yan, Y.: Revisiting optical flow estimation in 360 videos. In: ICPR (2021)
Google Scholar
Blender: https://www.blender.org/
Boomsma, W., Frellsen, J.: Spherical convolutions and their application in molecular modelling. In: NeurIPS (2017)
Google Scholar
Bromley, J., et al.: Signature verification using a “Siamese” time delay neural network. IJPRAI 7(04), 669–688 (1993)
Google Scholar
Brox, T., Bruhn, A., Papenberg, N., Weickert, J.: High accuracy optical flow estimation based on a theory for warping. In: Pajdla, T., Matas, J. (eds.) ECCV 2004. LNCS, vol. 3024, pp. 25–36. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-24673-2_3
Chapter Google Scholar
Brox, T., Malik, J.: Large displacement optical flow: descriptor matching in variational motion estimation. TPAMI 33(3), 500–513 (2010)
Article Google Scholar
Butler, D.J., Wulff, J., Stanley, G.B., Black, M.J.: A naturalistic open source movie for optical flow evaluation. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7577, pp. 611–625. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33783-3_44
Chapter Google Scholar
Chen, Q., Koltun, V.: Full flow: optical flow estimation by global optimization over regular grids. In: CVPR (2016)
Google Scholar
Chen, X., He, K.: Exploring simple Siamese representation learning. In: CVPR (2021)
Google Scholar
Cohen, T.S., Geiger, M., Koehler, J., Welling, M.: Spherical CNNs. arXiv (2018)
Google Scholar
Coors, B., Condurache, A.P., Geiger, A.: SphereNet: learning spherical representations for detection and classification in omnidirectional images. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11213, pp. 525–541. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01240-3_32
Chapter Google Scholar
Demonceaux, C., Kachi-Akkouche, D.: Optical flow estimation in omnidirectional images using wavelet approach. In: CVPRW (2003)
Google Scholar
Dosovitskiy, A., et al.: FlowNet: learning optical flow with convolutional networks. In: ICCV (2015)
Google Scholar
Eder, M., Shvets, M., Lim, J., Frahm, J.M.: Tangent images for mitigating spherical distortion. In: CVPR (2020)
Google Scholar
Esteves, C., Allen-Blanchette, C., Makadia, A., Daniilidis, K.: Learning SO(3) equivariant representations with spherical CNNs. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11217, pp. 54–70. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01261-8_4
Chapter Google Scholar
Feng, B.Y., Yao, W., Liu, Z., Varshney, A.: Deep depth estimation on 360$^{\circ }$ images with a double quaternion loss. In: 3DV (2020)
Google Scholar
Fernandez-Labrador, C., Facil, J.M., Perez-Yus, A., Demonceaux, C., Civera, J., Guerrero, J.J.: Corners for layout: end-to-end layout recovery from 360 images. RA-L 5(2), 1255–1262 (2020)
Google Scholar
Field, D.J.: Relations between the statistics of natural images and the response properties of cortical cells. Josa a 4(12), 2379–2394 (1987)
Article Google Scholar
Garg, R., Roussos, A., Agapito, L.: A variational approach to video registration with subspace constraints. IJCV 104(3), 286–314 (2013)
Article MathSciNet MATH Google Scholar
Geiger, A., Lenz, P., Stiller, C., Urtasun, R.: Vision meets robotics: the kitti dataset. IJRR 32(11), 1231–1237 (2013)
Google Scholar
Geiger, A., Lenz, P., Urtasun, R.: Are we ready for autonomous driving? The kitti vision benchmark suite. In: CVPR (2012)
Google Scholar
Geyer, C., Daniilidis, K.: A unifying theory for central panoramic systems and practical implications. In: Vernon, D. (ed.) ECCV 2000. LNCS, vol. 1843, pp. 445–461. Springer, Heidelberg (2000). https://doi.org/10.1007/3-540-45053-X_29
Chapter Google Scholar
Goralczyk, A.: Nishita sky demo (2020), creative Commons CC0 (Public Domain) - Blender Studio - cloud.blender.org
Google Scholar
Horn, B.K., Schunck, B.G.: Determining optical flow. AI 17(1–3), 185–203 (1981)
MATH Google Scholar
Horn, B., Schunck, B.: Techniques and applications of image understanding (1981)
Google Scholar
Hui, T.W., Tang, X., Loy, C.C.: LiteFlowNet: a lightweight convolutional neural network for optical flow estimation. In: CVPR (2018)
Google Scholar
Hui, T.W., Tang, X., Loy, C.C.: A lightweight optical flow CNN -revisiting data fidelity and regularization. TPAMI 43(8), 2555–2569 (2021)
Article Google Scholar
Hulle, S.V.: Bcon19 (2019), 2019 Blender Conference - cloud.blender.org
Google Scholar
Hur, J., Roth, S.: MirrorFlow: exploiting symmetries in joint optical flow and occlusion estimation. In: ICCV (2017)
Google Scholar
Ilg, E., Mayer, N., Saikia, T., Keuper, M., Dosovitskiy, A., Brox, T.: FlowNet 2.0: evolution of optical flow estimation with deep networks. In: CVPR (2017)
Google Scholar
Yu, J.J., Harley, A.W., Derpanis, K.G.: Back to basics: unsupervised learning of optical flow via brightness constancy and motion smoothness. In: Hua, G., Jégou, H. (eds.) ECCV 2016. LNCS, vol. 9915, pp. 3–10. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-49409-8_1
Chapter Google Scholar
Jiang, S., Campbell, D., Lu, Y., Li, H., Hartley, R.: Learning to estimate hidden motions with global motion aggregation. arXiv (2021)
Google Scholar
Liu, C., Freeman, W.T., Adelson, E.H., Weiss, Y.: Human-assisted motion annotation. In: CVPR (2008)
Google Scholar
Liu, P., Lyu, M., King, I., Xu, J.: SelFlow: self-supervised learning of optical flow. In: CVPR (2019)
Google Scholar
Lucas, B.D., Kanade, T.: An iterative image registration technique with an application to stereo vision. In: IJCAI, vol. 2 (1981)
Google Scholar
McCane, B., Novins, K., Crannitch, D., Galvin, B.: On benchmarking optical flow. CVIU 84(1) (2001)
Google Scholar
Meister, S., Hur, J., Roth, S.: Unflow: unsupervised learning of optical flow with a bidirectional census loss. In: AAAI (2018)
Google Scholar
Meister, S., Jähne, B., Kondermann, D.: Outdoor stereo camera system for the generation of real-world benchmark data sets. Opt. Eng. 51(2), 021107 (2012)
Article Google Scholar
Menze, M., Geiger, A.: Object scene flow for autonomous vehicles. In: CVPR (2015)
Google Scholar
Menze, M., Heipke, C., Geiger, A.: Discrete optimization for optical flow. In: GCPR (2015)
Google Scholar
Otte, M., Nagel, H.-H.: Optical flow estimation: advances and comparisons. In: Eklundh, J.-O. (ed.) ECCV 1994. LNCS, vol. 800, pp. 49–60. Springer, Heidelberg (1994). https://doi.org/10.1007/3-540-57956-7_5
Chapter Google Scholar
Ranjan, A., Black, M.J.: Optical flow estimation using a spatial pyramid network. In: CVPR (2017)
Google Scholar
Seidel, R., Apitzsch, A., Hirtz, G.: OmniFlow: human omnidirectional optical flow. In: CVPR (2021)
Google Scholar
Shakernia, O., Vidal, R., Sastry, S.: Omnidirectional egomotion estimation from back-projection flow. In: CVPRW (2003)
Google Scholar
Simoncelli, E.P., Olshausen, B.A.: Natural image statistics and neural representation. Annu. Rev. Neurosci. 24(1), 1193–1216 (2001)
Article Google Scholar
Sketchfab. https://sketchfab.com/
Steinbrücker, F., Pock, T., Cremers, D.: Large displacement optical flow computation without warping. In: ICCV (2009)
Google Scholar
Su, Y.C., Grauman, K.: Learning spherical convolution for fast features from 360$^{\circ }$ imagery. In: NeurIPS (2017)
Google Scholar
Su, Y.C., Grauman, K.: Kernel transformer networks for compact spherical convolution. In: CVPR (2019)
Google Scholar
Sun, D., Yang, X., Liu, M.Y., Kautz, J.: Models matter, so does training: an empirical study of CNNs for optical flow estimation. TPAMI 42(6), 1408–1423 (2019)
Article Google Scholar
Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: DeepFace: closing the gap to human-level performance in face verification. In: CVPR (2014)
Google Scholar
Teed, Z., Deng, J.: RAFT: recurrent all-pairs field transforms for optical flow. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12347, pp. 402–419. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58536-5_24
Chapter Google Scholar
Teney, D., Hebert, M.: Learning to extract motion from videos in convolutional neural networks. In: ACCV (2016)
Google Scholar
Tran, D., Bourdev, L., Fergus, R., Torresani, L., Paluri, M.: Deep end2end voxel2voxel prediction. In: CVPRW (2016)
Google Scholar
Turbosquid: https://www.turbosquid.com
Wang, R., Geraghty, D., Matzen, K., Szeliski, R., Frahm, J.M.: VPLNet: deep single view normal estimation with vanishing points and lines. In: CVPR (2020)
Google Scholar
Weinzaepfel, P., Revaud, J., Harchaoui, Z., Schmid, C.: DeepFlow: large displacement optical flow with deep matching. In: ICCV (2013)
Google Scholar
Woliński, M.: City - 3d model, sketchfab.com
Google Scholar
Wulff, J., Black, M.J.: Efficient sparse-to-dense optical flow estimation using a learned basis and layers. In: CVPR (2015)
Google Scholar
Zhang, Z., Xu, Y., Yu, J., Gao, S.: Saliency detection in 360$^\circ $ videos. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11211, pp. 504–520. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_30
Chapter Google Scholar
Zhao, S., Sheng, Y., Dong, Y., Chang, E.I., Xu, Y., et al.: MaskFlowNet: asymmetric feature matching with learnable occlusion mask. In: CVPR (2020)
Google Scholar
Zioulis, N., Karakottas, A., Zarpalas, D., Daras, P.: OmniDepth: dense depth estimation for indoors spherical panoramas. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11210, pp. 453–471. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01231-1_28
Chapter Google Scholar

Download references

Acknowledgements

This research was partially supported by NSF CNS-1908658, NeTS-2109982 and the gift donation from Cisco. This article solely reflects the opinions and conclusions of its authors and not the funding agents.

Author information

Authors and Affiliations

Texas State University, San Marcos, USA
Keshav Bhandari & Ziliang Zong
Illinois Institute of Technology, Chicago, USA
Bin Duan & Yan Yan
Cisco Research, San Jose, USA
Gaowen Liu & Hugo Latapie

Authors

Keshav Bhandari
View author publications
You can also search for this author in PubMed Google Scholar
Bin Duan
View author publications
You can also search for this author in PubMed Google Scholar
Gaowen Liu
View author publications
You can also search for this author in PubMed Google Scholar
Hugo Latapie
View author publications
You can also search for this author in PubMed Google Scholar
Ziliang Zong
View author publications
You can also search for this author in PubMed Google Scholar
Yan Yan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yan Yan .

Editor information

Editors and Affiliations

Tel Aviv University, Tel Aviv, Israel
Shai Avidan
University College London, London, UK
Gabriel Brostow
Google AI, Accra, Ghana
Moustapha Cissé
University of Catania, Catania, Italy
Giovanni Maria Farinella
Facebook (United States), Menlo Park, CA, USA
Tal Hassner

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 11412 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bhandari, K., Duan, B., Liu, G., Latapie, H., Zong, Z., Yan, Y. (2022). Learning Omnidirectional Flow in 360$^\circ $ Video via Siamese Representation. In: Avidan, S., Brostow, G., Cissé, M., Farinella, G.M., Hassner, T. (eds) Computer Vision – ECCV 2022. ECCV 2022. Lecture Notes in Computer Science, vol 13668. Springer, Cham. https://doi.org/10.1007/978-3-031-20074-8_32

Download citation

DOI: https://doi.org/10.1007/978-3-031-20074-8_32
Published: 12 November 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-20073-1
Online ISBN: 978-3-031-20074-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Learning Omnidirectional Flow in 360\(^\circ \) Video via Siamese Representation

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Learning to estimate optical flow using dual-frequency paradigm

Deep 360 $$^\circ $$ Optical Flow Estimation Based on Multi-projection Fusion

Video Enhancement with Task-Oriented Flow

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (pdf 11412 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Learning Omnidirectional Flow in 360\(^\circ \) Video via Siamese Representation

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Learning to estimate optical flow using dual-frequency paradigm

Deep 360 $$^\circ $$ Optical Flow Estimation Based on Multi-projection Fusion

Video Enhancement with Task-Oriented Flow

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (pdf 11412 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation