[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3664647.3681208acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article
Open access

OmniStitch: Depth-Aware Stitching Framework for Omnidirectional Vision with Multiple Cameras

Published: 28 October 2024 Publication History

Abstract

Omnidirectional vision systems provide a 360-degree panoramic view, enabling full environmental awareness in various fields, such as Advanced Driver Assistance Systems (ADAS) and Virtual Reality (VR). Existing omnidirectional stitching methods rely on a single specialized 360-degree camera. However, due to hardware limitations such as high mounting heights and blind spots, adapting these methods to vehicles of varying sizes and geometries is challenging. These challenges include limited generalizability due to the reliance on predefined stitching regions for fixed camera arrays, performance degradation from distance parallax leading to large depth differences, and the absence of suitable datasets with ground truth for multi-camera omnidirectional systems. To overcome these challenges, we propose a novel omnidirectional stitching framework and a publicly available dataset tailored for varying distance scenarios with multiple cameras. The framework, referred to as OmniStitch, consists of a Stitching Region Maximization (SRM) module for automatic adaptation to different vehicles with multiple cameras and a Depth-Aware Stitching (DAS) module to handle depth differences caused by distance parallax between cameras. In addition, we create and release an omnidirectional stitching dataset, called GV360, which provides ground truth images that maintain the perspective of the 360-degree FOV, designed explicitly for vehicle-agnostic systems. Extensive evaluations of this dataset demonstrate that our framework outperforms state-of-the-art stitching models, especially in handling varying distance parallax. The proposed dataset and code are publicly available in https://github.com/tngh5004/Omnistitch.

References

[1]
Josep Bosch, Nuno Gracias, Rafael Garcia, and Pere Ridao. 2019. Omnidirectional multicamera video stitching using depth maps. IEEE Journal of Oceanic Engineering 45, 4 (2019), 1337--1352.
[2]
Matthew Brown and David G Lowe. 2007. Automatic panoramic image stitching using invariant features. International journal of computer vision 74 (2007), 59--73.
[3]
Che-Han Chang, Yoichi Sato, and Yung-Yu Chuang. 2014. Shape-preserving half-projective warps for image stitching. In Proceedings of the IEEE conference on computer vision and pattern recognition. 3254--3261.
[4]
Pierre Charbonnier, Laure Blanc-Feraud, Gilles Aubert, and Michel Barlaud. 1994. Two deterministic half-quadratic regularization algorithms for computed imaging. In Proceedings of 1st international conference on image processing, Vol. 2. IEEE, 168--172.
[5]
Yu-Sheng Chen and Yung-Yu Chuang. 2016. Natural image stitching with the global similarity prior. In European conference on computer vision. Springer, 186-- 201.
[6]
Haojie Cheng, Chunxiao Xu, JiajunWang, and Lingxiao Zhao. 2022. Quad-fisheye Image Stitching for Monoscopic Panorama Reconstruction. In Computer Graphics Forum, Vol. 41. Wiley Online Library, 94--109.
[7]
Ana De Abreu, Cagri Ozcinar, and Aljosa Smolic. 2017. Look around you: Saliency maps for omnidirectional images in VR applications. In 2017 Ninth International Conference on Quality of Multimedia Experience (QoMEX). IEEE, 1--6.
[8]
Jiong Dong, Kaoru Ota, and Mianxiong Dong. 2023. Video frame interpolation: A comprehensive survey. ACM Transactions on Multimedia Computing, Communications and Applications 19, 2s (2023), 1--31.
[9]
Alexey Dosovitskiy, German Ros, Felipe Codevilla, Antonio Lopez, and Vladlen Koltun. 2017. CARLA: An open urban driving simulator. In Conference on robot learning. PMLR, 1--16.
[10]
Peng Du, Jifeng Ning, Jiguang Cui, Shaoli Huang, Xinchao Wang, and Jiaxin Wang. 2022. Geometric structure preserving warp for natural image stitching. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 3688--3696.
[11]
Huiyu Duan, Xiongkuo Min, Wei Sun, Yucheng Zhu, Xiao-Ping Zhang, and Guangtao Zhai. 2023. Attentive deep image quality assessment for omnidirectional stitching. IEEE Journal of Selected Topics in Signal Processing (2023).
[12]
Junhong Gao, Seon Joo Kim, and Michael S Brown. 2011. Constructing image panoramas using dual-homography warping. In CVPR 2011. IEEE, 49--56.
[13]
Vitor Guizilini, Igor Vasiljevic, Rares Ambrus, Greg Shakhnarovich, and Adrien Gaidon. 2022. Full surround monodepth from multiple cameras. IEEE Robotics and Automation Letters 7, 2 (2022), 5397--5404.
[14]
Markus Heimberger, Jonathan Horgan, Ciarán Hughes, John McDonald, and Senthil Yogamani. 2017. Computer vision in automated parking systems: Design, implementation and challenges. Image and Vision Computing 68 (2017), 88--101.
[15]
Lionel Heng, Benjamin Choi, Zhaopeng Cui, Marcel Geppert, Sixing Hu, Benson Kuan, Peidong Liu, Rang Nguyen, Ye Chuan Yeo, Andreas Geiger, et al. 2019. Project autovision: Localization and 3d scene perception for an autonomous vehicle with a multi-camera system. In 2019 International Conference on Robotics and Automation (ICRA). IEEE, 4695--4702.
[16]
Tuan Ho and Madhukar Budagavi. 2017. Dual-fisheye lens stitching for 360- degree imaging. In 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2172--2176.
[17]
Qi Jia, ZhengJun Li, Xin Fan, Haotian Zhao, Shiyu Teng, Xinchen Ye, and Longin Jan Latecki. 2021. Leveraging line-point consistence to preserve structures for wide parallax image stitching. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 12186--12195.
[18]
Xin Jin, Longhai Wu, Jie Chen, Youxin Chen, Jayoon Koo, and Cheul-hee Hahm. 2023. A unified pyramid recurrent network for video frame interpolation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1578--1587.
[19]
Xin Jin, LonghaiWu, Guotao Shen, Youxin Chen, Jie Chen, Jayoon Koo, and Cheulhee Hahm. 2023. Enhanced bi-directional motion estimation for video frame interpolation. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 5049--5057.
[20]
Christian Kinzig, Irene Cortés, Carlos Fernández, and Martin Lauer. 2022. Realtime seamless image stitching in autonomous driving. In 2022 25th International Conference on Information Fusion (FUSION). IEEE, 1--8.
[21]
Hyeokjun Kweon, Hyeonseong Kim, Yoonsu Kang, Youngho Yoon, Wooseong Jeong, and Kuk-Jin Yoon. 2023. Pixel-wise warping for deep image stitching. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 37. 1196--1204.
[22]
Wei-Sheng Lai, Orazio Gallo, Jinwei Gu, Deqing Sun, Ming-Hsuan Yang, and Jan Kautz. 2019. Video stitching for linear camera arrays. arXiv preprint arXiv:1907.13622 (2019).
[23]
Hyunchul Lee, Sungmin Lee, and Okkyung Choi. 2020. Improved method on image stitching based on optical flow algorithm. International Journal of Engineering Business Management 12 (2020), 1847979020980928.
[24]
Kyu-Yul Lee and Jae-Young Sim. 2020. Warping residual based image stitching for large parallax. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 8198--8206.
[25]
Si-Ho Lee, Bong-Ju Kim, and Seon-Bong Lee. 2021. Study on image correction and optimization of mounting positions of dual cameras for vehicle test. Energies 14, 16 (2021), 4857.
[26]
Jing Li, Baosong Deng, Rongfu Tang, Zhengming Wang, and Ye Yan. 2019. Localadaptive image alignment based on triangular facet approximation. IEEE Transactions on Image Processing 29 (2019), 2356--2369.
[27]
Jing Li, Zhengming Wang, Shiming Lai, Yongping Zhai, and Maojun Zhang. 2017. Parallax-tolerant image stitching based on robust elastic warping. IEEE Transactions on multimedia 20, 7 (2017), 1672--1687.
[28]
Jia Li, Kaiwen Yu, Yifan Zhao, Yu Zhang, and Long Xu. 2019. Cross-reference stitching quality assessment for 360 omnidirectional images. In Proceedings of the 27th ACM international conference on multimedia. 2360--2368.
[29]
Jia Li, Yifan Zhao, Weihua Ye, Kaiwen Yu, and Shiming Ge. 2019. Attentive Deep Stitching and Quality Assessment for 360 Omnidirectional Images. IEEE journal of selected topics in signal processing 14, 1 (2019), 209--221.
[30]
Chung-Ching Lin, Sharathchandra U Pankanti, Karthikeyan Natesan Ramamurthy, and Aleksandr Y Aravkin. 2015. Adaptive as-natural-as-possible image stitching. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1155--1163.
[31]
I-Chan Lo, Kuang-Tsu Shih, and Homer H Chen. 2021. Efficient and accurate stitching for 360° dual-fisheye images and videos. IEEE Transactions on Image Processing 31 (2021), 251--262.
[32]
Ilya Loshchilov and Frank Hutter. 2017. Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101 (2017).
[33]
Simon Meister, Junhwa Hur, and Stefan Roth. 2018. Unflow: Unsupervised learning of optical flow with a bidirectional census loss. In Proceedings of the AAAI conference on artificial intelligence, Vol. 32.
[34]
Mingyuan Meng and Shaojun Liu. 2020. High-quality panorama stitching based on asymmetric bidirectional optical flow. In 2020 5th International Conference on Computational Intelligence and Applications (ICCIA). IEEE, 118--122.
[35]
A Moussa and Naser El-Sheimy. 2016. A fast approach for stitching of aerial images. The international archives of the photogrammetry, remote sensing and spatial information sciences 41 (2016), 769--774.
[36]
Lang Nie, Chunyu Lin, Kang Liao, Meiqin Liu, and Yao Zhao. 2020. A viewfree image stitching network based on global homography. Journal of Visual Communication and Image Representation 73 (2020), 102950.
[37]
Lang Nie, Chunyu Lin, Kang Liao, Shuaicheng Liu, and Yao Zhao. 2021. Unsupervised deep image stitching: Reconstructing stitched features to images. IEEE Transactions on Image Processing 30 (2021), 6184--6197.
[38]
Lang Nie, Chunyu Lin, Kang Liao, Shuaicheng Liu, and Yao Zhao. 2022. Deep rectangling for image stitching: a learning baseline. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 5740--5748.
[39]
Lang Nie, Chunyu Lin, Kang Liao, Shuaicheng Liu, and Yao Zhao. 2023. Parallaxtolerant unsupervised deep image stitching. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 7399--7408.
[40]
Lang Nie, Chunyu Lin, Kang Liao, and Yao Zhao. 2020. Learning edgepreserved image stitching from large-baseline deep homography. arXiv preprint arXiv:2012.06194 (2020).
[41]
Simon Niklaus and Feng Liu. 2020. Softmax splatting for video frame interpolation. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 5437--5446.
[42]
Janice Pan, Vikram Appia, Jesse Villarreal, Lucas Weaver, and Do-Kyoung Kwon. 2017. Rear-stitched view panorama: A low-power embedded implementation for smart rear-view mirrors on vehicles. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. 20--29.
[43]
Mark W Powell, Jeffrey S Norris, MA Vona, Paul G Backes, and Justin V Wick. 2005. Scientific visualization for the Mars Exploration Rovers. In Proceedings of the 2005 IEEE international conference on robotics and automation. IEEE, 4290--4296.
[44]
Akshay Rangesh and Mohan Manubhai Trivedi. 2019. No blind spots: Fullsurround multi-object tracking for autonomous vehicles using cameras and lidars. IEEE Transactions on Intelligent Vehicles 4, 4 (2019), 588--599.
[45]
John N Sanders-Reed and Phillip L Koon. 2010. Vision systems for manned and robotic ground vehicles. In Unmanned Systems Technology XII, Vol. 7692. SPIE, 181--192.
[46]
Ulrich Schwesinger, Mathias Bürki, Julian Timpner, Stephan Rottmann, LarsWolf, Lina Maria Paz, Hugo Grimmett, Ingmar Posner, Paul Newman, Christian Häne, et al. 2016. Automated valet parking and charging for e-mobility. In 2016 IEEE Intelligent Vehicles Symposium (IV). IEEE, 157--164.
[47]
Ahmed Rida Sekkat, Yohan Dupuis, Varun Ravi Kumar, Hazem Rashed, Senthil Yogamani, Pascal Vasseur, and Paul Honeine. 2022. SynWoodScape: Synthetic surround-view fisheye camera dataset for autonomous driving. IEEE Robotics and Automation Letters 7, 3 (2022), 8502--8509.
[48]
Hyeonjun Sim, Jihyong Oh, and Munchurl Kim. 2021. Xvfi: extreme video frame interpolation. In Proceedings of the IEEE/CVF international conference on computer vision. 14489--14498.
[49]
Karen Simonyan and AndrewZisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).
[50]
Dae-Young Song, Geonsoo Lee, HeeKyung Lee, Gi-Mun Um, and Donghyeon Cho. 2022. Weakly-supervised stitching network for real-world panoramic image generation. In European Conference on Computer Vision. Springer, 54--71.
[51]
Deqing Sun, Xiaodong Yang, Ming-Yu Liu, and Jan Kautz. 2018. Pwc-net: Cnns for optical flow using pyramid, warping, and cost volume. In Proceedings of the IEEE conference on computer vision and pattern recognition. 8934--8943.
[52]
Zhou Wang, Alan C Bovik, Hamid R Sheikh, and Eero P Simoncelli. 2004. Image quality assessment: from error visibility to structural similarity. IEEE transactions on image processing 13, 4 (2004), 600--612.
[53]
Changhee Won, Jongbin Ryu, and Jongwoo Lim. 2020. End-to-end learning for omnidirectional stereo matching with uncertainty prior. IEEE transactions on pattern analysis and machine intelligence 43, 11 (2020), 3850--3862.
[54]
Tian-Zhu Xiang, Gui-Song Xia, Xiang Bai, and Liangpei Zhang. 2018. Image stitching by line-guided local warping with global similarity constraint. Pattern recognition 83 (2018), 481--497.
[55]
Jianxiong Xiao, Krista A Ehinger, Aude Oliva, and Antonio Torralba. 2012. Recognizing scene viewpoint using panoramic place representation. In 2012 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 2695--2702.
[56]
Chunmei Xie, Xiaoyun Zhang, Hua Yang, Li Chen, and Zhiyong Gao. 2018. Video stitching based on optical flow. In 2018 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB). IEEE, 1--5.
[57]
Weihua Ye, Kaiwen Yu, Yang Yu, and Jia Li. 2018. Logical stitching: A panoramic image stitching method based on color calibration box. In 2018 14th IEEE International Conference on Signal Processing (ICSP). IEEE, 1139--1143.
[58]
Senthil Yogamani, Ciarán Hughes, Jonathan Horgan, Ganesh Sistu, Padraig Varley, Derek O'Dea, Michal Uricár, Stefan Milz, Martin Simon, Karl Amende, et al. 2019. Woodscape: A multi-task, multi-camera fisheye dataset for autonomous driving. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 9308-- 9318.
[59]
Julio Zaragoza, Tat-Jun Chin, Michael S Brown, and David Suter. 2013. Asprojective- as-possible image stitching with moving DLT. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2339--2346.
[60]
Guofeng Zhang, Yi He, Weifeng Chen, Jiaya Jia, and Hujun Bao. 2016. Multiviewpoint panorama construction with wide-baseline images. IEEE Transactions on Image Processing 25, 7 (2016), 3099--3111.
[61]
Haoxian Zhang, Yang Zhao, and Ronggang Wang. 2020. A flexible recurrent residual pyramid network for video frame interpolation. In European conference on computer vision. Springer, 474--491.
[62]
Richard Zhang, Phillip Isola, Alexei A Efros, Eli Shechtman, and Oliver Wang. 2018. The unreasonable effectiveness of deep features as a perceptual metric. In Proceedings of the IEEE conference on computer vision and pattern recognition. 586--595.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '24: Proceedings of the 32nd ACM International Conference on Multimedia
October 2024
11719 pages
ISBN:9798400706868
DOI:10.1145/3664647
This work is licensed under a Creative Commons Attribution International 4.0 License.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 October 2024

Check for updates

Author Tags

  1. image stitching
  2. omnidirectional view dataset
  3. omnidirectional vision

Qualifiers

  • Research-article

Funding Sources

  • NRF
  • MSIT
  • IITP

Conference

MM '24
Sponsor:
MM '24: The 32nd ACM International Conference on Multimedia
October 28 - November 1, 2024
Melbourne VIC, Australia

Acceptance Rates

MM '24 Paper Acceptance Rate 1,150 of 4,385 submissions, 26%;
Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 137
    Total Downloads
  • Downloads (Last 12 months)137
  • Downloads (Last 6 weeks)119
Reflects downloads up to 21 Dec 2024

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media