Abstract
In three-dimensional video (3DV) coding, color videos and depth maps both need to be coded. Usually, two-channel coding method is used in the system of 3DV coding which encodes color videos and depth maps based on two parallel codec implements. The complexity and hardware requirements are nearly two times higher than coding the 2D color video. Meanwhile, depth maps in 3DV usually are estimated from color video data and the estimated depth itself can be very noisy. In this paper, we propose fast and effective depth coding to minimize depth coding bitrate and the coding complexity. The 3 × 3 bilateral filter is first utilized to pre-process depth maps to reduce noises from the depth estimation procedure, and thus unnecessary bits to code depth noises can be reduced. Meanwhile, there is a high correlation among motion information from color videos and depth maps. Coding information including motion vectors and the prediction mode is drawn from the color video to accelerate the mode decision procedure of depth coding and reduce the temporal variation of depth maps. Experimental results show that the proposed algorithm can reduce 70% computational complexity of depth coding and 20% depth bitrate.
Similar content being viewed by others
References
Bjontegaard G (2001) Calculation of average PSNR differences between RD-curves, ITU-T SG16/Q.6 Doc. VCEG-M33, Austin, TX, USA, Apr. 2001
De Silva DVSX, Fernando WAC, Kodikara Arachchi H (2010), a new mode selection technique for coding depth maps of 3D video, IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp.686–689, Mar. 2010
Grewatsch S, Muller E (2004) Sharing of motion vectors in 3D video coding, International Conference on Image Processing (ICIP), pp. 3271–3274, Oct. 2004
Kim SY, Ho YS(2007) Mesh-based depth coding for 3D video using hierarchical decomposition of depth maps, IEEE 14th International Conference on Image Processing (ICIP), pp.117-120, Oct. 2007
Kim W, Ortega A, Lai P, Tian D, Gomila C (2009) Depth map distortion analysis for view rendering and depth coding, IEEE International Conference on Image Processing (ICIP), pp.721-724, Oct. 2009
Konrad J, Halle M (2007) 3-D displays and signal processing. IEEE Signal Proc 24(6):97–111
Lee JY, Wey H-C, Park D-S (2011) A fast and efficient multi-view depth image coding method based on temporal and inter-view correlations of texture images. IEEE Trans Circ Syst Video Technol 21(12):1859–1868
Lin Y, Wu J (2011) A depth information based fast mode decision algorithm for color plus depth-map 3D videos. IEEE Trans Broadcast 57(2):542–550
Liu S, Lai P, Tian D, Chen C (2011) New depth coding techniques with utilization of corresponding video. IEEE Trans Broadcast 57(2):551–561
Merkle P, Morvan Y, Smolic A, Farin D, Müller K, de With PHN, Wiegand T (2009) The effects of multiview depth video compression on multiview rendering. Signal Process: Image Commun 24(1–2):73–88
Müller K, Merkle P, Wiegand T (2011) 3-D video representation using depth maps. Proc IEEE 99(4):643–656
Oh H, Ho Y (2006) H.264-based depth map sequence coding using motion information of corresponding Texture video, Springer Berlin/Heidelberg. Adv Image Video Technol 4319:898–907
Oh K, Vetro A, Ho Y (2011) Depth coding using a boundary reconstruction filter for 3-D video systems. IEEE Trans Circ Syst Video Technol 21(3):350–359
Oh T, Lee J, Park DS (2011) Depth map coding based on synthesized view distortion function. IEEE J Sel Top Sig Process 5(7):1344–1352
Pourazad MT, Nasiopoulos P, Ward RK (2006) An H.264-based video encoding scheme for 3D TV, European Signal Processing Conference (EUSIPCO), Sept. 2006
Seo J, Park D, Wey H, Lee S, Sohn K (2010) Motion information sharing mode for depth video coding, Proc. 3DTV Conference, pp. 1–4, June. 2010
Shen L, Sun Y, Liu Z, Zhang Z (2010) Efficient SKIP mode detection for coarse grain quality scalable video coding. IEEE Signal Process Lett 17(10):887–890
Tomasi C, Manduchi R (1998) Bilateral filtering for gray and color images, IEEE International Conference on Computer Vision (ICVV), pp. 839–846, Jan. 1998
Vetro A, Pandit P, Kimata H, Smolic A (2007) Joint multiview video model, joint video team, Doc. JVT-X207, Geneva, Switzerland, July 2007
Vetro A, Tourapis AM, Müller K, Chen T (2011) 3D-TV content storage and transmission. IEEE Trans Broadcasting 57(2):384–394
Wang M, Jin X, Goto S (2010) Difference detection based early mode termination for depth map coding in MVC, 28th Picture Coding Symposium, pp. 502–505, Dec. 2010
Yoon SU, Ho Y (2007) Multiple color and depth video coding using a hierarchical representation. IEEE Trans Circ Syst Video Technol 17(11):1450–1460
Zhang Q, An P, Zhang Y, Shen L, Zhang Z (2011) Low complexity multiview video plus depth coding. IEEE Trans Consum Electron 57(4):1857–1865
Acknowledgment
This work is sponsored by Shanghai Rising-Star Program (11QA1402400) and Innovation Program of Shanghai Municipal Education Commission (No.13ZZ069), and is supported by the National Natural Science Foundation of China under grant No. 60832003, 60902085, 61171084 and 61171096.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Shen, L., Zhang, Z. Efficient depth coding in 3D video to minimize coding bitrate and complexity. Multimed Tools Appl 72, 1639–1652 (2014). https://doi.org/10.1007/s11042-013-1455-3
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-013-1455-3