short-paper

Novel Motion-Compensated Spatio-Temporal Filtering Scheme for x265 Open-Source Video Encoder

Authors:

Santhoshini Sekar,

Ashok Kumar Mishra,

Alex Giladi,

Dan GroisAuthors Info & Claims

MHV '23: Proceedings of the 2nd Mile-High Video Conference

Pages 126 - 127

https://doi.org/10.1145/3588444.3591024

Published: 16 June 2023 Publication History

Get Access

Abstract

There is a strong demand to decrease the video transmission bitrate without reducing visual quality [1]. The x265 encoder [2]-[4] is a popular open-source encoder, which generates bitstreams compliant with the H.265/MPEG-HEVC video coding standard [5]. Built on top of x264[6], the x265 encoder is integrated into several popular open-source frameworks, such as ffmpeg [7], GStreamer [8], and Handbrake [9]. In addition, the x265 is used by a variety of broadcast and streaming service providers who leverage the benefits of HEVC for streaming live and over-the-top (OTT) content. In addition to implementing nearly all the tools defined in HEVC, it implements many algorithmic optimizations that enable trading off encoder performance for quality [2]-[4]. The performance-critical kernels are implemented with hand-coded assembly kernels that use AVX2 and AVX-512 single instruction, multiple data instructions to improve performance on x86 CPUs. This flexible architecture of x265 makes it a popular choice for HEVC encoding for both on-premises and cloud services.

Recent x265 development efforts have been focused on further improving the coding gains. Specifically, the motion compensated spatio-temporal filtering (MCSTF) employed within the coding loop is especially useful for pictures that contain a high level of noise. It utilizes previously generated motion vectors across different video content resolutions to find the best temporal correspondence for low-pass filtering, while the temporal filtering is applied to the I- and P-frames. Figure 1 schematically illustrates the motion estimation process for temporal filtering in a temporal window, which consists of 5 adjacent pictures: two past, two future and one central picture used for producing a single filtered picture. Motion estimation is applied between the central picture and each future or past picture, thereby generating multiple motion-compensated predictions, which are then combined by using adaptive filtering to produce a final noise-reduced picture. Thus, a hierarchical motion estimation scheme is employed (layers L0, L1 and L2, are illustrated in Figure 2). Subsampled pictures are generated for all reference pictures

and the original picture as well: i.e., L1, while L2 is derived from L1 by using the same subsampling method. First, the motion estimation is done for each 16x16 block in L2. Then, the selected motion vector is used as an initial value for estimating the motion in L1. After that, the same is performed for estimating the motion in L0.

As a final step, the subpixel motion is estimated for each 8x8 block by using an interpolation filter on L0. Particularly, the motion of reference pictures before and after, relative to the original picture, is estimated per the 8x8 picture block. In turn, the motion compensation is applied on the pictures before and after the original picture according to the best matching motion for each block. i.e., such that pixel coordinates of the original picture in each block have the best matching coordinates within the referenced pictures. The filter is then applied to the current pixels, and after that, the filtered picture is encoded. Note that the pixels are processed one by one for the luma and chroma channels. The new sample value, is calculated by using the following equation:

[EQUATION]

where I_o is the original pixel, I_r(i) is the intensity of the corresponding pixel within the motion compensated picture i, and w_r(i, a) is the weight of the motion compensated picture where a is the number of available motion compensated pictures. The conducted extensive experimental results show significant bit-rate savings in terms of BD-BR [10].

References

[1]

"Cisco Visual Networking Index: Forecast and Methodology, 2018--2023", Online: https://www.cisco.com/c/en/us/solutions/collateral/executive-perspectives/annual-internet-report/white-paper-c11-741490.pdf, Cisco Systems Inc., 9 Mar. 2020.

Google Scholar

[2]

P. Ramachandran et al., "Speed-Distortion Optimization: Tradeoffs in Open-Source HEVC Encoding," in SMPTE Motion Imaging Journal, vol. 129, no. 7, pp. 17--25, Aug. 2020.

Crossref

Google Scholar

[3]

Projects from VideoLAN, x265 software library and application, Online: https://www.videolan.org/developers/x265.html.

Google Scholar

[4]

x265 Documentation, Online: https://x265.readthedocs.io/en/stable/index.html

Google Scholar

[5]

ITU-T, Recommendation H.265 (04/13), Series H: Audiovisual and Multimedia Systems, Infrastructure of audiovisual services - Coding of Moving Video, High Efficiency Video Coding.

Google Scholar

[6]

Projects from VideoLAN, x264 software library and application, Online: http://www.videolan.org/developers/x264.html

Google Scholar

[7]

FFmpeg multimedia framework, Online: https://ffmpeg.org/

Google Scholar

[8]

GStreamer multimedia framework, Online: https://gstreamer.freedesktop.org/

Google Scholar

[9]

HandBrake video transcoder, Online: https://handbrake.fr/

Google Scholar

[10]

G. Bjøntegaard, "Calculation of average PSNR differences between RD-curves", ITU-T Q.6/SG16 VCEG 13th Meeting, Document VCEG-M33, Austin, USA, Apr. 2001.

Google Scholar

Index Terms

Novel Motion-Compensated Spatio-Temporal Filtering Scheme for x265 Open-Source Video Encoder
1. Information systems
  1. Information systems applications
    1. Multimedia information systems
      1. Multimedia streaming

Recommendations

Novel Segment-Based Rate Control Scheme for x265 Open-Source Video Encoder
MHV '24: Proceedings of the 3rd Mile-High Video Conference

There is a rapid development in various multimedia services such as mobile streaming services, video conferencing and live video streaming platforms [1],[2]. To achieve a high quality-of-experience (QoE), video streaming platforms require infrastructure ...
Novel Histogram-Based Scene Change Detection Scheme for x265 Open-Source Video Encoder
MHV '23: Proceedings of the 2nd Mile-High Video Conference

There is an increasing need in bandwidth requirements, particularly, due to the increase in the device resolution [1]. As a result, there is a strong demand to decrease the video transmission bitrate without reducing visual quality. In 2010, a Joint ...
Performance Assessment of AV1, x265 and VVenC Open-Source Encoder Implementations Compared to VVC and HEVC Reference Software Models
MHV '23: Proceedings of the 2nd Mile-High Video Conference

Video applications continue to gain a lot of traction and have an enormous demand. As a result, there is a strong demand to decrease the video transmission bitrate without reducing visual quality [1],[2]. The efforts to achieve bitrate savings, ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

MHV '23: Proceedings of the 2nd Mile-High Video Conference

May 2023

176 pages

ISBN:9798400701603

DOI:10.1145/3588444

Chair:
Ali C. Begen,
General Chairs:
Tamar Shoham
Beamr
,
Alex Giladi
Comcast
,
Program Co-chairs:
Christian Timmerer,
Dan Grois

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 June 2023

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Conference

MHV '23

Sponsor:

SIGMM

MHV '23: 2nd Mile-High Video Conference

May 7 - 10, 2023

CO, Denver, USA

Upcoming Conference

MHV '25

Sponsor:
sigmm

Mile-High Video Conference

February 18 - 20, 2025

Denver , CO , USA

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
65
Total Downloads

Downloads (Last 12 months)27
Downloads (Last 6 weeks)2

Reflects downloads up to 16 Jan 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Index Terms

Recommendations

Novel Segment-Based Rate Control Scheme for x265 Open-Source Video Encoder

Novel Histogram-Based Scene Change Detection Scheme for x265 Open-Source Video Encoder

Performance Assessment of AV1, x265 and VVenC Open-Source Encoder Implementations Compared to VVC and HEVC Reference Software Models