[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3588444.3591024acmconferencesArticle/Chapter ViewAbstractPublication PagesmhvConference Proceedingsconference-collections
short-paper

Novel Motion-Compensated Spatio-Temporal Filtering Scheme for x265 Open-Source Video Encoder

Published: 16 June 2023 Publication History

Abstract

There is a strong demand to decrease the video transmission bitrate without reducing visual quality [1]. The x265 encoder [2]-[4] is a popular open-source encoder, which generates bitstreams compliant with the H.265/MPEG-HEVC video coding standard [5]. Built on top of x264[6], the x265 encoder is integrated into several popular open-source frameworks, such as ffmpeg [7], GStreamer [8], and Handbrake [9]. In addition, the x265 is used by a variety of broadcast and streaming service providers who leverage the benefits of HEVC for streaming live and over-the-top (OTT) content. In addition to implementing nearly all the tools defined in HEVC, it implements many algorithmic optimizations that enable trading off encoder performance for quality [2]-[4]. The performance-critical kernels are implemented with hand-coded assembly kernels that use AVX2 and AVX-512 single instruction, multiple data instructions to improve performance on x86 CPUs. This flexible architecture of x265 makes it a popular choice for HEVC encoding for both on-premises and cloud services.
Recent x265 development efforts have been focused on further improving the coding gains. Specifically, the motion compensated spatio-temporal filtering (MCSTF) employed within the coding loop is especially useful for pictures that contain a high level of noise. It utilizes previously generated motion vectors across different video content resolutions to find the best temporal correspondence for low-pass filtering, while the temporal filtering is applied to the I- and P-frames. Figure 1 schematically illustrates the motion estimation process for temporal filtering in a temporal window, which consists of 5 adjacent pictures: two past, two future and one central picture used for producing a single filtered picture. Motion estimation is applied between the central picture and each future or past picture, thereby generating multiple motion-compensated predictions, which are then combined by using adaptive filtering to produce a final noise-reduced picture. Thus, a hierarchical motion estimation scheme is employed (layers L0, L1 and L2, are illustrated in Figure 2). Subsampled pictures are generated for all reference pictures
and the original picture as well: i.e., L1, while L2 is derived from L1 by using the same subsampling method. First, the motion estimation is done for each 16x16 block in L2. Then, the selected motion vector is used as an initial value for estimating the motion in L1. After that, the same is performed for estimating the motion in L0.
As a final step, the subpixel motion is estimated for each 8x8 block by using an interpolation filter on L0. Particularly, the motion of reference pictures before and after, relative to the original picture, is estimated per the 8x8 picture block. In turn, the motion compensation is applied on the pictures before and after the original picture according to the best matching motion for each block. i.e., such that pixel coordinates of the original picture in each block have the best matching coordinates within the referenced pictures. The filter is then applied to the current pixels, and after that, the filtered picture is encoded. Note that the pixels are processed one by one for the luma and chroma channels. The new sample value, is calculated by using the following equation:
[EQUATION]
where Io is the original pixel, Ir(i) is the intensity of the corresponding pixel within the motion compensated picture i, and wr(i, a) is the weight of the motion compensated picture where a is the number of available motion compensated pictures. The conducted extensive experimental results show significant bit-rate savings in terms of BD-BR [10].

References

[1]
"Cisco Visual Networking Index: Forecast and Methodology, 2018--2023", Online: https://www.cisco.com/c/en/us/solutions/collateral/executive-perspectives/annual-internet-report/white-paper-c11-741490.pdf, Cisco Systems Inc., 9 Mar. 2020.
[2]
P. Ramachandran et al., "Speed-Distortion Optimization: Tradeoffs in Open-Source HEVC Encoding," in SMPTE Motion Imaging Journal, vol. 129, no. 7, pp. 17--25, Aug. 2020.
[3]
Projects from VideoLAN, x265 software library and application, Online: https://www.videolan.org/developers/x265.html.
[4]
x265 Documentation, Online: https://x265.readthedocs.io/en/stable/index.html
[5]
ITU-T, Recommendation H.265 (04/13), Series H: Audiovisual and Multimedia Systems, Infrastructure of audiovisual services - Coding of Moving Video, High Efficiency Video Coding.
[6]
Projects from VideoLAN, x264 software library and application, Online: http://www.videolan.org/developers/x264.html
[7]
FFmpeg multimedia framework, Online: https://ffmpeg.org/
[8]
GStreamer multimedia framework, Online: https://gstreamer.freedesktop.org/
[9]
HandBrake video transcoder, Online: https://handbrake.fr/
[10]
G. Bjøntegaard, "Calculation of average PSNR differences between RD-curves", ITU-T Q.6/SG16 VCEG 13th Meeting, Document VCEG-M33, Austin, USA, Apr. 2001.

Index Terms

  1. Novel Motion-Compensated Spatio-Temporal Filtering Scheme for x265 Open-Source Video Encoder

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MHV '23: Proceedings of the 2nd Mile-High Video Conference
    May 2023
    176 pages
    ISBN:9798400701603
    DOI:10.1145/3588444
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 16 June 2023

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. x265
    2. HEVC
    3. H.265
    4. MCSTF
    5. open-source
    6. HM
    7. coding gain
    8. computational complexity

    Qualifiers

    • Short-paper

    Conference

    MHV '23
    Sponsor:
    MHV '23: 2nd Mile-High Video Conference
    May 7 - 10, 2023
    CO, Denver, USA

    Upcoming Conference

    MHV '25
    Mile-High Video Conference
    February 18 - 20, 2025
    Denver , CO , USA

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 65
      Total Downloads
    • Downloads (Last 12 months)27
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 16 Jan 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media