[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

WO2016165122A1 - Inter prediction offset - Google Patents

Inter prediction offset Download PDF

Info

Publication number
WO2016165122A1
WO2016165122A1 PCT/CN2015/076865 CN2015076865W WO2016165122A1 WO 2016165122 A1 WO2016165122 A1 WO 2016165122A1 CN 2015076865 W CN2015076865 W CN 2015076865W WO 2016165122 A1 WO2016165122 A1 WO 2016165122A1
Authority
WO
WIPO (PCT)
Prior art keywords
block
offset
inter prediction
neighboring
resi
Prior art date
Application number
PCT/CN2015/076865
Other languages
French (fr)
Inventor
Han HUANG
Original Assignee
Mediatek Singapore Pte. Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mediatek Singapore Pte. Ltd. filed Critical Mediatek Singapore Pte. Ltd.
Priority to PCT/CN2015/076865 priority Critical patent/WO2016165122A1/en
Priority to US15/521,499 priority patent/US10785475B2/en
Priority to CN201580060217.4A priority patent/CN107079165B/en
Priority to EP15857286.7A priority patent/EP3202151B1/en
Priority to PCT/CN2015/093763 priority patent/WO2016070808A1/en
Publication of WO2016165122A1 publication Critical patent/WO2016165122A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/593Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction

Definitions

  • the invention relates generally to image and video processing.
  • the presented invention relates to image and video coding.
  • Motion compensation prediction is a powerful tool in video coding. Instead of the original signal, the prediction residual is coded together with the side information.
  • the side information usually includes: motion vector (s) and the reference index of the reference picture (s) .
  • merge mode in HEVC [1] for example, the side information of current block is derived from already coded block (s) to reduce the overhead of signaling the side information.
  • the prediction residual signal is usually coded by transform coding.
  • An inter prediction offset is derived from already coded block (s) to improve the efficiency of motion compensation in current block.
  • Fig. 1 is a diagram illustrating an example of neighboring coded block chosen for offset calculation.
  • Fig. 2 is a diagram illustrating an example of Example of using multiple neighboring coded blocks for offset calculation.
  • Fig. 3 is a diagram illustrating an example of spatial merge candidates in HEVC.
  • an inter prediction offset is derived from one or multiple neighboring coded blocks.
  • the offset is then added to the motion compensation prediction block.
  • the offset is used to predict the residual signal.
  • the motion compensation prediction residual signal is subtracted by the inter prediction offset.
  • Orig X and Pred X be the original and motion compensation prediction signal of current block X
  • Offset X be the derived inter prediction offset from the neighboring block.
  • Offset X mean (Resi' Y ) is derived as the mean value of Resi' Y , where Resi' Y is the reconstructed residual signal of the neighboring coded block Y.
  • Offset X mean (Resi' Y ) +Offset Y , where Offset Y is the inter prediction offset applied to the neighboring coded block.
  • the neighboring coded block used for inter prediction offset derivation can be a prediction block, transform block, 4x4 block or other predefined block sizes.
  • the used neighboring coded block is those that have same motion information as current block.
  • the motion vector (s) and reference frame (s) index of the used neighboring coded block should be equal to the motion vector (s) and reference frame (s) index of current block.
  • the neighboring coded block can be chosen from the left or the above.
  • An example is shown in Fig. 1, where A and B are the neighboring coded blocks.
  • MV_curr represent the motion of current block
  • MV_i represent a neighboring coded block i
  • DC_i represents the mean value of the residual of block i.
  • the block i can be a 4x4 block or any other predefined block sizes.
  • the mean value of residual signal can be pre-calculated and stored. For example, when the residual of a block is reconstructed, then the mean value of this block is immediately obtained and stored in a buffer.
  • the neighboring block can be the same as the merge candidate.
  • An example is shown in Fig. 3. If current block is merged to one of the spatial neighboring coded block, then the mean value of residual in that block is used as inter prediction offset.
  • Offset X is set to zero if Offset X is larger than a threshold.
  • Offset X is set to zero if neighboring block is smaller than current block.
  • Offset X is set to zero if the variance of neighboring block is larger than a threshold.
  • the inter prediction offset is applied in all components if the video is RGB, YUV, YCbCr or other format.
  • the inter prediction offset is only applied in luma component if the video is YUV or YCbCr format.
  • inter prediction offset can be constrained to applied for merge mode only.
  • inter prediction offset can be constrained to applied for merge 2Nx2N mode only.
  • inter prediction offset can be constrained to applied for 2Nx2N partition only.
  • inter prediction offset can be constrained to applied for other predefined partition size.
  • an embodiment of the present invention can be a circuit integrated into a video compression chip or program codes integrated into video compression software to perform the processing described herein.
  • An embodiment of the present invention may also be program codes to be executed on a Digital Signal Processor (DSP) to perform the processing described herein.
  • DSP Digital Signal Processor
  • the invention may also involve a number of functions to be performed by a computer processor, a digital signal processor, a microprocessor, or field programmable gate array (FPGA) .
  • processors can be configured to perform particular tasks according to the invention, by executing machine-readable software code or firmware code that defines the particular methods embodied by the invention.
  • the software code or firmware codes may be developed in different programming languages and different format or style.
  • the software code may also be compiled for different target platform.
  • different code formats, styles and languages of software codes and other means of configuring code to perform the tasks in accordance with the invention will not depart from the spirit and scope of the invention.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

Inter prediction offset is proposed. Besides the motion information, an inter prediction offset is also derived from the neighboring coded block to enhance the motion compensation prediction, or to predict the residual signal.

Description

INTER PREDICTION OFFSET
FIELD OF INVENTION
The invention relates generally to image and video processing. In particular, the presented invention relates to image and video coding.
BACKGROUND OF THE INVENTION
Motion compensation prediction (MCP) is a powerful tool in video coding. Instead of the original signal, the prediction residual is coded together with the side information. The side information usually includes: motion vector (s) and the reference index of the reference picture (s) . In some special coding mode, merge mode in HEVC [1] for example, the side information of current block is derived from already coded block (s) to reduce the overhead of signaling the side information. The prediction residual signal is usually coded by transform coding.
SUMMARY OF THE INVENTION
Methods of deriving inter prediction offset are proposed. An inter prediction offset is derived from already coded block (s) to improve the efficiency of motion compensation in current block.
Other aspects and features of the invention will become apparent to those with ordinary skill in the art upon review of the following descriptions of specific embodiments.
BRIEF DESCRIPTION OF THE DRAWINGS
The invention can be more fully understood by reading the subsequent detailed description and examples with references made to the accompanying drawings, wherein:
Fig. 1 is a diagram illustrating an example of neighboring coded block chosen for offset calculation.
Fig. 2 is a diagram illustrating an example of Example of using multiple neighboring coded blocks for offset calculation.
Fig. 3 is a diagram illustrating an example of spatial merge candidates in HEVC.
DETAILED DESCRIPTION
The following description is of the best-contemplated mode of carrying out the invention.  This description is made for the purpose of illustrating the general principles of the invention and should not be taken in a limiting sense. The scope of the invention is best determined by reference to the appended claims.
In the proposed methods, an inter prediction offset is derived from one or multiple neighboring coded blocks.
The offset is then added to the motion compensation prediction block.
In the other aspect, the offset is used to predict the residual signal. In other words, the motion compensation prediction residual signal is subtracted by the inter prediction offset.
Let OrigX and PredX be the original and motion compensation prediction signal of current block X, OffsetX be the derived inter prediction offset from the neighboring block. The final residual signal of current block is ResiX=OrigX-PredX-OffsetX. At the decoder side, the reconstructed signal is RecoX=Resi'X+PredX+OffsetX, where Resi'X is the reconstructed residual signal.
In one embodiment, OffsetX=mean (Resi'Y) is derived as the mean value of Resi'Y, where Resi'Y is the reconstructed residual signal of the neighboring coded block Y.
In another embodiment, if an offset is already applied to the neighboring coded block, thenOffsetX=mean (Resi'Y) +OffsetY, where OffsetY is the inter prediction offset applied to the neighboring coded block.
In still another embodiment, the neighboring coded block used for inter prediction offset derivation can be a prediction block, transform block, 4x4 block or other predefined block sizes.
In still another embodiment, the used neighboring coded block is those that have same motion information as current block. In the other words, the motion vector (s) and reference frame (s) index of the used neighboring coded block should be equal to the motion vector (s) and reference frame (s) index of current block.
In still another embodiment, the neighboring coded block can be chosen from the left or the above. An example is shown in Fig. 1, where A and B are the neighboring coded blocks.
In still another embodiment, if multiple neighboring coded blocks are used, then the offset is calculated as the average. For example, if both A and B in Fig. 1 are used, then OffsetX= (mean (Resi'A) +mean (Resi'B) ) >>1. Another example is shown in Fig. 2, where the neighboring coded blocks can be chosen from A to C to B. The offset calculation can be described as follows:
Figure PCTCN2015076865-appb-000001
Figure PCTCN2015076865-appb-000002
where MV_curr represent the motion of current block, and MV_i represent a neighboring coded block i, and DC_i represents the mean value of the residual of block i. Note that the block i can be a 4x4 block or any other predefined block sizes.
In still another embodiment, the mean value of residual signal can be pre-calculated and stored. For example, when the residual of a block is reconstructed, then the mean value of this block is immediately obtained and stored in a buffer.
In still another embodiment, if current block is merge mode, then the neighboring block can be the same as the merge candidate. An example is shown in Fig. 3. If current block is merged to one of the spatial neighboring coded block, then the mean value of residual in that block is used as inter prediction offset.
In still another embodiment, OffsetX is set to zero if OffsetX is larger than a threshold.
In still another embodiment, OffsetX is set to zero if neighboring block is smaller than current block.
In still another embodiment, OffsetX is set to zero if the variance of neighboring block is larger than a threshold.
In still another embodiment, the inter prediction offset is applied in all components if the video is RGB, YUV, YCbCr or other format.
In still another embodiment, the inter prediction offset is only applied in luma component if the video is YUV or YCbCr format.
In still another embodiment, inter prediction offset can be constrained to applied for merge mode only.
In still another embodiment, inter prediction offset can be constrained to applied for merge 2Nx2N mode only.
In still another embodiment, inter prediction offset can be constrained to applied for 2Nx2N partition only.
In still another embodiment, inter prediction offset can be constrained to applied for other predefined partition size.
The methods described above can be used in a video encoder as well as in a video decoder. Embodiments of merge with inter prediction offset according to the present invention as described above may be implemented in various hardware, software codes, or a combination of both. For example, an embodiment of the present invention can be a circuit integrated into a video compression chip or program codes integrated into video compression software to perform the processing described herein. An embodiment of the present invention may also be program codes to be executed on a Digital Signal Processor (DSP) to perform the processing described herein. The invention may also involve a number of functions to be performed by a computer processor, a digital signal processor, a microprocessor, or field programmable gate array (FPGA) . These processors can be configured to perform particular tasks according to the invention, by executing machine-readable software code or firmware code that defines the particular methods embodied by the invention. The software code or firmware codes may be developed in different programming languages and different format or style. The software code may also be compiled for different target platform. However, different code formats, styles and languages of software codes and other means of configuring code to perform the tasks in accordance with the invention will not depart from the spirit and scope of the invention.
The invention may be embodied in other specific forms without departing from its spirit or essential characteristics. The described examples are to be considered in all respects only as illustrative and not restrictive. To the contrary, it is intended to cover various modifications and similar arrangements (as would be apparent to those skilled in the art) . Therefore, the scope of the appended claims should be accorded the broadest interpretation so as to encompass all such modifications and similar arrangements.

Claims (21)

  1. Amethod of inter prediction offset, comprising:
    the residual signal for current block is calculated as ResiX=OrigX-PredX-OffsetX, where OrigX is the original signal, PredXis the prediction signal, OffsetXis the inter prediction offset, at the decoder side, the reconstructed signal isRecoX=Resi'X+PredX+OffsetX, where Resi'X is the reconstructed residual signal; and
    the inter prediction offset OffsetX is derived from the residual signal of neighboring coded block (s) .
  2. The method as claimed in claim 1, wherein the offset OffsetX=mean (Resi'Y) is derived as mean value of the reconstructed residual signal of a neighboring coded block Y.
  3. The method as claimed in claim 1 and claim 2, wherein if an offset is already applied to the neighboring coded block Y, thenOffsetX=mean (Resi'Y) +OffsetY, where OffsetYis the inter prediction offset applied to the neighboring coded block.
  4. The method as claimed in claim 2 and claim 3, wherein neighboring coded block Y can be a prediction block, transform block, 4x4 block or other predefined block sizes.
  5. The method as claimed in claim 2 and claim 3, wherein Y is chosen from those that have same motion information as current block, in the other words, the motion vector (s) and reference frame (s) index of Y should be equal to the motion vector (s) and reference frame (s) index of current block X.
  6. The method as claimed in claim 2 and claim 3, wherein Y can be chosen from the left or the above of current block X.
  7. The method as claimed in claim 1, wherein multiple neighboring coded blocks are used to derive the offset, if both A and B are used, thenOffsetX= (mean (Resi'A ) +mean (Resi'B)) >>1; the neighboring coded blocks can be chosen from A to C to B, the offset is calculated by averaging over the used blocks:
    Figure PCTCN2015076865-appb-100001
    Figure PCTCN2015076865-appb-100002
    where MV_curr represent the motion of current block, and MV_i represent a neighboring coded block i, and DC_i represents the mean value of the residual of block i, the block i can be a 4x4 block or any other predefined block sizes.
  8. The method as claimed in claim 2 to claim 7, wherein the mean value of residual signal in a block can be pre-calculated and stored, when the residual of a block is reconstructed, then the mean value of this block is immediately obtained and stored in a buffer.
  9. The method as claimed in claim 8, wherein the block used for pre-calculating the mean value can be a prediction unit (PU) , transform unit (TU) , 4x4 block or any predefined block size.
  10. The method as claimed in claim 1, wherein if current block is merge mode, then the neighboring block can be the same as the merge candidate, if current block is merged to one of the spatial neighboring coded block, then the mean value of residual in that block is used as inter prediction offset.
  11. The method as claimed in claim 1, wherein OffsetX is set to zero if OffsetXis larger than a threshold.
  12. The method as claimed in claim 1, wherein OffsetX is set to zero if neighboring block is smaller than current block.
  13. The method as claimed in claim 1, wherein OffsetX is set to zero if the variance of neighboring block is larger than a threshold.
  14. The method as claimed in claim 1, wherein the inter prediction offset is applied in all components if the video is RGB, YUV, YCbCr or other format.
  15. The method as claimed in claim 1, wherein the inter prediction offset is only applied in luma component if the video is YUV or YCbCr format.
  16. The method as claimed in claim 1, inter prediction offset can be constrained to applied for merge mode only.
  17. The method as claimed in claim 1, inter prediction offset can be constrained to applied for 2Nx2N partition only.
  18. The method as claimed in claim 1, inter prediction offset can be constrained to applied for other predefined partition size.
  19. The method as claimed in claim 1, wherein one or more syntax elements can be used to signal whether merge with inter prediction offset is used, the syntax element can be coded.
  20. The method as claimed in claim 19, wherein the syntax elements can be explicitly  transmitted in the sequence level, view level, picture level, slice level, or other levels, it can be coded in VPS, SPS, PPS, APS, slice header, LCU et al.
  21. The method as claimed in claim 20, wherein the information about whether the merge with inter prediction offset is used can also be derived implicitly at decoder side according to statistics of mode selections.
PCT/CN2015/076865 2014-11-05 2015-04-17 Inter prediction offset WO2016165122A1 (en)

Priority Applications (5)

Application Number Priority Date Filing Date Title
PCT/CN2015/076865 WO2016165122A1 (en) 2015-04-17 2015-04-17 Inter prediction offset
US15/521,499 US10785475B2 (en) 2014-11-05 2015-11-04 Method and apparatus of video coding with prediction offset
CN201580060217.4A CN107079165B (en) 2014-11-05 2015-11-04 Video coding method and apparatus using prediction residual
EP15857286.7A EP3202151B1 (en) 2014-11-05 2015-11-04 Method and apparatus of video coding with prediction offset
PCT/CN2015/093763 WO2016070808A1 (en) 2014-11-05 2015-11-04 Method and apparatus of video coding with prediction offset

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2015/076865 WO2016165122A1 (en) 2015-04-17 2015-04-17 Inter prediction offset

Publications (1)

Publication Number Publication Date
WO2016165122A1 true WO2016165122A1 (en) 2016-10-20

Family

ID=57126343

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/076865 WO2016165122A1 (en) 2014-11-05 2015-04-17 Inter prediction offset

Country Status (1)

Country Link
WO (1) WO2016165122A1 (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009110754A2 (en) * 2008-03-05 2009-09-11 Samsung Electronics Co., Ltd. Method and apparatus for encoding and decoding image
CN101677400A (en) * 2008-09-19 2010-03-24 华为技术有限公司 Coding and decoding method, coder, decoder and encoder/decoder system
US20130182765A1 (en) * 2012-01-17 2013-07-18 Futurewei Technologies, Inc. In-loop Filtering for Lossless Coding Mode in High Efficiency Video Coding

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009110754A2 (en) * 2008-03-05 2009-09-11 Samsung Electronics Co., Ltd. Method and apparatus for encoding and decoding image
CN101677400A (en) * 2008-09-19 2010-03-24 华为技术有限公司 Coding and decoding method, coder, decoder and encoder/decoder system
US20130182765A1 (en) * 2012-01-17 2013-07-18 Futurewei Technologies, Inc. In-loop Filtering for Lossless Coding Mode in High Efficiency Video Coding

Similar Documents

Publication Publication Date Title
US9756336B2 (en) Method of background residual prediction for video coding
EP3202151B1 (en) Method and apparatus of video coding with prediction offset
WO2017156669A1 (en) Methods for motion vector storage in video coding
US20190149842A1 (en) Method and apparatus for intra mode coding
WO2016115981A1 (en) Method of video coding for chroma components
US20170310988A1 (en) Method of Motion Vector Predictor or Merge Candidate Derivation in Video Coding
WO2017143467A1 (en) Localized luma mode prediction inheritance for chroma coding
WO2016008157A1 (en) Methods for motion compensation using high order motion model
WO2016165069A1 (en) Advanced temporal motion vector prediction in video coding
WO2016090568A1 (en) Binary tree block partitioning structure
WO2017219342A1 (en) Methods of signaling quantization parameter for quad-tree plus binary tree structure
WO2012092763A1 (en) Method and apparatus of improved intra luma prediction mode coding
US10298951B2 (en) Method and apparatus of motion vector prediction
WO2016119104A1 (en) Motion vector regularization
WO2015192372A1 (en) A simplified method for illumination compensation in multi-view and 3d video coding
WO2016065538A1 (en) Guided cross-component prediction
WO2016115728A1 (en) Improved escape value coding methods
WO2013102299A1 (en) Residue quad tree depth for chroma components
WO2015180166A1 (en) Improved intra prediction mode coding
WO2016115736A1 (en) Additional intra prediction modes using cross-chroma-component prediction
US12081800B2 (en) Method and apparatus for adaptively processing video samples in a video signal frame
WO2016165122A1 (en) Inter prediction offset
WO2012123321A1 (en) Method for reconstructing and coding an image block
WO2017035833A1 (en) Neighboring-derived prediction offset (npo)
WO2016123749A1 (en) Deblocking filtering with adaptive motion vector resolution

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15888827

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15888827

Country of ref document: EP

Kind code of ref document: A1