[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

WO2022062880A1 - 视频解码方法、装置、计算机可读介质及电子设备 - Google Patents

视频解码方法、装置、计算机可读介质及电子设备 Download PDF

Info

Publication number
WO2022062880A1
WO2022062880A1 PCT/CN2021/116451 CN2021116451W WO2022062880A1 WO 2022062880 A1 WO2022062880 A1 WO 2022062880A1 CN 2021116451 W CN2021116451 W CN 2021116451W WO 2022062880 A1 WO2022062880 A1 WO 2022062880A1
Authority
WO
WIPO (PCT)
Prior art keywords
value
block
quantization
encoded data
transformation matrix
Prior art date
Application number
PCT/CN2021/116451
Other languages
English (en)
French (fr)
Inventor
王力强
Original Assignee
腾讯科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 腾讯科技(深圳)有限公司 filed Critical 腾讯科技(深圳)有限公司
Publication of WO2022062880A1 publication Critical patent/WO2022062880A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/124Quantisation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/109Selection of coding mode or of prediction mode among a plurality of temporal predictive coding modes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/11Selection of coding mode or of prediction mode among a plurality of spatial predictive coding modes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/80Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
    • H04N19/82Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/85Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
    • H04N19/86Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving reduction of coding artifacts, e.g. of blockiness
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • H04N19/91Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display

Definitions

  • the present application relates to the field of computer and communication technologies, and in particular, to a video decoding method, an apparatus, a computer-readable medium, and an electronic device.
  • the encoder In the video encoding process, the encoder usually needs to transform, quantize, and entropy the residual data between the original video data and the predicted video data before sending it to the decoder. Due to the diversity of residual data, a single DCT (Discrete Cosine Transform, discrete cosine transform) transform kernel cannot adapt to all residual characteristics, so it may be necessary to select multiple DCT transform kernels or DST (Discrete Sine Transform) for a residual block. , discrete sine transform) transformation kernel is used as a transformation matrix combination. In this case, although the adaptability of the transformation matrix combination to the residual block is improved, it is necessary to encode the transformation for each coding unit (Coding Unit, referred to as CU). The index of the kernel, thus resulting in lower coding efficiency.
  • CU coding Unit
  • Embodiments of the present application provide a video decoding method, apparatus, computer-readable medium, and electronic device.
  • a video decoding method executed by an electronic device, includes: performing entropy decoding processing on an encoded block of a video image frame to obtain a quantized coefficient block of residual data corresponding to the encoded block; The quantization coefficients in the area are counted to obtain a statistical result, and the remainder of the statistical result for the set value is calculated; according to the correspondence between the remainder and the transformation matrix combination, the corresponding transformation matrix combination is selected; based on the selected transformation matrix combination, the corresponding transformation matrix combination is selected. Perform inverse transformation processing on the inverse quantization result of the quantized coefficient block; reconstruct the video image frame according to the processing result of the inverse transformation processing.
  • a video decoding device comprising: a decoding unit configured to perform entropy decoding processing on an encoded block of a video image frame to obtain a quantized coefficient block of residual data corresponding to the encoded block; a statistics unit configured to perform entropy decoding on the quantized coefficient The quantization coefficients in at least one area in the block are counted to obtain a statistical result, and the remainder of the statistical result for the set value is calculated; the selection unit is configured to select the corresponding transformation matrix according to the correspondence between the remainder and the combination of transformation matrices combination; a processing unit configured to perform inverse transform processing on the inverse quantization result of the quantized coefficient block based on the selected transform matrix combination; and a reconstruction unit configured to reconstruct the video image frame according to the processing result of the inverse transform processing.
  • An electronic device comprising a memory and one or more processors, the memory storing computer-readable instructions that, when executed by the one or more processors, cause the one or more processors
  • the processor performs the steps of the video decoding method described above.
  • One or more non-volatile computer-readable storage media having computer-readable instructions stored thereon that, when executed by one or more processors, cause the one or multiple processors to perform the steps of the above-described video decoding method.
  • a computer program product or computer program comprising computer readable instructions stored in a computer readable storage medium from which a processor of a computer device readable storage The computer-readable instructions are read by the medium, and the computer-readable instructions are executed by the processor to cause the computer device to perform the steps of the above-described video decoding method.
  • FIG. 1 shows a schematic diagram of an exemplary system architecture to which the technical solutions of the embodiments of the present application can be applied;
  • FIG. 2 shows a schematic diagram of a placement manner of a video encoding device and a video decoding device in a streaming transmission system
  • Fig. 3 shows the basic flow chart of a video encoder
  • Fig. 4 shows the scanning area marked by SRCC technology
  • Fig. 5 shows the sequence schematic diagram of scanning the marked scanning area
  • FIG. 6 shows a flowchart of a video decoding method according to an embodiment of the present application
  • FIG. 7 shows a schematic diagram of a division manner of a designated area according to an embodiment of the present application.
  • FIG. 8 shows a schematic diagram of a division manner of a designated area according to an embodiment of the present application.
  • FIG. 9 shows a schematic diagram of a division manner of a designated area according to an embodiment of the present application.
  • FIG. 10 shows a block diagram of a video decoding apparatus according to an embodiment of the present application.
  • FIG. 11 shows a schematic structural diagram of a computer system suitable for implementing the electronic device according to the embodiment of the present application.
  • Example embodiments will now be described more fully with reference to the accompanying drawings.
  • Example embodiments can be embodied in various forms and should not be construed as limited to the examples set forth herein; rather, these embodiments are provided so that this application will be thorough and complete, and will fully convey the concept of example embodiments to those skilled in the art.
  • FIG. 1 shows a schematic diagram of an exemplary system architecture to which the technical solutions of the embodiments of the present application can be applied.
  • the system architecture 100 includes a plurality of end devices that can communicate with each other through, for example, a network 150 .
  • the system architecture 100 may include a first end device 110 and a second end device 120 interconnected by a network 150 .
  • the first terminal device 110 and the second terminal device 120 perform unidirectional data transmission.
  • the first terminal device 110 may encode video data (eg, a video picture stream captured by the terminal device 110 ) for transmission to the second terminal device 120 through the network 150, and the encoded video data may be encoded in one or more
  • the second terminal device 120 may receive the encoded video data from the network 150, decode the encoded video data to restore the video data, and display video pictures according to the restored video data.
  • the system architecture 100 may include a third end device 130 and a fourth end device 140 that perform bidirectional transmission of encoded video data, such as may occur during a video conference.
  • each of the third end device 130 and the fourth end device 140 may encode video data (eg, a stream of video pictures captured by the end device) for transmission to the third end device over the network 150 130 and the other terminal device of the fourth terminal device 140 .
  • Each of the third terminal device 130 and the fourth terminal device 140 may also receive encoded video data transmitted by the other one of the third terminal device 130 and the fourth terminal device 140, and may The video data is decoded to recover the video data, and a video picture can be displayed on an accessible display device based on the recovered video data.
  • the first terminal device 110 , the second terminal device 120 , the third terminal device 130 and the fourth terminal device 140 may be servers, personal computers and smart phones, but the principles disclosed in this application may not be limited thereto .
  • Embodiments disclosed herein are applicable to laptop computers, tablet computers, media players, and/or dedicated videoconferencing equipment.
  • Network 150 represents any number of networks, including, for example, wired and/or wireless communication networks, that communicate encoded video data between first end device 110, second end device 120, third end device 130, and fourth end device 140.
  • Communication network 150 may exchange data in circuit-switched and/or packet-switched channels.
  • the network may include a telecommunications network, a local area network, a wide area network, and/or the Internet.
  • the architecture and topology of network 150 may be immaterial to the operations disclosed herein.
  • FIG. 2 illustrates the placement of a video encoding device and a video decoding device in a streaming environment.
  • the subject matter disclosed herein is equally applicable to other video-enabled applications including, for example, videoconferencing, digital TV (television), storing compressed video on digital media including CDs, DVDs, memory sticks, and the like.
  • the streaming system may include a capture subsystem 213 , which may include a video source 201 such as a digital camera, and the video source creates an uncompressed video picture stream 202 .
  • the video picture stream 202 includes samples captured by a digital camera.
  • the video picture stream 202 is depicted as a thick line to emphasize the high data volume of the video picture stream, which can be processed by the electronic device 220, and the electronic Device 220 includes video encoding device 203 coupled to video source 201 .
  • Video encoding device 203 may include hardware, software, or a combination of hardware and software to implement or implement various aspects of the disclosed subject matter as described in greater detail below.
  • the encoded video data 204 (or encoded video code stream 204) is depicted as a thin line to emphasize the lower amount of encoded video data 204 (or encoded video code stream 204) 204), which can be stored on the streaming server 205 for future use.
  • One or more streaming client subsystems such as client subsystem 206 and client subsystem 208 in FIG. 2 , may access streaming server 205 to retrieve copies 207 and 209 of encoded video data 204 .
  • Client subsystem 206 may include, for example, video decoding device 210 in electronic device 230 .
  • the video decoding device 210 decodes the incoming copy 207 of the encoded video data and produces an output video picture stream 211 that can be presented on a display 212 (eg, a display screen) or another presentation device.
  • encoded video data 204, video data 207, and video data 209 may be encoded according to certain video encoding/compression standards. Examples of these standards include ITU-T H.265.
  • video coding standard under development is informally referred to as Versatile Video Coding (VVC), and this application may be used in the context of the VVC standard.
  • electronic device 220 and the electronic device 230 may include other components not shown in the figures.
  • electronic device 220 may include a video decoding device
  • electronic device 230 may also include a video encoding device.
  • the international video coding standard HEVC High Efficiency Video Coding, high efficiency video coding
  • VVC Very Video Coding, multifunctional video coding
  • AVS Chinese national video coding standard AVS (Audio Video coding) Standard, source coding standard
  • CTU Coding Tree Unit, coding tree unit
  • LCU Large Coding Unit, largest coding unit
  • the CTU can continue to be further divided into finer divisions to obtain one or more basic coding units CU, and CU is the most basic element in a coding link.
  • Predictive Coding includes intra-frame prediction and inter-frame prediction. After the original video signal is predicted by the selected reconstructed video signal, a residual video signal is obtained. The encoder needs to decide which predictive coding mode to select for the current CU and inform the decoder. Among them, intra-frame prediction means that the predicted signal comes from an area that has been coded and reconstructed in the same image; inter-frame prediction means that the predicted signal comes from another image that has been coded and different from the current image (called a reference image). ).
  • Transform & Quantization After the residual video signal undergoes transform operations such as DFT (Discrete Fourier Transform), DCT, etc., the signal is converted into the transform domain, which is called transform coefficient. The transform coefficient is further subjected to a lossy quantization operation, which loses a certain amount of information, so that the quantized signal is beneficial to the compressed expression. In some video coding standards, there may be more than one transformation mode to choose from, so the encoder also needs to select one of the transformation modes for the current CU and inform the decoder. The fineness of quantization is usually determined by the Quantization Parameter (QP for short). If the value of QP is larger, the coefficients representing a larger value range will be quantized into the same output, which usually brings greater distortion and distortion. A lower code rate; on the contrary, if the QP value is smaller, the coefficients representing a smaller value range will be quantized into the same output, so it usually brings less distortion and corresponds to a higher code rate.
  • QP Quantization Parameter
  • Entropy Coding or Statistical Coding The quantized transform domain signal will undergo statistical compression coding according to the frequency of occurrence of each value, and finally output a binarized (0 or 1) compressed code stream. At the same time, other information generated by encoding, such as the selected encoding mode, motion vector data, etc., also needs to be entropy encoded to reduce the bit rate.
  • Statistical coding is a lossless coding method that can effectively reduce the code rate required to express the same signal. Common statistical coding methods include Variable Length Coding (VLC) or context-based binary arithmetic coding ( Content Adaptive Binary Arithmetic Coding, referred to as CABAC).
  • Loop Filtering The changed and quantized signal will obtain a reconstructed image through the operations of inverse quantization, inverse transformation and prediction compensation. Compared with the original image, the reconstructed image is different from the original image due to the influence of quantization, that is, the reconstructed image will produce distortion (Distortion). Therefore, filtering operations can be performed on the reconstructed image, such as deblocking filter (DB), SAO (Sample Adaptive Offset, adaptive pixel compensation) or ALF (Adaptive Loop Filter, adaptive loop filter) and other filters , which can effectively reduce the degree of distortion caused by quantization. Since these filtered reconstructed images will be used as references for subsequent encoded images to predict future image signals, the above filtering operation is also called in-loop filtering, ie, a filtering operation in an encoding loop.
  • DB deblocking filter
  • SAO Sample Adaptive Offset, adaptive pixel compensation
  • ALF Adaptive Loop Filter, adaptive loop filter
  • FIG. 3 shows a basic flowchart of a video encoder, and intra-frame prediction is used as an example for description in the flowchart.
  • the original image signal sk [x,y] and the predicted image signal Do the difference operation to get the residual signal u k [x,y]
  • the residual signal u k [x,y] is transformed and quantized to obtain quantized coefficients
  • the quantized coefficients are encoded by entropy coding to obtain the encoded bits
  • the reconstructed residual signal u' k [x, y] is obtained through inverse quantization and inverse transformation processing, and the predicted image signal It is superimposed with the reconstructed residual signal u' k [x, y] to generate an image signal image signal
  • it is input to the intra - frame mode decision module and the intra-frame prediction module for intra-frame prediction processing; ] can be used as a reference image for the next frame for motion estimation and motion compensation prediction.
  • the SRCC technology can mark the size of the upper left region of the non-zero coefficients contained in each quantized coefficient block (size W ⁇ H) SRx ⁇ SRy, where SRx is the quantization coefficient The abscissa of the rightmost non-zero coefficient in the coefficient block, SRy is the ordinate of the lowermost non-zero coefficient in the quantized coefficient block, and 1 ⁇ SRx ⁇ W, 1 ⁇ SRy ⁇ H, and the coefficients outside this area are 0.
  • the SRCC technology uses (SRx, SRy) to determine the quantized coefficient area that needs to be scanned in a quantized coefficient block. As shown in Figure 4, only the quantized coefficients in the scanning area marked by (SRx, SRy) need to be coded.
  • the scanning order of the coding is as follows As shown in Figure 5, it can be a reverse zigzag scan from the lower right corner to the upper left corner.
  • the decoding end Based on the above encoding process, for each CU, the decoding end performs entropy decoding to obtain various mode information and quantization coefficients after obtaining the compressed code stream (ie, the bit stream). Then, the quantized coefficients undergo inverse quantization and inverse transformation to obtain residual signals.
  • the predicted signal corresponding to the CU can be obtained, and then the reconstructed signal can be obtained by adding the residual signal and the predicted signal. The reconstructed signal is then subjected to loop filtering and other operations to generate the final output signal.
  • the transform processing of the residual signal makes the energy of the residual signal concentrate on less low-frequency coefficients, that is, most coefficients have smaller values. Then after the subsequent quantization module, the smaller coefficient value will become zero value, which greatly reduces the cost of coding the residual signal.
  • transformation kernels such as DST7 and DCT8 are introduced into the transformation process, and the horizontal transformation and vertical transformation of the residual signal are carried out. Direct transforms can use different transform kernels.
  • the possible transformation combinations for transform processing of a residual signal are as follows: (DCT2, DCT2), (DCT8, DCT8), (DCT8, DST7) ), (DST7, DCT8) and (DST7, DST7).
  • the embodiments of the present application propose to implicitly indicate the transformation matrix combination corresponding to the coding block by using the quantization coefficients in the quantization coefficient block, thereby reducing the bits occupied by the transformation matrix index and effectively improving the video coding efficiency.
  • a statistical result is obtained by performing statistics on the quantized coefficients in at least one area in the quantized coefficient block, and then the remainder of the statistical result for the set value is calculated, so as to select the corresponding and perform inverse transformation processing on the inverse quantization result of the quantization coefficient block based on the selected transformation matrix combination, and reconstruct the video frame according to the processing result of the inverse transformation processing, so that the quantization coefficients in the quantization coefficient block can be used to implicitly indicate
  • the transformation matrix combination corresponding to the encoding block does not require the encoding end to encode the index of the transformation matrix for each coding unit, which reduces the bits occupied by the transformation matrix index, thereby effectively improving the video encoding efficiency.
  • FIG. 6 shows a flowchart of a video decoding method according to an embodiment of the present application.
  • the video decoding method can be executed by a device with a computing processing function, such as an electronic device, where the electronic device is a terminal device or server.
  • the video decoding method at least includes steps S610 to S640, which are described in detail as follows:
  • step S610 entropy decoding processing is performed on the coded block of the video image frame to obtain a quantized coefficient block of residual data corresponding to the coded block.
  • the video image frame sequence includes a series of images, each image can be further divided into slices, and the slices can be further divided into a series of LCUs (or CTUs).
  • the LCUs include There are several CUs.
  • the video image frame is encoded in block units.
  • MB macroblock
  • MB macroblock
  • Prediction Prediction block
  • basic concepts such as coding unit CU, prediction unit (PU) and transform unit (TU) are used to functionally divide a variety of block units, and adopt a new tree-based structure for describe.
  • a CU can be divided into smaller CUs according to a quadtree, and the smaller CUs can be further divided to form a quadtree structure.
  • the coding block in this embodiment of the present application may be a CU, or a block smaller than the CU, such as a smaller block obtained by dividing the CU.
  • step S620 statistics are performed on the quantization coefficients in at least one area in the quantization coefficient block to obtain a statistical result, and the remainder of the statistical result with respect to the set value is calculated.
  • the specified index identifier may include at least one of the following data: the sequence header of the first encoded data corresponding to the video image frame sequence, the image header of the second encoded data corresponding to the video image frame, and a bar of the video image frame.
  • the slice header with the corresponding third encoded data and the LCU header of the fourth encoded data corresponding to one largest coding unit LCU of the video image frame are as follows:
  • whether the target coded block in the first coded data needs to be decoded according to entropy may be indicated by the value of the specified index identifier included in the sequence header of the first coded data corresponding to a sequence of video image frames
  • the corresponding transformation matrix combination is selected for the quantization coefficients in the obtained quantization coefficient block.
  • the value of the specified index identifier in the sequence header of the first encoded data is 1, which means that the target encoding block in the first encoded data needs to be decoded according to entropy.
  • the quantized coefficients in the obtained quantized coefficient block select the corresponding transformation matrix combination.
  • the technical solution of this embodiment enables the indication of all target coding blocks corresponding to the entire video image frame sequence through an index identification in the sequence header of the first encoded data corresponding to the video image frame sequence, thereby effectively reducing the index identification
  • the occupied bits improve the video coding efficiency.
  • the target coding block in the first coded data may be all coding blocks in the first coded data, or may be coding blocks in the first coded data using the intra-frame coding mode, or may also be the first coding block The coded block in the inter-frame coding mode in the data.
  • whether the target coded block in the second coded data needs to be obtained according to entropy decoding may be indicated by the value of the specified index identifier included in the image header of the second coded data corresponding to one video image frame
  • the corresponding transformation matrix combination is selected for the quantization coefficients in the quantized coefficient block of
  • the quantization coefficients in the quantization coefficient block select the corresponding transformation matrix combination.
  • the target coding block in the second coded data may be all coding blocks in the second coded data, or may be coding blocks in the second coded data using the intra-frame coding mode, or may also be the second coding block The coded block in the inter-frame coding mode in the data.
  • whether the target coding block in the third coded data needs to be indicated by the value of the specified index identifier included in the slice header of the third coded data corresponding to one slice of the video image frame can be used.
  • the corresponding transformation matrix combination is selected according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding.
  • the value of the specified index identifier in the slice header of the third coded data is 1, which means the target coding block in the third coded data.
  • the corresponding transformation matrix combination needs to be selected according to the quantized coefficients in the quantized coefficient block obtained by entropy decoding.
  • the technical solution of this embodiment enables the indication of all target coding blocks corresponding to the entire slice through an index identifier in the slice header of the third encoded data corresponding to one slice of the video image frame, and the index can also be reduced.
  • the bits occupied by the identification improve the video coding efficiency.
  • the target coding blocks in the third coded data may be all coding blocks in the third coded data, or may be coding blocks in the third coded data using the intra-frame coding mode, or may also be the third coding block The coded block in the inter-frame coding mode in the data.
  • whether the target coding block in the fourth coded data needs to be based on entropy can be indicated by the value of the specified index identifier included in the LCU header of the fourth coded data corresponding to one LCU of the video image frame
  • the corresponding transformation matrix combination is selected for the quantization coefficients in the decoded quantization coefficient block.
  • the value of the specified index identifier in the LCU header of the fourth encoded data is 1, which means that the target encoding block in the fourth encoded data needs to be based on entropy.
  • the quantized coefficients in the decoded quantized coefficient block select the corresponding transformation matrix combination.
  • the technical solution of this embodiment makes it possible to realize the indication of all target coding blocks corresponding to the entire LCU through an index mark in the LCU header of the fourth coded data corresponding to one LCU of the video image frame, which can also reduce the occupation of the index mark. bits, which improves the video coding efficiency.
  • the target coding blocks in the fourth coded data may be all coding blocks in the fourth coded data, or may be coding blocks in the fourth coded data using the intra-frame coding mode, or may be the fourth coding block The coded block in the inter-frame coding mode in the data.
  • the size of the coding block is small (for example, smaller than a preset threshold), it can be determined that the coding block needs to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding; If the size of the block is relatively large (eg, larger than a certain preset threshold), it can be determined that the coding block does not need to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding.
  • the technical solution of this embodiment can implicitly indicate whether it is necessary to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding according to the size of the coding block, without the need for additional bits, so the video coding efficiency can also be improved. .
  • the target encoding in the first encoded data is determined.
  • the block does not need to select the corresponding transformation matrix combination according to the quantized coefficients in the quantized coefficient block obtained by entropy decoding.
  • the index identifier included in the sequence header is the second value (for example, it may be 1), at this time, it can be further determined according to the index identifier included in the image header of the second encoded data corresponding to the video image frame in the sequence of video image frames. make decisions.
  • the target coding block in the first coded data may be all coding blocks in the first coded data, or may be coding blocks in the first coded data using the intra-frame coding mode, or may also be the first coding block The coded block in the inter-frame coding mode in the data.
  • the index identifier included in the aforementioned sequence header is the second value (for example, it may be 1), and the index identifier included in the aforementioned image header is the first value (eg, it may be 0) , it can be determined that the target coding block in the second coded data does not need to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding. If the index identifier contained in the foregoing sequence header and the index identifier included in the foregoing image header are both second values (for example, may be 1), it can be determined that the target encoding block in the second encoded data needs to be obtained by entropy decoding.
  • the quantization coefficients in the quantization coefficient block select the corresponding transformation matrix combination. In this case, the decision can no longer be made through the slice header, the LCU header and the size of the coding block.
  • the target coding block in the second coded data may be all coding blocks in the second coded data, or may be coding blocks in the second coded data using the intra-frame coding mode, or may also be the second coding block The coded block in the inter-frame coding mode in the data.
  • the video image The decision is made according to the index identifier contained in the slice header of the third encoded data corresponding to the slice of the frame.
  • the index identifier included in the preceding sequence header and the index identifier included in the preceding image header are both the second value, but the index identifier included in the slice header information of the third encoded data corresponding to one slice is the first value (for example, it can be 0), then it can be determined that the target coding block in the third coded data does not need to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding.
  • the index identifier included in the foregoing sequence header, the index identifier included in the foregoing image header, and the index identifier included in the foregoing slice header are all second values (for example, may be 1), it can be determined that the third encoded data contains
  • the target coding block needs to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding. In this case, the decision can no longer be made according to the size of the LCU header and the coding block.
  • the target coding blocks in the third coded data may be all coding blocks in the third coded data, or may be coding blocks in the third coded data using the intra-frame coding mode, or may also be the third coding block The coded block in the inter-frame coding mode in the data.
  • the decision may be further made according to the index identifier included in the LCU header of the fourth encoded data corresponding to the LCU in the slice.
  • the index identifier included in the preceding sequence header, the index identifier included in the preceding image header, and the index identifier included in the preceding slice header are all second values, but the LCU header information of the fourth encoded data corresponding to one LCU If the included index identifier is the first value (for example, it may be 0), it can be determined that the target encoding block in the fourth encoded data does not need to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding.
  • the index identifier included in the aforementioned sequence header, the index identifier included in the aforementioned image header, the index identifier included in the aforementioned slice header, and the index identifier included in the aforementioned LCU header information are all second values (for example, it may be 1 ), then it can be determined that the target coding block in the fourth coded data needs to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding. In this case, the size of the coding block can no longer be determined.
  • the target coding blocks in the fourth coded data may be all coding blocks in the fourth coded data, or may be coding blocks in the fourth coded data using the intra-frame coding mode, or may be the fourth coding block The coded block in the inter-frame coding mode in the data.
  • the index identifier included in the aforementioned sequence header if the index identifier included in the aforementioned image header, the index identifier included in the aforementioned slice header, and the index included in the aforementioned LCU header information If the flags are all second values (for example, it may be 1), then the decision may be further made according to the size of the coding block.
  • the index identifier included in the aforementioned sequence header, the index identifier included in the aforementioned image header, the index identifier included in the aforementioned slice header, and the index identifier included in the aforementioned LCU header information are all second values, but one If the size of the coding block is small (for example, smaller than a certain preset threshold), it can be determined that the coding block needs to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding; on the contrary, if the size of the coding block is If it is larger (for example, greater than a certain preset threshold), it can be determined that the encoding block does not need to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding.
  • an indication bit may also be used to indicate whether the quantization coefficient block obtained by entropy decoding is required to be respectively indicated to the coding block using the intra-frame coding mode and the coding block using the inter-frame coding mode in the coded data.
  • the quantization coefficient selects the corresponding transformation matrix combination. in particular:
  • the value of the first indicator bit and the value of the second indicator bit included in the sequence header of the first encoded data corresponding to a sequence of video image frames it is possible to determine, respectively, the value of the first indicator bit and the value of the second indicator bit in the first encoded data. Whether it is necessary to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding for the coding block in the intra-frame coding mode and the coding block in the inter-frame coding mode.
  • the value of the first indicator bit included in the sequence header of the first encoded data when the value of the first indicator bit included in the sequence header of the first encoded data is 0, it indicates that the encoding block in the first encoded data that adopts the intra-frame encoding mode does not need the quantization coefficients in the quantization coefficient block obtained by entropy decoding Select the corresponding transformation matrix combination; when the value of the first indicator bit included in the sequence header of the first encoded data is 1, it indicates that the encoding block in the first encoded data that adopts the intra-frame encoding mode needs to be decoded according to the entropy.
  • the obtained quantization coefficient block The quantization coefficient in selects the corresponding transformation matrix combination; when the value of the second indicator bit included in the sequence header of the first encoded data is 0, it indicates that the encoding block in the first encoded data using the inter-frame encoding mode does not need to be decoded according to entropy
  • the quantization coefficient in the obtained quantization coefficient block is selected by the corresponding transformation matrix combination; when the value of the second indicator bit included in the sequence header of the first coded data is 1, it indicates that the first coded data adopts the coding block of the inter-frame coding mode
  • the corresponding transformation matrix combination needs to be selected according to the quantized coefficients in the quantized coefficient block obtained by entropy decoding.
  • the frame used in the second encoded data can be determined respectively. Whether it is necessary to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding for the coding block in the intra-coding mode and the coding block in the inter-coding mode.
  • the corresponding transformation matrix combination is selected for the quantization coefficients in the obtained quantization coefficient block; when the value of the second indicator bit included in the image header of the second coded data is 1, it indicates that the encoding block in the second encoded data that adopts the intra-frame encoding mode does not need the quantization coefficients in the quantization coefficient block obtained by entropy decoding Select the corresponding transformation matrix combination; when the value of the first indicator bit contained in the image header of the second encoded data is 1, it indicates that the encoding block using the intra-frame encoding mode in the second encoded data needs a quantization coefficient block obtained by entropy decoding The quantization coefficient in selects the corresponding transformation matrix combination; when the value of the second indicator bit included in the image header of the second encoded data is 0, it indicates that the encoding block in the second encoded data using the inter-frame encoding mode does not need to be decoded according to entropy The corresponding transformation matrix combination is selected for the quantization coefficients in the obtained quantization coefficient block; when the value
  • the third encoding is determined respectively according to the value of the first indicator bit and the value of the second indicator bit included in the slice header information of the third encoded data corresponding to one slice of the video image frame. Whether the coding block using the intra-frame coding mode and the coding block using the inter-frame coding mode in the data needs to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding. For example, when the value of the first indicator bit included in the slice header of the third encoded data is 0, it indicates that the encoding block in the third encoded data that adopts the intra-frame encoding mode does not need the quantization in the quantization coefficient block obtained by entropy decoding.
  • the coefficient selects the corresponding transformation matrix combination; when the value of the first indicator bit included in the slice header of the third encoded data is 1, it indicates that the encoding block in the third encoded data that adopts the intra-frame encoding mode needs to be quantized according to entropy decoding.
  • the quantization coefficient in the coefficient block selects the corresponding transformation matrix combination; when the value of the second indicator bit included in the slice header of the third encoded data is 0, it indicates that the encoding block in the inter-frame encoding mode in the third encoded data does not need to be
  • the corresponding transformation matrix combination is selected according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding; when the value of the second indicator bit included in the slice header of the third encoded data is 1, it indicates that inter-frame encoding is used in the third encoded data
  • the encoding block of the mode needs to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding.
  • the value of the first indicator bit and the value of the second indicator bit included in the LCU header information of the fourth encoded data corresponding to one LCU of the video image frame determine the value of the fourth encoded data respectively. Whether the coding block using the intra-frame coding mode and the coding block using the inter-frame coding mode needs to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding.
  • the value of the first indicator bit included in the LCU header of the fourth encoded data when the value of the first indicator bit included in the LCU header of the fourth encoded data is 0, it indicates that the encoding block in the fourth encoded data that adopts the intra-frame encoding mode does not need the quantization coefficients in the quantization coefficient block obtained by entropy decoding Select the corresponding transformation matrix combination; when the value of the first indicator bit included in the LCU header of the fourth encoded data is 1, it indicates that the encoding block using the intra-frame encoding mode in the fourth encoded data needs a quantization coefficient block obtained by entropy decoding The quantization coefficient in selects the corresponding transformation matrix combination; when the value of the second indicator bit included in the LCU header of the fourth encoded data is 0, it indicates that the encoding block using the inter-frame encoding mode in the fourth encoded data does not need to be decoded according to entropy The quantization coefficients in the obtained quantization coefficient block are selected corresponding to the transformation matrix combination; when the value
  • the coding block using the intra-frame coding mode and the coding block using the inter-frame coding mode in the coded data by indicating the bit pair respectively indicate whether the corresponding transform needs to be selected according to the quantization coefficient in the quantization coefficient block obtained by entropy decoding.
  • a scheme similar to that in the previous embodiment can also be adopted, and the decision is made jointly by the indicator bits in the sequence header and the image header, or by the indicator bits in the sequence header, the image header and the slice header. to co-decision, or by the indicator bits in the sequence header, picture header, slice header and LCU header, or by the sequence header, picture header, slice header, LCU header Indicates bits, and the size of the encoding block to make a joint decision.
  • the index identifier in the aforementioned sequence header is 1 (the value is only an example), which means that the target coding block in the first coded data corresponding to the video image frame sequence needs to be selected according to the quantization coefficient in the quantization coefficient block obtained by entropy decoding.
  • the corresponding transformation matrix combination is 1 (the value is only an example), which means that the target coding block in the first coded data corresponding to the video image frame sequence needs to be selected according to the quantization coefficient in the quantization coefficient block obtained by entropy decoding.
  • the target coding block in the first coded data may be all coding blocks in the first coded data, or may be coding blocks in the first coded data using the intra-frame coding mode, or may also be the first coding block The coded block in the inter-frame coding mode in the data.
  • the coding block using the intra-frame coding mode and the coding block using the inter-frame coding mode in the first coded data corresponding to the video image frame sequence may also be respectively indicated by the indication bit in the sequence header. That is, the sequence header contains two indicator bits, and one indicator bit is used to indicate whether the encoding block using the intra-frame encoding mode in the first encoded data needs to select the corresponding transformation matrix combination according to the quantization coefficient in the quantization coefficient block obtained by entropy decoding. , and another indication bit is used to indicate whether the coding block in the first coded data using the inter-frame coding mode needs to select the corresponding transformation matrix combination according to the quantization coefficient in the quantization coefficient block obtained by entropy decoding.
  • the index identifier in the image header is 1 (the value is only an example), which means that the target encoding block in the second encoded data corresponding to the video image frame needs to be selected according to the quantization coefficient in the quantization coefficient block obtained by entropy decoding. combination of transformation matrices.
  • the target coding block in the second coded data may be all coding blocks in the second coded data, or may be coding blocks in the second coded data using the intra-frame coding mode, or may also be the second coding block The coded block in the inter-frame coding mode in the data.
  • the indication bits in the image header may be used to separately indicate the coding blocks that use the intra-frame coding mode and the coding blocks that use the inter-frame coding mode in the second coded data corresponding to the video image frame. That is, the image header contains two indicator bits, and one indicator bit is used to indicate whether the encoding block using the intra-frame encoding mode in the second encoded data needs to select the corresponding transformation matrix combination according to the quantization coefficient in the quantization coefficient block obtained by entropy decoding. , and another indicator bit is used to indicate whether the encoding block in the inter-frame encoding mode in the second encoded data needs to select a corresponding transformation matrix combination according to the quantization coefficient in the quantization coefficient block obtained by entropy decoding.
  • the index identifier in the slice header is 1 (the value is only an example), which means that the target encoding block in the third encoded data corresponding to the slice needs to be selected according to the quantization coefficient block in the quantization coefficient block obtained by entropy decoding. combination of transformation matrices.
  • the target coding blocks in the third coded data may be all coding blocks in the third coded data, or may be coding blocks in the third coded data using the intra-frame coding mode, or may also be the third coding block The coded block in the inter-frame coding mode in the data.
  • the indication bits in the slice header may be used to respectively indicate the coding blocks that use the intra-frame coding mode and the coding blocks that use the inter-frame coding mode in the third coded data corresponding to the slice. That is, the slice header contains two indicator bits, and one indicator bit is used to indicate whether the encoding block using the intra-frame encoding mode in the third encoded data needs to select the corresponding transformation matrix according to the quantization coefficient in the quantization coefficient block obtained by entropy decoding. The other indicator bit is used to indicate whether the encoding block in the inter-frame encoding mode in the third encoded data needs to select the corresponding transformation matrix combination according to the quantization coefficient in the quantization coefficient block obtained by entropy decoding.
  • the index identifier in the LCU header is 1 (the value is only an example), which means that the target coding block in the fourth coded data corresponding to the LCU needs to select the corresponding transform according to the quantization coefficient in the quantization coefficient block obtained by entropy decoding matrix combination.
  • the target coding blocks in the fourth coded data may be all coding blocks in the fourth coded data, or may be coding blocks in the fourth coded data using the intra-frame coding mode, or may be the fourth coding block The coded block in the inter-frame coding mode in the data.
  • the indication bits in the LCU header may be used to separately indicate the coding blocks that use the intra-frame coding mode and the coding blocks that use the inter-frame coding mode in the fourth coded data corresponding to the LCU. That is, the LCU header contains two indicator bits, and one indicator bit is used to indicate whether the encoding block using the intra-frame encoding mode in the fourth encoded data needs to select the corresponding transformation matrix combination according to the quantization coefficient in the quantization coefficient block obtained by entropy decoding. , and another indication bit is used to indicate whether the coding block in the fourth coded data that adopts the inter-coding mode needs to select the corresponding transformation matrix combination according to the quantization coefficient in the quantization coefficient block obtained by entropy decoding.
  • the size of the coding block is small (for example, smaller than a certain preset threshold), it can be determined that the coding block needs to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding; on the contrary, if If the size of the coding block is relatively large (eg, larger than a certain preset threshold), it can be determined that the coding block does not need to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding.
  • the index identifier in the aforementioned sequence header and the index identifier in the image header it means that the target encoding block in the second encoded data corresponding to the video image frame needs to be obtained by entropy decoding
  • the quantization coefficients in the quantization coefficient block select the corresponding transformation matrix combination; if the index mark in the sequence header is 1, and the index mark in the image header is 0 (the value is only an example), it means that the video image frame corresponds to The target coding block in the second coded data does not need to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding.
  • the target coding block in the second coded data may be all coding blocks in the second coded data, or may be coding blocks in the second coded data using the intra-frame coding mode, or may also be the second coding block The coded block in the inter-frame coding mode in the data.
  • the index identifier in the sequence header, the index identifier in the image header, and the index identifier in the slice header are all 1 (the value is only an example), it means that the target in the third encoded data corresponding to the slice is The coding block needs to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding; If it is 0 (the value is only an example), it means that the target coding block in the third coded data corresponding to the slice does not need to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding.
  • the target coding blocks in the third coded data may be all coding blocks in the third coded data, or may be coding blocks in the third coded data using the intra-frame coding mode, or may also be the third coding block The coded block in the inter-frame coding mode in the data.
  • index identifier in the aforementioned sequence header the index identifier in the image header, the index identifier in the slice header, and the index identifier in the LCU header, and the decision is no longer made based on the size of the coding block.
  • the index identifier in the sequence header, the index identifier in the image header, the index identifier in the slice header, and the index identifier in the LCU header are all 1 (the value is only an example), it means that the LCU corresponds to the
  • the target coding block in the fourth coded data needs to select the corresponding transformation matrix combination according to the quantization coefficient in the quantization coefficient block obtained by entropy decoding; if the index identifier in the sequence header, the index identifier in the image header and the slice header
  • the index identification in the LCU is 1, and the index identification in the LCU header is 0 (the value is only an example), which means that the target encoding block in the fourth encoded data corresponding to the LCU does not need to be decoded according to entropy.
  • the quantization coefficients of select the corresponding transformation matrix combination.
  • the target coding blocks in the fourth coded data may be all coding blocks in the fourth coded data, or may be coding blocks in the fourth coded data using the intra-frame coding mode, or may be the fourth coding block The coded block in the inter-frame coding mode in the data.
  • the decision is made based on the index identifier in the sequence header, the index identifier in the image header, the index identifier in the slice header, the index identifier in the LCU header, and the size of the coding block.
  • the index identifier in the sequence header, the index identifier in the image header, the index identifier in the slice header, and the index identifier in the LCU header are all 1 (the value is only an example), and the size of the encoding block is less than the set threshold, it means that the coding block needs to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding;
  • the index mark in the section and the index mark in the LCU header are both 1, but the size of the coding block is greater than the set threshold, it means that the coding block does not need to select the corresponding transform according to the quantization coefficient in the quantization coefficient block obtained by entropy decoding matrix combination.
  • the index identifier in the image header and the index identifier in the slice header are both 1 (the value is only an example), it means that the target encoding block in the third encoded data corresponding to the slice needs to be decoded according to entropy.
  • the quantization coefficient in the quantization coefficient block selects the corresponding transformation matrix combination; if the index mark in the image header is 1, and the index mark in the slice header is 0 (the value is only an example), it means that the slice corresponds to the
  • the target coding block in the third coded data does not need to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding.
  • the target coding blocks in the third coded data may be all coding blocks in the third coded data, or may be coding blocks in the third coded data using the intra-frame coding mode, or may also be the third coding block The coded block in the inter-frame coding mode in the data.
  • the index identifier in the image header, the index identifier in the slice header, and the index identifier in the LCU header are all 1 (the numerical value is only an example), it means that the target encoding in the fourth encoded data corresponding to the LCU is The block needs to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding; if the index identification in the image header and the index identification in the slice header are 1, and the index identification in the LCU header is 0 (the value is only an example), it means that the target coding block in the fourth coded data corresponding to the LCU does not need to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding.
  • the target coding blocks in the fourth coded data may be all coding blocks in the fourth coded data, or may be coding blocks in the fourth coded data using the intra-frame coding mode, or may be the fourth coding block The coded block in the inter-frame coding mode in the data.
  • the decision is made based on the index identifier in the image header, the index identifier in the slice header, the index identifier in the LCU header, and the size of the coding block, and the sequence header is no longer used for decision-making.
  • the index identifier in the image header, the index identifier in the slice header, and the index identifier in the LCU header are all 1 (the value is only an example), and the size of the encoding block is smaller than the set threshold, it means that the The coding block needs to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding; , but the size of the coding block is larger than the set threshold, it means that the coding block does not need to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding.
  • the quantization coefficients in at least one area in the quantization coefficient block are counted, the quantization coefficients in the at least one area are counted together, and finally a statistical result is obtained, specifically:
  • the numerical sum of the quantization coefficients in the at least one area can be calculated, and the obtained sum is used as the statistical result; or the sum of the absolute values of the quantized coefficients in the at least one area can be calculated, and the obtained sum is used as the statistical result;
  • the sum of the numerical values of the odd-numbered quantization coefficients in the at least one area may be calculated, and the obtained sum may be used as the statistical result; or the sum of the absolute values of the odd-numbered quantization coefficients in the at least one area may be calculated, and the result will be obtained.
  • the sum value is used as the statistical result; or the sum of the numerical values of the quantization coefficients whose values are even or non-zero even in the at least one area can be calculated, and the obtained sum is used as the statistical result; or the at least one area can be calculated as an even number. Or the sum of the absolute values of non-zero even quantization coefficients, and the obtained sum will be used as the statistical result.
  • the values of the quantization coefficients in the at least one area may be linearly mapped, and then the quantization coefficients in the at least one area may be calculated.
  • the sum of the numerical or absolute values of the quantized coefficients after linear mapping, and the obtained sum is used as the statistical result.
  • the above-mentioned linear mapping may be to convert the numerical value of the quantization coefficient whose value is an odd number in the at least one region into a first value, and convert the numerical value of the quantization coefficient whose value is an even number into a second value, Wherein, one of the first value and the second value is an odd number, and the other is an even number.
  • the value of the quantization coefficient with odd value in the at least one area is converted to 1, and the value of the quantization coefficient with even value is converted to 0; or the value of the quantization coefficient with odd value in the at least one area is converted into 0, convert the value of the quantization coefficient whose value is an even number to 1; or convert the value of the quantization coefficient whose value is an odd number in the at least one area to 3, and convert the value of the quantization coefficient whose value is an even number to 2; or In at least one area, the value of the odd-numbered quantization coefficient is converted to 2, and the value of the even-numbered quantization coefficient is converted to 3.
  • the above-mentioned linear mapping may be to convert the value of the non-zero quantization coefficient in the at least one region into a third value, and convert the value of the quantization coefficient with a value of zero into a fourth value, wherein , one of the third value and the fourth value is odd and the other is even.
  • the value of the non-zero quantization coefficient in the at least one area is converted to 1, and the value of the quantization coefficient of zero value is converted to 0; or the value of the non-zero quantization coefficient in the at least one area is converted to 0, Convert the value of the quantization coefficient with a value of zero to 1; or convert the value of the non-zero quantization coefficient in the at least one area to 3, and convert the value of the quantization coefficient with a value of zero to 2; or the at least one area.
  • the value of non-zero quantization coefficients within is converted to 2, and the value of quantization coefficients with a value of zero is converted to 3.
  • the above-mentioned linear mapping may reduce the value of the quantization coefficient in the at least one region by a fifth value.
  • the values of the quantization coefficients in the at least one area are reduced by 1, or reduced by 2.
  • the above-mentioned linear mapping may be to increase the value of the quantization coefficient in the at least one region by a value.
  • the values of the quantization coefficients in the at least one area are increased by 1, or increased by 2.
  • the above-mentioned linear mapping may be a sixth non-zero value multiplied by the value of the quantization coefficient in the at least one region.
  • the values of the quantization coefficients in the at least one region are all multiplied by 1, or multiplied by 2.
  • the sixth numerical value may be a non-zero even number, such as 2, 4, 6, and so on.
  • the above-mentioned linear mapping may be the value of the quantization coefficients in the at least one region divided by a non-zero value.
  • the values of the quantization coefficients in the at least one area are divided by 1, or divided by 2.
  • the value can be a non-zero even number, such as 2, 4, 6, and so on.
  • the above-mentioned linear mapping may be to convert the numerical value of the quantization coefficient in the at least one region into an opposite number.
  • the set value may be any non- A number of zero, such as 2, 3, 4, etc.
  • the embodiments of the present application may perform statistics on the quantized coefficients in at least one region in the quantized coefficient block in the following manner:
  • the above-mentioned at least one area may be all areas in the quantized coefficient block.
  • the above-mentioned at least one area may be at least one designated position in the quantized coefficient block.
  • the above-mentioned at least one area may be at least one row specified in the quantized coefficient block.
  • the quantized coefficient block is a 4 ⁇ 4 coefficient block, and each block represents a quantized coefficient
  • one row of the gray area can be used as the at least one area;
  • two lines of the gray area may be used as the at least one area.
  • the at least one row may be an upper row in the block of quantized coefficients.
  • the above-mentioned at least one area may be at least one column specified in the quantized coefficient block.
  • the quantized coefficient block is a 4 ⁇ 4 coefficient block, and each block represents a quantized coefficient
  • one column of the gray area can be used as the at least one area;
  • the two columns in the gray area may be used as the at least one area.
  • the at least one column may be the left column in the block of quantized coefficients.
  • the above-mentioned at least one area may be at least one designated row and at least one designated column in the quantized coefficient block.
  • the quantized coefficient block is a 4 ⁇ 4 coefficient block, and each block represents a quantized coefficient
  • the lower row and the right column that is, The gray area in it
  • the lower 2 rows and the right 2 columns can be used as the at least one area; or as shown in the figure
  • the upper row and the left column that is, the gray area
  • the upper 2 rows and the left 2 columns that is, the gray area in it
  • the above-mentioned at least one area may be a position on at least one oblique line in the quantized coefficient block.
  • the quantized coefficient block is a 4 ⁇ 4 coefficient block, and each block represents a quantized coefficient
  • the position on a diagonal line can be used as The at least one area; or as shown in (c) and (d) in 9, the position on the two oblique lines is used as the at least one area.
  • the above-mentioned at least one region may be an SRCC region in the quantized coefficient block.
  • the SRCC area is the scanning area marked by the SRCC technology.
  • the above-mentioned at least one area may be one location or multiple locations specified in the SRCC area.
  • the at least one position specified in the SRCC area may include: the first N positions in the scanning order or the N positions in the middle in the scanning order, of course, the last N positions in the scanning order, etc., N is a non-zero natural number.
  • the above-mentioned at least one area may be at least one row specified in the SRCC area.
  • the SRCC area is a 4 ⁇ 4 coefficient block, and each block represents a quantized coefficient
  • one row of the gray area can be used as the at least one area; or
  • two lines of the gray area may be used as the at least one area.
  • the at least one row may be an upper row in the block of quantized coefficients.
  • the above-mentioned at least one area may be at least one column designated in the SRCC area.
  • the SRCC area is a 4 ⁇ 4 coefficient block, and each block represents a quantized coefficient
  • one column of the gray area can be used as the at least one area; or
  • the two columns of the gray area can be used as the at least one area.
  • the at least one column may be the left column in the block of quantized coefficients.
  • the above-mentioned at least one area may be at least one designated row and at least one designated column in the SRCC area.
  • the SRCC area is a 4 ⁇ 4 coefficient block, and each block represents a quantized coefficient
  • the lower row and the right column that is, where the gray area
  • the lower 2 rows and the right 2 columns that is, the gray area
  • the upper row and the left column that is, the gray area
  • the upper two rows and the left 2 columns that is, the gray area therein
  • the above-mentioned at least one area may be a position on at least one oblique line in the SRCC area.
  • the SRCC area is a 4 ⁇ 4 coefficient block, and each block represents a quantized coefficient, then as shown in (a) and (b) in 9, the position on a diagonal line can be used as the at least one area; or as shown in (c) and (d) in 9, the position on the two oblique lines is taken as the at least one area.
  • At least one area division manner in the foregoing embodiments may also be combined, so that the combined area is used as the at least one area.
  • step S630 according to the correspondence between the remainder and the transformation matrix combination, the corresponding transformation matrix combination is selected.
  • the set value is set to 3, then the three remainders (ie, 0, 1, and 2) may correspond to different transformation matrix combinations; If the value is set to 5, then the 5 remainders (ie, 0, 1, 2, 3, and 4) can correspond to different transformation matrix combinations, respectively. It can be seen that the technical solutions of the embodiments of the present application can realize the index indication of various transformation matrix combinations, and can effectively reduce the bits occupied by the transformation matrix indexes.
  • the transformation matrix combinations corresponding to different remainders may be different.
  • the transformation matrix combinations corresponding to different remainders may also be the same, but the possible remainder results cannot all correspond to the same transformation matrix combination.
  • the transformation kernel used in the transformation matrix combination may be any combination of known discrete transformation kernels (8 kinds of DCT transformation kernels and 8 kinds of DST transformation kernels).
  • the optional transform kernels may include: DCT2, DCT5, DCT8, DST1, DST7, and so on.
  • the corresponding relationship between the remainder and the combination of transformation matrices may be preset based on the combination of the DCT transform kernel and/or the DST transform kernel according to the value of the remainder.
  • the transformation matrix combination includes any combination of at least one of DCT2, DCT5, DCT8, DST1, and DST7.
  • the following correspondence can be set: when the remainder is 0, the corresponding transformation matrix combination can be (DCT2, DCT2); when the remainder is 1, the corresponding transformation matrix combination Can be (DCT8, DCT8); when the remainder is 2, the corresponding transformation matrix combination can be (DCT8, DST7); when the remainder is 3, the corresponding transformation matrix combination can be (DST7, DCT8); when the remainder is 4, the corresponding transformation matrix The combination can be (DST7, DST7).
  • a certain value of the remainder may also be used to indicate that the coding block skips the transformation process.
  • the value of a certain remainder indicates that the corresponding transform matrix combination is a null value (or TS), and the null value (or TS) indicates that the coding block skips the transform process.
  • the corresponding transformation matrix combination when the remainder is 0 can be (DCT2, DCT2); when the remainder is 1, the corresponding transformation matrix combination can be (DCT8, DCT8); when the remainder is 2
  • the corresponding transformation matrix combination can be (DCT8, DST7); when the remainder is 3, the corresponding transformation matrix combination can be (DST7, DCT8); when the remainder is 4, the corresponding transformation matrix combination can be null (or TS), with Indicates that the encoding block skips the transform process, in which case, the quantized coefficient block can be directly inverse quantized, and then the result of the inverse quantization process is used as the reconstructed residual data, without the need for the result of the inverse quantization process. Perform inverse transformation.
  • the remainder indicating that the coding block skips the transformation process may be set according to actual needs, and 4 in this embodiment is only an example.
  • step S640 inverse transform processing is performed on the inverse quantization result of the quantized coefficient block based on the selected transform matrix combination.
  • inverse transform processing is performed on the inverse quantization result of the quantized coefficient block based on the selected transform matrix combination.
  • step S650 the video image frame is reconstructed according to the processing result of the inverse transformation processing.
  • FIG. 10 shows a block diagram of a video decoding apparatus according to an embodiment of the present application.
  • the video decoding apparatus may be set in a device with a computing processing function, such as a terminal device or a server.
  • a video decoding apparatus 1000 includes: a decoding unit 1002 , a statistics unit 1004 , a selection unit 1006 , a processing unit 1008 and a reconstruction unit 1010 .
  • the decoding unit 1002 is configured to perform entropy decoding processing on the coding block of the video image frame to obtain a quantized coefficient block of residual data corresponding to the coding block;
  • the statistics unit 1004 is configured to perform entropy decoding on at least one region in the quantized coefficient block. The quantization coefficients within are counted to obtain statistical results, and the remainder of the statistical results for the set value is calculated;
  • the selection unit 1006 is configured to select the corresponding transformation matrix combination according to the correspondence between the remainder and the transformation matrix combination;
  • the processing unit 1008 is configured In order to perform inverse transformation processing on the inverse quantization result of the quantized coefficient block based on the selected transformation matrix combination, the reconstruction unit 1010 is configured to reconstruct the video image frame according to the processing result of the inverse transformation processing.
  • the video decoding apparatus 1000 further includes: a first determining unit, configured to determine, according to the value of the specified index identifier, the need for a target coding block in the corresponding coded data
  • the corresponding transformation matrix combination is selected according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding, wherein the specified index identifier is included in at least one of the following: the sequence header of the first encoded data corresponding to the video image frame sequence, the video The image header of the second coded data corresponding to the image frame, the slice header of the third coded data corresponding to a slice of the video image frame, and the fourth coded data corresponding to an LCU (Largest Coding Unit, maximum coding unit) of the video image frame the LCU header.
  • LCU Large Coding Unit, maximum coding unit
  • the first determination unit is configured to, when the value of the specified index identifier is the first value, determine that the target encoding block in the corresponding encoded data needs to be obtained by entropy decoding
  • the quantization coefficients in the quantization coefficient block select the corresponding transformation matrix combination.
  • the first determination unit is configured to be the first value when the specified index identifier included in the sequence header of the first encoded data corresponding to a sequence of video image frames is the first value , determining the target coding block in the first coded data requires selecting a corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding.
  • the first determination unit is configured to, when the value of the specified index identifier included in the image header of the second encoded data corresponding to one video image frame is the first value, Determining the target encoding block in the second encoded data requires selecting a corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding.
  • the first determination unit is configured to set the value of the specified index identifier included in the slice header information of the third encoded data corresponding to one slice of the video image frame to the value of
  • determining the target coding block in the third coded data requires selecting a corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding.
  • the first determining unit is configured to set the value of the specified index identifier included in the LCU header information of the fourth encoded data corresponding to one LCU of the video image frame to the first When the value is determined, it is necessary to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding to determine the target coding block in the fourth coded data.
  • the video decoding apparatus 1000 further includes: a second determining unit, configured to determine the Whether the encoding block needs to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding.
  • the video decoding apparatus 1000 further includes: a third determining unit, configured as an index identifier included in a sequence header of the first encoded data corresponding to a sequence of video image frames When it is the first value, it is not necessary to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding to determine the target coding block in the first coded data.
  • a third determining unit configured as an index identifier included in a sequence header of the first encoded data corresponding to a sequence of video image frames
  • the third determining unit is further configured to indicate that the index included in the sequence header of the first encoded data is a second value, and the video image frame sequence is When the index identifier contained in the image header of the second encoded data corresponding to one video image frame is the first value, it is determined that the target encoding block in the second encoded data does not need to be decoded according to entropy.
  • the quantization coefficient selects the corresponding transformation matrix combination.
  • the third determining unit is further configured to include an index identifier included in the sequence header of the first encoded data as a second value, and the second encoded data
  • the index identifier included in the image header is the second value, it is determined that the target coding block in the second coded data needs to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding.
  • the third determining unit is further configured to include an index in the sequence header of the first encoded data identified as a second value, an image of the second encoded data
  • the index identifier included in the header is the second value
  • the index identifier included in the slice header information of the third encoded data corresponding to one slice of the video image frame is the first value
  • the third determining unit is further configured to include an index in the sequence header of the first encoded data identified as a second value, an image of the second encoded data
  • the index identifier included in the header is the second value
  • the index identifier included in the slice header information of the third encoded data is the second value
  • the third determining unit is further configured to include an index in the sequence header of the first encoded data identified as a second value, an image of the second encoded data
  • the index identifier included in the header is the second value
  • the index identifier included in the slice header information of the third encoded data is the second value
  • the LCU header of the fourth encoded data corresponding to one LCU of the slice is identified as the second value.
  • the third determining unit is further configured to include an index in the sequence header of the first encoded data identified as a second value, an image of the second encoded data
  • the index identifier included in the header is the second value
  • the index identifier included in the slice header information of the third encoded data is the second value
  • the index identifier included in the LCU header information of the fourth encoded data is the first value.
  • the third determining unit is further configured to include an index in the sequence header of the first encoded data identified as a second value, an image of the second encoded data
  • the index identifier included in the header is the second value
  • the index identifier included in the slice header information of the third encoded data is the second value
  • the index identifier included in the LCU header information of the fourth encoded data is the second value.
  • the target coding block is any one of the following: coding blocks in intra-frame coding mode, coding blocks in inter-frame coding mode, and all coding blocks in corresponding coded data.
  • the video decoding apparatus further includes: a fourth determination unit, configured to be configured according to a first indication included in a sequence header of the first encoded data corresponding to a sequence of video image frames
  • the value of the bit and the value of the second indicator bit respectively determine whether the encoding block using the intra-frame encoding mode and the encoding block using the inter-frame encoding mode in the first encoded data need quantization in the quantization coefficient block obtained by entropy decoding.
  • the coefficient selects the corresponding transformation matrix combination; or
  • the encoding block in the second encoded data using the intra-frame encoding mode and the encoding block using the intra-frame encoding mode respectively. Whether the coding block in the inter-coding mode needs to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding; or
  • the third encoded data adopts the intra-frame encoding mode Whether it is necessary to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding and the coding block using the inter-coding mode; or
  • the encoding using the intra-frame encoding mode in the fourth encoded data is determined respectively. Whether it is necessary to select the corresponding transformation matrix combination according to the quantization coefficients in the quantization coefficient block obtained by entropy decoding for the block and the coding block using the inter coding mode.
  • the statistics unit 1004 is configured to: calculate the sum of the numerical values of the quantization coefficients in the at least one region, and use the obtained sum as the statistical result; or
  • the statistics unit 1004 is configured to: perform linear mapping on the numerical values of the quantized coefficients in the at least one area, and calculate the value of the quantized coefficients in the at least one area after the linear mapping.
  • the sum of numerical values or absolute values, and the obtained sum value is used as the statistical result, wherein the linear mapping includes:
  • the at least one region includes at least one of the following regions:
  • the SRCC area is located on at least one diagonal line.
  • the at least one position specified in the SRCC area includes: the first N positions in the scanning order, the middle N positions in the scanning order, or the last N positions in the scanning order position, N is a non-zero natural number.
  • the remainder whose value is a specified value is used to indicate that the coding block skips the transform processing process.
  • the corresponding relationship between the remainder and the combination of transformation matrices is based on the value of the remainder, based on the discrete cosine transform DCT transform kernel and/or the discrete sine transform DST transform kernel The combination between is preset.
  • the transformation matrix combination includes a combination of any at least one of DCT2, DCT5, DCT8, DST1, and DST7.
  • FIG. 11 shows a schematic structural diagram of a computer system suitable for implementing the electronic device according to the embodiment of the present application.
  • the computer system 1100 includes a central processing unit (Central Processing Unit, CPU) 1101, which can be loaded into a random device according to a program stored in a read-only memory (Read-Only Memory, ROM) 1102 or from a storage part 1108
  • the programs in the memory (Random Access Memory, RAM) 1103 are accessed to perform various appropriate actions and processes, for example, the methods described in the above embodiments are performed.
  • RAM Random Access Memory
  • various programs and data required for system operation are also stored.
  • the CPU 1101, the ROM 1102, and the RAM 1103 are connected to each other through a bus 1104.
  • An Input/Output (I/O) interface 1105 is also connected to the bus 1104 .
  • the following components are connected to the I/O interface 1105: an input section 1106 including a keyboard, a mouse, etc.; an output section 1107 including a cathode ray tube (CRT), a liquid crystal display (LCD), etc., and a speaker, etc. ; a storage part 1108 including a hard disk and the like; and a communication part 1109 including a network interface card such as a LAN (Local Area Network) card, a modem, and the like.
  • the communication section 1109 performs communication processing via a network such as the Internet.
  • Drivers 1110 are also connected to I/O interface 1105 as needed.
  • Removable media 1111 such as magnetic disks, optical disks, magneto-optical disks, semiconductor memories, etc., are mounted on the drive 1110 as needed so that computer-readable instructions read therefrom are mounted into the storage section 1108 as needed.
  • embodiments of the present application include a computer program product comprising computer readable instructions carried on a computer readable medium, the computer readable instructions comprising computer readable instructions for performing the method illustrated in the flowchart.
  • the computer readable instructions may be downloaded and installed from the network via the communication portion 1109 and/or installed from the removable media 1111.
  • CPU central processing unit
  • the computer-readable medium shown in the embodiments of the present application may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the above two.
  • the computer-readable storage medium can be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device, or a combination of any of the above.
  • Computer readable storage media may include, but are not limited to, electrical connections with one or more wires, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable Erasable Programmable Read Only Memory (EPROM), flash memory, optical fiber, portable Compact Disc Read-Only Memory (CD-ROM), optical storage device, magnetic storage device, or any suitable of the above The combination.
  • a computer-readable storage medium can be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device.
  • a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave, carrying computer-readable computer-readable instructions thereon.
  • Such propagated data signals may take a variety of forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing.
  • a computer-readable signal medium can also be any computer-readable medium other than a computer-readable storage medium that can transmit, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device .
  • Computer readable instructions embodied on a computer readable medium may be transmitted using any suitable medium, including but not limited to wireless, wired, etc., or any suitable combination of the foregoing.
  • each block in the flowchart or block diagram may represent a module, program segment, or part of code, and the above-mentioned module, program segment, or part of code contains one or more executables for realizing the specified logical function instruction.
  • the functions noted in the blocks may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved.
  • the units involved in the embodiments of the present application may be implemented in software or hardware, and the described units may also be provided in a processor. Among them, the names of these units do not constitute a limitation on the unit itself under certain circumstances.
  • the present application also provides a computer-readable medium.
  • the computer-readable medium may be included in the electronic device described in the above embodiments; it may also exist alone without being assembled into the electronic device. middle.
  • the above-mentioned computer-readable medium carries one or more programs, and when the above-mentioned one or more programs are executed by an electronic device, enables the electronic device to implement the methods described in the above-mentioned embodiments.
  • the exemplary embodiments described herein may be implemented by software, or may be implemented by software combined with necessary hardware. Therefore, the technical solutions according to the embodiments of the present application may be embodied in the form of software products, and the software products may be stored in a non-volatile storage medium (which may be CD-ROM, U disk, mobile hard disk, etc.) or on the network , which includes several instructions to cause a computing device (which may be a personal computer, a server, a touch terminal, or a network device, etc.) to execute the method according to the embodiments of the present application.
  • a computing device which may be a personal computer, a server, a touch terminal, or a network device, etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

本申请的实施例提供了一种视频解码方法、装置、计算机可读介质及电子设备。该视频解码方法包括:对视频图像帧的编码块进行熵解码处理,获得所述编码块对应的残差数据的量化系数块;对所述量化系数块中的至少一个区域内的量化系数进行统计得到统计结果,计算所述统计结果针对设定值的余数;根据余数与变换矩阵组合之间的对应关系,选择对应的变换矩阵组合;基于选择的变换矩阵组合对所述量化系数块的反量化结果进行反变换处理,根据所述反变换处理的处理结果,重建所述视频图像帧。

Description

视频解码方法、装置、计算机可读介质及电子设备
本申请要求于2020年09月27日提交中国专利局,申请号为2020110339900,申请名称为“视频解码方法、装置、计算机可读介质及电子设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本申请涉及计算机及通信技术领域,具体而言,涉及一种视频解码方法、装置、计算机可读介质及电子设备。
背景技术
在视频编码过程中,编码端通常需要对原始视频数据与预测视频数据之间的残差数据进行变换、量化及熵编码处理之后发送给解码端。由于残差数据的多样性,单一的DCT(Discrete Cosine Transform,离散余弦变换)变换核无法适应所有的残差特性,因此对于一个残差块可能需要选择多个DCT变换核或DST(Discrete Sine Transform,离散正弦变换)变换核作为变换矩阵组合,在这种情况下,虽然提升了变换矩阵组合对残差块的适应性,但是由于需要针对每个编码单元(Coding Unit,简称CU)来编码变换核的索引,因此导致编码效率较低。
发明内容
本申请的实施例提供了一种视频解码方法、装置、计算机可读介质及电子设备。
一种视频解码方法,由电子设备执行,包括:对视频图像帧的编码块进行熵解码处理,获得所述编码块对应的残差数据的量化系数块;对所述量化系数块中的至少一个区域内的量化系数进行统计得到统计结果,计算所述统计结果针对设定值的余数;根据余数与变换矩阵组合之间的对应关系,选择对应的变换矩阵组合;基于选择的变换矩阵组合对所述量化系数块的反量化结果进行反变换处理;根据所述反变换处理的处理结果,重建所述视频图像帧。
一种视频解码装置,包括:解码单元,配置为对视频图像帧的编码块进行熵解码处理,获得所述编码块对应的残差数据的量化系数块;统计单元,配置为对所述量化系数块中的至少一个区域内的量化系数进行统计得到统计结果,计算所述统计结果针对设定值的余数;选择单元,配置为根据余数与变换矩阵组合之间的对应关系,选择对应的变换矩阵组合;处理单元,配置为基于选择的变换矩阵组合对所述量化系数块的反量化结果进行反变换处理;重建单元,配置为根据所述反变换处理的处理结果,重建所述视频图像帧。
一种电子设备,包括存储器和一个或多个处理器,所述存储器存储有计算机可读指令,所述计算机可读指令被所述一个或多个处理器执行时,使得所述一个或多个处理器执行上述视频解码方法的步骤。
一个或多个存储有计算机可读指令的非易失性计算机可读存储介质,其上存储有计算 机可读指令,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器执行上述视频解码方法的步骤。
一种计算机程序产品或计算机程序,所述计算机程序产品或计算机程序包括计算机可读指令,所述计算机可读指令存储在计算机可读存储介质中,计算机设备的处理器从所述计算机可读存储介质读取所述计算机可读指令,所述处理器执行所述计算机可读指令,使得所述计算机设备执行上述视频解码方法的步骤。
本申请的一个或多个实施例的细节在下面的附图和描述中提出。本申请的其它特征、目的和优点将从说明书、附图以及权利要求书变得明显。
附图说明
此处的附图被并入说明书中并构成本说明书的一部分,示出了符合本申请的实施例,并与说明书一起用于解释本申请的原理。显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。在附图中:
图1示出了可以应用本申请实施例的技术方案的示例性系统架构的示意图;
图2示出视频编码装置和视频解码装置在流式传输系统中的放置方式示意图;
图3示出了一个视频编码器的基本流程图;
图4示出了通过SRCC技术标记出的扫描区域;
图5示出了对标记出的扫描区域进行扫描的顺序示意图;
图6示出了根据本申请的一个实施例的视频解码方法的流程图;
图7示出了根据本申请的一个实施例的指定区域的划分方式示意图;
图8示出了根据本申请的一个实施例的指定区域的划分方式示意图;
图9示出了根据本申请的一个实施例的指定区域的划分方式示意图;
图10示出了根据本申请的一个实施例的视频解码装置的框图;
图11示出了适于用来实现本申请实施例的电子设备的计算机系统的结构示意图。
具体实施方式
现在将参考附图更全面地描述示例实施方式。然而,示例实施方式能够以多种形式实施,且不应被理解为限于在此阐述的范例;相反,提供这些实施方式使得本申请将更加全面和完整,并将示例实施方式的构思全面地传达给本领域的技术人员。
此外,所描述的特征、结构或特性可以以任何合适的方式结合在一个或更多实施例中。在下面的描述中,提供许多具体细节从而给出对本申请的实施例的充分理解。然而,本领域技术人员将意识到,可以实践本申请的技术方案而没有特定细节中的一个或更多,或者可以采用其它的方法、组元、装置、步骤等。在其它情况下,不详细示出或描述公知方法、装置、实现或者操作以避免模糊本申请的各方面。
附图中所示的方框图仅仅是功能实体,不一定必须与物理上独立的实体相对应。即,可以采用软件形式来实现这些功能实体,或在一个或多个硬件模块或集成电路中实现这些功能实体,或在不同网络和/或处理器装置和/或微控制器装置中实现这些功能实体。
附图中所示的流程图仅是示例性说明,不是必须包括所有的内容和操作/步骤,也不是 必须按所描述的顺序执行。例如,有的操作/步骤还可以分解,而有的操作/步骤可以合并或部分合并,因此实际执行的顺序有可能根据实际情况改变。
需要说明的是:在本文中提及的“多个”是指两个或两个以上。“和/或”描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。字符“/”一般表示前后关联对象是一种“或”的关系。
图1示出了可以应用本申请实施例的技术方案的示例性系统架构的示意图。
如图1所示,系统架构100包括多个终端装置,所述终端装置可通过例如网络150彼此通信。举例来说,系统架构100可以包括通过网络150互连的第一终端装置110和第二终端装置120。在图1的实施例中,第一终端装置110和第二终端装置120执行单向数据传输。
举例来说,第一终端装置110可对视频数据(例如由终端装置110采集的视频图片流)进行编码以通过网络150传输到第二终端装置120,已编码的视频数据以一个或多个已编码视频码流形式传输,第二终端装置120可从网络150接收已编码视频数据,对已编码视频数据进行解码以恢复视频数据,并根据恢复的视频数据显示视频图片。
在本申请的一个实施例中,系统架构100可以包括执行已编码视频数据的双向传输的第三终端装置130和第四终端装置140,所述双向传输比如可以发生在视频会议期间。对于双向数据传输,第三终端装置130和第四终端装置140中的每个终端装置可对视频数据(例如由终端装置采集的视频图片流)进行编码,以通过网络150传输到第三终端装置130和第四终端装置140中的另一终端装置。第三终端装置130和第四终端装置140中的每个终端装置还可接收由第三终端装置130和第四终端装置140中的另一终端装置传输的已编码视频数据,且可对已编码视频数据进行解码以恢复视频数据,并可根据恢复的视频数据在可访问的显示装置上显示视频图片。
在图1的实施例中,第一终端装置110、第二终端装置120、第三终端装置130和第四终端装置140可为服务器、个人计算机和智能电话,但本申请公开的原理可不限于此。本申请公开的实施例适用于膝上型计算机、平板电脑、媒体播放器和/或专用视频会议设备。网络150表示在第一终端装置110、第二终端装置120、第三终端装置130和第四终端装置140之间传送已编码视频数据的任何数目的网络,包括例如有线和/或无线通信网络。通信网络150可在电路交换和/或分组交换信道中交换数据。该网络可包括电信网络、局域网、广域网和/或互联网。出于本申请的目的,除非在下文中有所解释,否则网络150的架构和拓扑对于本申请公开的操作来说可能是无关紧要的。
在本申请的一个实施例中,图2示出视频编码装置和视频解码装置在流式传输环境中的放置方式。本申请所公开主题可同等地适用于其它支持视频的应用,包括例如视频会议、数字TV(television,电视机)、在包括CD、DVD、存储棒等的数字介质上存储压缩视频等等。
流式传输系统可包括采集子系统213,采集子系统213可包括数码相机等视频源201,视频源创建未压缩的视频图片流202。在实施例中,视频图片流202包括由数码相机拍摄的样本。相较于已编码的视频数据204(或已编码的视频码流204),视频图片流202被描绘为粗线以强调高数据量的视频图片流,视频图片流202可由电子装置220处理,电子装 置220包括耦接到视频源201的视频编码装置203。视频编码装置203可包括硬件、软件或软硬件组合以实现或实施如下文更详细地描述的所公开主题的各方面。相较于视频图片流202,已编码的视频数据204(或已编码的视频码流204)被描绘为细线以强调较低数据量的已编码的视频数据204(或已编码的视频码流204),其可存储在流式传输服务器205上以供将来使用。一个或多个流式传输客户端子系统,例如图2中的客户端子系统206和客户端子系统208,可访问流式传输服务器205以检索已编码的视频数据204的副本207和副本209。客户端子系统206可包括例如电子装置230中的视频解码装置210。视频解码装置210对已编码的视频数据的传入副本207进行解码,且产生可在显示器212(例如显示屏)或另一呈现装置上呈现的输出视频图片流211。在一些流式传输系统中,可根据某些视频编码/压缩标准对已编码的视频数据204、视频数据207和视频数据209(例如视频码流)进行编码。该些标准的实施例包括ITU-T H.265。在实施例中,正在开发的视频编码标准非正式地称为下一代视频编码(Versatile Video Coding,VVC),本申请可用于VVC标准的上下文中。
应注意,电子装置220和电子装置230可包括图中未示出的其它组件。举例来说,电子装置220可包括视频解码装置,且电子装置230还可包括视频编码装置。
在本申请的一个实施例中,以国际视频编码标准HEVC(High Efficiency Video Coding,高效率视频编码)、VVC(Versatile Video Coding,多功能视频编码),以及中国国家视频编码标准AVS(Audio Video coding Standard,信源编码标准)为例,当输入一个视频帧图像之后,会根据一个块大小,将视频帧图像划分成若干个不重叠的处理单元,每个处理单元将进行类似的压缩操作。这个处理单元被称作CTU(Coding Tree Unit,编码树单元),或者称之为LCU(Largest Coding Unit,最大编码单元)。CTU再往下可以继续进行更加精细的划分,得到一个或多个基本的编码单元CU,CU是一个编码环节中最基本的元素。以下介绍对CU进行编码时的一些概念:
预测编码(Predictive Coding):预测编码包括了帧内预测和帧间预测等方式,原始视频信号经过选定的已重建视频信号的预测后,得到残差视频信号。编码端需要为当前CU决定选择哪一种预测编码模式,并告知解码端。其中,帧内预测是指预测的信号来自于同一图像内已经编码重建过的区域;帧间预测是指预测的信号来自于已经编码过的、不同于当前图像的其它图像(称之为参考图像)。
变换及量化(Transform&Quantization):残差视频信号经过DFT(Discrete Fourier Transform,离散傅里叶变换)、DCT等变换操作后,将信号转换到变换域中,称之为变换系数。变换系数进一步进行有损的量化操作,丢失掉一定的信息,使得量化后的信号有利于压缩表达。在一些视频编码标准中,可能有多于一种变换方式可以选择,因此编码端也需要为当前CU选择其中的一种变换方式,并告知解码端。量化的精细程度通常由量化参数(Quantization Parameter,简称QP)来决定,QP取值较大,表示更大取值范围的系数将被量化为同一个输出,因此通常会带来更大的失真及较低的码率;相反,QP取值较小,表示较小取值范围的系数将被量化为同一个输出,因此通常会带来较小的失真,同时对应较高的码率。
熵编码(Entropy Coding)或统计编码:量化后的变换域信号将根据各个值出现的频率进行统计压缩编码,最后输出二值化(0或者1)的压缩码流。同时,编码产生其他信息, 例如选择的编码模式、运动矢量数据等,也需要进行熵编码以降低码率。统计编码是一种无损的编码方式,可以有效的降低表达同样信号所需要的码率,常见的统计编码方式有变长编码(Variable Length Coding,简称VLC)或者基于上下文的二值化算术编码(Content Adaptive Binary Arithmetic Coding,简称CABAC)。
环路滤波(Loop Filtering):经过变化及量化的信号会通过反量化、反变换及预测补偿的操作获得重建图像。重建图像与原始图像相比由于存在量化的影响,部分信息与原始图像有所不同,即重建图像会产生失真(Distortion)。因此,可以对重建图像进行滤波操作,例如去块效应滤波(Deblocking filter,简称DB)、SAO(Sample Adaptive Offset,自适应像素补偿)或者ALF(Adaptive Loop Filter,自适应环路滤波)等滤波器,可以有效降低量化所产生的失真程度。由于这些经过滤波后的重建图像将作为后续编码图像的参考来对将来的图像信号进行预测,因此上述的滤波操作也被称为环路滤波,即在编码环路内的滤波操作。
在本申请的一个实施例中,图3示出了一个视频编码器的基本流程图,在该流程中以帧内预测为例进行说明。其中,原始图像信号s k[x,y]与预测图像信号
Figure PCTCN2021116451-appb-000001
做差值运算,得到残差信号u k[x,y],残差信号u k[x,y]经过变换及量化处理之后得到量化系数,量化系数一方面通过熵编码得到编码后的比特流,另一方面通过反量化及反变换处理得到重构残差信号u' k[x,y],预测图像信号
Figure PCTCN2021116451-appb-000002
与重构残差信号u' k[x,y]叠加生成图像信号
Figure PCTCN2021116451-appb-000003
图像信号
Figure PCTCN2021116451-appb-000004
一方面输入至帧内模式决策模块和帧内预测模块进行帧内预测处理,另一方面通过环路滤波输出重建图像信号s' k[x,y],重建图像信号s' k[x,y]可以作为下一帧的参考图像进行运动估计及运动补偿预测。然后基于运动补偿预测的结果s' r[x+m x,y+m y]和帧内预测结果
Figure PCTCN2021116451-appb-000005
得到下一帧的预测图像信号,并继续重复上述过程,直至编码完成。
此外,由于残差信号在经过变换和量化处理后的量化系数块中非零系数较大概率会集中在块的左边和上方区域,而块的右边和下方区域往往为0,因此引入了SRCC(Scan Region Coefficient Coding,扫描区域系数编码)技术中,通过SRCC技术可以标记出每个量化系数块(尺寸为W×H)中包含的非零系数的左上区域的大小SRx×SRy,其中SRx是量化系数块中最右面的非零系数的横坐标,SRy是量化系数块中最下面的非零系数的纵坐标,且1≤SRx≤W,1≤SRy≤H,而该区域外的系数均为0。SRCC技术利用(SRx,SRy)来确定一个量化系数块中需要扫描的量化系数区域,如图4所示,只有(SRx,SRy)标记的扫描区域内的量化系数需要编码,编码的扫描顺序如图5所示,可以是从右下角到左上角的反向Z字型扫描。
基于上述的编码过程,在解码端针对每一个CU,在获取到压缩码流(即比特流)之后,进行熵解码获得各种模式信息及量化系数。然后量化系数经过反量化及反变换处理得到残差信号。另一方面,根据已知的编码模式信息,可获得该CU对应的预测信号,然后 将残差信号与预测信号相加之后即可得到重建信号,重建信号再经过环路滤波等操作,产生最终的输出信号。
在上述的编解码过程中,对残差信号的变换处理使得残差信号的能量集中在较少的低频系数,也就是多数系数值较小。然后经过后续的量化模块后,较小系数值将变为零值,极大降低了编码残差信号的代价。但是,由于残差分布的多样性,单一的DCT变换无法适应所有的残差特性,因此,DST7和DCT8这样的变换核被引入到变换处理过程中,并且对残差信号进行的水平变换和竖直变换可以采用不同的变换核。以AMT(Adaptive multiple core transform,自适应多核变换)技术为例,对于一个残差信号进行变换处理可能选择的变换组合如下所示:(DCT2,DCT2)、(DCT8,DCT8)、(DCT8,DST7)、(DST7,DCT8)和(DST7,DST7)。
对于残差信号具体选择哪种变换组合,需要在编码端使用RDO(Rate–Distortion Optimization,率失真优化)进行决策,虽然采用多个变换核提升了变换矩阵组合对残差块的适应性,但是由于需要针对每个编码单元来编码变换核的索引,因此导致编码效率较低。
针对上述问题,本申请的实施例提出了通过量化系数块中的量化系数来隐含指示编码块对应的变换矩阵组合,进而可以降低变换矩阵索引所占用的比特位,有效提高了视频编码效率。具体地,通过对量化系数块中的至少一个区域内的量化系数进行统计得到统计结果,然后计算该统计结果针对设定值的余数,以根据余数与变换矩阵组合之间的对应关系,选择对应的变换矩阵组合,并基于选择的变换矩阵组合对量化系数块的反量化结果进行反变换处理,根据反变换处理的处理结果重建视频帧,使得能够通过量化系数块中的量化系数来隐含指示编码块对应的变换矩阵组合,无需编码端针对每个编码单元都编码变换矩阵的索引,降低了变换矩阵索引所占用的比特位,进而可以有效提高视频编码效率。
以下对本申请实施例的技术方案的实现细节进行详细阐述:
图6示出了根据本申请的一个实施例的视频解码方法的流程图,该视频解码方法可以由具有计算处理功能的设备来执行,比如可以由电子设备来执行,该电子设备为终端设备或服务器。参照图6所示,该视频解码方法至少包括步骤S610至步骤S640,详细介绍如下:
在步骤S610中,对视频图像帧的编码块进行熵解码处理,获得编码块对应的残差数据的量化系数块。
在本申请的一个实施例中,视频图像帧序列包括了一系列图像,每张图像可以被进一步划分为条带(Slice),条带又可以划分为一系列的LCU(或CTU),LCU包含有若干CU。视频图像帧在编码时是以块为单位进行编码处理,在一些新的视频编码标准中,比如在H.264标准中有宏块(macroblock,MB),宏块可进一步划分成多个可用于预测编码的预测块(prediction)。在HEVC标准中,采用编码单元CU、预测单元(prediction unit,PU)和变换单元(transform unit,TU)等基本概念,从功能上划分了多种块单元,并采用全新的基于树的结构进行描述。比如CU可以按照四叉树划分为更小的CU,而更小的CU还可以继续划分,从而形成一种四叉树结构。本申请实施例中的编码块可以是CU,或者是比CU更小的块,如对CU进行划分得到的更小的块。
在步骤S620中,对量化系数块中的至少一个区域内的量化系数进行统计得到统计结果,计算该统计结果针对设定值的余数。
在本申请的一个实施例中,可以根据指定的索引标识的取值,确定对应的编码数据中的目标编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合,其中,指定的索引标识可以包括在以下数据中的至少一种:视频图像帧序列对应的第一编码数据的序列头、视频图像帧对应的第二编码数据的图像头、视频图像帧的一个条带对应的第三编码数据的条带头、视频图像帧的一个最大编码单元LCU对应的第四编码数据的LCU头。详细说明如下:
在本申请的一个实施例中,可以通过一个视频图像帧序列对应的第一编码数据的序列头部中包含的指定索引标识的值来指示第一编码数据中的目标编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合,比如第一编码数据的序列头部中的指定索引标识的值为1,则表示第一编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。该实施例的技术方案使得通过视频图像帧序列对应的第一编码数据的序列头部中的一个索引标识实现了对整个视频图像帧序列对应的所有目标编码块的指示,进而可以有效降低索引标识所占用的比特位,提高了视频编码效率。需要说明的是:第一编码数据中的目标编码块可以是第一编码数据中的所有编码块,或者可以是第一编码数据中采用帧内编码模式的编码块,或者也可以是第一编码数据中采用帧间编码模式的编码块。
在本申请的一个实施例中,可以通过一个视频图像帧对应的第二编码数据的图像头部中包含的指定索引标识的值来指示第二编码数据中的目标编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合,比如第二编码数据的图像头部中的指定索引标识的值为1,则表示第二编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。该实施例的技术方案使得通过视频图像帧对应的第二编码数据的图像头部中的一个索引标识实现了对整个视频图像帧对应的所有目标编码块的指示,同样可以降低索引标识所占用的比特位,提高了视频编码效率。需要说明的是:第二编码数据中的目标编码块可以是第二编码数据中的所有编码块,或者可以是第二编码数据中采用帧内编码模式的编码块,或者也可以是第二编码数据中采用帧间编码模式的编码块。
在本申请的一个实施例中,可以通过视频图像帧的一个条带对应的第三编码数据的条带头部中包含的指定索引标识的值来指示第三编码数据中的目标编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合,比如第三编码数据的条带头部中的指定索引标识的值为1,则表示第三编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。该实施例的技术方案使得通过视频图像帧的一个条带对应的第三编码数据的条带头部中的一个索引标识实现了对整个条带对应的所有目标编码块的指示,同样可以降低索引标识所占用的比特位,提高了视频编码效率。需要说明的是:第三编码数据中的目标编码块可以是第三编码数据中的所有编码块,或者可以是第三编码数据中采用帧内编码模式的编码块,或者也可以是第三编码数据中采用帧间编码模式的编码块。
在本申请的一个实施例中,可以通过视频图像帧的一个LCU对应的第四编码数据的LCU头部中包含的指定索引标识的值来指示第四编码数据中的目标编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合,比如第四编码数据的 LCU头部中的指定索引标识的值为1,则表示第四编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。该实施例的技术方案使得通过视频图像帧的一个LCU对应的第四编码数据的LCU头部中的一个索引标识实现了对整个LCU对应的所有目标编码块的指示,同样可以降低索引标识所占用的比特位,提高了视频编码效率。需要说明的是:第四编码数据中的目标编码块可以是第四编码数据中的所有编码块,或者可以是第四编码数据中采用帧内编码模式的编码块,或者也可以是第四编码数据中采用帧间编码模式的编码块。
在本申请的一个实施例中,也可以根据编码块的尺寸大小和预设阈值范围之间的关系,确定编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。比如,若编码块的尺寸较小(如小于某个预设阈值),则可以确定该编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;相反地,若编码块的尺寸较大(如大于某个预设阈值),则可以确定该编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。该实施例的技术方案可以根据编码块的尺寸来隐含指示是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合,无需额外的比特位,因此也可以提高视频编码效率。
在本申请的一个实施例中,如果一个视频图像帧序列对应的第一编码数据的序列头部包含的索引标识为第一值(比如可以为0),则确定第一编码数据中的目标编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。如果该序列头部包含的索引标识为第二值(比如可以为1),此时可以进一步根据视频图像帧序列中的视频图像帧所对应的第二编码数据的图像头部包含的索引标识来进行决策。需要说明的是:第一编码数据中的目标编码块可以是第一编码数据中的所有编码块,或者可以是第一编码数据中采用帧内编码模式的编码块,或者也可以是第一编码数据中采用帧间编码模式的编码块。
在本申请的一个实施例中,如果前述的序列头部包含的索引标识为第二值(比如可以为1),且前述的图像头部包含的索引标识为第一值(比如可以为0),则可以确定第二编码数据中的目标编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。如果前述的序列头部包含的索引标识和前述的图像头部包含的索引标识均为第二值(比如可以为1),那么可以确定第二编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合,这种情况下,可以不再通过条带头、LCU头和编码块的大小进行决策。需要说明的是:第二编码数据中的目标编码块可以是第二编码数据中的所有编码块,或者可以是第二编码数据中采用帧内编码模式的编码块,或者也可以是第二编码数据中采用帧间编码模式的编码块。
当然,在本申请的一个实施例中,如果前述的序列头部包含的索引标识和前述的图像头部包含的索引标识均为第二值(比如可以为1),那么也可以进一步根据视频图像帧的条带所对应的第三编码数据的条带头部包含的索引标识来进行决策。比如,若前述序列头部包含的索引标识、前述图像头部包含的索引标识均为第二值,但一个条带对应的第三编码数据的条带头部信息包含的索引标识为第一值(比如可以为0),那么可以确定第三编码数据中的目标编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。如果前述的序列头部包含的索引标识、前述的图像头部包含的索引标识和前述条带头部包含的索引标识均为第二值(比如可以为1),那么可以确定第三编码数据中的目标 编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合,这种情况下,可以不再通过LCU头和编码块的大小进行决策。需要说明的是:第三编码数据中的目标编码块可以是第三编码数据中的所有编码块,或者可以是第三编码数据中采用帧内编码模式的编码块,或者也可以是第三编码数据中采用帧间编码模式的编码块。
当然,在本申请的一个实施例中,如果前述的序列头部包含的索引标识、前述的图像头部包含的索引标识和前述条带头部包含的索引标识均为第二值(比如可以为1),那么也可以进一步根据条带中的LCU所对应的第四编码数据的LCU头部包含的索引标识来进行决策。比如,若前述序列头部包含的索引标识、前述图像头部包含的索引标识和前述条带头部包含的索引标识均为第二值,但一个LCU对应的第四编码数据的LCU头部信息包含的索引标识为第一值(比如可以为0),那么可以确定第四编码数据中的目标编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。如果前述的序列头部包含的索引标识、前述的图像头部包含的索引标识、前述条带头部包含的索引标识和前述LCU头部信息包含的索引标识均为第二值(比如可以为1),那么可以确定第四编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合,这种情况下,可以不再通过编码块的大小进行决策。需要说明的是:第四编码数据中的目标编码块可以是第四编码数据中的所有编码块,或者可以是第四编码数据中采用帧内编码模式的编码块,或者也可以是第四编码数据中采用帧间编码模式的编码块。
当然,在本申请的一个实施例中,如果前述的序列头部包含的索引标识、前述的图像头部包含的索引标识、前述条带头部包含的索引标识和前述LCU头部信息包含的索引标识均为第二值(比如可以为1),那么也可以进一步根据编码块的大小来进行决策。比如,若前述的序列头部包含的索引标识、前述的图像头部包含的索引标识、前述条带头部包含的索引标识和前述LCU头部信息包含的索引标识均为第二值,但一个编码块的尺寸较小(如小于某个预设阈值),则可以确定该编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;相反地,若编码块的尺寸较大(如大于某个预设阈值),则可以确定该编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
在本申请的一个实施例中,也可以通过指示位来对编码数据中采用帧内编码模式的编码块和采用帧间编码模式的编码块分别指示是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。具体而言:
在本申请的一个实施例中,可以根据一个视频图像帧序列对应的第一编码数据的序列头部包含的第一指示位的值和第二指示位的值,分别确定第一编码数据中采用帧内编码模式的编码块和采用帧间编码模式的编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。比如,第一编码数据的序列头部包含的第一指示位的值为0时,表示第一编码数据中采用帧内编码模式的编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;第一编码数据的序列头部包含的第一指示位的值为1时,表示第一编码数据中采用帧内编码模式的编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;第一编码数据的序列头部包含的第二指示位的值为0时,表示第一编码数据中采用帧间编码模式的编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;第一编码数据的序列头部包含的第二 指示位的值为1时,表示第一编码数据中采用帧间编码模式的编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
在本申请的一个实施例中,可以根据一个视频图像帧对应的第二编码数据的图像头部包含的第一指示位的值和第二指示位的值,分别确定第二编码数据中采用帧内编码模式的编码块和采用帧间编码模式的编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。比如,第二编码数据的图像头部包含的第一指示位的值为0时,表示第二编码数据中采用帧内编码模式的编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;第二编码数据的图像头部包含的第一指示位的值为1时,表示第二编码数据中采用帧内编码模式的编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;第二编码数据的图像头部包含的第二指示位的值为0时,表示第二编码数据中采用帧间编码模式的编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;第二编码数据的图像头部包含的第二指示位的值为1时,表示第二编码数据中采用帧间编码模式的编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
在本申请的一个实施例中,根据视频图像帧的一个条带对应的第三编码数据的条带头部信息包含的第一指示位的值和第二指示位的值,分别确定第三编码数据中采用帧内编码模式的编码块和采用帧间编码模式的编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。比如,第三编码数据的条带头部包含的第一指示位的值为0时,表示第三编码数据中采用帧内编码模式的编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;第三编码数据的条带头部包含的第一指示位的值为1时,表示第三编码数据中采用帧内编码模式的编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;第三编码数据的条带头部包含的第二指示位的值为0时,表示第三编码数据中采用帧间编码模式的编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;第三编码数据的条带头部包含的第二指示位的值为1时,表示第三编码数据中采用帧间编码模式的编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
在本申请的一个实施例中,根据视频图像帧的一个LCU对应的第四编码数据的LCU头部信息包含的第一指示位的值和第二指示位的值,分别确定第四编码数据中采用帧内编码模式的编码块和采用帧间编码模式的编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。比如,第四编码数据的LCU头部包含的第一指示位的值为0时,表示第四编码数据中采用帧内编码模式的编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;第四编码数据的LCU头部包含的第一指示位的值为1时,表示第四编码数据中采用帧内编码模式的编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;第四编码数据的LCU头部包含的第二指示位的值为0时,表示第四编码数据中采用帧间编码模式的编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;第四编码数据的LCU头部包含的第二指示位的值为1时,表示第四编码数据中采用帧间编码模式的编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
需要说明的是:在通过指示位对编码数据中采用帧内编码模式的编码块和采用帧间编 码模式的编码块分别指示是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合时,也可以采用类似于前述实施例中的方案,由序列头部和图像头部中的指示位来共同决策,或者由序列头部、图像头部和条带头部中的指示位来共同决策,或者由序列头部、图像头部、条带头部和LCU头部中的指示位来共同决策,或者由序列头部、图像头部、条带头部、LCU头部中的指示位,以及编码块的大小来共同决策。
综上,在确定是否需要根据量化系数块中的量化系数选择编码块对应的变换矩阵组合时,有以下方式:
1、仅通过前述序列头部中的索引标识来指示,不再通过图像头部、条带头部、LCU头部和编码块的大小来进行决策。此时,序列头部中的索引标识为1(数值仅为示例),就说明视频图像帧序列对应的第一编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。需要说明的是:第一编码数据中的目标编码块可以是第一编码数据中的所有编码块,或者可以是第一编码数据中采用帧内编码模式的编码块,或者也可以是第一编码数据中采用帧间编码模式的编码块。
另外,也可以仅通过序列头部中的指示位来对视频图像帧序列对应的第一编码数据中采用帧内编码模式的编码块和采用帧间编码模式的编码块分别进行指示。即序列头部包含有两个指示位,一个指示位用于指示第一编码数据中采用帧内编码模式的编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合,另一个指示位用于指示第一编码数据中采用帧间编码模式的编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
2、仅通过前述图像头部中的索引标识来指示,不再通过序列头部、条带头部、LCU头部和编码块的大小来进行决策。此时,图像头部中的索引标识为1(数值仅为示例),就说明视频图像帧对应的第二编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。需要说明的是:第二编码数据中的目标编码块可以是第二编码数据中的所有编码块,或者可以是第二编码数据中采用帧内编码模式的编码块,或者也可以是第二编码数据中采用帧间编码模式的编码块。
另外,也可以仅通过图像头部中的指示位来对视频图像帧对应的第二编码数据中采用帧内编码模式的编码块和采用帧间编码模式的编码块分别进行指示。即图像头部包含有两个指示位,一个指示位用于指示第二编码数据中采用帧内编码模式的编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合,另一个指示位用于指示第二编码数据中采用帧间编码模式的编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
3、仅通过前述条带头部中的索引标识来指示,不再通过序列头部、图像头部、LCU头部和编码块的大小来进行决策。此时,条带头部中的索引标识为1(数值仅为示例),就说明条带对应的第三编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。需要说明的是:第三编码数据中的目标编码块可以是第三编码数据中的所有编码块,或者可以是第三编码数据中采用帧内编码模式的编码块,或者也可以是第三编码数据中采用帧间编码模式的编码块。
另外,也可以仅通过条带头部中的指示位来对条带对应的第三编码数据中采用帧内编码模式的编码块和采用帧间编码模式的编码块分别进行指示。即条带头部包含有两个指示 位,一个指示位用于指示第三编码数据中采用帧内编码模式的编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合,另一个指示位用于指示第三编码数据中采用帧间编码模式的编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
4、仅通过前述LCU头部中的索引标识来指示,不再通过序列头部、图像头部、条带头部和编码块的大小来进行决策。此时,LCU头部中的索引标识为1(数值仅为示例),就说明LCU对应的第四编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。需要说明的是:第四编码数据中的目标编码块可以是第四编码数据中的所有编码块,或者可以是第四编码数据中采用帧内编码模式的编码块,或者也可以是第四编码数据中采用帧间编码模式的编码块。
另外,也可以仅通过LCU头部中的指示位来对LCU对应的第四编码数据中采用帧内编码模式的编码块和采用帧间编码模式的编码块分别进行指示。即LCU头部包含有两个指示位,一个指示位用于指示第四编码数据中采用帧内编码模式的编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合,另一个指示位用于指示第四编码数据中采用帧间编码模式的编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
5、仅通过编码块的尺寸大小来隐含指示,不再通过序列头部、图像头部、条带头部和LCU头部来进行决策。此时,若编码块的尺寸较小(如小于某个预设阈值),则可以确定该编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;相反地,若编码块的尺寸较大(如大于某个预设阈值),则可以确定该编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
6、通过前述序列头部中的索引标识和图像头部中的索引标识来指示,不再通过条带头部、LCU头部和编码块的大小来进行决策。此时,若序列头部中的索引标识和图像头部中的索引标识均为1(数值仅为示例),则说明视频图像帧对应的第二编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;若序列头部中的索引标识为1,而图像头部中的索引标识为0(数值仅为示例),则说明该视频图像帧对应的第二编码数据中的目标编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。需要说明的是:第二编码数据中的目标编码块可以是第二编码数据中的所有编码块,或者可以是第二编码数据中采用帧内编码模式的编码块,或者也可以是第二编码数据中采用帧间编码模式的编码块。
7、通过前述序列头部中的索引标识、图像头部中的索引标识和条带头部中的索引标识来指示,不再通过LCU头部和编码块的大小来进行决策。此时,若序列头部中的索引标识、图像头部中的索引标识和条带头部的索引标识均为1(数值仅为示例),则说明条带对应的第三编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;若序列头部中的索引标识和图像头部中的索引标识为1,而条带头部中的索引标识为0(数值仅为示例),则说明该条带对应的第三编码数据中的目标编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。需要说明的是:第三编码数据中的目标编码块可以是第三编码数据中的所有编码块,或者可以是第三编码数据中采用帧内编码模式的编码块,或者也可以是第三编码数据中采用帧间编码模 式的编码块。
8、通过前述序列头部中的索引标识、图像头部中的索引标识、条带头部中的索引标识和LCU头部中的索引标识来指示,不再通过编码块的大小来进行决策。此时,若序列头部中的索引标识、图像头部中的索引标识、条带头部的索引标识和LCU头部中的索引标识均为1(数值仅为示例),则说明LCU对应的第四编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;若序列头部中的索引标识、图像头部中的索引标识和条带头部中的索引标识为1,而LCU头部中的索引标识为0(数值仅为示例),则说明该LCU对应的第四编码数据中的目标编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。需要说明的是:第四编码数据中的目标编码块可以是第四编码数据中的所有编码块,或者可以是第四编码数据中采用帧内编码模式的编码块,或者也可以是第四编码数据中采用帧间编码模式的编码块。
9、通过前述序列头部中的索引标识、图像头部中的索引标识、条带头部中的索引标识、LCU头部中的索引标识和编码块的大小来进行决策。此时,若序列头部中的索引标识、图像头部中的索引标识、条带头部的索引标识、LCU头部中的索引标识均为1(数值仅为示例),且编码块的尺寸小于设定阈值,则说明该编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;若序列头部中的索引标识、图像头部中的索引标识、条带头部中的索引标识和LCU头部中的索引标识均为1,但是编码块的尺寸大于设定阈值,则说明该编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
当然,还可以有如下方式:
10、通过前述图像头部中的索引标识和条带头部中的索引标识来指示,不再通过序列头部、LCU头部和编码块的大小来进行决策。此时,若图像头部中的索引标识和条带头部的索引标识均为1(数值仅为示例),则说明条带对应的第三编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;若图像头部中的索引标识为1,而条带头部中的索引标识为0(数值仅为示例),则说明该条带对应的第三编码数据中的目标编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。需要说明的是:第三编码数据中的目标编码块可以是第三编码数据中的所有编码块,或者可以是第三编码数据中采用帧内编码模式的编码块,或者也可以是第三编码数据中采用帧间编码模式的编码块。
11、通过前述图像头部中的索引标识、条带头部中的索引标识和LCU头部中的索引标识来指示,不再通过序列头部、编码块的大小来进行决策。此时,若图像头部中的索引标识、条带头部的索引标识和LCU头部中的索引标识均为1(数值仅为示例),则说明LCU对应的第四编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;若图像头部中的索引标识和条带头部中的索引标识为1,而LCU头部中的索引标识为0(数值仅为示例),则说明该LCU对应的第四编码数据中的目标编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。需要说明的是:第四编码数据中的目标编码块可以是第四编码数据中的所有编码块,或者可以是第四编码数据中采用帧内编码模式的编码块,或者也可以是第四编码数据中采用帧间编码模式的编码块。
12、通过前述图像头部中的索引标识、条带头部中的索引标识、LCU头部中的索引标识和编码块的大小来进行决策,不再通过序列头部来进行决策。此时,若图像头部中的索引标识、条带头部的索引标识、LCU头部中的索引标识均为1(数值仅为示例),且编码块的尺寸小于设定阈值,则说明该编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;若图像头部中的索引标识、条带头部中的索引标识和LCU头部中的索引标识均为1,但是编码块的尺寸大于设定阈值,则说明该编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
此外,在本申请的实施例中,还有其它更多种指示方式,不再详述。
在本申请的一个实施例中,对量化系数块中的至少一个区域内的量化系数进行统计时,是将这至少一个区域内的量化系数在一起进行统计,最后到一个统计结果,具体地:可以计算该至少一个区域内的量化系数的数值之和,将得到的和值作为统计结果;或者可以计算该至少一个区域内的量化系数的绝对值之和,将得到的和值作为统计结果;或者可以计算该至少一个区域内数值为奇数的量化系数的数值之和,将得到的和值作为统计结果;或者可以计算该至少一个区域内数值为奇数的量化系数的绝对值之和,将得到的和值作为统计结果;或者可以计算该至少一个区域内数值为偶数或非零偶数的量化系数的数值之和,将得到的和值作为统计结果;或者可以计算该至少一个区域内数值为偶数或非零偶数的量化系数的绝对值之和,将得到的和值作为统计结果。
在本申请的一个实施例中,对量化系数块中的至少一个区域内的量化系数进行统计时,可以将该至少一个区域内的量化系数的数值进行线性映射,然后计算该至少一个区域内的量化系数在线性映射之后的数值或绝对值之和,将得到的和值作为统计结果。
在本申请的一个实施例中,上述的线性映射可以是将该至少一个区域内数值为奇数的量化系数的数值转换为第一数值,将数值为偶数的量化系数的数值转换为第二数值,其中,该第一数值和该第二数值中的一个为奇数,另一个为偶数。比如,将该至少一个区域内数值为奇数的量化系数的数值转换为1,将数值为偶数的量化系数的数值转换为0;或者将该至少一个区域内数值为奇数的量化系数的数值转换为0,将数值为偶数的量化系数的数值转换为1;或者将该至少一个区域内数值为奇数的量化系数的数值转换为3,将数值为偶数的量化系数的数值转换为2;或者将该至少一个区域内数值为奇数的量化系数的数值转换为2,将数值为偶数的量化系数的数值转换为3。
在本申请的一个实施例中,上述的线性映射可以是将该至少一个区域内的非零量化系数的数值转换为第三数值,将数值为零的量化系数的数值转换为第四数值,其中,该第三数值和该第四数值中的一个为奇数,另一个为偶数。比如,将该至少一个区域内的非零量化系数的数值转换为1,将数值为零的量化系数的数值转换为0;或者将该至少一个区域内的非零量化系数的数值转换为0,将数值为零的量化系数的数值转换为1;或者将该至少一个区域内的非零量化系数的数值转换为3,将数值为零的量化系数的数值转换为2;或者将该至少一个区域内的非零量化系数的数值转换为2,将数值为零的量化系数的数值转换为3。
在本申请的一个实施例中,上述的线性映射可以是将该至少一个区域内的量化系数的数值减少第五数值。比如将该至少一个区域内的量化系数的数值都减少1,或者减少2。
在本申请的一个实施例中,上述的线性映射可以是将该至少一个区域内的量化系数的 数值增加一个数值。比如将该至少一个区域内的量化系数的数值都增加1,或者增加2。
在本申请的一个实施例中,上述的线性映射可以是将该至少一个区域内的量化系数的数值乘以非零的第六数值。比如将该至少一个区域内的量化系数的数值都乘以1,或者乘以2。可选地,该第六数值可以为非零偶数,比如2、4、6等。
在本申请的一个实施例中,上述的线性映射可以是将该至少一个区域内的量化系数的数值除以非零的一个数值。比如将该至少一个区域内的量化系数的数值都除以1,或者除以2。可选地,该数值可以为非零偶数,比如2、4、6等。
在本申请的一个实施例中,上述的线性映射可以是将该至少一个区域内的量化系数的数值转换为相反数。
在本申请的一个实施例中,在对量化系数块中的至少一个区域内的量化系数进行统计得到统计结果之后,计算该统计结果针对设定值的余数时,该设定值可以是任意非零的数,比如可以是2、3、4等。
综上,本申请的实施例在对量化系数块中的至少一个区域内的量化系数进行统计时可以有如下方式:
1、对该至少一个区域内的量化系数的数值直接求和;
2、对该至少一个区域内的量化系数求数值的绝对值之和;
3、对该至少一个区域内数值为奇数的量化系数的数值直接求和;
4、对该至少一个区域内数值为奇数的量化系数求数值的绝对值之和;
5、对该至少一个区域内数值为偶数或非零偶数的量化系数的数值直接求和;
6、对该至少一个区域内数值为偶数或非零偶数的量化系数求数值的绝对值之和;
7、先根据该至少一个区域内的量化系数的奇偶性对奇数和偶数进行数值转换,然后对该至少一个区域内转换后的所有数值求和;
8、先根据该至少一个区域内的量化系数的奇偶性对奇数和偶数进行数值转换,然后对该至少一个区域内转换后的所有数值的绝对值求和;
9、先根据该至少一个区域内的非零量化系数和数值为零的量化系数进行数值转换,然后对该至少一个区域内转换后的所有数值求和;
10、先根据该至少一个区域内的非零量化系数和数值为零的量化系数进行数值转换,然后对该至少一个区域内转换后的所有数值的绝对值求和;
11、先对该至少一个区域内的所有量化系数进行增加、减少、乘以非零倍数、除以非零倍数或者求相反数等数值转换操作,然后对该至少一个区域内转换后的所有数值求和;
12、先对该至少一个区域内的所有量化系数进行增加、减少、乘以非零倍数、除以非零倍数或者求相反数等数值转换操作,然后对该至少一个区域内转换后的所有数值的绝对值求和。
当然,还可以有其它方式,比如先通过前述7至12中的转换方式对该至少一个区域内的量化系数进行数值转换,然后只对转换后的奇数或者偶数进行求和等。
在本申请的一个实施例中,上述的至少一个区域可以是量化系数块中的全部区域。
在本申请的一个实施例中,上述的至少一个区域可以是量化系数块中指定的至少一个位置。
在本申请的一个实施例中,上述的至少一个区域可以是量化系数块中指定的至少一行。 如图7所示,假设量化系数块是4×4的系数块,每个方块表示一个量化系数,那么可以如图7中(a)所示,将灰色区域的1行作为该至少一个区域;或者可以如图7中(b)所示,将灰色区域的2行作为该至少一个区域。可选地,这至少一行可以是量化系数块中靠上边的行。
在本申请的一个实施例中,上述的至少一个区域可以是量化系数块中指定的至少一列。如图7所示,假设量化系数块是4×4的系数块,每个方块表示一个量化系数,那么可以如图7中(c)所示,将灰色区域的1列作为该至少一个区域;或者可以如图7中(d)所示,将灰色区域的2列作为该至少一个区域。可选地,这至少一列可以是量化系数块中靠左边的列。
在本申请的一个实施例中,上述的至少一个区域可以是量化系数块中指定的至少一行和指定的至少一列。如图8所示,假设量化系数块是4×4的系数块,每个方块表示一个量化系数,那么可以如图8中(a)所示,将下方的1行和右边的1列(即其中的灰色区域)作为该至少一个区域;或者可以如图8中(b)所示,将下方的2行和右边的2列(即其中的灰色区域)作为该至少一个区域;或者可以如图8中(c)所示,将上方的1行和左边的1列(即其中的灰色区域)作为该至少一个区域;或者可以如图8中(d)所示,将上方的2行和左边的2列(即其中的灰色区域)作为该至少一个区域。
在本申请的一个实施例中,上述的至少一个区域可以是量化系数块中处于至少一条斜线上的位置。如图9所示,假设量化系数块是4×4的系数块,每个方块表示一个量化系数,那么可以如9中的(a)和(b)所示,将一条斜线上的位置作为该至少一个区域;或者如9中的(c)和(d)所示,将两条斜线上的位置作为该至少一个区域。
在本申请的一个实施例中,上述的至少一个区域可以是量化系数块中的SRCC区域。其中,SRCC区域即为通过SRCC技术标记出的扫描区域。
在本申请的一个实施例中,上述的至少一个区域可以是SRCC区域中指定的一个位置或多个位置。可选地,SRCC区域中指定的至少一个位置可以包括:按照扫描顺序的前N个位置或者按照扫描顺序处于中间的N个位置,当然也可以是按照扫描顺序处于最后的N个位置等,N为非0的自然数。
在本申请的一个实施例中,上述的至少一个区域可以是SRCC区域中指定的至少一行。如图7所示,假设SRCC区域是4×4的系数块,每个方块表示一个量化系数,那么可以如图7中(a)所示,将灰色区域的1行作为该至少一个区域;或者可以如图7中(b)所示,将灰色区域的2行作为该至少一个区域。可选地,这至少一行可以是量化系数块中靠上边的行。
在本申请的一个实施例中,上述的至少一个区域可以是SRCC区域中指定的至少一列。如图7所示,假设SRCC区域是4×4的系数块,每个方块表示一个量化系数,那么可以如图7中(c)所示,将灰色区域的1列作为该至少一个区域;或者可以如图7中(d)所示,将灰色区域的2列作为该至少一个区域。可选地,这至少一列可以是量化系数块中靠左边的列。
在本申请的一个实施例中,上述的至少一个区域可以是SRCC区域中指定的至少一行和指定的至少一列。如图8所示,假设SRCC区域是4×4的系数块,每个方块表示一个量化系数,那么可以如图8中(a)所示,将下方的1行和右边的1列(即其中的灰色区 域)作为该至少一个区域;或者可以如图8中(b)所示,将下方的2行和右边的2列(即其中的灰色区域)作为该至少一个区域;或者可以如图8中(c)所示,将上方的1行和左边的1列(即其中的灰色区域)作为该至少一个区域;或者可以如图8中(d)所示,将上方的2行和左边的2列(即其中的灰色区域)作为该至少一个区域。
在本申请的一个实施例中,上述的至少一个区域可以是SRCC区域中处于至少一条斜线上的位置。如图9所示,假设SRCC区域是4×4的系数块,每个方块表示一个量化系数,那么可以如9中的(a)和(b)所示,将一条斜线上的位置作为该至少一个区域;或者如9中的(c)和(d)所示,将两条斜线上的位置作为该至少一个区域。
在本申请的其它实施例中,也可以将上述实施例中的至少一个区域划分方式进行组合,以将组合后的区域作为该至少一个区域。
继续参照图6所示,在步骤S630中,根据余数与变换矩阵组合之间的对应关系,选择对应的变换矩阵组合。
在本申请的一个实施例中,比如在计算余数时,将设定值设置为3,那么3个余数(即0、1、2)可以分别对应于不同的变换矩阵组合;再如将设定值设置为5,那么5个余数(即0、1、2、3、4)可以分别对应于不同的变换矩阵组合。可见,本申请实施例的技术方案能够实现多种变换矩阵组合的索引指示,并且可以有效降低变换矩阵索引所占用的比特位。
需要说明的是,在本申请的实施例中,不同余数对应的变换矩阵组合可以是不同的。当然,不同余数对应的变换矩阵组合也可以是相同的,但是对于可能的求余结果,不能都对应相同的变换矩阵组合。
可选地,变换矩阵组合所采用的变换核可以是已知的离散变换核(8种DCT变换核和8种DST变换核)的任意组合。比如可选的变换核可以包括:DCT2、DCT5、DCT8、DST1、DST7等。
在本申请的一个实施例中,余数与变换矩阵组合之间的对应关系可以是根据余数的取值,基于DCT变换核和/或DST变换核之间的组合进行预先设定的。可选地,变换矩阵组合包括DCT2、DCT5、DCT8、DST1、DST7中任意至少一种的组合。更具体地,比如,若前述的设定值为5,那么可以设定如下的对应关系:余数为0时对应的变换矩阵组合可以是(DCT2,DCT2);余数为1时对应的变换矩阵组合可以是(DCT8,DCT8);余数为2时对应的变换矩阵组合可以是(DCT8,DST7);余数为3时对应的变换矩阵组合可以是(DST7,DCT8);余数为4时对应的变换矩阵组合可以是(DST7,DST7)。
在本申请的一个实施例中,也可以通过余数的某个值来表示编码块跳过了变换处理过程。例如某个余数的值指示对应的变换矩阵组合为空值(或者为TS),该空值(或TS)表示编码块跳过了变换处理过程。比如,若前述的设定值为5,那么余数为0时对应的变换矩阵组合可以是(DCT2,DCT2);余数为1时对应的变换矩阵组合可以是(DCT8,DCT8);余数为2时对应的变换矩阵组合可以是(DCT8,DST7);余数为3时对应的变换矩阵组合可以是(DST7,DCT8);余数为4时对应的变换矩阵组合可以是空值(或者为TS),以指示编码块跳过了变换处理过程,在这种情况下,可以直接对量化系数块进行反量化处理,然后将反量化处理的结果作为重建的残差数据,而不需要对反量化处理的结果进行反变换处理。需要说明的是,指示编码块跳过了变换处理过程的余数可以根据实际需要进行设定, 该实施例中的4仅为示例。
继续参照图6所示,在步骤S640中,基于选择的变换矩阵组合对量化系数块的反量化结果进行反变换处理。该过程可以参照前述实施例中的相关描述,不再赘述。
在步骤S650中,根据反变换处理的处理结果,重建视频图像帧。
该过程可以参照前述实施例中的相关描述,不再赘述。
本申请上述实施例的技术方案使得能够通过量化系数块中的量化系数来隐含指示编码块对应的变换矩阵组合,无需编码端针对每个编码单元都编码变换矩阵的索引,降低了变换矩阵索引所占用的比特位,进而可以有效提高视频编码效率。
以下介绍本申请的装置实施例,可以用于执行本申请上述实施例中的视频解码方法。对于本申请装置实施例中未披露的细节,请参照本申请上述的视频解码方法的实施例。
图10示出了根据本申请的一个实施例的视频解码装置的框图,该视频解码装置可以设置在具有计算处理功能的设备内,比如可以设置在终端设备或服务器内。
参照图10所示,根据本申请的一个实施例的视频解码装置1000,包括:解码单元1002、统计单元1004、选择单元1006、处理单元1008和重建单元1010。
其中,解码单元1002配置为对视频图像帧的编码块进行熵解码处理,获得所述编码块对应的残差数据的量化系数块;统计单元1004配置为对所述量化系数块中的至少一个区域内的量化系数进行统计得到统计结果,计算所述统计结果针对设定值的余数;选择单元1006配置为根据余数与变换矩阵组合之间的对应关系,选择对应的变换矩阵组合;处理单元1008配置为基于选择的变换矩阵组合对所述量化系数块的反量化结果进行反变换处理,重建单元1010配置为根据所述反变换处理的处理结果,重建所述视频图像帧。
在本申请的一些实施例中,基于前述方案,所述的视频解码装置1000还包括:第一确定单元,配置为根据指定的索引标识的取值,确定对应的编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合,其中,所述指定的索引标识包括在以下至少之一中:视频图像帧序列对应的第一编码数据的序列头、视频图像帧对应的第二编码数据的图像头、视频图像帧的一个条带对应的第三编码数据的条带头、视频图像帧的一个LCU(Largest Coding Unit,最大编码单元)对应的第四编码数据的LCU头。
在本申请的一些实施例中,基于前述方案,所述第一确定单元配置为当指定的索引标识的取值为第一值时,确定对应的编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
在本申请的一些实施例中,基于前述方案,所述第一确定单元配置为在一个视频图像帧序列对应的第一编码数据的序列头部包含的指定的索引标识的值为第一值时,确定所述第一编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
在本申请的一些实施例中,基于前述方案,所述第一确定单元配置为在一个视频图像帧对应的第二编码数据的图像头部包含的指定的索引标识的值为第一值时,确定所述第二编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
在本申请的一些实施例中,基于前述方案,所述第一确定单元配置为在视频图像帧的 一个条带对应的第三编码数据的条带头部信息包含的指定的索引标识的值为第一值时,确定所述第三编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
在本申请的一些实施例中,基于前述方案,所述第一确定单元配置为在视频图像帧的一个LCU对应的第四编码数据的LCU头部信息包含的指定的索引标识的值为第一值时,确定所述第四编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
在本申请的一些实施例中,基于前述方案,所述的视频解码装置1000还包括:第二确定单元,配置为根据所述编码块的尺寸大小和预设阈值范围之间的关系,确定所述编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
在本申请的一些实施例中,基于前述方案,所述的视频解码装置1000还包括:第三确定单元,配置为在一个视频图像帧序列对应的第一编码数据的序列头部包含的索引标识为第一值时,确定所述第一编码数据中的目标编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
在本申请的一些实施例中,基于前述方案,所述第三确定单元还配置为在所述第一编码数据的序列头部包含的索引标识为第二值,且所述视频图像帧序列中的一个视频图像帧所对应的第二编码数据的图像头部包含的索引标识为第一值时,确定所述第二编码数据中的目标编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
在本申请的一些实施例中,基于前述方案,所述第三确定单元还配置为在所述第一编码数据的序列头部包含的索引标识为第二值,且所述第二编码数据的图像头部包含的索引标识为第二值时,确定所述第二编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
在本申请的一些实施例中,基于前述方案,所述第三确定单元还配置为在所述第一编码数据的序列头部包含的索引标识为第二值、所述第二编码数据的图像头部包含的索引标识为第二值,且所述视频图像帧的一个条带对应的第三编码数据的条带头部信息包含的索引标识为第一值时,确定所述第三编码数据中的目标编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
在本申请的一些实施例中,基于前述方案,所述第三确定单元还配置为在所述第一编码数据的序列头部包含的索引标识为第二值、所述第二编码数据的图像头部包含的索引标识为第二值、所述第三编码数据的条带头部信息包含的索引标识为第二值时,确定所述第三编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
在本申请的一些实施例中,基于前述方案,所述第三确定单元还配置为在所述第一编码数据的序列头部包含的索引标识为第二值、所述第二编码数据的图像头部包含的索引标识为第二值、所述第三编码数据的条带头部信息包含的索引标识为第二值,且所述条带的一个LCU对应的第四编码数据的LCU头部信息包含的索引标识为第一值时,确定所述第四编码数据中的目标编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
在本申请的一些实施例中,基于前述方案,所述第三确定单元还配置为在所述第一编码数据的序列头部包含的索引标识为第二值、所述第二编码数据的图像头部包含的索引标识为第二值、所述第三编码数据的条带头部信息包含的索引标识为第二值,且所述第四编码数据的LCU头部信息包含的索引标识为第二值时,确定所述第四编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
在本申请的一些实施例中,基于前述方案,所述第三确定单元还配置为在所述第一编码数据的序列头部包含的索引标识为第二值、所述第二编码数据的图像头部包含的索引标识为第二值、所述第三编码数据的条带头部信息包含的索引标识为第二值、所述第四编码数据的LCU头部信息包含的索引标识为第二值,则根据所述第四编码数据中的各个编码块的尺寸大小和预设阈值范围之间的关系,确定所述各个编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
在本申请的一些实施例中,基于前述方案,所述目标编码块为以下任一:采用帧内编码模式的编码块、采用帧间编码模式的编码块、相应编码数据中的所有编码块。
在本申请的一些实施例中,基于前述方案,所述的视频解码装置还包括:第四确定单元,配置为根据一个视频图像帧序列对应的第一编码数据的序列头部包含的第一指示位的值和第二指示位的值,分别确定所述第一编码数据中采用帧内编码模式的编码块和采用帧间编码模式的编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;或
根据一个视频图像帧对应的第二编码数据的图像头部包含的第一指示位的值和第二指示位的值,分别确定所述第二编码数据中采用帧内编码模式的编码块和采用帧间编码模式的编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;或
根据视频图像帧的一个条带对应的第三编码数据的条带头部信息包含的第一指示位的值和第二指示位的值,分别确定所述第三编码数据中采用帧内编码模式的编码块和采用帧间编码模式的编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;或
根据视频图像帧的一个LCU对应的第四编码数据的LCU头部信息包含的第一指示位的值和第二指示位的值,分别确定所述第四编码数据中采用帧内编码模式的编码块和采用帧间编码模式的编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
在本申请的一些实施例中,基于前述方案,统计单元1004配置为:计算所述至少一个区域内的量化系数的数值之和,将得到的和值作为所述统计结果;或
计算所述至少一个区域内的量化系数的绝对值之和,将得到的和值作为所述统计结果;或
计算所述至少一个区域内数值为奇数的量化系数的数值之和,将得到的和值作为所述统计结果;或
计算所述至少一个区域内数值为奇数的量化系数的绝对值之和,将得到的和值作为所述统计结果;或
计算所述至少一个区域内数值为偶数或非零偶数的量化系数的数值之和,将得到的和 值作为所述统计结果;或
计算所述至少一个区域内数值为偶数或非零偶数的量化系数的绝对值之和,将得到的和值作为所述统计结果。
在本申请的一些实施例中,基于前述方案,统计单元1004配置为:将所述至少一个区域内的量化系数的数值进行线性映射,计算所述至少一个区域内的量化系数在线性映射之后的数值或绝对值之和,将得到的和值作为所述统计结果,其中,所述线性映射包括:
将所述至少一个区域内数值为奇数的量化系数的数值转换为第一数值,将数值为偶数的量化系数的数值转换为第二数值,其中,所述第一数值和所述第二数值中的一个为奇数,另一个为偶数;或
将所述至少一个区域内的非零量化系数的数值转换为第三数值,将数值为零的量化系数的数值转换为第四数值,其中,所述第三数值和所述第四数值中的一个为奇数,另一个为偶数;或
将所述至少一个区域内的量化系数的数值减少或增加第五数值;或
将所述至少一个区域内的量化系数的数值乘以或除以非零的第六数值;或
将所述至少一个区域内的量化系数的数值乘以或除以非零偶数。
在本申请的一些实施例中,基于前述方案,所述至少一个区域包括以下区域中的至少一个:
所述量化系数块中的全部区域;
所述量化系数块中指定的至少一个位置;
所述量化系数块中指定的至少一行;
所述量化系数块中指定的至少一列;
所述量化系数块中指定的至少一行和指定的至少一列;
所述量化系数块中处于至少一条斜线上的位置;
所述量化系数块中的SRCC区域;
所述SRCC区域中指定的至少一个位置;
所述SRCC区域中指定的至少一行;
所述SRCC区域中指定的至少一列;
所述SRCC区域中指定的至少一行和指定的至少一列;
所述SRCC区域中处于至少一条斜线上的位置。
在本申请的一些实施例中,基于前述方案,所述SRCC区域中指定的至少一个位置包括:按照扫描顺序的前N个位置或按照扫描顺序处于中间的N个位置或按照扫描顺序的后N个位置,N为非0的自然数。
在本申请的一些实施例中,基于前述方案,数值为指定值的余数用于表示所述编码块跳过了变换处理过程。
在本申请的一些实施例中,基于前述方案,所述余数与变换矩阵组合之间的对应关系是根据所述余数的取值,基于离散余弦变换DCT变换核和/或离散正弦变换DST变换核之间的组合进行预先设定的。
在本申请的一些实施例中,基于前述方案,所述变换矩阵组合包括DCT2、DCT5、DCT8、DST1、DST7中任意至少一种的组合。
图11示出了适于用来实现本申请实施例的电子设备的计算机系统的结构示意图。
需要说明的是,图11示出的电子设备的计算机系统1100仅是一个示例,不应对本申请实施例的功能和使用范围带来任何限制。
如图11所示,计算机系统1100包括中央处理单元(Central Processing Unit,CPU)1101,其可以根据存储在只读存储器(Read-Only Memory,ROM)1102中的程序或者从存储部分1108加载到随机访问存储器(Random Access Memory,RAM)1103中的程序而执行各种适当的动作和处理,例如执行上述实施例中所述的方法。在RAM 1103中,还存储有系统操作所需的各种程序和数据。CPU 1101、ROM 1102以及RAM 1103通过总线1104彼此相连。输入/输出(Input/Output,I/O)接口1105也连接至总线1104。
以下部件连接至I/O接口1105:包括键盘、鼠标等的输入部分1106;包括诸如阴极射线管(Cathode Ray Tube,CRT)、液晶显示器(Liquid Crystal Display,LCD)等以及扬声器等的输出部分1107;包括硬盘等的存储部分1108;以及包括诸如LAN(Local Area Network,局域网)卡、调制解调器等的网络接口卡的通信部分1109。通信部分1109经由诸如因特网的网络执行通信处理。驱动器1110也根据需要连接至I/O接口1105。可拆卸介质1111,诸如磁盘、光盘、磁光盘、半导体存储器等等,根据需要安装在驱动器1110上,以便于从其上读出的计算机可读指令根据需要被安装入存储部分1108。
特别地,根据本申请的实施例,上文参考流程图描述的过程可以被实现为计算机软件程序。例如,本申请的实施例包括一种计算机程序产品,其包括承载在计算机可读介质上的计算机可读指令,该计算机可读指令包含用于执行流程图所示的方法的计算机可读指令。在这样的实施例中,该计算机可读指令可以通过通信部分1109从网络上被下载和安装,和/或从可拆卸介质1111被安装。在该计算机可读指令被中央处理单元(CPU)1101执行时,执行本申请的系统中限定的各种功能。
需要说明的是,本申请实施例所示的计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质或者是上述两者的任意组合。计算机可读存储介质例如可以是——但不限于——电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(Erasable Programmable Read Only Memory,EPROM)、闪存、光纤、便携式紧凑磁盘只读存储器(Compact Disc Read-Only Memory,CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本申请中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。而在本申请中,计算机可读的信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的计算机可读指令。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读的信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。计算机可读介质上包含的计算机可读指令可以用任何适当的介质传输,包括但不限于:无线、有线等等,或者上述的任意合适的组合。
附图中的流程图和框图,图示了按照本申请各种实施例的系统、方法和计算机程序产 品的可能实现的体系架构、功能和操作。其中,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,上述模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图或流程图中的每个方框、以及框图或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。
描述于本申请实施例中所涉及到的单元可以通过软件的方式实现,也可以通过硬件的方式来实现,所描述的单元也可以设置在处理器中。其中,这些单元的名称在某种情况下并不构成对该单元本身的限定。
作为另一方面,本申请还提供了一种计算机可读介质,该计算机可读介质可以是上述实施例中描述的电子设备中所包含的;也可以是单独存在,而未装配入该电子设备中。上述计算机可读介质承载有一个或者多个程序,当上述一个或者多个程序被一个该电子设备执行时,使得该电子设备实现上述实施例中所述的方法。
应当注意,尽管在上文详细描述中提及了用于动作执行的设备的若干模块或者单元,但是这种划分并非强制性的。实际上,根据本申请的实施方式,上文描述的两个或更多模块或者单元的特征和功能可以在一个模块或者单元中具体化。反之,上文描述的一个模块或者单元的特征和功能可以进一步划分为由多个模块或者单元来具体化。
通过以上的实施方式的描述,本领域的技术人员易于理解,这里描述的示例实施方式可以通过软件实现,也可以通过软件结合必要的硬件的方式来实现。因此,根据本申请实施方式的技术方案可以以软件产品的形式体现出来,该软件产品可以存储在一个非易失性存储介质(可以是CD-ROM,U盘,移动硬盘等)中或网络上,包括若干指令以使得一台计算设备(可以是个人计算机、服务器、触控终端、或者网络设备等)执行根据本申请实施方式的方法。
本领域技术人员在考虑说明书及实践这里公开的实施方式后,将容易想到本申请的其它实施方案。本申请旨在涵盖本申请的任何变型、用途或者适应性变化,这些变型、用途或者适应性变化遵循本申请的一般性原理并包括本申请未公开的本技术领域中的公知常识或惯用技术手段。
应当理解的是,本申请并不局限于上面已经描述并在附图中示出的精确结构,并且可以在不脱离其范围进行各种修改和改变。本申请的范围仅由所附的权利要求来限制。

Claims (20)

  1. 一种视频解码方法,其特征在于,由电子设备执行,包括:
    对视频图像帧的编码块进行熵解码处理,获得所述编码块对应的残差数据的量化系数块;
    对所述量化系数块中的至少一个区域内的量化系数进行统计得到统计结果,计算所述统计结果针对设定值的余数;
    根据余数与变换矩阵组合之间的对应关系,选择对应的变换矩阵组合;
    基于选择的变换矩阵组合对所述量化系数块的反量化结果进行反变换处理;
    根据所述反变换处理的处理结果,重建所述视频图像帧。
  2. 根据权利要求1所述的视频解码方法,其特征在于,还包括:
    根据指定的索引标识的取值,确定对应的编码数据中的目标编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合,其中,所述指定的索引标识包括以下数据中的至少一种:
    视频图像帧序列对应的第一编码数据的序列头、视频图像帧对应的第二编码数据的图像头、视频图像帧的一个条带对应的第三编码数据的条带头、视频图像帧的一个最大编码单元LCU对应的第四编码数据的LCU头。
  3. 根据权利要求2所述的视频解码方法,其特征在于,所述目标编码块为以下任意一种:采用帧内编码模式的编码块、采用帧间编码模式的编码块、对应的编码数据中的所有编码块。
  4. 根据权利要求2所述的视频解码方法,其特征在于,所述根据指定的索引标识的取值,确定对应的编码数据中的目标编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合,包括:
    当指定的索引标识的取值为第一值时,确定对应的编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
  5. 根据权利要求2所述的视频解码方法,其特征在于,所述根据指定的索引标识的取值,确定对应的编码数据中的目标编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合,包括:
    在所述第一编码数据的序列头部包含的索引标识为第一值时,确定所述第一编码数据中的目标编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
  6. 根据权利要求2所述的视频解码方法,其特征在于,所述根据指定的索引标识的取值,确定对应的编码数据中的目标编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合,包括:
    在所述第一编码数据的序列头部包含的索引标识为第二值,且所述第二编码数据的图像头部包含的索引标识为第一值时,确定所述第二编码数据中的目标编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;
    在所述第一编码数据的序列头部包含的索引标识为第二值,且所述图像头部包含的索引标识为第二值时,确定所述第二编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
  7. 根据权利要求2所述的视频解码方法,其特征在于,其特征在于,所述根据指定的索引标识的取值,确定对应的编码数据中的目标编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合,包括:
    在所述第一编码数据的序列头部包含的索引标识为第二值、所述第二编码数据的图像头部包含的索引标识为第二值,且所述第三编码数据的条带头部信息包含的索引标识为第一值时,确定所述第三编码数据中的目标编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;
    在所述序列头部包含的索引标识为第二值、所述图像头部包含的索引标识为第二值、所述条带头部信息包含的索引标识为第二值时,确定所述第三编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
  8. 根据权利要求2所述的视频解码方法,其特征在于,其特征在于,所述根据指定的索引标识的取值,确定对应的编码数据中的目标编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合,包括:
    在所述第一编码数据的序列头部包含的索引标识为第二值、所述第二编码数据的图像头部包含的索引标识为第二值、所述第三编码数据的条带头部信息包含的索引标识为第二值,且所述第四编码数据的LCU头部信息包含的索引标识为第一值时,确定所述第四编码数据中的目标编码块不需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;
    在所述序列头部包含的索引标识为第二值、所述图像头部包含的索引标识为第二值、所述条带头部信息包含的索引标识为第二值,且所述LCU头部信息包含的索引标识为第二值时,确定所述第四编码数据中的目标编码块需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合;
    在所述序列头部包含的索引标识为第二值、所述图像头部包含的索引标识为第二值、所述条带头部信息包含的索引标识为第二值、所述LCU头部信息包含的索引标识为第二值,则根据所述第四编码数据中的各个编码块的尺寸大小和预设阈值范围之间的关系,确定所述各个编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
  9. 根据权利要求1所述的视频解码方法,其特征在于,还包括:
    根据所述编码块的尺寸大小和预设阈值范围之间的关系,确定所述编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合。
  10. 根据权利要求1至9中任一项所述的视频解码方法,其特征在于,还包括:
    根据第一编码数据的序列头部包含的第一指示位的值和第二指示位的值,分别确定所述第一编码数据中采用帧内编码模式的编码块和采用帧间编码模式的编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合,所述第一编码数据为一个视频图像帧序列对应的编码数据;或
    根据第二编码数据的图像头部包含的第一指示位的值和第二指示位的值,分别确定所述第二编码数据中采用帧内编码模式的编码块和采用帧间编码模式的编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合,所述第二编码数据为一个视频图像帧对应的编码数据;或
    根据第三编码数据的条带头部信息包含的第一指示位的值和第二指示位的值,分别确定所述第三编码数据中采用帧内编码模式的编码块和采用帧间编码模式的编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合,所述第三编码数据为视频图像帧的一个条带对应的编码数据;或
    根据第四编码数据的LCU头部信息包含的第一指示位的值和第二指示位的值,分别确定所述第四编码数据中采用帧内编码模式的编码块和采用帧间编码模式的编码块是否需要根据熵解码得到的量化系数块中的量化系数选择对应的变换矩阵组合,所述第四编码数据为视频图像帧的一个LCU对应的编码数据。
  11. 根据权利要求1至9中任一项所述的视频解码方法,其特征在于,所述统计结果的确定方式包括以下方式中的任意一种:
    计算所述至少一个区域内的量化系数的数值之和,将得到的和值作为所述统计结果;
    计算所述至少一个区域内的量化系数的绝对值之和,将得到的和值作为所述统计结果;
    计算所述至少一个区域内数值为奇数的量化系数的数值之和,将得到的和值作为所述统计结果;
    计算所述至少一个区域内数值为奇数的量化系数的绝对值之和,将得到的和值作为所述统计结果;
    计算所述至少一个区域内数值为偶数或非零偶数的量化系数的数值之和,将得到的和值作为所述统计结果;
    计算所述至少一个区域内数值为偶数或非零偶数的量化系数的绝对值之和,将得到的和值作为所述统计结果。
  12. 根据权利要求1至9中任一项所述的视频解码方法,其特征在于,对所述量化系数块中的至少一个区域内的量化系数进行统计得到统计结果,包括:将所述至少一个区域内的量化系数的数值进行线性映射,计算所述至少一个区域内的量化系数在线性映射之后的数值或绝对值之和,将得到的和值作为所述统计结果,其中,所述线性映射包括以下方式中的任意一种:
    将所述至少一个区域内数值为奇数的量化系数的数值转换为第一数值,将数值为偶数的量化系数的数值转换为第二数值,其中,所述第一数值和所述第二数值中的一个为奇数,另一个为偶数;
    将所述至少一个区域内的非零量化系数的数值转换为第三数值,将数值为零的量化系数的数值转换为第四数值,其中,所述第三数值和所述第四数值中的一个为奇数,另一个为偶数;
    将所述至少一个区域内的量化系数的数值减少或增加第五数值;
    将所述至少一个区域内的量化系数的数值乘以或除以非零的第六数值;
    将所述至少一个区域内的量化系数的数值乘以或除以非零偶数。
  13. 根据权利要求1至9中任一项所述的视频解码方法,其特征在于,所述至少一个区域包括以下区域中的至少一个:
    所述量化系数块中的全部区域;
    所述量化系数块中指定的至少一个位置;
    所述量化系数块中指定的至少一行;
    所述量化系数块中指定的至少一列;
    所述量化系数块中指定的至少一行和指定的至少一列;
    所述量化系数块中处于至少一条斜线上的位置;
    所述量化系数块中的扫描区域系数编码SRCC区域;
    所述SRCC区域中指定的至少一个位置;
    所述SRCC区域中指定的至少一行;
    所述SRCC区域中指定的至少一列;
    所述SRCC区域中指定的至少一行和指定的至少一列;
    所述SRCC区域中处于至少一条斜线上的位置。
  14. 根据权利要求13所述的视频解码方法,其特征在于,所述SRCC区域中指定的至少一个位置包括:按照扫描顺序的前N个位置或按照扫描顺序处于中间的N个位置或按照扫描顺序的后N个位置,N为非0的自然数。
  15. 根据权利要求1至9中任一项所述的视频解码方法,其特征在于,数值为指定值的余数用于表示所述编码块跳过了变换处理过程。
  16. 根据权利要求1至9中任一项所述的视频解码方法,其特征在于,所述余数与变换矩阵组合之间的对应关系是根据所述余数的取值,基于离散余弦变换DCT变换核和/或离散正弦变换DST变换核之间的组合进行预先设定的。
  17. 根据权利要求1至9中任一项所述的视频解码方法,其特征在于,所述变换矩阵组合包括DCT2、DCT5、DCT8、DST1、DST7中任意至少一种的组合。
  18. 一种视频解码装置,其特征在于,包括:
    解码单元,配置为对视频图像帧的编码块进行熵解码处理,获得所述编码块对应的残差数据的量化系数块;
    统计单元,配置为对所述量化系数块中的至少一个区域内的量化系数进行统计得到统计结果,计算所述统计结果针对设定值的余数;
    选择单元,配置为根据余数与变换矩阵组合之间的对应关系,选择对应的变换矩阵组合;
    处理单元,配置为基于选择的变换矩阵组合对所述量化系数块的反量化结果进行反变换处理;
    重建单元,配置为根据所述反变换处理的处理结果,重建所述视频图像帧。
  19. 一个或多个存储有计算机可读指令的非易失性计算机可读介质,其上存储有计算机可读指令,其特征在于,所述计算机可读指令被一个或多个处理器执行时实现如权利要求1至17中任一项所述的视频解码方法。
  20. 一种电子设备,其特征在于,包括:
    一个或多个处理器;
    存储装置,用于存储计算机可读指令,当所述计算机可读指令被所述一个或多个处理器执行时,使得所述一个或多个处理器实现如权利要求1至17中任一项所述的视频解码方法。
PCT/CN2021/116451 2020-09-27 2021-09-03 视频解码方法、装置、计算机可读介质及电子设备 WO2022062880A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011033990.0 2020-09-27
CN202011033990.0A CN112565751B (zh) 2020-09-27 2020-09-27 视频解码方法、装置、计算机可读介质及电子设备

Publications (1)

Publication Number Publication Date
WO2022062880A1 true WO2022062880A1 (zh) 2022-03-31

Family

ID=75042687

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/116451 WO2022062880A1 (zh) 2020-09-27 2021-09-03 视频解码方法、装置、计算机可读介质及电子设备

Country Status (2)

Country Link
CN (2) CN112565751B (zh)
WO (1) WO2022062880A1 (zh)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114079772B (zh) * 2020-08-21 2023-04-07 腾讯科技(深圳)有限公司 视频解码方法、装置、计算机可读介质及电子设备
CN112565751B (zh) * 2020-09-27 2021-09-10 腾讯科技(深圳)有限公司 视频解码方法、装置、计算机可读介质及电子设备
CN112533000B (zh) * 2020-10-16 2022-08-05 腾讯科技(深圳)有限公司 视频解码方法、装置、计算机可读介质及电子设备

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2991352A1 (en) * 2009-01-27 2016-03-02 Thomson Licensing Methods and apparatus for transform selection in video encoding and decoding
US20160219290A1 (en) * 2015-01-26 2016-07-28 Qualcomm Incorporated Enhanced multiple transforms for prediction residual
US20180262777A1 (en) * 2015-12-23 2018-09-13 Huawei Technologies Co., Ltd. Method and Apparatus for Transform Coding with Block-Level Transform Selection and Implicit Signaling Within Hierarchical Partitioning
CN110519601A (zh) * 2019-09-02 2019-11-29 北京百度网讯科技有限公司 数字视频的编码方法和装置
CN112533000A (zh) * 2020-10-16 2021-03-19 腾讯科技(深圳)有限公司 视频解码方法、装置、计算机可读介质及电子设备
CN112543337A (zh) * 2020-09-27 2021-03-23 腾讯科技(深圳)有限公司 视频解码方法、装置、计算机可读介质及电子设备
CN112565751A (zh) * 2020-09-27 2021-03-26 腾讯科技(深圳)有限公司 视频解码方法、装置、计算机可读介质及电子设备

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
MX2011004850A (es) * 2008-11-07 2011-05-30 Mitsubishi Electric Corp Codificador de imagenes y decodificador de imagenes.
CN101771868B (zh) * 2008-12-31 2016-03-02 华为技术有限公司 对图像的量化处理方法及装置
CN102045560B (zh) * 2009-10-23 2013-08-07 华为技术有限公司 一种视频编解码方法及设备
JP5741728B2 (ja) * 2010-12-09 2015-07-01 ソニー株式会社 画像処理装置及び画像処理方法
KR102199463B1 (ko) * 2015-08-31 2021-01-06 삼성전자주식회사 스캔 순서를 고려한 영상의 변환 방법 및 그 장치, 및 역변환 방법 및 그 장치
CN111225206B (zh) * 2018-11-23 2021-10-26 华为技术有限公司 视频解码方法和视频解码器
CN109831670B (zh) * 2019-02-26 2020-04-24 北京大学深圳研究生院 一种反量化方法、系统、设备及计算机可读介质

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2991352A1 (en) * 2009-01-27 2016-03-02 Thomson Licensing Methods and apparatus for transform selection in video encoding and decoding
US20160219290A1 (en) * 2015-01-26 2016-07-28 Qualcomm Incorporated Enhanced multiple transforms for prediction residual
US20180262777A1 (en) * 2015-12-23 2018-09-13 Huawei Technologies Co., Ltd. Method and Apparatus for Transform Coding with Block-Level Transform Selection and Implicit Signaling Within Hierarchical Partitioning
CN110519601A (zh) * 2019-09-02 2019-11-29 北京百度网讯科技有限公司 数字视频的编码方法和装置
CN112543337A (zh) * 2020-09-27 2021-03-23 腾讯科技(深圳)有限公司 视频解码方法、装置、计算机可读介质及电子设备
CN112565751A (zh) * 2020-09-27 2021-03-26 腾讯科技(深圳)有限公司 视频解码方法、装置、计算机可读介质及电子设备
CN112533000A (zh) * 2020-10-16 2021-03-19 腾讯科技(深圳)有限公司 视频解码方法、装置、计算机可读介质及电子设备

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
A. SAID (QUALCOMM), X. ZHAO (TENCENT): "CE6: Summary Report on Transforms and Transform Signalling", 12. JVET MEETING; 20181003 - 20181012; MACAO; (THE JOINT VIDEO EXPLORATION TEAM OF ISO/IEC JTC1/SC29/WG11 AND ITU-T SG.16 ), no. JVET-L0026, 1 October 2018 (2018-10-01), XP030194119 *

Also Published As

Publication number Publication date
CN112565751B (zh) 2021-09-10
CN113824957B (zh) 2022-10-28
CN112565751A (zh) 2021-03-26
CN113824957A (zh) 2021-12-21

Similar Documents

Publication Publication Date Title
WO2022078163A1 (zh) 视频解码方法、视频编码方法及相关装置
WO2022062880A1 (zh) 视频解码方法、装置、计算机可读介质及电子设备
WO2022063033A1 (zh) 视频解码方法、视频编码方法、装置、计算机可读介质及电子设备
WO2022078304A1 (zh) 视频解码方法、装置、计算机可读介质、程序及电子设备
WO2022174660A1 (zh) 视频编解码方法、装置、计算机可读介质及电子设备
US12137223B2 (en) Video decoding method, video coding method, and related apparatus
WO2022174637A1 (zh) 视频编解码方法、装置、计算机可读介质及电子设备
WO2022174701A1 (zh) 视频编解码方法、装置、计算机可读介质及电子设备
WO2022037478A1 (zh) 视频解码方法、视频编码方法、装置、介质及电子设备
WO2022037477A1 (zh) 视频解码方法、装置、计算机可读介质及电子设备
CN114979656B (zh) 视频编解码方法、装置、计算机可读介质及电子设备
WO2022063040A1 (zh) 视频编解码方法、装置及设备
WO2022116854A1 (zh) 视频解码方法、装置、可读介质、电子设备及程序产品
WO2023130899A1 (zh) 环路滤波方法、视频编解码方法、装置、介质及电子设备
WO2022174638A1 (zh) 视频编解码方法、装置、计算机可读介质及电子设备
WO2022174659A1 (zh) 视频编解码方法、装置、计算机可读介质及电子设备
CN115209146A (zh) 视频编解码方法、装置、计算机可读介质及电子设备
CN118803237A (zh) 视频编解码方法、装置、计算机可读介质及电子设备
CN115209141A (zh) 视频编解码方法、装置、计算机可读介质及电子设备

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21871252

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM XXXX DATED 11.08.2023)

122 Ep: pct application non-entry in european phase

Ref document number: 21871252

Country of ref document: EP

Kind code of ref document: A1