CN1535027A

CN1535027A - Inframe prediction method used for video frequency coding

Info

Publication number: CN1535027A
Application number: CNA2004100006663A
Authority: CN
Inventors: 孔德慧; 张楠; 尹宝才; 王雁来; 孙艳丰; 岳文颖
Original assignee: Beijing University of Technology
Current assignee: Beijing University of Technology
Priority date: 2004-01-16
Filing date: 2004-01-16
Publication date: 2004-10-06
Anticipated expiration: 2024-01-16
Also published as: CN100536573C

Abstract

An in-frame predication method for video encode in order to improve video encode quality is disclosed. The original video stream taken by camera is used as input, which is input to computer by video acquisition card, and then processed by computer and the JVT video encode technique. An operation rule for calculating the DC predication mode by use of the samples of the decoded pixel in adjacent blocks is defined. Multiple predication modes can be recombined and sorted.

Description

Intra-frame prediction method for video coding

Technical Field

The invention relates to the technical field of computer digital video coding, and aims to provide a video coding system. The specific research content is the intra-frame prediction technology.

Background

In order to transmit and store images in the current limited transmission bandwidth and storage media, we must perform compression coding processing on the images. In the compression coding technique of a moving image, a coding algorithm is divided into two cases of intra-frame coding and inter-frame coding. The first image in the video sequence or the first image after the scene change is coded by adopting intra-frame transformation, and other images are coded by adopting inter-frame coding. In the prior art, intra-coding uses spatial prediction to exploit spatial statistical correlation in a source signal, and inter-coding uses block-based inter-prediction to exploit temporal statistical correlation. During specific coding, a prediction mode of a basic processing block in an image is specified, if interframe prediction is adopted, a motion vector of a current block is calculated according to a corresponding algorithm to obtain a prediction value of the current block; otherwise, adopting intra-frame prediction, and predicting by using the adjacent reconstructed pixels of the current block according to the corresponding intra-frame prediction technology; further, the prediction residual is transformed to remove the spatial correlation in the transform block, and then quantization is carried out; finally, the quantized transform coefficient information is encoded using variable length coding or arithmetic coding of the existing JVT technique.

At present, a video coding standard proposed by an audio/video standardization organization JVT formed by combining ITU-T and ISO/IEC JTC1 is a very popular coding standard at home and abroad, and is widely applied to the fields of television image compression, multimedia communication, multimedia computers, image databases, communication and the like. In JVT a macroblock consists of a 16 x 16 block of luminance samples and two corresponding blocks of chrominance samples, which are used as basic processing units for the video codec process.

In the intra prediction technique of the version 5.0 video coding standard provided by the JVT standard, the prediction of luminance or chrominance samples employs a prediction structure based on p × q blocks, where p denotes the number of columns of a block and q denotes the number of rows of the block, which is a prediction of pixel samples of the current block according to some prediction modes and their calculation rules, using pixel samples already reconstructed above, above right, above left and below left (fig. 3) of the p × q block, where i is 0, 1_iRepresenting the ith row of pixel samples, s, of the column to the left of the current block_jRepresenting the j-th column of pixel samples, a, of the upper row of the current block_ijRepresenting the pixel sample of ith row and j column of the current block; wherein,

in processing luminance samples, the JVT standard defines that when the blocks are 4 × 4, 4 × 8, 8 × 4 and 8 × 8 blocks, i.e. p is 4 or p is 8 and q is 4 or q is 8, 9 prediction modes are used, these prediction modes and the order being:

mode 0: vertical prediction (vertical prediction)

Mode 1: horizontal prediction (horizontal prediction)

Mode 2: DC prediction (DC prediction)

Mode 3: 45 degree directional prediction (diagonaldown/left prediction)

Mode 4: 135 degree directional prediction (diagonaldown/right prediction)

Mode 5: 112.5 degree Direction prediction (vertical-right prediction)

Mode 6: 157.5 degree Direction prediction (horizontal-Down prediction)

Mode 7: 67.5 degree Direction prediction (vertical-left prediction)

Mode 8: 22.5 degree Direction prediction (horizontal-up prediction)

Wherein, except for the DC prediction mode, the remaining 8 prediction modes are called directional prediction modes, the numerals in fig. 4 designate the directions of the respective directional prediction modes, and 2, which is not labeled, denotes the DC prediction mode. The DC prediction mode is defined as:

i. if s is_j(j＝0，1，2，...，p-1)，t_i(i-0, 1, 2.., q-1) is available, then all prediction samples are available

Is equal to

if t_i(

i ═

0, 1, 2.., q-1) is not available, s_j(j ═ 0, 1, 2.., p-1) available, then all predicted samples are available

Is equal to

if s_j(j ═ 0, 1, 2.., p-1) is not available, t_i(i-0, 1, 2.., q-1) is available, then all prediction samples are available

Is equal to

if s_j(j＝0，1，2，...，p-1)，t_i(i-0, 1, 2.., q-1) is not available, then all prediction samples are availableEqual to 128,

i

0, 1, 2.., q-1, representing pixel row coordinates,

j

0, 1, 2.., p-1, representing pixel column coordinates;

the JVT standard also defines that when processing luminance samples, 4 prediction modes are used when p q 16, which are the sum sequence:

mode 0: vertical prediction (vertical prediction)

Mode 1: horizontal prediction (horizontal prediction)

Mode 2: DC prediction (DC prediction)

Mode 3: plate prediction (plane prediction)

The DC prediction mode definition therein is consistent with the prediction mode definitions of luminance blocks of 4 × 4 blocks, 4 × 8 blocks, 8 × 4 blocks and 8 × 8 blocks.

The JVT standard defines 4 prediction modes and orders for an 8 x 8 block when processing chroma samples as:

mode 0: DC prediction (DC prediction)

Mode 1: horizontal prediction (horizontal prediction)

Mode 2: vertical prediction (vertical prediction)

Mode 3: plate prediction (plane prediction)

The JVT has a fine prediction structure, but its prediction accuracy in the DC prediction mode is not high enough, and its prediction modes for the samples are many, for example, its prediction for the luminance samples has 9 prediction modes in 4 × 4 blocks, 4 × 8 blocks, 8 × 4 blocks and 8 × 8 blocks, resulting in a high complexity of the whole algorithm.

Disclosure of Invention

The invention aims to overcome the defect of inaccurate prediction of a DC prediction mode, reduce the computational complexity of an intra-frame prediction algorithm in the coding process and provide an intra-frame prediction method for video coding.

The system block diagram of the invention is shown in fig. 1, the intra-frame prediction method for video coding is that an original video sequence is obtained by a video camera as input, the original video sequence is changed into video sequence data by a video capture card and enters a computer, and the video coding technology provided by JVT is adopted, and the processing and operation are carried out by the computer. The method comprises the following steps: the computer system receives the original video stream processed by the acquisition card, then reads out an image of the received video sequence, and divides the pixel sample value of the image into 16 × 16 macro blocks from left to right and from top to bottom; the macro block read from the computer memory is sent to the intra-frame predicting module, when in concrete coding, the predicting mode of the basic processing block in the image is stipulated, if the inter-frame prediction is adopted, the motion vector of the current block is calculated according to the corresponding algorithm, and the predicting value is obtained; otherwise, adopting intra-frame prediction, using the adjacent reconstructed pixels of the current block to predict according to the corresponding intra-frame prediction technology, namely, performing sample value prediction according to the prediction mode of JVT and the intra-frame prediction method provided by the invention, or performing sample value prediction according to the simplified prediction mode and the prediction mode calculation method provided by the invention, then transforming the prediction residual error according to the JVT standard method to remove the spatial correlation in the transform block, and then quantizing; then, coding the quantized transform coefficient information by using variable length coding or arithmetic coding of the existing JVT technology until the image coding is completed, and finally outputting the image coding bit stream; the next picture in the received sequence is read, and so on until all pictures are encoded, the flow is shown in fig. 6.

In the intra prediction technique of the version 5.0 video coding standard provided by the JVT standard, the prediction of luminance or chrominance samples employs a prediction structure based on p × q blocks, where p denotes the number of columns of a block, q denotes the number of rows of the block, which is used to predict the pixel samples of the current block according to some prediction modes and their calculation rules using the pixel samples that have been reconstructed above, below, above, and below the p × q block, where i is 0, 1,_irepresenting the ith row of pixel samples, s, of the column to the left of the current block_jRepresenting the j-th column of pixel samples, a, of the upper row of the current block_ijRepresenting the pixel sample of ith row and j column of the current block; wherein,

1) in processing luminance samples, the JVT standard defines that when the blocks are 4 × 4, 4 × 8, 8 × 4 and 8 × 8, i.e. p is 4 or p is 8 and q is 4 or q is 8, 9 prediction modes are used, these prediction modes and the order being:

mode 0: vertical prediction

Mode 1: horizontal prediction

Mode 2: DC prediction

Mode 3: 45 degree direction prediction

Mode 4: 135 degree direction prediction

Mode 5: 112.5 degree Direction prediction

Mode 6: 157.5 degree direction prediction

Mode 7: 67.5 degree direction prediction

Mode 8: 22.5 degree direction prediction

Wherein, except for the DC prediction mode, the remaining 8 prediction modes are called directional prediction modes;

2) when processing luminance samples, the JVT standard also defines that 4 prediction modes are used when p q 16, which are sum-ordered:

mode 0: vertical prediction

Mode 1: horizontal prediction

Mode 2: DC prediction

Mode 3: plate prediction

3) In processing chroma samples, the JVT standard defines 4 prediction modes and order for an 8 × 8 block as:

mode 0: DC prediction

Mode 1: horizontal prediction

Mode 2: vertical prediction

Mode 3: plate prediction

The invention is characterized in that, after reading macroblock data from a computer memory, entering an intra prediction module, said predicting of pixel samples in each 16 × 16 macroblock selected for intra prediction consists of the following steps in sequence:

(1) taking a 16 × 16 macroblock as a current prediction macroblock;

(2) dividing the macroblock into p × q in order from left to right, top to bottom, p representing the number of columns of the block, which may be equal to 4, 8, or 16, q representing the number of rows of the block, which may be equal to 4, 8, or 16;

(3) taking a p × q block as a current block;

(4) predicting a pixel luminance or chrominance sample value of the current block p × q;

(5) taking the next p × q block as the current block, and repeating the processes from the step (3) to the step (5) until the macro block is predicted completely;

when predicting the pixel brightness or chroma sample value of the current block p × q, the DC prediction mode method is mainly characterized in that:

the current block is calculated for its DC prediction mode using samples of already decoded pixels in neighboring blocks (U, L, UR, UL, DL), wherein the symbol C is defined to represent the current block, the symbol U to represent an upper block adjacent to the current block, the symbol L to represent a left block adjacent to the current block, the symbol UL to represent an upper left block adjacent to the current block, the symbol UR to represent an upper right block adjacent to the current block, and the symbol DL to represent a lower left block adjacent to the current block;

1) when the upper, upper right, upper left and lower left blocks adjacent to the current block can be used, defining that all pixel predicted values of the current block in the DC prediction mode can be obtained by using a method similar to 8 prediction directions of the JVT standard, but the filtering method is different from the filtering method of the 8 prediction directions;

2) when the upper block adjacent to the current block is available, defining that all pixel predicted values of the current block in the DC prediction mode can be obtained by a method similar to 8 prediction directions of the JVT standard, but different from a filtering method of the 8 prediction directions;

3) when a left block adjacent to the current block is available, defining that all pixel predicted values of the current block in the DC prediction mode can be obtained by using a method similar to 8 prediction directions of the JVT standard, but the filtering method is different from the filtering method of the 8 prediction directions;

4) when the upper and left blocks adjacent to the current block are not available, defining the predicted value of all pixels of the current block to be 128 in the DC prediction mode.

The present invention is further characterized in that, after entering the intra prediction module, the DC prediction mode can be defined by the following method:

1) when the upper, upper right, upper left and lower left blocks adjacent to the current block are all available, the predicted values of all pixels of the current block in the DC prediction mode are defined to be obtained by a bidirectional prediction method, see DC in FIG. 5₀；

2) When the upper block adjacent to the current block is available, all pixel prediction values of the current block in the DC prediction mode can be obtained by using an approximate vertical direction prediction method, see DC in FIG. 5₁(ii) a Although the method is consistent with the direction of vertical prediction, the adjacent pixels selected in the operation process are different from the filtering method;

3) when the left block adjacent to the current block is available, the predicted values of all pixels of the current block in the DC prediction mode can be obtained by using an approximate horizontal direction prediction method, as shown in DC in FIG. 5₂(ii) a Although the method is consistent with the direction of horizontal prediction, the adjacent pixels selected in the operation process are different from the filtering method;

4) when neither the upper block nor the left block adjacent to the current block is available, the predicted value of all pixels of the current block in the DC prediction mode is defined as 128, which is the same as the existing JVT standard.

The intra prediction method for video coding according to the present invention is further characterized in that, after entering the intra prediction module, the DC prediction mode may specifically define values by using the following method:

(1) firstly, the adjacent pixel t reconstructed by the current block is processed_i、s_jF, according to JVT method making low-pass filtering of correspondent point, placing it into array, recording said array as EP, and recording m-th array variable in said array as EP_mWhere i 0, 1., 2q-1 denotes pixel row coordinates, j 0, 1., 2p-1 denotes pixel column coordinates, p × q denotes the block size, p denotes the number of columns of the block, which may be equal to 4, 8, or 16, q denotes the number of rows of the block, which may be equal to 4, 8, or 16Equal to 4, 8, or 16, t_iRepresenting the ith row of pixel samples, s, of the column to the left of the current block_jRepresenting the j-th column of pixel samples, a, of the upper row of the current block_ijRepresenting the pixel sample of ith row and j column of the current block; m represents the subscript variable of the array EP;

in the following calculations, the symbol ">" represents a bit right shift operation;

the EP is derived from the following algorithm:

A. if the current macroblock has a reconstructed pixel with an upper edge neighbor, i.e. s_jCan be used, wherein, j is 0, 1, 2, p-1, then

a)EP_(j+1)＝s_(j)；j＝0，...，p-1

b) If the current macroblock has a reconstructed pixel adjacent to the top right, i.e. s_jIt is possible to use, among other things,

j ═ p, p +1, p +2,.., 2p-1, then

EP_(1+j+p)＝s_(p+j)； j＝0，…，p-1

EP_(1+j+p)＝s_(p+p-1)；j＝p，…，q-1

Otherwise

EP_(1+j+p)＝EP_(p)； j＝0，...，p-1

c)EP_(1+j+p)＝EP_(p+j)； j ＝p，...，q+1

d)EP₍₀₎＝s₀；

B. If the current macroblock has left-neighboring reconstructed pixels, ti, available, where i is 0, 1, 2

a)EP_(-1-i)＝t_(i)；i＝0，…，q-1

b) If the current macro block has a lower left edgeNeighboring reconstructed pixels, i.e. t_iUseful are, among others, i ═ q, q +1, q +2

EP_(-1-i-q)＝t_(q+i)；i＝0，…，q-1

EP_(-1-i-q)＝t_(q+q-1)；i＝q，…，p-1

Otherwise

EP_(-1-i-q)＝EP_(-q)；i＝0，...，q-1

c)EP_(-1-i-q)＝EP_(-i-q)；i＝p，p+1

d)EP₍₀₎＝t₀；

C. If s is_jAvailable, and t is available_iWherein i is 0, 1, 2.., q-1, and wherein j is 0, 1, 2.., p-1, then

EP₍₀₎＝f；

D. Define variable last _ pix equal to EP_(-(p+q))；

Taking i equal to- (p + q), where i represents a counter, the following steps are performed,

a) let the variable new _ pix equal (last _ pix + (EP)_(i)＜＜1)+EP_(i+1)+2)＞＞2；

b) Let variable last _ pix equal EP_(i)；

c) Let the index be the array variable EP of i_iEqual to new _ pix;

d) increasing i by 1, turning to a), until i is greater than (p + q);

(2) operation rule of DC prediction mode

i. If s is_j，t_iAll are available, then all prediction samples

Is equal to (EP)_i+EP_j) > 1, see FIG. 5

DC₀: wherein,

i-0, 1, 2., q-1, representing pixel row coordinates, and j-0, 1, 2., p-1, representing pixel column coordinates;

if t_iNot available, s_jIf available, all prediction samplesEqualing EP_jSee DC in FIG. 5_i；

Wherein i is 0, 1, 2., q-1, which indicates pixel row coordinates, and j is 0, 1, 2., p-1, which indicates pixel column coordinates;

if s_jUnusable, t_iIf available, all prediction samples

Equaling EP_iSee DC in FIG. 5₂: it is composed of

Where i ═ 0, 1, 2., q-1, denotes pixel row coordinates, and j ═ 0, 1, 2., p-1, denotes pixel column coordinates;

if s_j，t_iAll are not available, then all prediction samples areEqual to 128, where i 0, 1, 2., q-1, denotes pixel row coordinates and

j

0, 1, 2.., p-1, denotes pixel column coordinates.

The intra prediction method for video coding according to the present invention is further characterized in that, in terms of hardware implementation, when the p × q block is 4 × 4 or 4 × 8 or 8 × 4 or 8 × 8, the structures of the 9 prediction modes of the luminance samples are very complex, and it is desirable to use a simpler prediction mode to compress the image and to ensure that the performance of image compression is not reduced. Therefore, the invention provides that partial prediction modes in 9 prediction modes can be selected, and mode sequencing is carried out on the modes again according to the coding requirement; for example, a prediction method of intra luminance sample value based on 5 prediction modes, i.e. the DC prediction proposed by the present invention, and the vertical prediction, horizontal prediction, 45-degree direction prediction, 135-degree direction prediction modes adopted in JVT can be adopted;

mode 0: vertical prediction (vertical prediction)

Mode 1: horizontal prediction (horizontal prediction)

Mode 2: DC prediction (DC prediction, the DC prediction mode proposed by the present invention)

Mode 3: 45 degree directional prediction (diagonaldown/left prediction)

Mode 4: 135 degree directional prediction (diagonaldown/right prediction)

Compared with the 9 prediction modes adopted by the original JVT, the simplified prediction structure reduces the prediction calculation in 4 directions, thereby greatly reducing the calculation complexity.

Likewise, for 16 × 16 luma blocks and 8 × 8 chroma blocks, only the DC prediction mode proposed by the present invention and one or two prediction modes selected from the vertical prediction mode, the horizontal prediction mode, and the flat prediction mode in JVT may be used.

Compared with the intra-frame prediction method of the JVT standard, the intra-frame prediction method for video coding has the advantages that the DC prediction mode enables prediction to be more accurate, and the coding quality of images is improved; the simplified prediction mode greatly reduces the complexity of calculation under the condition of ensuring that the image coding performance is not reduced.

Drawings

FIG. 1 is a block diagram of a system;

FIG. 2 is a diagram of the locations of a current block and its neighboring blocks;

FIG. 3 is a block diagram of a prediction structure for p × q block samples;

FIG. 4 8 prediction patterns for p × q blocks of luminance samples;

FIG. 5 is a schematic diagram of DC prediction mode;

FIG. 6 is a system flow diagram;

FIG. 78 is a block diagram of a prediction structure of luminance samples of the X8 block;

FIG. 8 is a graph of sample signal-to-noise ratio and bit rate for luminance samples under 9 prediction modes as defined by the present invention and the JVT standard;

FIG. 9 is a graph of sample signal-to-noise ratio and bit rate for luminance samples in 5 prediction modes defined by the present invention and in 9 prediction modes defined by the JVT standard;

FIG. 10 is a graph of the signal-to-noise ratio and bit rate of samples U for the first of two samples of chroma under 2 prediction modes of DC prediction and flat panel prediction as defined by the present invention and under 4 prediction modes as defined by the JVT standard;

FIG. 11 is a graph of the signal-to-noise ratio and bit rate of samples V for the second of the two chroma samples in the 2 prediction modes of DC prediction and flat panel prediction as defined by the present invention and the 4 prediction modes as defined by the JVT standard;

Detailed Description

According to the technical scheme of the invention, as shown in fig. 1 and fig. 6, an original video sequence is obtained by a video camera as input, the input is changed into a video data stream by a video acquisition card and enters a computer, and the intra-frame prediction based on 8 x 8 blocks is carried out on the brightness sample value of an image in the sequence by adopting the video coding technology provided by JVT, and the method comprises the following specific steps:

1. reading an image in the sequence;

2. dividing an image into macroblocks in a size of 16 × 16;

3. taking a 16 x 16 macro block as a current prediction macro block;

4. dividing the macro block into 8 x 8 blocks from left to right and from top to bottom;

5. taking an 8 x 8 block as a current block;

6. predicting the pixel luminance samples of the current block 8 x 8 block;

the positions of the already coded pixel luminance samples around the current 8 × 8 block are shown in fig. 7, and the 9 prediction modes are defined in the following order:

mode 0: vertical prediction (vertical prediction)

Mode 1: horizontal prediction (horizontal prediction)

Mode 2: DC prediction (DC prediction)

Mode 3: 45 degree directional prediction (diagonaldown/left prediction)

Mode 4: 135 degree directional prediction (diagonaldown/right prediction)

Mode 5: 112.5 degree Direction prediction (vertical-right prediction)

Mode 6: 157.5 degree Direction prediction (horizontal-Down prediction)

Mode 7: 67.5 degree Direction prediction (vertical-left prediction)

Mode 8: 22.5 degree Direction prediction (horizontal-up prediction)

According to the intra prediction method for video coding proposed by the present invention, the prediction modes of the block in 9 modes are defined as follows:

1. firstly, the adjacent pixel t reconstructed by the current block is processed_i、s_jF, according to JVT method making low-pass filtering of correspondent point, placing it into array, recording said array as EP, and recording m-th array variable in said array as EP_mWherein i ═ 0, 1.., 2q-1, tablePixel row coordinates, j 0, 1.., 2p-1, pixel column coordinates, block size, p × q, p representing the number of columns of the block, which may be equal to 4, 8, or 16, q representing the number of rows of the block, which may be equal to 4, 8, or 16, t_iRepresenting the ith row of pixel samples, s, of the column to the left of the current block_jRepresenting the j-th column of pixel samples, a, of the upper row of the current block_ijRepresenting the pixel sample of ith row and j column of the current block; m represents the subscript variable of the array EP;

in the following calculations, the symbol ">" represents a bit right shift operation.

The EP is derived from the following algorithm:

A. if the current macroblock has a reconstructed pixel with an upper edge neighbor, i.e. s_j(j ═ 0, 1, 2.., 7) is available,

then

a)EP_(j+1)＝s_(j)；j＝0，...，7

b) If the current macroblock has a reconstructed pixel adjacent to the top right, i.e. s_j(j ═ 7, 8.., 15) may be used

Use of

EP_(1+j+p)＝s_(p+j)； j＝0，...，7

Otherwise

EP_(1+j+p)＝EP_(p)； j＝0，...，7

c)EP_(1+j+p)＝EP_(p+j)；j＝8，9

d)EP₍₀₎＝s₀；

B. If the current macroblock has a reconstructed pixel adjacent to the left, i.e. t_i(i ═ 0, 1, 2,. 7) is available,

then

a)EP_(-1-i)＝t_(i)； i＝0，...，7

b) If the current macroblock has reconstructed pixels adjacent to the bottom left, i.e. t_i(i＝8，9，...，15)

Can be used, then

EP_(-1-i-q)＝t_(q+i))；i＝0，...，7

Otherwise

EP_(-1-i-q)＝EP_(-q)；i＝0，...，7

c)EP_(-1-i-q)＝EP_(-i-q)；i＝8，9

d)EP₍₀₎＝t₀；

C. If s is_j(j ═ 0, 1, 2,. 7) is available, and t is_i(i ═ 0, 1, 2.., 7) available, then

EP₍₀₎＝f；

E. Define variable last _ pix equal to EP_(-16)；

Taking i equal to-16, wherein i represents a counter, the following steps are performed,

b) Let variable last _ pix equal EP_(i)；

c) Let the index be the array variable EP of i_iEqual to new _ pix;

d) increasing i by 1, turning to a) until i is greater than 16;

2. calculating the predicted value in each mode

a. Mode 0: vertical Prediction (vertical Prediction)

The requirement for using this mode is s_j(j ═ 0, 1, 2.., 7) available, predictive samples

The generation method of (a) is as follows:



b. mode 1: horizontal prediction (horizontal prediction)

The requirement for using this mode is t_i(i ═ 0, 1, 2.., 7) is available, predictive samplesThe generation method of (a) is as follows:



c. mode 2: DC prediction (DC prediction)

i. If s is_j(j＝0，1，2，...，7)，t_i(i ═ 0, 1, 2.., 7) is available, then all predicted samples are availableIs equal to (EP)_i+EP_j)＞＞1；

if t_i(i ═ 0, 1, 2,. 7) is not available, s_j(j ═ 0, 1, 2.., 7) available, then all prediction samples are availableEqualing EP_j；

if s_j(j ═ 0, 1, 2,. 7) unusable, t_i(i ═ 0, 1, 2.., 7) available, then all prediction samples are available

Equaling EP_i；

if s_j(j＝0，1，2，...，7)，t_i(i ═ 0, 1, 2.., 7) is not available, then all predicted samples are available

Equal to 128, i 0, 1, 2.., 7, representing pixel row coordinates, j 0, 1, 2., 7, representing pixel column coordinates;

the other prediction modes and their operation rules are the same as the JVT standard.

7. Determining an optimal prediction mode for a current block

a. Defining k to represent the current prediction mode, and enabling the initial value to be 0;

b. obtaining a prediction residual value delta under the prediction mode k by the following prediction residual formula_k：

Here, a_ijRepresenting the luminance sample of the original pixel,representing the predicted pixel luminance sample value in mode k, where k represents the prediction mode code number; i-0, 1, 2., q-1 denotes pixel row coordinates; j-0, 1, 2,.. p-1 denotes pixel column coordinates;

c. performing DCT (DCT) transformation (DCT refers to discrete cosine transform), quantization and entropy coding on the prediction residual error of each pixel by adopting a coding method in JVT (JVT), and calculating the coding bit number of the current block in the current mode; and after DCT transformation and quantization are carried out on the prediction residual error of each pixel, inverse quantization and inverse DCT transformation are carried out, and then a predicted value is addedThe brightness sample value of each pixel point in the reconstructed block is recorded as

k represents a prediction mode coding number; i-0, 1, 2., q-1 denotes pixel row coordinates; j-0, 1, 2,.. p-1 denotes pixel column coordinates;

d. calculating the distortion rate of the block in the current prediction mode by adopting a method in JVT, and recording the distortion rate as rdcost;

<math> <mrow> <mi>distortion</mi> <mo>=</mo> <munder> <mi>Σ</mi> <mrow> <mi>i</mi> <mo>,</mo> <mi>j</mi> </mrow> </munder> <msup> <mrow> <mo>(</mo> <msub> <mi>a</mi> <mi>ij</mi> </msub> <mo>-</mo> <msub> <mover> <mi>a</mi> <mo>^</mo> </mover> <mi>kij</mi> </msub> <mo>)</mo> </mrow> <mn>2</mn> </msup> </mrow> </math>

rd cost＝distortion+lambda×rate；

wherein, the distortion is the square sum of the original brightness sample values of all pixels of the current block and the difference of the prediction values, lambda is a constant, and rate is the number of bits used for encoding the current block in the current mode;

e.k, increasing the value by 1, repeating the steps b, c, d and e until all prediction modes of the block are executed;

d. comparing the rdcost in each mode, and selecting the mode with the minimum rdcost as the current optimal prediction mode;

8. taking the predicted value in the optimal prediction mode as the last predicted value of the block, and recording the last predicted value as the last predicted valuei-0, 1, 2., q-1 denotes pixel row coordinates; j-0, 1, 2,.. p-1 denotes pixel column coordinates; taking the reconstructed value in the optimal prediction mode as the last reconstructed value of the block, and recording the last reconstructed value as the last reconstructed value

i-0, 1, 2., q-1 denotes pixel row coordinates; j-0, 1, 2,.. p-1 denotes pixel column coordinates;

9. taking the next 8 x 8 block as the current block, and repeating the processes of steps 6 to 9 until the macro block is completely encoded;

10. and taking the next macro block as the current prediction macro block, and repeating the processes of the steps 3 to 10 until the whole image is coded.

11. Taking down one image and repeating the process from step 2 to step 11 until the coding of the whole sequence is completed.

If the simplified prediction mode is selected, only the prediction values of the prediction modes selected as required in the step 6 need to be calculated each time, for example, only the prediction values of the first 5 prediction modes can be calculated.

Example results

1. By utilizing the prediction structure of 9 prediction modes of the improved DC prediction mode, 10-frame full-frame intra prediction tests are carried out on a 1280 × 720 high-definition video sequence by taking 8 × 8 blocks as basic processing blocks, and compared with the sample signal-to-noise ratio and the bit rate of the intra prediction technology in the existing JVT standard under different quantization values (the following table), a graph (shown as a Bitrate) of the brightness sample signal-to-noise ratio (marked as PSNRY) and the bit rate (marked as Bitrate) is drawn (FIG. 8).

High definition video sequenceThe following test results: (frame rate: 30Hz, 10 frames, 1280 x 720)

		qp＝29	qp＝32	qp＝37	qp＝43	Gain
		qp＝29	qp＝32	qp＝37	qp＝43	Gain	JVT	PSNRY	40.74	39.07	36.44	33.65
	Bitrate	49975.51	39797.21	26065.39	16003.66		JVT	PSNRY	40.74	39.07	36.44	33.65
	Bitrate	49975.51	39797.21	26065.39	16003.66
The invention	PSNRY	40.74	39.07	36.45	33.66
The invention	PSNRY	40.74	39.07	36.45	33.66			Bitrate	48273.29	38174.86	24659.59	15053.78	0.317872

As can be seen from the figure, the curves obtained by the 9 prediction methods of the improved DC prediction mode proposed by the present invention are above the curves obtained by the JVT intra prediction method, which shows the improvement of the compression performance of the image without any increase of complexity by the present invention.

2. By using the simplified prediction structure provided by the invention, if only the first 5 prediction modes are adopted, the original sequence is kept, 10-frame full-frame intra-frame tests are respectively carried out on a 1280 × 720 high-definition video sequence, the sample signal-to-noise ratio and the bit rate are compared with the sample signal-to-noise ratio and the bit rate of the intra-frame prediction technology in the existing JVT standard under different quantization values (the following table), and a sample signal-to-noise ratio and bit rate curve chart is drawn (figure 9).

High definition video sequence test results: (frame rate: 30Hz, 10 frames, 1280 x 720)

qp＝29

qp＝32

qp＝37

qp＝43

Gain

JVT	PSNRY	40.74	39.07	36.44	33.65
JVT	PSNRY	40.74	39.07	36.44	33.65			Bitrate	49975.51	39797.21	26065.39	16003.66
								Bitrate	49975.51	39797.21	26065.39	16003.66
							The invention	PSNRY	40.71	39.05	36.42	33.62
	Bitrate	49494.07	39244.87	25611.6	15811.37	0.070442	The invention	PSNRY	40.71	39.05	36.42	33.62

It can be seen from the figure that the curve obtained by the simplified prediction mode method proposed by the present invention substantially coincides with the curve obtained by the JVT intra prediction method, which shows that the present invention can maintain the compression performance of the image well under the condition of reducing a great deal of complexity.

3. The simplified prediction structure provided by the invention is utilized to carry out DC prediction (mode 0) and 2 prediction modes of flat prediction (mode 1) on two chrominance samples, 10-frame full-frame test is respectively carried out on a 1280 x 720 high-definition video sequence, the sample signal-to-noise ratio and the bit rate under different quantization values compared with the sample signal-to-noise ratio and the bit rate of the intra-frame prediction technology in the existing JVT standard (the following table) are carried out, and graphs (fig. 10 and fig. 11) of the sample signal-to-noise ratio and the bit rate of the chrominance sample signal-to-noise ratio (marked as PSNRU and PSNRV) are drawn.

	QP＝27	QP＝30	QP＝35	QP＝40	Gain
	QP＝27	QP＝30	QP＝35	QP＝40	Gain	JVT	PSNRU	43.73	42.51	40.39	38.42
PSNRV	45.02	43.81	41.73	39.75		JVT	PSNRU	43.73	42.51	40.39	38.42
PSNRV	45.02	43.81	41.73	39.75			Bitrate	10852.48	8886.85	6255.15	4354.17
							Bitrate	10852.48	8886.85	6255.15	4354.17
						The invention	PSNRU	43.65	42.44	40.34	38.36
PSNRV	44.95	43.74	41.69	39.71	-0.09104	The invention	PSNRU	43.65	42.44	40.34	38.36
PSNRV	44.95	43.74	41.69	39.71	-0.09104		Bitrate	10898.41	8916.32	6299.61	4372.89	-0.08189

Claims

1. A intra-frame prediction method for video coding, the video coding is to obtain the original video stream as the input through the video camera, and the original video stream is changed into the video data stream to enter the computer after passing the video capture card, and the video coding technique provided by JVT is adopted, and the computer processes and operates, the method steps are: the computer system receives the original video stream processed by the acquisition card, then reads out an image of the received video sequence, and divides the pixel sample value of the image into 16 × 16 macro blocks from left to right and from top to bottom; the macro block read from the computer memory is sent to the intra-frame predicting module, when in concrete coding, the predicting mode of the basic processing block in the image is stipulated, if the inter-frame prediction is adopted, the motion vector of the current block is calculated according to the corresponding algorithm, and the predicting value is obtained; otherwise, adopting intra-frame prediction, using the adjacent reconstructed pixels of the current block to predict according to the corresponding intra-frame prediction technology, then transforming the prediction residual error according to the method of JVT standard to remove the spatial correlation in the transformed block, and then quantizing; then, coding the quantized transform coefficient information by using variable length coding or arithmetic coding of JVT technology until the image coding is completed, and finally outputting the image coding bit stream; reading the next image in the received sequence, and repeating the steps until all the images are coded;

in the intra prediction technique of the version 5.0 video coding standard provided by the JVT standard, the prediction of luminance or chrominance samples uses a prediction structure based on p × q blocks, where p denotes the number of columns of a block, q denotes the number of rows of a block, which is used to predict the pixel samples of the current block according to some prediction modes and their calculation rules using the pixel samples that have been reconstructed above, above right, above left and below left of the p × q block, where i is 0, 1, …, 2q-1 denotes the pixel row coordinates, j is 0, 1, …, 2p-1 denotes the pixel column coordinates, t is 0, 1, …, 2p-1 denotes the pixel column coordinates, and t is a pixel row coordinates_iRepresenting the ith row of pixel samples, s, of the column to the left of the current block_jRepresenting the j-th column of pixel samples, a, of the upper row of the current block_ijRepresenting the pixel sample of ith row and j column of the current block; wherein,

mode 0: vertical prediction

Mode 1: horizontal prediction

Mode 2: DC prediction

Mode 3: 45 degree direction prediction

Mode 4: 135 degree direction prediction

Mode 5: 112.5 degree Direction prediction

Mode 6: 157.5 degree direction prediction

Mode 7: 67.5 degree direction prediction

Mode 8: 22.5 degree direction prediction

mode 0: vertical prediction

Mode 1: horizontal prediction

Mode 2: DC prediction

Mode 3: plate prediction

mode 0: DC prediction

Mode 1: horizontal prediction

Mode 2: vertical prediction

Mode 3: plate prediction

(1) taking a 16 × 16 macroblock as a current prediction macroblock;

(3) taking a p × q block as a current block;

(4) predicting a pixel luma or chroma sample value of the current block p × q;

(5) taking the next p × q block as the current block, and repeating the processes from (3) to (5) until the macro block is predicted;

2. The method of claim 1, wherein the DC prediction mode is defined as follows after entering an intra prediction module:

1) when the upper, upper right, upper left and lower left blocks adjacent to the current block can be used, all pixel predicted values of the current block can be obtained by a bidirectional prediction method under the DC prediction mode;

2) when the upper block adjacent to the current block is available, all pixel predicted values of the current block in a DC prediction mode are defined to be obtained by an approximate vertical direction prediction method, although the method is consistent with the vertical prediction direction, the adjacent pixels selected in the operation process are different from the filtering method;

3) when a left block adjacent to the current block is available, all pixel predicted values of the current block under a DC prediction mode are defined and can be obtained by using an approximate horizontal direction prediction method, although the method is consistent with the horizontal prediction direction, the adjacent pixels selected in the operation process are different from the filtering method;

4) and when the upper and left blocks adjacent to the current block are not available, defining the predicted value of all pixels of the current block to be 128 in the DC prediction mode.

3. The method according to claim 1 or 2, wherein the DC prediction mode, after entering the intra prediction module, specifically adopts the following values:

(1) firstly, the adjacent pixel t reconstructed by the current block is processed_i、s_jF, according to JVT method making low-pass filtering of correspondent point, placing it into array, recording said array as EP, and recording m-th array variable in said array as EP_mWhere i is 0, 1, …, 2q-1 indicating pixel row coordinates, j is 0, 1, …, 2p-1 indicating pixel column coordinates, p × q indicating the block size, p indicating the number of columns of the block, which may be equal to 4, 8, or 16, q indicating the number of rows of the block, which may be equal to 4, 8, or 16, t_iRepresenting the ith row of pixel samples, s, of the column to the left of the current block_jRepresenting the j-th column of pixel samples, a, of the upper row of the current block_ijRepresenting the pixel sample of ith row and j column of the current block; m represents the subscript variable of the array EP;

the EP is derived from the following algorithm:

A. if the current macroblock has a reconstructed pixel with an upper edge neighbor, i.e. s_jWhere j is 0,

1, 2, …, p-1, then

a)EP_(j+1)＝s_(j)；j＝0，…，p-1

j is p, p +1, p +2, …, 2p-1, then

EP_(1+j+p)＝s_(p+j)；j＝0，…，p-1

EP_(1+j+p)＝S_(p+p-1)；j＝p，…，q-1

Otherwise

EP_(1+j+p)＝EP_(p)；j＝0，…，p-1

c)EP_(1+j+p)＝EP_(p+j)；j＝p，…，q+1

d)EP₍₀₎＝s₀；

B. If the current macroblock has a reconstructed pixel adjacent to the left, i.e. t_iWhere, i ═ 0,

1, 2, …, q-1, then

a)EP_(-1-i)＝t_(i)；i＝0，…，q-1

b) If the current macroblock has reconstructed pixels adjacent to the bottom left, i.e. t_iIt is possible to use, among other things,

q, q +1, q +2, …, 2q-1, then

EP_(-1-i-q)＝t_(q+i)；i＝0，…，q-1

EP_(-1-i-q)＝t_(q+q-1)；i＝q，…，p-1

Otherwise

EP_(-1-i-q)＝EP_(-q)；i＝0，…，q-1

c)EP_(-1-i-q)＝EP_(-i-q)；i＝p，p+1

d)EP₍₀₎＝t₀；

C. If s is_jAvailable, and t is available_iWherein i is 0, 1, 2, …, q-1, wherein j is 0,

1, 2, …, p-1, then

EP₍₀₎＝f；

D. Define variable last _ ix equal to EP_(-(p+q))；

b) Let variable last _ pix equal EP_(i)；

c) Let the index be the array variable EP of i_iEqual to new _ pix;

d) increasing i by 1, turning to a), until i is greater than (p + q);

(2) operation rule of DC prediction mode

i. If s is_j，t_iAll are available, then all prediction samples

Is equal to (EP)_i+EP_j) > 1, wherein,

i-0, 1, 2, …, q-1, representing pixel row coordinates, j-0, 1, 2, …, p-1, representing pixel column coordinates;

if t_iNot available, s_jIf available, all prediction samples

Equaling EP_jWhere i is 0, 1, 2, …, q-1, representing pixel row coordinates, and j is 0, 1, 2, …, p-1, representing pixel column coordinates;

if s_jUnusable, t_iIf available, all prediction samples

Equaling EP_iWhere i is 0, 1, 2, …, q-1, representing pixel row coordinates, and j is 0, 1, 2, …, p-1, representing pixel column coordinates;

if s_j，t_iAll are not available, then all prediction samples are

Equal to 128 where i-0, 1, 2, …, q-1 denotes pixel row coordinates and j-0, 1, 2, …, p-1 denotes pixel column coordinates.

4. The method of claim 1, wherein a part of the 9 prediction modes can be selected and re-ordered according to coding requirements; for example, a prediction method of intra luminance sample value based on 5 prediction modes, i.e. the DC prediction proposed by the present invention, and the vertical prediction, horizontal prediction, 45-degree direction prediction, 135-degree direction prediction modes adopted in JVT can be adopted;

likewise, for 16 × 16 luminance blocks and 8 × 8 chrominance blocks, it is also possible to use only one or two prediction modes of the DC prediction proposed by the present invention, the vertical prediction, the horizontal prediction, and the flat prediction modes used in JVT.