CN103369326A

CN103369326A - Transition coder applicable to HEVC ( high efficiency video coding) standards

Info

Publication number: CN103369326A
Application number: CN2013102833903A
Authority: CN
Inventors: 李甫; 樊春晓; 牛毅; 石光明; 齐飞; 周蕾蕾; 张犁; 宋晓丹; 焦丹丹
Original assignee: Xidian University
Current assignee: Xidian University
Priority date: 2013-07-05
Filing date: 2013-07-05
Publication date: 2013-10-23
Anticipated expiration: 2033-07-05
Also published as: CN103369326B

Abstract

The invention discloses a transition coder applicable to HEVC ( high efficiency video coding) standards and mainly solves the problems of multiple use of multipliers and a complicated circuit in the prior art. The transition coder comprises a one-dimensional DCT (discrete cosine transformation) module (1), a transposition buffer module (2) and a top layer control module (3), wherein the one-dimensional DCT module (1) adopts a plurality of butterfly computing units and a plurality of odd coefficient processing units for completing each DCT in the HEVC standards, the odd coefficient processing units resolve the complicated multiply operation into multistage circuits and adopt a shifter, an adding device and a subtracter, i.e. the multistage shifter, the adding device and the subtracter are used for replacing a matrix multiplier, and the circuit structure is simplified. The transition coder has the advantages that the structure is simple and regular, the reusability is high, the key route is short, the lock frequency is high, and the integration is easy. The transition coding on video residual data is efficiently realized under the condition of not using the multipliers.

Description

Be suitable for the transform coder of high-performance video coding standard HEVC

Technical field

The invention belongs to the electronic circuit technology field, be specifically related to the transform coder structure among the video compression coding standard HEVC, can be applicable to VLSI (very large scale integrated circuit) designs.

Background technology

As everyone knows, along with the development of electronics and information industry, it is increasingly extensive that the application of digital video technology has become.Yet along with the continuous lifting of image resolution ratio, its corresponding data volume also increases thereupon.Contradiction between these mass datas and hard-disk capacity and channel capacity also seems and becomes increasingly conspicuous.Thereby High Data Rate, big data quantity problem have proposed huge challenge to existing compression algorithm, become a large bottleneck of expansion high-resolution video application.How become the problem that people are studying not losing or do not reduce data volume in the situation of loss of information as far as possible.Therefore, many image/video compression algorithms are proposed in succession by people.

Wherein, HEVC is as up-to-date video compression coding standard, and it has adopted a lot of efficiently image compression algorithms.With respect to video compression coding standard H.264, it has adopted meticulousr tree-shaped partitioned organization, so that the piecemeal of image is meticulousr; And basic block size also 16 * 16 increases to 64 * 64 by what adopt in H.264, makes it be more suitable for the compression of large image.Yet when obtaining higher compression efficiency, its corresponding computational complexity also increases greatly.Lifting along with the basic block size, the size of HEVC converter unit also increases thereupon, and it need support 4 * 4,8 * 8,16 * 16 and 32 * 32 4 kinds of dct transforms, so that the multiplier number in its corresponding circuits sharply increases, it is very complicated that translation circuit becomes, and becomes a hard-wired difficult point.Thereby, design an efficient transform coder and seem very important.

So far, in order to reduce the multiplier number in the transition coding module, reduce the complexity of transition coding module, the transition coding structure that has proposed mainly contains following two kinds:

The first is the structure that the part butterfly that adopts in the HEVC test model combines with matrix multiplier, and it has utilized the symmetry of basic matrix in the transition coding, has reduced by 3 times multiplier number.This structure is made of four butterfly structures and four matrix multipliers.Wherein, butterfly structure is comprised of a series of adders and subtracter, after butterfly structure, computing is divided into two parts, even number part and odd number part, this odd number part is finished calculating by the less translation circuit of multiplexing transform block size, and this even number part then is to use matrix multiplier to calculate.Although through optimizing, the number of multiplier is still a lot of in its matrix multiplier for this structure, is difficult for hardware and realizes.

The second is that the patent application that Xian Electronics Science and Technology University proposes " is suitable for the transform coder of HEVC standard " (number of patent application 201210251115.9, publication number CN102857756A).This Invention Announce a kind of transform coder that is suitable for the HEVC standard, be mainly used in solving multiplier in part butterfly and the matrix multiplier combined structure and use too much problem.This structure comprises one dimension DCT/DST module, transpose buffering module and top layer control unit.Wherein, one dimension DCT/DST module is finished the various transition codings of HEVC in conjunction with butterfly structure and matrix multiplication array; The transpose buffering module is utilized the storage different with memory in the path delay between register and is read order, finishes the matrix transpose operation of transform data; The top layer control unit produces resetting and enable signal of one dimension DCT/DST module and transpose buffering module, controls each module co-ordination.But the one-dimensional transform module in this structure still will be used 48 multipliers, and its circuit structure is complicated, is unfavorable for the hardware-efficient realization, and its needed clock cycle when realizing relatively large transition coding is also long.

Summary of the invention

The object of the invention is to the deficiency for above-mentioned prior art, a kind of transform coder that is suitable for high-performance video coding standard HEVC is proposed, to reduce the complexity of circuit structure, the needed clock cycle when reducing transition coding, be easy to hardware and realize, the high-performance that satisfies the HEVC coding standard realizes requirement.

Realize that the object of the invention technical thought is: decompose by the matrix multiplication in part butterfly and the matrix multiplier combined structure is operated, the multiply operation that it is complicated decomposes to multi-level pmultistage circuit and finishes, namely by simple shift unit and adder complete operation, so that the computational complexity of every one-level circuit reduces greatly, thereby shortening critical path, improve clock frequency and the code efficiency of transition coding circuit, finally obtain a transform coder that is suitable for high-performance video coding standard HEVC that does not comprise multiplier.

According to above-mentioned thinking, transform coder of the present invention comprises: one dimension DCT module, transpose buffering module and top layer control module, the data output end of this one dimension DCT module links to each other with the data input pin of transpose buffering module, and data input pin links to each other with the data output end of transpose buffering module; This top layer control module links to each other with reset terminal, the Enable Pin of reset terminal, Enable Pin and the transpose buffering module of one dimension DCT module respectively, it is characterized in that:

Described one dimension DCT module comprises:

32 butterfly processing elements, be used for finishing to the in twos addition of coefficient to be transformed of input and the operation of subtracting each other in twos, and 16 data that the phase add operation obtains are inputed to 16 butterfly processing elements, 16 data that the phase reducing is obtained input to 32 strange coefficient processing unit;

16 butterfly processing elements, be used for finishing to 16 in twos additions of data of 32 butterfly processing element inputs and the operation of subtracting each other in twos, and 8 data that addition obtains are inputed to 8 butterfly processing elements, will subtract each other 8 data that obtain and input to 16 strange coefficient processing unit;

32 strange coefficient processing unit, be used for obtaining by 16 data of 32 butterfly processing elements inputs and this 16 data self move to left rear coefficient with, and by 16 groups of different shift counts summed result is shifted respectively, addition, subtracts each other, try to achieve 16 transform datas, and input to the transpose buffering module;

8 butterfly processing elements, be used for finishing to 8 in twos additions of data of 16 butterfly processing element inputs and the operation of subtracting each other in twos, and 4 data that addition obtains are inputed to 4 butterfly processing elements, will subtract each other 4 data that obtain and input to 8 strange coefficient processing unit;

16 strange coefficient processing unit, be used for obtaining by 8 data of 16 butterfly processing elements inputs and this 8 data self move to left rear coefficient with, and by 8 groups of different shift counts summed result is shifted respectively, addition, subtracts each other, try to achieve 8 transform datas, and input to the transpose buffering module;

4 butterfly processing elements, be used for finishing 4 in twos additions of data that 8 butterfly processing elements are inputted and subtracting each other in twos, and 2 data that addition obtains are inputed to 4 even coefficient processing unit, will subtract each other 2 data that obtain and input to 4 strange coefficient processing unit;

8 strange coefficient processing unit, be used for obtaining by 4 data of 8 butterfly processing elements inputs and this 4 data self move to left rear coefficient with, and by 4 groups of different shift counts summed result is shifted respectively, addition, subtracts each other, try to achieve 4 transform datas and input to the transpose buffering module;

4 even coefficient processing unit, 2 data that are used for finishing 4 butterfly processing elements inputs postpone, and shifter-adder, the operation of subtracting each other, and try to achieve 2 transform datas and input to the transpose buffering module;

4 strange coefficient processing unit, be used for obtaining by 2 data of 4 butterfly processing elements inputs and this 2 data self move to left rear coefficient with, and by 2 groups of different shift counts summed result is shifted respectively, addition, subtracts each other, try to achieve 2 transform datas and input to the transpose buffering module;

The reset enable control unit links to each other with the top layer control module, be used for to receive resetting and enable signal of top layer control module output, and according to reset and enable signal control one dimension DCT module in the resetting and enable of unit.

The present invention compared with prior art has the following advantages:

The first, the present invention has adopted unified conversion implementation structure, can use same encoder circuit to finish the dct transform of 4 kinds of different masses sizes, thereby has improved the extent for multiplexing of circuit, has reduced greatly circuit scale;

Second, the one dimension DCT module that the present invention adopts, be assigned in the multi-level pmultistage circuit by the multiplying with complexity and finish, use the strange coefficient processing unit that does not comprise multiplier to finish complicated multiply operation, reduced the complexity in every one-level circuit, improved system clock frequency, be more suitable for hardware and realize;

Description of drawings

Fig. 1 is the general structure block diagram of transform coder of the present invention;

Fig. 2 is transpose buffering modular structure schematic diagram of the present invention;

Fig. 3 is the structured flowchart of one dimension DCT module among the present invention;

Fig. 4 is structure and the connection diagram of 32 butterfly processing elements, 16 butterfly processing elements, 8 butterfly processing elements and 4 butterfly processing elements among the present invention;

Fig. 5 is the structure chart of 4 even coefficient processing unit among the present invention;

Fig. 6 is the structure chart of 4 strange coefficient processing unit among the present invention;

Fig. 7 is the structure chart of 8 dot factor addition subelements among the present invention;

Fig. 8 is the structure chart of 16 dot factor addition subelements among the present invention;

Fig. 9 is the structure chart of 32 dot factor addition subelements among the present invention.

Embodiment

The present invention is the improvement to one-dimensional transform structure in the existing HEVC standard, can reduce the computational complexity of every grade of flowing water, improves system clock, and is easier to the Parallel Implementation of hardware.

The present invention is described in detail below in conjunction with drawings and Examples.

With reference to Fig. 1, the transform coder of high-performance video coding standard HEVC of the present invention, consisted of by one dimension DCT module 1, transpose buffering module 2 and top layer control module 3, wherein the output of top layer control module 3 is divided into two-way, the first via is connected with one dimension DCT module 1, and the second the tunnel is connected with transpose buffering module 2; The input of the data input pin of one dimension DCT module 1 is divided into two-way, and the first via is connected with outside input data, and the second the tunnel is connected with the data output end of transpose buffering module 2; The data output end of one dimension DCT module 1 is connected with the data input pin of transpose buffering module 2; The data input pin of transpose buffering module 2 is connected with the data output end of one dimension DCT module 1, and the output of the data output end of transpose buffering module 2 is divided into two-way, and the first via is connected with the data input pin of one dimension DCT module 1, and the second the tunnel is connected with outside output.Wherein:

Described top layer control module 3, comprise reset enable module 30 and data flow con-trol module 31, reset enable module 30 links to each other with the reset enable control unit 19 of one dimension DCT module 1 and the transposition reset enable unit 20 of transpose buffering module 2 respectively, enables and reset signal for these two modules provide; Data flow con-trol module 31 links to each other with the address control unit 22 of transpose buffering module 2, for generation of control signal, and the read-write mode of control transpose buffering module 2 and read-write order.This reset enable module 30 and data flow con-trol module 31 consist of by counter and logical circuit, be used for count status and current alternative types of carrying out according to counter, produce resetting of one dimension DCT module 1 by logical circuit, enable and the resetting of transpose buffering module 2, enable, the data flow con-trol signal, the input data of 1 pair of transform coder of control one dimension DCT module are carried out the one dimension line translation, and generation control signal control transpose buffering module 2 receives the line translation result of one dimension DCT modules 1, after All Datarows was finished dealing with, control transpose buffering module 2 exported the line translation result behind the transposition to one dimension DCT module 1 and carries out the one dimension rank transformation.

With reference to Fig. 2, described transpose buffering module 2, comprise transposition reset enable unit 20, RAM memory 21 and address control unit 22, transposition reset enable unit 20 is made of logical circuit, be used for to receive that top layer control module 3 sends reset, enable signal, and produce resetting and enabling of control signal control RAM memory 21 and address control unit 22; RAM memory 21 is made of 8 memory arrays, and each memory array all links to each other with one dimension DCT module 1; Address control unit 22 links to each other with the address end of each memory array in the RAM memory 21, input and output for generation of each memory enable and I/O Address, realization deposits the dct transform result of one dimension DCT module 1 input respectively in 8 memory arrays, again by row or by the operation that is listed as output.

Described one dimension DCT module 1 is used for finishing 4 DCT of HEVC standard, DCT, 16 DCT and 32 DCT one-dimensional transforms at 8, and its structure as shown in Figure 3.

With reference to Fig. 3, one dimension DCT module 1 comprises 32 butterfly processing elements 10,16 butterfly processing elements 11,13,16 strange coefficient processing of 12,8 butterfly processing elements in 32 strange coefficient processing unit unit 14,4 butterfly processing elements 15,17,4 strange coefficient processing unit, 16,4 even coefficient processing unit, 8 strange coefficient processing unit 18, reset enable control unit 19, wherein:

Described reset enable control unit 19, consisted of by logical circuit, it links to each other with the reset enable unit 30 of top layer control module 3 and the unit of one dimension DCT module 1, be used for to receive resetting and enable signal of top layer control module 3 outputs, and according to resetting and enable signal is controlled resetting of unit in the whole one dimension DCT module 1 and enabled.

Described 32 butterfly processing elements 10 are made of 16 adders and 16 subtracters, and these 16 adders link to each other with 16 butterfly processing elements 11, and these 16 subtracters link to each other with 32 strange coefficient processing unit 12, as shown in Figure 4.

These 16 adders are sued for peace in twos to carrying out head and the tail from 32 data of one dimension DCT module 1 input input, namely try to achieve the 1st data and the 32nd data sum E ₀, ask again the 2nd data and the 31st data sum E ₁, so analogize, try to achieve the 16th data and the 17th data sum E ₁₅, and 16 addition result E that will try to achieve ₀～E ₁₅Input to 16 butterfly processing elements 11;

These 16 subtracters carry out head and the tail to 32 coefficients from the input of one dimension DCT module 1 input and ask in twos poor, namely try to achieve the difference O of the 1st data and the 32nd data ₀, try to achieve again the difference O of the 2nd data and the 31st data ₁, so analogize, try to achieve the difference O of the 16th data and the 17th data ₁₅, and 16 of will try to achieve subtract each other as a result O ₀～O ₁₅Input to 32 strange coefficient processing unit 12.

Described 16 butterfly processing elements 11 are made of 8 adders and 8 subtracters, and these 8 adders link to each other with 8 butterfly processing elements 13, and these 8 subtracters link to each other with 16 strange coefficient processing unit 14, as shown in Figure 4.

These 8 adders are to the data E by 10 inputs of 32 butterfly processing elements ₀～E ₁₅Carry out head and the tail and sue for peace in twos, namely try to achieve E ₀With E ₁₅Sum EE ₀, try to achieve again E ₁With E ₁₄Sum EE ₁, so analogize, try to achieve E ₇With E ₈Sum EE ₇, and 8 addition result EE that will try to achieve ₀～EE ₇Input to 8 butterfly processing elements 13;

These 8 subtracters are to data E ₀～E ₁₅Carry out head and the tail and ask in twos poor, namely try to achieve E ₀With E ₁₅Difference EO ₀, try to achieve again E ₁With E ₁₄Difference EO ₁, so analogize, try to achieve E ₇With E ₈Difference EO ₇, and 8 of will try to achieve subtract each other as a result EO ₀～EO ₇Input to 16 strange coefficient processing unit 14.

Described 8 butterfly processing elements 13 are made of 4 adders and 4 subtracters, these 4 adders and 4 butterfly processing elements 15, and these 4 subtracters link to each other with 8 strange coefficient processing unit 16, as shown in Figure 4.

These 4 adders are to the data EE by 11 inputs of 16 butterfly processing elements ₀～EE ₇Carry out head and the tail and sue for peace in twos, namely try to achieve EE ₀With EE ₇Sum EEE ₀, try to achieve again EE ₁With EE ₆Sum EEE ₁, so analogize, try to achieve and EE ₃With EE ₄Sum EEE ₃, with 4 addition result EEE that try to achieve ₀～EEE ₃Input to 4 butterfly processing elements 15;

These 4 subtracters are to data EE ₀～EE ₇Carry out head and the tail and ask in twos poor, namely try to achieve EE ₀With EE ₇Difference EEO ₀, try to achieve again EE ₁With EE ₆Difference EEO ₁, so analogize, try to achieve also, and EE ₃With EE ₄Difference EEO ₃4 of trying to achieve are subtracted each other as a result EEO ₀～EEO ₃Input to 8 strange coefficient processing unit 16.

Described 4 butterfly processing elements 15 are made of 2 adders and 2 subtracters, these 2 adders and 4 even coefficient processing unit 17, and these 2 subtracters link to each other with 4 strange coefficient processing unit 18, as shown in Figure 4.

These 2 adders are used in the hope of the data EEE by 13 inputs of 8 butterfly processing elements ₀With EEE ₃Sum EEEE ₀, and the data EEE of input ₁With EEE ₂Sum EEEE ₁, and these 2 addition result EEEE that will try to achieve ₀, EEEE ₁Input to 4 even coefficient processing unit 17;

The data EEE of these 2 subtracters in the hope of input ₀With EEE ₃Difference EEEO ₀, and the data EEE of input ₁With EEE ₂Difference EEEO ₁, and 2 of will try to achieve subtract each other as a result EEEO ₀, EEEO ₁Input to 4 strange coefficient processing unit 18.

With reference to Fig. 5, described 4 even coefficient processing unit 17 consist of by postponing subelement 170,2 butterfly computation subelements 171 and the subelement 172 that is shifted;

This postpones subelement 170, to the data EEEE by 15 inputs of 4 butterfly processing elements ₀With EEEE ₁Carry out the delay of 2 clock cycle, obtain delayed data EEEE _{0_0}With EEEE _{1_0}, and these 2 data are sent into 2 butterfly computation subelements 171;

These 2 butterfly computation subelements 171 are made of 1 adder and 1 subtracter, are used for postponing the delayed data EEEE of subelement 170 inputs _{0_0}With EEEE _{1_0}Carry out respectively addition and subtract each other, obtain summarized information EEEEE and subtract each other data EEEEO and send into displacement subelement 172;

This subelement 172 that is shifted is made of 2 shift units, is used for data EEEEE and EEEEO by 171 inputs of 2 butterfly computation subelements are moved to left 6, and will try to achieve 2 coefficients and export to transpose buffering module 2 as a result.

With reference to Fig. 6, described 4 strange coefficient processing unit 18 are made of 14 dot factor operator unit 180 and 24 dot factor addition subelements 181;

This 4 dot factor operator unit 180 is made of register, shift unit and adder cascade, is used for finishing the data EEEO to by 15 inputs of 4 butterfly processing elements ₀, EEEO ₁Postpone, obtain retardation coefficient EEEO _{0_0}, EEEO _{1_0}, and try to achieve respectively EEEO ₀With EEEO ₀, and EEEO ₁With EEEO ₁Self move to left data sum behind the coordination not, that is:

Try to achieve EEEO ₀With EEEO ₀Self move to left data sum after 1 obtains the first summation coefficient EEEO of 4 _{0_1},

Try to achieve EEEO ₁With EEEO ₁Self move to left data sum after 1 obtains the second summation coefficient EEEO of 4 _{1_1},

Try to achieve EEEO ₀With EEEO ₀Self move to left data sum after 2 obtains 4 the 3rd summation coefficient EEEO _{0_2},

Try to achieve EEEO ₁With EEEO ₁Self move to left data sum after 2 obtains 4 the 4th summation coefficient EEEO _{1_2},

Again these retardation coefficients and summation coefficient are inputed to each 4 dot factor addition subelement 181;

Each 4 dot factor addition subelement 181 is made of shift unit, adder and subtracter cascade, is used for trying to achieve one of dct transform as a result coefficient, namely divides 3 grades to two retardation coefficient EEEO of 4 dot factor operator unit, 180 inputs _{0_0}, EEEO _{1_0}, and four summation coefficient EEEO _{0_1}, EEEO _{0_2}, EEEO _{1_1}, EEEO _{1_2}Merge, wherein:

The 1st grade is that following three groups of coefficients are once merged respectively simultaneously:

First group is with EEEO _{0_0}And EEEO _{1_0}After these two retardation coefficients move to left respectively, carry out again addition or subtract each other, obtain first merge coefficient COE of the 1st grade of 4 _{4_101}

Second group is with EEEO _{0_1}And EEEO _{1_1}These two summations are after coefficients move to left respectively, carry out addition again or subtract each other, and obtain second merge coefficient COE of the 1st grade of 4 _{4_102}

The 3rd group is with EEEO _{0_2}And EEEO _{1_2}These two summations are after coefficients move to left respectively, carry out addition again or subtract each other, and obtain 4 the 1st grade the 3rd merge coefficient COE _{4_103}

The 2nd grade is simultaneously the 1st grade of three merge coefficients of trying to achieve to be carried out respectively secondary to merge:

First merge coefficient COE of the 1st grade with 4 _{4_101}Second merge coefficient COE of the 1st grade with 4 _{4_102}After moving to left respectively, carry out again addition or subtract each other, obtain first merge coefficient COE of the 2nd grade of 4 _{4_201}

The 1st grade the 3rd merge coefficient COE with 4 _{4_103}Move to left, obtain second merge coefficient COE of the 2nd grade of 4 _{4_202}

3rd level is that the 2nd grade of two merge coefficients of trying to achieve are merged, and is about to first merge coefficient COE of the 2nd grade of 4 _{4_201}Second merge coefficient COE of the 2nd grade with 4 _{4_202}After moving to left respectively, carry out again addition or subtract each other, obtain one 4 as a result coefficient COEFF ₄, and with this as a result coefficient COEFF of 4 ₄Export to transpose buffering module 2.

Described 8 strange coefficient processing unit 16 are made of 18 dot factor operator unit 160 and 48 dot factor addition subelements 161;

This 8 dot factor operator unit 160 is made of register, shift unit and adder cascade, is used for the data EEO to 13 inputs of 8 butterfly processing elements ₀～EEO ₃Postpone respectively, obtain retardation coefficient EEO _{0_0}～EEO _{3_0}, and try to achieve respectively data EEO ₀～EEO ₃With these data EEO ₀～EEO ₃Self move to left data sum behind the coordination not, that is:

Try to achieve EEO ₀With EEO ₀Self move to left data sum after 1 obtains the first summation coefficient EEO of 8 _{0_1}

Try to achieve EEO ₁With EEO ₁Self move to left data sum after 1 obtains the second summation coefficient EEO of 8 _{1_1}

Try to achieve EEO ₂With EEO ₂Self move to left data sum after 1 obtains 8 the 3rd summation coefficient EEO _{2_1}

Try to achieve EEO ₃With EEO ₃Self move to left data sum after 1 obtains 8 the 4th summation coefficient EEO _{3_1}

Try to achieve EEO ₀With EEO ₀Self move to left data sum after 2 obtains 8 the 5th summation coefficient EEO _{0_2}

Try to achieve EEO ₁With EEO ₁Self move to left data sum after 2 obtains 8 the 6th summation coefficient EEO _{1_2}

Try to achieve EEO ₂With EEO ₂Self move to left data sum after 2 obtains 8 the 7th summation coefficient EEO _{2_2}

Try to achieve EEO ₃With EEO ₃Self move to left data sum after 2 obtains 8 the 8th summation coefficient EEO _{3_2}

These eight summation coefficients are sent into to each 8 dot factor addition subelement 161;

Each 8 dot factor addition subelement 161 is made of shift unit, adder and subtracter cascade, is used for trying to achieve one of dct transform as a result coefficient, namely divides 4 grades to the coefficient EEO by 160 inputs of 8 dot factor operator unit _{0_0}～EEO _{3_0}, EEO _{0_1}～EEO _{3_1}And EEO _{0_2}～EEO _{3_2}Carry out shifter-adder or displacement and subtract each other, wherein:

The 1st grade is that following six groups of coefficients are once merged respectively simultaneously:

First group is with EEO _{0_0}And EEO _{1_0}After these two retardation coefficients move to left respectively, carry out again addition or subtract each other, obtain first merge coefficient COE of 8: 0 1st grades _{8_101}

Second group is with EEO _{2_0}And EEO _{3_0}After these two retardation coefficients move to left respectively, carry out again addition or subtract each other, obtain second merge coefficient COE of 8: 0 1st grades _{8_102}

The 3rd group is with EEO _{0_1}And EEO _{1_1}These two summations are after coefficients move to left respectively, carry out addition again or subtract each other, and obtain 8: 0 1st grades the 3rd merge coefficient COE _{8_103}

The 4th group is with EEO _{2_1}And EEO _{3_1}These two summations are after coefficients move to left respectively, carry out addition again or subtract each other, and obtain 8: 0 1st grades the 4th merge coefficient COE _{8_104}

The 5th group is with EEO _{0_2}And EEO _{1_2}These two summations are after coefficients move to left respectively, carry out addition again or subtract each other, and obtain 8: 0 1st grades the 5th merge coefficient COE _{8_105}

The 6th group is with EEO _{2_2}And EEO _{3_2}These two summations are after coefficients move to left respectively, carry out addition again or subtract each other, and obtain 8: 0 1st grades the 6th merge coefficient COE _{8_106}

The 2nd grade is to the 1st grade of three being combined coefficient and carrying out respectively secondary and merge of trying to achieve simultaneously:

First group is with COE _{8_101}And COE _{8_102}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain first merge coefficient COE of 8: 0 2nd grades _{8_201}

Second group is with COE _{8_103}And COE _{8_104}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain second merge coefficient COE of 8: 0 2nd grades _{8_202}

The 3rd group is with COE _{8_105}And COE _{8_106}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain 8: 0 2nd grades the 3rd merge coefficient COE _{8_203}

3rd level is simultaneously the 2nd grade of three merge coefficients of trying to achieve to be carried out respectively three times to merge:

First merge coefficient COE with 8: 0 2nd grades _{8_201}Second merge coefficient COE with 8: 0 2nd grades _{8_202}After moving to left respectively, carry out again addition or subtract each other, obtain first merge coefficient COE of 8 3rd levels _{8_301}

The 3rd merge coefficient COE with 8: 0 2nd grades _{8_203}Move to left, obtain second merge coefficient COE of 8 3rd levels _{8_302}

The 4th grade is that two merge coefficients that 3rd level is tried to achieve are merged, and is about to first merge coefficient COE of 8 3rd levels _{8_301}Second merge coefficient COE with 8 3rd levels _{8_302}After moving to left respectively, carry out again addition or subtract each other, obtain one 8 as a result coefficient COEFF ₈, and with this as a result coefficient COEFF of 8 ₈Export to transpose buffering module 2, as shown in Figure 7.

Described 16 strange coefficient processing unit 14 are made of 1 16 dot factor operator unit 140 and 8 16 dot factor addition subelements 141;

This 16 dot factor operator unit 140 is made of register, shift unit and adder cascade, is used for the data EO to 11 inputs of 16 butterfly processing elements ₀～EO ₇Postpone respectively, obtain retardation coefficient EO _{0_0}～EO _{7_0}, and try to achieve respectively retardation coefficient EO ₀～EO ₇With EO ₀～EO ₇Self move to left data sum behind the coordination not, that is:

Try to achieve data EO ₀With EO ₀Self move to left data sum after 1 obtains the first summation coefficient EO of 16 _{0_1}

Try to achieve data EO ₁With EO ₁Self move to left data sum after 1 obtains the second summation coefficient EO of 16 _{1_1}

So analogize;

Try to achieve data EO ₇With EO ₇Self move to left data sum after 1 obtains 16 the 8th summation coefficient EO _{7_1}

Try to achieve data EO ₀With EO ₀Self move to left data sum after 2 obtains 16 the 9th summation coefficient EO _{0_2}

Try to achieve data EO ₁With EO ₁Self move to left data sum after 2 obtains 16 the tenth summation coefficient EO _{1_2}

So analogize;

Try to achieve data EO ₇With EO ₇Self move to left data sum after 2 obtains 16 the 16 summation coefficient EO _{7_2}

These 16 summation coefficients are sent into to each 16 dot factor addition subelement 141;

Described 16 dot factor addition subelements 141 are made of shift unit, adder and subtracter cascade, are used for trying to achieve one of dct transform as a result coefficient, namely divide 5 grades to the coefficient EO by 140 inputs of 16 dot factor operator unit _{0_0}～EO _{7_0}, EO _{0_1}～EO _{7_1}And EO _{0_2}～EO _{7_2}Carry out shifter-adder or displacement and subtract each other, wherein:

The 1st grade is that following 12 groups of coefficients are once merged respectively simultaneously:

First group is with EO _{0_0}And EO _{1_0}After these two retardation coefficients move to left respectively, carry out again addition or subtract each other, obtain first merge coefficient COE of the 1st grade of 16 _{16_101}

Second group is with EO _{2_0}And EO _{3_0}After these two retardation coefficients move to left respectively, carry out again addition or subtract each other, obtain second merge coefficient COE of the 1st grade of 16 _{16_102}

The 3rd group is with EO _{4_0}And EO _{5_0}After these two retardation coefficients move to left respectively, carry out again addition or subtract each other, obtain 16 the 1st grade the 3rd merge coefficient COE _{16_103}

The 4th group is with EO _{6_0}And EO _{7_0}After these two retardation coefficients move to left respectively, carry out again addition or subtract each other, obtain 16 the 1st grade the 4th merge coefficient COE _{16_104}

The 5th group is with EO _{0_1}And EO _{1_1}These two summations are after coefficients move to left respectively, carry out addition again or subtract each other, and obtain 16 the 1st grade the 5th merge coefficient COE _{16_105}

The 6th group is with EO _{2_1}And EO _{3_1}These two summations are after coefficients move to left respectively, carry out addition again or subtract each other, and obtain 16 the 1st grade the 6th merge coefficient COE _{16_106}

So analogize;

The 11 group is with EO _{4_2}And EO _{5_2}These two summations are after coefficients move to left respectively, carry out addition again or subtract each other, and obtain 16 the 1st grade the 11 merge coefficient COE _{16_111}

The 12 group is with EO _{6_2}And EO _{7_2}These two summations are after coefficients move to left respectively, carry out addition again or subtract each other, and obtain 16 the 1st grade the 12 merge coefficient COE _{16_112}

The 2nd grade is following six groups of coefficients to be carried out respectively secondary merge simultaneously:

First group is with COE _{16_101}And COE _{16_102}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain first merge coefficient COE of the 2nd grade of 16 _{16_201}

Second group is with COE _{16_103}And COE _{16_104}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain second merge coefficient COE of the 2nd grade of 16 _{16_202}

The 3rd group is with COE _{16_105}And COE _{16_106}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain 16 the 2nd grade the 3rd merge coefficient COE _{16_203}

The 4th group is with COE _{16_107}And COE _{16_108}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain 16 the 2nd grade the 4th merge coefficient COE _{16_204}

The 5th group is with COE _{16_109}And COE _{16_110}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain 16 the 2nd grade the 5th merge coefficient COE _{16_205}

The 6th group is with COE _{16_111}And COE _{16_112}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain 16 the 2nd grade the 6th merge coefficient COE _{16_206}

3rd level is to be combined coefficient to following three to carry out respectively three merging simultaneously:

First group is with COE _{16_201}And COE _{16_202}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain first merge coefficient COE of 16 3rd level _{16_301}

Second group is with COE _{16_203}And COE _{16_204}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain second merge coefficient COE of 16 3rd level _{16_302}

The 3rd group is with COE _{16_205}And COE _{16_206}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain the 3rd merge coefficient COE of 16 3rd level _{16_303}

The 4th grade is that three merge coefficients of simultaneously 3rd level being tried to achieve carry out respectively four merging:

First merge coefficient COE with 16 3rd levels _{16_301}Second merge coefficient COE with 16 3rd level _{16_302}After moving to left respectively, carry out again addition or subtract each other, obtain first merge coefficient COE of the 4th grade of 16 _{16_401}

The 3rd merge coefficient COE with 16 3rd levels _{16_303}Move to left, obtain second merge coefficient COE of the 4th grade of 16 _{16_402}

The 5th grade is that the 4th grade of two merge coefficients of trying to achieve are merged, and is about to first merge coefficient COE of the 4th grade of 16 _{16_401}Second merge coefficient COE of the 4th grade with 16 _{16_402}After moving to left respectively, carry out again addition or subtract each other, obtain one 16 as a result coefficient COEFF ₁₆, and with this as a result coefficient COEFF of 16 ₁₆Export to transpose buffering module 2, as shown in Figure 8.

Described 32 strange coefficient processing unit 12 are made of 1 32 dot factor operator unit 120 and 16 32 dot factor addition subelements 121;

This 32 dot factor operator unit 120 is made of register, shift unit and adder cascade, is used for the data O to 10 inputs of 32 butterfly processing elements ₀～O ₁₅Postpone respectively, obtain retardation coefficient O _{0_0}～O _{15_0}, and try to achieve respectively input data O ₀～O ₁₅With this O ₀～O ₁₅Self move to left data sum behind the coordination not, that is:

Try to achieve O ₀With O ₀Self move to left data sum after 1 obtains the first summation coefficient O of 32 _{0_1}

Try to achieve O ₁With O ₁Self move to left data sum after 1 obtains the second summation coefficient O of 32 _{1_1}

So analogize;

Try to achieve O ₁₅With O ₁₅Self move to left data sum after 1 obtains 32 the 16 summation coefficient O _{15_1}

Try to achieve O ₀With O ₀Self move to left data sum after 2 obtains 32 the 17 summation coefficient O _{0_2}

Try to achieve O ₁With O ₁Self move to left data sum after 2 obtains 32 the 18 summation coefficient O _{1_2}

So analogize;

Try to achieve O ₁₅With O ₁₅Self move to left data sum after 2 obtains 32 the 32 summation coefficient O _{15_2}

Try to achieve O ₀With O ₀Self move to left data sum after 3 obtains 32 the 33 summation coefficient O _{0_3}

Try to achieve O ₁With O ₁Self move to left data sum after 3 obtains 32 the 34 summation coefficient O _{1_3}

So analogize;

Try to achieve O ₁₅With O ₁₅Self move to left data sum after 3 obtains 32 the 48 summation coefficient O _{15_3}

These 48 summation coefficients are sent into to each 32 dot factor addition subelement 121;

Each 32 dot factor addition subelement 121 is used for trying to achieve one of dct transform as a result coefficient, and this subelement is made of shift unit, adder and subtracter cascade, and minutes 6 grades to the coefficient O by 120 inputs of 32 dot factor operator unit _{0_0}～O _{15_0}, O _{0_1}～O _{15_1}, O _{0_2}～O _{15_2}And O _{0_3}～O _{15_3}Carry out shifter-adder or displacement and subtract each other,

Wherein:

The 1st grade is that following 32 groups of coefficients are once merged respectively simultaneously:

First group is with O _{0_0}And O _{1_0}After these two retardation coefficients move to left respectively, carry out again addition or subtract each other, obtain first merge coefficient COE of the 1st grade of 32 _{32_101}

Second group is with O _{2_0}And O _{3_0}After these two retardation coefficients move to left respectively, carry out again addition or subtract each other, obtain second merge coefficient COE of the 1st grade of 32 _{32_102}

The 3rd group is with O _{4_0}And O _{5_0}After these two retardation coefficients move to left respectively, carry out again addition or subtract each other, obtain 32 the 1st grade the 3rd merge coefficient COE _{32_103}

So analogize;

The 8th group is with O _{14_0}And O _{15_0}After these two retardation coefficients move to left respectively, carry out again addition or subtract each other, obtain 32 the 1st grade the 8th merge coefficient COE _{32_108}

The 9th group is with O _{0_1}And O _{1_1}These two summations are after coefficients move to left respectively, carry out addition again or subtract each other, and obtain 32 the 1st grade the 9th merge coefficient COE _{32_109}

The tenth group is with O _{2_1}And O _{3_1}These two summations are after coefficients move to left respectively, carry out addition again or subtract each other, and obtain 32 the 1st grade the tenth merge coefficient COE _{32_110}

So analogize;

The 31 group is with O _{12_3}And O _{13_3}These two summations are after coefficients move to left respectively, carry out addition again or subtract each other, and obtain 32 the 1st grade the 31 merge coefficient COE _{32_131}

The 32 group is with O _{14_3}And O _{15_3}These two summations are after coefficients move to left respectively, carry out addition again or subtract each other, and obtain 32 the 1st grade the 32 merge coefficient COE _{32_132}

The 2nd grade is following 16 groups of coefficients to be carried out respectively secondary merge simultaneously:

First group is with COE _{32_101}And COE _{32_102}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain first merge coefficient COE of the 2nd grade of 32 _{32_201}

Second group is with COE _{32_103}And COE _{32_104}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain second merge coefficient COE of the 2nd grade of 32 _{32_202}

The 3rd group is with COE _{32_105}And COE _{32_106}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain 32 the 2nd grade the 3rd merge coefficient COE _{32_203}

The 4th group is with COE _{32_107}And COE _{32_108}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain 32 the 2nd grade the 4th merge coefficient COE _{32_204}

The 5th group is with COE _{32_109}And COE _{32_110}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain 32 the 2nd grade the 5th merge coefficient COE _{32_205}

The 6th group is with COE _{32_110}And COE _{32_111}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain 32 the 2nd grade the 6th merge coefficient COE _{32_206}

So analogize;

The 15 group is with COE _{32_128}And COE _{32_129}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain 32 the 2nd grade the 15 merge coefficient COE _{32_215}

The 16 group is with COE _{32_130}And COE _{32_131}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain 32 the 2nd grade the 16 merge coefficient COE _{32_216}

3rd level is following eight groups of coefficients to be carried out respectively three times merge simultaneously:

First group is with COE _{32_201}And COE _{32_202}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain first merge coefficient COE of 32 3rd level _{32_301}

Second group is with COE _{32_203}And COE _{32_204}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain second merge coefficient COE of 32 3rd level _{32_302}

The 3rd group is with COE _{32_205}And COE _{32_206}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain the 3rd merge coefficient COE of 32 3rd level _{32_303}

So analogize;

The 7th group is with COE _{32_213}And COE _{32_214}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain the 7th merge coefficient COE of 32 3rd level _{32_307}

The 8th group is with COE _{32_215}And COE _{32_216}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain the 8th merge coefficient COE of 32 3rd level _{32_308}

The 4th grade is to be combined coefficient to following four to carry out respectively four merging simultaneously:

First group is with COE _{32_301}And COE _{32_302}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain first merge coefficient COE of the 4th grade of 32 _{32_401}

Second group is with COE _{32_303}And COE _{32_304}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain second merge coefficient COE of the 4th grade of 32 _{32_402}

The 3rd group is with COE _{32_305}And COE _{32_306}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain 32 the 4th grade the 3rd merge coefficient COE _{32_403}

The 4th group is with COE _{32_307}And COE _{32_308}After these two merge coefficients move to left respectively, carry out again addition or subtract each other, obtain 32 the 4th grade the 4th merge coefficient COE _{32_404}

The 5th grade is simultaneously the 4th grade of four merge coefficients of trying to achieve to be carried out respectively five times to merge:

First merge coefficient COE of the 4th grade with 32 _{32_401}Second merge coefficient COE of the 4th grade with 32 _{32_402}After moving to left respectively, carry out again addition or subtract each other, obtain first merge coefficient COE of the 5th grade of 32 _{32_501}

The 4th grade the 3rd merge coefficient COE with 32 _{32_403}The 4th grade the 4th merge coefficient COE with 32 _{32_404}After moving to left respectively, carry out again addition or subtract each other, obtain second merge coefficient COE of the 5th grade of 32 _{32_502}

The 6th grade is that the 5th grade of two merge coefficients of trying to achieve are merged, and is about to first merge coefficient COE of the 5th grade of 32 _{32_501}Second merge coefficient COE of the 5th grade with 32 _{32_502}After moving to left respectively, carry out again addition or subtract each other, obtain one 32 as a result coefficient COEFF ₃₂, and with this as a result coefficient COEFF of 32 ₃₂Export to transpose buffering module 2, as shown in Figure 9.

During every one-level in above-mentioned each 4 dot factor addition subelement 181,8 dot factor addition subelements, 161,16 dot factor addition subelement 141 and 32 dot factor addition subelements 121 merges, choosing of shift count and adder or subtracter is to determine with experiment according to the demand of reality.

Claims

1. transform coder that is suitable for high-performance video coding standard HEVC, comprise: one dimension DCT module (1), transpose buffering module (2) and top layer control module (3), the data output end of this one dimension DCT module (1) links to each other with the data input pin of transpose buffering module (2), and data input pin links to each other with the data output end of transpose buffering module (2); This top layer control module (3) links to each other with reset terminal, the Enable Pin of reset terminal, Enable Pin and the transpose buffering module (2) of one dimension DCT module (1) respectively, it is characterized in that:

Described one dimension DCT module (1) comprising:

32 butterfly processing elements (10), be used for finishing to the in twos addition of coefficient to be transformed of input and the operation of subtracting each other in twos, and 16 data that the phase add operation obtains are inputed to 16 butterfly processing elements (11), 16 data that the phase reducing is obtained input to 32 strange coefficient processing unit (12);

16 butterfly processing elements (11), be used for finishing to 16 in twos additions of data of 32 butterfly processing elements (10) input and the operation of subtracting each other in twos, and 8 data that addition obtains are inputed to 8 butterfly processing elements (13), will subtract each other 8 data that obtain and input to 16 strange coefficient processing unit (14);

32 strange coefficient processing unit (12), be used for obtaining by 16 data of 32 butterfly processing elements (10) input and this 16 data self move to left rear coefficient with, and by 16 groups of different shift counts summed result is shifted respectively, addition, subtracts each other, try to achieve 16 transform datas, and input to transpose buffering module (2);

8 butterfly processing elements (13), be used for finishing to 8 in twos additions of data of 16 butterfly processing elements (11) input and the operation of subtracting each other in twos, and 4 data that addition obtains are inputed to 4 butterfly processing elements (15), will subtract each other 4 data that obtain and input to 8 strange coefficient processing unit (16);

16 strange coefficient processing unit (14), be used for obtaining by 8 data of 16 butterfly processing elements (11) input and this 8 data self move to left rear coefficient with, and by 8 groups of different shift counts summed result is shifted respectively, addition, subtracts each other, try to achieve 8 transform datas, and input to transpose buffering module (2);

4 butterfly processing elements (15), be used for finishing 4 in twos additions of data that 8 butterfly processing elements (13) are inputted and subtracting each other in twos, and 2 data that addition obtains are inputed to 4 even coefficient processing unit (17), will subtract each other 2 data that obtain and input to 4 strange coefficient processing unit (18);

8 strange coefficient processing unit (16), be used for obtaining by 4 data of 8 butterfly processing elements (13) input and this 4 data self move to left rear coefficient with, and by 4 groups of different shift counts summed result is shifted respectively, addition, subtracts each other, try to achieve 4 transform datas and input to transpose buffering module (2);

4 even coefficient processing unit (17), 2 data that are used for finishing 4 butterfly processing elements (15) input postpone, and shifter-adder, the operation of subtracting each other, and try to achieve 2 transform datas and input to transpose buffering module (2);

4 strange coefficient processing unit (18), be used for obtaining by 2 data of 4 butterfly processing elements (15) input and this 2 data self move to left rear coefficient with, and by 2 groups of different shift counts summed result is shifted respectively, addition, subtracts each other, try to achieve 2 transform datas and input to transpose buffering module (2);

Reset enable control unit (19) links to each other with top layer control module (3), be used for to receive resetting and enable signal of top layer control module (3) output, and according to reset and enable signal control one dimension DCT module (1) in the resetting and enable of unit.

2. transform coder according to claim 1 is characterized in that: 32 butterfly processing elements (10), consisted of by 16 adders and 16 subtracters, and 16 adders are carried out head and the tail to the input data and are sued for peace in twos, and 16 addition result E that will try to achieve ₀～E ₁₅Input to 16 butterfly processing elements (11); 16 subtracters carry out head and the tail to input coefficient and ask in twos poor, and 16 of will try to achieve subtract each other as a result O ₀～O ₁₅Input to 32 strange coefficient processing unit (12).

3. transform coder according to claim 1 is characterized in that: 16 butterfly processing elements (11), consisted of by 8 adders and 8 subtracters, and 8 adders are to the data E by 32 butterfly processing elements (10) input ₀～E ₁₅Carry out head and the tail and sue for peace in twos, and 8 addition result EE that will try to achieve ₀～EE ₇Input to 8 butterfly processing elements (13), 8 subtracters are to data E ₀～E ₁₅Carry out head and the tail and ask in twos poor, and 8 of will try to achieve subtract each other as a result EO ₀～EO ₇Input to 16 strange coefficient processing unit (14).

4. transform coder according to claim 1 is characterized in that: 32 strange coefficient processing unit (12) are made of 1 32 dot factor operator unit (120) and 16 32 dot factor addition subelement (121) cascades;

Described 32 dot factor operator unit (120) are made of register, shift unit and adder cascade, are used for finishing the data O to by 32 butterfly processing elements (10) input ₀～O ₁₅Postpone to obtain retardation coefficient O _{0_0}～O _{15_0}, and try to achieve O ₀～O ₁₅With O ₀～O ₁₅Self move to left 1,2,3 and O _{0_1}～O _{15_1}, O _{0_2}～O _{15_2}, O _{0_3}～O _{15_3}, these coefficients are sent into to each 32 dot factor addition subelement (121);

Described 32 dot factor addition subelements (121) are made of shift unit, adder and subtracter cascade, are used for finishing the coefficient O to by the input of 32 dot factor operator unit (120) _{0_0}～O _{15_0}, O _{0_1}～O _{15_1}, O _{0_2}～O _{15_2}And O _{0_3}～O _{15_3}Carry out shifter-adder or displacement and subtract each other, 1 data of finally trying to achieve also output it to transpose buffering module (2).

5. transform coder according to claim 1 is characterized in that: 8 butterfly processing elements (13), consisted of by 4 adders and 4 subtracters, and 4 adders are to the data EE by 16 butterfly processing elements (11) input ₀～EE ₇Carry out head and the tail and sue for peace in twos, and 4 addition result EEE that will try to achieve ₀～EEE ₃Input to 4 butterfly processing elements (15), 4 subtracters are to data EE ₀～EE ₇Carry out head and the tail and ask in twos poor, and 4 of will try to achieve subtract each other as a result EEO ₀～EEO ₃Input to 8 strange coefficient processing unit (16).

6. transform coder according to claim 1 is characterized in that: 16 strange coefficient processing unit (14) are made of 1 16 dot factor operator unit (140) and 8 16 dot factor addition subelement (141) cascades;

Described 16 dot factor operator unit (140) are made of register, shift unit and adder cascade, are used for finishing the data EO to by 16 butterfly processing elements (11) input ₀～EO ₇Postpone, obtain retardation coefficient EO _{0_0}～EO _{7_0}, and try to achieve EO ₀～EO ₇Respectively with EO ₀～EO ₇1 sum coefficient EO self moves to left _{0_1}～EO _{7_1}And EO ₀～EO ₇2 sum coefficient EO self move to left _{0_2}～EO _{7_2}, these coefficients are sent into to each 16 dot factor addition subelement (141);

Described 16 dot factor addition subelements (141) are made of shift unit, adder and subtracter cascade, are used for finishing the coefficient EO to by the input of 16 dot factor operator unit (140) _{0_0}～EO _{7_0}, EO _{0_1}～EO _{7_1}And EO _{0_2}～EO _{7_2}Carry out shifter-adder or displacement and subtract each other, 1 data of finally trying to achieve are exported to transpose buffering module (2).

7. transform coder according to claim 1 is characterized in that: 4 butterfly processing elements (15), consisted of by 2 adders and 2 subtracters, and 2 adders are used in the hope of the data EEE by 8 butterfly processing elements (13) input ₀With EEE ₃Sum EEEE ₀, and the data EEE of input ₁With EEE ₂Sum EEEE ₁, and these 2 addition result EEEE that will try to achieve ₀, EEEE ₁Input to 4 even coefficient processing unit (17); 2 subtracters data EEE in the hope of input ₀With EEE ₃Difference EEEO ₀, and the data EEE of input ₁With EEE ₂Difference EEEO ₁, and 2 of will try to achieve subtract each other as a result EEEO ₀, EEEO ₁Input to 4 strange coefficient processing unit (18).

8. transform coder according to claim 1 is characterized in that: 8 strange coefficient processing unit (16) are made of 18 dot factor operator unit (160) and 48 dot factor addition subelement (161) cascades;

Described 8 dot factor operator unit (160) are made of register, shift unit and adder cascade, are used for finishing the data EEO to by 8 butterfly processing elements (13) input ₀～EEO ₃Postpone, obtain retardation coefficient EEO _{0_0}～EEO _{3_0}, and try to achieve EEO ₀～EEO ₃Respectively with EEO ₀～EEO ₃1 sum coefficient EEO self moves to left _{0_1}～EEO _{3_1}And EEO ₀～EEO ₃2 sum coefficient EEO self move to left _{0_2}～EEO _{3_2}, these coefficients are sent into to each 8 dot factor addition subelement (161);

Described 8 dot factor addition subelements (161) are made of shift unit, adder and subtracter cascade, are used for finishing the coefficient EEO to by the input of 8 dot factor operator unit (160) _{0_0}～EEO _{3_0}, EEO _{0_1}～EEO _{3_1}And EEO _{0_2}～EEO _{3_2}Carry out shifter-adder or displacement and subtract each other, 1 data of finally trying to achieve are exported to transpose buffering module (2).

9. transform coder according to claim 1 is characterized in that: 4 even coefficient processing unit (17), by postponing subelement (170), 2 butterfly computation subelements (171) and displacement subelement (172) cascade formation;

Described delay subelement (170) is to the data EEEE by 4 butterfly processing elements (15) input ₀With EEEE ₁Carry out the delay of 2 clock cycle, obtain delayed data EEEE _{0_0}With EEEE _{1_0}, and these 2 data are sent into 2 butterfly computation subelements (171);

Described 2 butterfly computation subelements (171) are made of 1 adder and 1 subtracter, are used for postponing the delayed data EEEE of subelement (170) input _{0_0}With EEEE _{1_0}Carry out respectively addition and subtract each other, obtain summarized information EEEEE and subtract each other data EEEEO and send into displacement subelement (172);

Described displacement subelement (172) is made of 2 shift units, be used for data EEEEE and EEEEO by 2 butterfly computation subelements (171) input are moved to left, and 2 data will trying to achieve is exported to transpose buffering module (2).

10. transform coder according to claim 1 is characterized in that: 4 strange coefficient processing unit (18) are made of 14 dot factor operator unit (180) and 24 dot factor addition subelements (181);

Described 4 dot factor operator unit (180) are made of register, shift unit and adder cascade, are used for finishing the data EEEO to by 4 butterfly processing elements (15) input ₀, EEEO ₁Postpone, obtain retardation coefficient EEEO _{0_0}, EEEO _{1_0}, and try to achieve respectively EEEO ₀With EEEO ₀, and EEEO ₁With EEEO ₁Self move to left data sum behind the coordination not, that is:

Try to achieve EEEO ₀With EEEO ₀Data sum EEEO after 1 self moves to left _{0_1},

Try to achieve EEEO ₁With EEEO ₁Data sum EEEO after 1 self moves to left _{1_1},

Try to achieve EEEO ₀With EEEO ₀Data sum EEEO after 2 self moves to left _{0_2},

Try to achieve EEEO ₁With EEEO ₁Data sum EEEO after 2 self moves to left _{1_2},

These coefficients are inputed to each 4 dot factor addition subelement (181);

Described 4 dot factor addition subelements (181) are made of shift unit, adder and subtracter cascade, are used for the coefficient EEEO to the input of 4 dot factor operator unit (180) _{0_0}, EEEO _{1_0}, EEEO _{0_1}, EEEO _{1_1}, EEEO _{0_2}And EEEO _{1_2}Carry out shifter-adder or displacement and subtract each other, 1 data of finally trying to achieve are exported to transpose buffering module (2).