WO2005104094A1 - 符号化装置 - Google Patents
符号化装置 Download PDFInfo
- Publication number
- WO2005104094A1 WO2005104094A1 PCT/JP2005/007498 JP2005007498W WO2005104094A1 WO 2005104094 A1 WO2005104094 A1 WO 2005104094A1 JP 2005007498 W JP2005007498 W JP 2005007498W WO 2005104094 A1 WO2005104094 A1 WO 2005104094A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- signal
- tone
- frequency region
- component
- band
- Prior art date
Links
- 238000004364 calculation method Methods 0.000 claims abstract description 62
- 238000000034 method Methods 0.000 claims abstract description 54
- 238000012545 processing Methods 0.000 claims abstract description 24
- 230000001629 suppression Effects 0.000 claims description 12
- 230000005236 sound signal Effects 0.000 description 24
- 238000010586 diagram Methods 0.000 description 17
- 238000001228 spectrum Methods 0.000 description 15
- 238000004458 analytical method Methods 0.000 description 8
- 230000003595 spectral effect Effects 0.000 description 7
- 230000015572 biosynthetic process Effects 0.000 description 5
- 230000007423 decrease Effects 0.000 description 5
- 230000000694 effects Effects 0.000 description 5
- 238000001914 filtration Methods 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- 238000013507 mapping Methods 0.000 description 4
- 230000006835 compression Effects 0.000 description 3
- 238000007906 compression Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 238000000926 separation method Methods 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 239000004606 Fillers/Extenders Substances 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
Definitions
- the present invention relates to an encoding device that efficiently compresses and encodes the spectrum of an audio signal and decodes the compressed and encoded signal to generate a high-quality audio signal.
- FIG. 1 is a diagram showing the configuration of a conventional encoder 200 and decoder 210 that perform general compression encoding and decoding processing of an audio signal.
- Fig. 1 shows the most common compression method for audio signals.
- the conventional encoder 200 includes a frame division unit 201, a spectrum conversion unit 202, and a statutory encoding unit 203.
- the frame dividing unit 201 divides an input audio signal into a continuous number of frames having a constant sampling power in the time domain.
- the vector converter 202 converts the sample of the input audio signal of each frame into a spectrum signal in the frequency domain.
- the spectrum coding unit 203 quantizes a spectrum signal up to a certain frequency band, which is generally called a bandwidth, and outputs the result as code information (bit stream).
- the output bit stream is sent to the decoder 210 via a transmission path or via a recording medium, for example.
- the decoder 210 that has acquired the code information from the encoder 200 as an input bit stream includes a spectrum decoding unit 204, a spectrum inverse transform unit 205, and a frame combining unit 206.
- the spectrum decoding unit 204 obtains a star signal by dequantizing the code information of the input bit stream.
- FIG. 2 is a diagram illustrating an example of an audio signal in which a high-frequency signal is lost due to a conventional low-bit-rate encoding.
- bit rate which is the code amount per unit time that can be used to represent an audio signal
- the bandwidth 301 of the audio signal to be encoded also decreases.
- the high-frequency component is less perceptually important than the low-frequency component (low-frequency signal). Will be reduced.
- a high-frequency tone signal 303 and a high-frequency component 304 existing as a low-frequency component harmonic structure (her monitor) are missing.
- the range 302 decoded by the conventional decoder is equal to the bandwidth 301 of the signal to be coded, and the audible sound quality is also reduced.
- Band width extension technology (Band Width Extension) is a technology for compensating for the high frequency components lost due to the above-mentioned reasons in low bit rate coding.
- ISOZIEC 14496-3 MPEG-4 There is an SBR (Spectral Band Replication) method defined as a standard method for Audio. This technology is also described in Patent Document 1.
- FIG. 3 is a block diagram showing a configuration of a decoder 400 that decodes an encoded bit stream according to the SBR method.
- the decoder 400 is a decoder having a function of extending a band by the SBR method, and includes a bit stream separation unit 401, a core audio decoding unit 402, an analysis sub-band filter unit 403, a band extension unit 404, and a synthesis sub-band filter unit. 405.
- an input bit stream is converted by a bit stream separation section 401 into a bit stream of a core audio section, which is obtained by encoding a low-band audio stereo signal, and a low-order stream encoded by the core audio section. It is separated into a bit stream of a band extension unit obtained by encoding band extension information for generating a signal of a high band using the signal of the band unit.
- the core audio decoding unit 402 decodes the bit stream of the core audio unit and generates a low-frequency component time signal.
- any existing decoding unit may be used. For example, in the case of MPEG-4 Audio, the AAC system which is also the MPEG-4 standard is used.
- the decoded low-band component signal is divided into M-channel sub-band signals in the analysis sub-band filter unit 403. Subsequent bandwidth extension processing This is performed on a sub-band signal (low-frequency sub-band signal).
- the band extension unit 404 processes the low band sub-band signal using the band extension information included in the band extension unit in the bit stream, and generates a new high band sub-band signal representing the signal of the high band component. .
- the generated high-band sub-band signal is input to the synthesis sub-band filter unit 405 as an N-channel sub-band signal together with the low-frequency sub-band signal, and becomes an output audio signal through a synthesis process.
- the output audio signal of the synthesis filter M to the synthesis filter N-1 is a signal whose band has been extended.
- the subband signal used here can be regarded as a representation of the audio signal, which is a time signal, by dividing the subband in the frequency direction and the two-dimensional arrangement of time samples included in each subband.
- FIG. 4 is a diagram showing a process in which the band extending section 404 shown in FIG. 3 processes the low-band sub-band signal to generate the high-band sub-band signal.
- the copied high band sub-band signal 501 is generated by copying the low band sub-band signal 502 to the high band side.
- the inverse filtering process 503 suppresses the tone characteristics of the low-frequency sub-band signal.
- the degree of suppression of the tone property is controlled by a value called a chirp factor 504 (corresponding to the “adjustment coefficient” in the claims).
- a group consisting of a plurality of consecutive subbands and the ability to apply the same chirp factor to the group is referred to as a chirp factor band hereinafter.
- a typical D-order inverse filter is shown in the following equation.
- X high (t, k) X low (t, p (k)) + Bja, X low (t- i, p (k))
- Xhigh (t, k) is a generated high band subband signal
- Xlow (t, k) is a low band subband signal
- t is a time sample position
- k is a subband number
- ai is The Xlow (t, k) force is also a linear prediction coefficient calculated by linear prediction
- p (k) is a mapping function for providing a low-band subband signal corresponding to the kth high-band subband signal
- Bj Is the chirp factor corresponding to the chirp factor band bj set for the high band subband signal Xhigh (t, k).
- the technical details of the inverse filtering and the method of determining the mapping function p (k) are not included in the content disclosed in the present invention, and therefore, description thereof will be omitted.
- the grouping information of the chirp factor bands and the chirp factor for each chirp factor band are encoded, transmitted in a bit stream.
- the envelope shape (roughly represented signal energy distribution) of the generated high-frequency sub-band signal is adjusted so as to have frequency characteristics similar to the high-frequency sub-band signal of the original sound.
- Patent Document 2 is an example showing such an envelope shape adjustment method.
- the high band sub-band signal which is a two-dimensional representation of the time Z frequency, is first divided into “time segments” in the time direction, and then into “frequency bands” in the frequency direction.
- FIG. 5 shows this high frequency sub-band signal division processing.
- FIG. 5 is a diagram showing an example of a dividing method for dividing a high-frequency sub-band signal into a time segment and a frequency band.
- Arrow 601 indicates the division of the high band subband signal in the time direction
- arrow 602 indicates the division in the frequency direction.
- the high band sub-band signals in each region (referred to as "energy band") divided in the time and frequency directions are scaled to correspond to the energy value given for each region.
- the time used for the envelope shape adjustment The division information in the Z frequency direction and the energy value for each of the divided areas are encoded in the encoder 200, incorporated into a bit stream, and transmitted.
- the tone Z noise ratio of the generated high band sub-band signal also enhances the expressiveness of the generated signal, realizing sound quality closer to the input signal. It is an important factor to do. If the generated high frequency sub-band signal partially lacks noise components, it is necessary to add artificial noise components to compensate for this. Similarly, when the tone component is partially insufficient, an artificial tone component (sine wave) is added. The addition of noise components is performed on an area called “noise band”, and the addition of sine signals is performed on an area called “tone band”.
- FIGS. 6 (a) to 6 (c) show an example of division of a high-frequency sub-band signal obtained when the high-frequency area divided as shown in FIG.
- FIG. 5 is grouped according to energy, noise and tone.
- FIG. FIGS. 6A to 6C show the relationship between the energy band, the noise band, and the tone band.
- the division of the time-frequency space in Fig. 6 (a) It shows a region where the same energy value is given for adjusting the envelope shape of the subband signal.
- the classification of noise band and the classification of chirp factor band are common.
- the sub-band 704 to which the sine-wave tone signal is added in FIG. For the subband at The division information of the noise band and the tone band, the amount of noise added to each noise band, and the presence / absence of the power-generating signal in each tone band are encoded by the encoder, and are incorporated into the bit stream and transmitted. .
- B (t, k), E (t, k), Q (t, k) and H (t, k) are the time in the time Z A flag indicating the chirp factor, the energy value, the ratio of the noise component in the signal, and the presence or absence of the power gain signal with respect to the signal represented by the sample t and the frequency band k.
- E (t, k) Ei for all signal points (samples) indicated by (t, k) included in a certain energy band ei.
- FIG. 7 is a table showing, in the same energy band, an energy ratio between a high-frequency sub-band signal whose low-frequency sub-band signal power is also duplicated and a noise component or a tone component artificially added. The energy value for each of the high-frequency sub-band signal copied from the low-frequency sub-band signal, the artificially added noise component, and the artificially-added tone component is calculated as shown in FIG.
- the important points in this energy value calculation are the three components of the high-frequency sub-band signal copied from the low-frequency sub-band signal, an artificially added noise component, and an artificially added tone component.
- the sum of the energy values is always equal to E (t, k).
- noise The component ratio Q (t, k) is responsible for separating the total signal energy E (t, k) into two components: a duplicated high-band subband signal and an artificially added noise or tone component. Play, will be.
- the parameters necessary for the band extension processing described above must be appropriately set in the encoder in order to generate a grammatically correct bit stream with high sound quality.
- a method of analyzing the input signal expressed in time-Z frequency is required. If these information are not calculated correctly, for example, if the proportion of the noise component is too high, the reproduced sound will be noisy, and if the addition of inappropriate tone components or inverse filtering will result in a muffled sound quality, In the worst case, the sound will be distorted.
- Patent Document 3 discloses an example of a method of calculating a chirp factor. According to this method, the tone Z noise ratio of the high frequency signal of the input signal is compared with the tone Z noise ratio of the signal generated by duplicating the low frequency signal in the high frequency range, and a simple mathematical expression is obtained. By fitting, the chirp factor can be calculated.
- Patent Document 4 discloses an example of a method of calculating the ratio of noise components. According to this method, an input signal, which is a time signal, is divided into time frames and converted into spectral coefficients by Fourier transform.
- a pointer called a “peak follower” or “dip follower” is set to represent the peaks and valleys of the spectral coefficients, respectively.
- the ratio of the noise component is determined from the spectral energy value of the noise component that is derived.
- Patent Document 1 International Patent Publication No. W098Z57436
- Patent Document 2 International Patent Publication WO01Z26095
- Patent Document 3 U.S. Patent Publication US2002Z0087304
- Patent Document 4 International Patent Publication WO00Z45379
- the tone Z noise ratio of the high band signal and the low band signal When the chirp factor is calculated by applying the tone-z noise ratio of the duplicated high-frequency signal to a simple mathematical formula, the tone-Z noise ratio of the high-frequency signal of the original sound is extremely low in the calculation of the chirp factor.
- the tone / noise ratio of the high-frequency signal duplicated from the low-frequency signal is very low, an appropriate chirp factor may not be calculated. As a result, there is a problem that sound quality is reduced as a result of using an inappropriate chirp factor.
- an object of the present invention is to provide a coding device capable of obtaining an appropriate chirp factor without using a process having a high calculation load such as a Fourier transform. .
- an encoding device provides information for generating a signal belonging to a high frequency domain by copying a signal belonging to a low frequency domain in a divided time frequency domain.
- Tone z noise ratio calculating means for calculating a tone z noise ratio and a tone z noise ratio of the signal in the low frequency region replicated in the high frequency region using a linear prediction process; and An adjustment coefficient calculation for calculating an adjustment coefficient for adjusting the tone property of the signal in the low frequency region to be replicated in the high frequency region, based on the tone Z noise ratio calculated for the signals in the region and the region And encoding means for generating an encoded signal including the calculated adjustment coefficient.
- a more appropriate chirp factor can be calculated and applied by multidimensionally evaluating the tone Z noise ratio of the input signal and the duplicated signal and the appropriate chirp factor. . Therefore, the quality of the reproduced sound can be improved.
- appropriate information can be obtained with a smaller processing amount. .
- FIG. 1 is a diagram showing a configuration of a conventional encoder and decoder that perform general compression encoding and decoding processing of an audio signal.
- FIG. 2 is a diagram showing an example of an audio signal in which a high-frequency signal has been lost due to a conventional low bit rate encoding.
- FIG. 3 is a block diagram showing a configuration of a conventional decoder that decodes an encoded bit stream according to the SBR method.
- FIG. 4 is a diagram showing a process in which the band extending section shown in FIG. 3 processes a low-band sub-band signal to generate a high-band sub-band signal.
- FIG. 5 is a diagram showing an example of a dividing method for dividing a high-frequency sub-band signal into a time segment and a frequency band.
- FIGS. 6 (a) to 6 (c) show high-frequency sub-bands obtained when the high-frequency region divided as shown in FIG. 5 is grouped according to energy, noise and tone.
- FIG. 3 is a diagram illustrating an example of signal division.
- FIG. 7 is a table showing an energy ratio between a high-frequency sub-band signal copied from a low-frequency sub-band signal and an artificially added noise component or tone component in the same energy band.
- FIG. 8 is a block diagram showing a configuration of an encoder according to the present embodiment.
- FIG. 9 is a block diagram showing a configuration of a band extension information encoding unit shown in FIG. 8.
- FIG. 10 shows the necessity of suppressing the tone characteristic of the low band sub-band signal based on the tone Z noise ratio of the input high band sub-band signal and the tone Z noise ratio of the low band sub-band signal.
- FIG. 11 illustrates the relationship between the calculated chirp factor Bi and the two-tone Z noise ratio of the low-frequency sub-band signal and the input high-frequency sub-band signal.
- FIGS. 12 (a) to 12 (c) compare the energy of adjacent subband signals
- FIG. 9 is a diagram showing an example of determining the position of a tone component in a command.
- FIG. 13 is a table for determining whether or not there is a tone component in a subband by comparing the energy of adjacent subbands.
- FIG. 14 is a flowchart showing an operation of a chirp factor calculation unit shown in FIG.
- FIG. 15 is a flowchart showing an operation of a tone signal addition determining section shown in FIG.
- the low-frequency sub-band signal is copied to the high-frequency sub-band, and A case where a high-frequency sub-band signal is generated by superimposing a signal or noise will be described.
- FIG. 8 is a block diagram showing a configuration of encoder 100 according to the present embodiment.
- the encoder according to the present embodiment analyzes the input high-frequency sub-band signal by a simple method without using a calculation method with a high load such as Fourier transform, and outputs the low-frequency sub-band signal and the high-frequency sub-band signal.
- This is an encoder that encodes band extension information for generating RBs, and includes a core audio encoding unit 901, an analysis subband filter 902, a band extension information encoding unit 903, and a bit stream multiplexing unit 904.
- the analysis subband filter 902 includes N sets of an analysis filter and a 1 / N downsampling unit, and divides an input audio signal into N-channel subband signals.
- band extension information encoding section 903 extracts and encodes information necessary for subband signal power band extension processing. The configuration and operation of the band extension information coding unit 903 will be described later in detail.
- core audio encoding section 901 extracts and encodes only a signal representing a low-frequency component of the input signal.
- the coding result of the low-frequency component and the coding result of the band extension information are multiplexed in a bitstream multiplexing unit 904 to generate an output bitstream.
- FIG. 9 is a block diagram showing a configuration of band extension information coding section 903 shown in FIG.
- Band extension information encoding section 903 of the present embodiment uses a calculation with a high processing load such as Fourier transform for band extension information for generating a high band sub-band signal by duplicating a low band sub-band signal.
- the processing unit includes a region dividing unit 101, an energy calculating unit 103, a chirp factor calculating unit 104, a tone signal addition determining unit 105, and a noise component calculating unit 106.
- the chirp factor calculator 104 includes a signal component calculator 111 and a component energy calculator 112. Further, the noise component calculation unit 106 includes a component energy calculation unit 113.
- the sub-band signal input to the band extension information coding unit 903 is In the dividing unit 101, the high frequency part is divided into a plurality of areas.
- the space representing the subband signal is divided into the time direction and the frequency direction, and the energy value calculation, the chirp factor calculation, the noise component calculation, and the tone component calculation are performed. Group doodle for each of the.
- the area division information ei, bi, qi, hi determined for each of the energy value calculation, the chirp factor calculation, the noise component calculation, and the tone component calculation are output to the bit stream multiplexing unit 904.
- a method of dividing the area a predetermined fixed dividing method may be used, or the input subband signals may be divided and adaptively divided so that similar signals fall in the same area.
- the determined area division information is also coded and transmitted by the decoder in order to perform the same area division on the sub-band signal represented by the time Z frequency.
- the subsequent processes of energy calculation, chirp factor calculation, tone component calculation, and noise component calculation are performed in this order on the corresponding areas.
- the energy value Ei in the energy band ei may be calculated by the energy calculation unit 103 for the average energy of the input high frequency sub-band signal for each energy band ei.
- FIG. 14 is a flowchart showing the operation of the chirp factor calculation unit 104.
- the strength of the inverse filtering process for the low-band subband signal is duplicated so that the tone / noise ratio q_lo (i) of the duplicate signal approaches the tone-Z noise ratio q_hi (i) of the high-frequency signal of the input signal. It depends on the degree to which the tone characteristics of the low-frequency signal should be suppressed. The extent to which the tone of the low-frequency signal should be suppressed is controlled by the chirp factor calculated by the chirp factor calculating unit 104.
- the basis of the method disclosed in the present invention is that despite the low tone Z noise ratio q_hi (i) of the input high band subband signal, the tone Z noise ratio q_lo ( When i) is high, the tone characteristic of the low band sub-band signal is suppressed.
- FIG. 10 shows the necessity of suppressing the tone characteristic of the low band sub-band signal based on the tone Z noise ratio of the input high band sub-band signal and the tone Z noise ratio of the low band sub-band signal.
- the tone Z noise ratio qjo (i) or q_hi (i) When the tone Z noise ratio qjo (i) or q_hi (i) is large in both the low band subband signal and the high band subband signal, the tone / noise ratio q_lo (i) or q_hi (i) becomes This indicates that the subband signal has high tone characteristics. Conversely, if the tone / noise ratio qjo (i) or q_hi (i) is small, then the tone Z noise ratio qJo (i) or q_hi (i) will result in a subband signal with poor tonality (i.e., High noise).
- the low-frequency sub-band signal having a high tone characteristic (q_lo is large) is converted into the high-frequency sub-band signal of the original high-frequency sub-band signal having a low tone characteristic (q_hi is small). It can be seen that it is necessary to suppress the tone characteristics of the low-frequency sub-band signal when replicating the data in the subband.
- the tone Z noise ratio of the input high band sub-band signal can be calculated by using a linear prediction process. Assuming that the high-frequency subband signal is represented by S (t, k), this signal can be separated into tone components St (t, k) and noise components Sn (t, k) by using linear prediction. .
- the signal component calculation unit 111 applies the linear prediction to all the high frequency sub-bands k included in the chirp factor band bi, thereby converting the high frequency sub-band signal S (t, k) into the tone component St. (t, k) and the noise component Sn (t, k).
- the total energy of the tone components is included in this chirp factor band.
- T (i) is the number of samples in the time direction of the target chirp factor band bi.
- the chirp factor calculation unit 104 calculates the tone Z noise ratio q_hi (i) of the input high-frequency subband signal in the chirp factor band bi by: Calculate using the formula (S1401)
- the total energy of the tone component Sn 2 (t, k) and the total energy of the noise component Sn, k) can be calculated as follows using the linear prediction processing.
- the component energy calculation unit 112 calculates the total energy of the tone component St 2 (t, k) of the high frequency sub-band signal in the chirp factor band bi and the energy of the noise component Sn 2 (t, k). Calculate the sum.
- the subband signal power of the highband subband k, the lowband subband signal power represented by the mapping function P (k) is generated.
- the data calculation unit 104 calculates the tone Z noise ratio qJoG) of the copied low-frequency sub-band signal from the following equation ( 2 ) (S1402).
- the total energy of the tone component St 2 (t, p (k)) of the low-band sub-band signal copied to the high-frequency sub-band k, and the noise components Sn, p (k )) Is calculated as the sum of the energy of the tone component St 2 (t, k) of the input high frequency sub-band signal in the high frequency sub-band k and the noise component Sn, k) of the input high frequency sub-band signal. It is self-evident that it can be calculated using linear prediction processing in the same way as the energy total.
- the tone Z noise ratio of the input high-frequency sub-band signal and the low-frequency sub-band signal copied to the high-frequency sub-band calculated as described above is evaluated by evaluating the magnitude relationship between the two. , The necessary degree of tone suppression can be determined.
- the tone Z noise ratio q_hi (i) of the input high-frequency sub-band signal is smaller than the first threshold Tr (Yes in S1403), and the low-frequency sub-band signal to be copied is If the tone / noise ratio q_lo (i) is larger than the second threshold value Tr2 (Yes in S1404), the chirp factor calculation unit 104 determines that tone property suppression processing is necessary (S1405). Also, the degree of suppression of tone characteristics, that is, the chirp factor Bi is obtained as in the following equation (S1406).
- Bi min (Bi, 1), which is the second equation of Equation 7, indicates that the smaller of Bi and “1”, which also obtained the first equation force of Equation 7, is selected.
- FIG. 11 illustrates the relationship between the calculated chirp factor Bi and the two-tone Z-noise ratio of the low-frequency sub-band signal and the input high-frequency sub-band signal.
- the chirp factor Bi increases as q_lo (i) increases, and conversely, decreases as q_hi (i) increases. That is, the chirp factor Bi increases as the tone of the low-band sub-band signal increases, and conversely, decreases as the tone of the high-band sub-band signal increases.
- the chirp factor calculation unit 104 determines that the tone suppression processing is not necessary, and thus the chirp factor is set to “0”. Become. As described above, the calculated capture factor Bi is mapped to the high frequency sub-band included in the relevant capture factor band, and is represented as B (t, k). The process of calculating the chirp factor is repeated until the chirp factor is calculated for all the chirp factor bands. Each calculated chirp factor is encoded, and encoded information is sent to the bitstream multiplexing unit 107.
- Equation 7 shown in the above embodiment is an empirical equation, and shows one of the most preferable examples for calculating the chirp factor. Therefore, the formula for calculating the chirp factor is not limited to this.
- FIG. 15 is a flowchart showing the operation of tone signal addition determining section 105 shown in FIG.
- Whether or not it is necessary to add an artificial tone signal to each of the tone bands hi described above is determined by duplicating the tone Z noise ratio q_hi of the high band sub-band signal corresponding to the target tone band. Can be determined based on whether or not the tone Z noise ratio qJo of the low-frequency sub-band signal exceeds a predetermined value.
- two additional conditions are required for adding a tone signal.
- One is that the tone Z noise ratio of the high band sub-band signal is an absolutely large value.
- the tone Z noise ratio of the duplicated low-band subband signal is not absolutely large (rather than relatively high compared to the high-band subband signal). If the tone Z noise ratio of the low-band sub-band signal is very large, that is, if the signal has a very strong tone, the tone of the high-band sub-band signal will It is considered that there is no need to add a new artificial tone signal because it is maintained by the included tone signal components. Note that the tone Z noise ratio of the low-frequency sub-band signal to be copied is affected by the tone suppression processing described above, and it is necessary to consider the influence.
- tone signal addition determination section 105 calculates the tone Z noise ratio of the high band sub-band signal and the copied low band sub-band signal (S1501). At this time, the tone component St (t, k) and the noise component Sn (t, k) calculated by the chirp factor calculation unit 104 can be used for the tone / noise ratio of the high band sub-band signal.
- the processing is different for the tone Z noise ratio of the copied low band sub-band signal because the effect of the tone suppression processing needs to be considered. Since the energy reduction of the tone component due to the tone suppression process can be approximated by multiplying by approximately (1 ⁇ B (t, k)), the tone Z noise ratio of the low-frequency subband signal is calculated as follows: Yes (S1502 [0048] [Equation 9] (t, k))
- tone signal addition determining section 105 determines that it is necessary to add an artificial tone signal to the tone band. (S1503-S1505). That is,
- Tr4, Tr5, Tr6 are predetermined thresholds.
- Tone signal addition determining section 105 makes this determination for all tone bands hi, and sends the information to bit stream multiplexing section 107 as to whether or not a tone signal has been added in each tone band.
- bit stream multiplexing section 107 Only “information on whether or not a tone signal is added” is sent to bit stream multiplexing section 107, but “information indicating a frequency position in a tone band to which a tone signal is added” is also included. May be sent to
- the tone signal addition determining unit 105 another configuration can be used.
- an artificial tone signal is added only when there is a clear tone component in the input high-band subband signal, regardless of the shape of the low-band subband signal.
- the detection of apparent tone components is performed by judging whether or not there is a prominently high energy subband signal among a plurality of relatively low energy subband signals.
- FIGS. 12 (a) to 12 (c) are diagrams showing an example in which the energy of adjacent subband signals is compared to determine the position of a tone component in a tone band. That is, FIGS. 12 (a) to 12 (c) It represents three patterns that serve as criteria for tone component determination. The three patterns are when the tone component is (1) near the center of the subband frequency, (2) when it is near the upper frequency limit of the subband, and (3) when it is near the lower frequency limit of the subband. is there.
- each force indicates that a tone component exists in a certain subband k.
- the tone component of the subband energy 1101 is the center frequency of the subband k. This shows a case in which it exists in the vicinity.
- FIG. 12B shows a case where the tone component of the sub-band energy 1102 exists near the upper limit frequency of the sub-band k. In this case, a part of the signal energy leaks to the adjacent sub-band due to the characteristics of the general sub-band filter, so that the energy of the sub-band (k + 1) also increases.
- FIG. 12 (c) shows a case where the tone component of the subband energy 1103 exists near the lower limit frequency of the subband k. In this case, the energy of the subband (k-1) increases.
- FIG. 13 is a table for determining whether or not there is a tone component in the subband by comparing the energy of adjacent subbands. Based on such a phenomenon, whether or not an obvious tone component exists in subband k can be determined by the relational expression shown in the table of FIG.
- Ethres and Qthres indicate predetermined energy and tone Z noise ratio thresholds
- E (k) is an energy value calculated by the following equation.
- Tone signal addition determination section 105 makes a determination based on the three conditions shown in Fig. 13 for all high band subbands k included in tone band hi, and determines at least one high band subband in at least one high band subband. If the two conditions are satisfied, the tone band is determined to be a signal having a clear tone characteristic, and a flag for adding an artificial tone signal is set ( (SI 506 in Figure 15). This determination is made for all tone bands hi, and the flag information indicating whether or not the determined ability to add an artificial tone signal is sent to bit stream multiplexing section 107. In this example, the same value is used as the determination threshold value for the target subband k and its adjacent subbands, but different threshold values may be used for each subband. good.
- the logical operation of “AND” and “OR” for integrating the determination results in each subband can be selected and used based on the correlation with the set threshold. Also, in the evaluation of tone characteristics, consider the case where tone components are spread over a relatively wide range, and evaluate the tone Z noise ratio of several subbands above and below the target subband k. May be.
- the noise component calculation unit 106 will be described. If the sum of the noise components contained in the duplicated signal is almost equal to the sum of the noise components contained in the input signal, the sound texture expressed by the noise components of the input signal and the duplicated signal will be close. . Further, since the noise component is generally a signal having a wide band in frequency, it should be considered in a band covering a wider band (referred to as a noise band) than the tone band described above. Good. Therefore, since a certain noise band includes a plurality of tone bands, to calculate a correct noise component, the noise component in the tone band to which the tone signal is added and the noise component in the tone band to which the tone signal is not added are calculated. Both noise components must be considered.
- the noise component such that the total value of the noise components composed of these two components is equal to the total value of the noise components in the high-frequency sub-band of the input signal The amount is determined. In this process, it is necessary to consider the influence of the tone suppression process described above.
- the sum of the noise components of the input high-frequency sub-band signal is calculated by the following equation.
- a subband signal to be copied is represented as Qi, where the noise component amount in noise band qi is Qi.
- a noise component derived from a tone band signal to which a tone signal is added is represented by the following equation.
- TB (i) represents a set of tone bands to which a tone is added, which is included in the noise band qi.
- r (t, k) is the ratio of the noise component contained in the copied high-frequency subband signal, and takes into account the effect of the tone suppression processing performed on St (t, p (k), and expressed.
- the amount of noise component caused by the tone band signal to which the tone signal is not added is represented by the following equation.
- NTB (i) represents a set of tone bands to which no tone signal is added, which is included in noise band qi.
- TB (i) U NTB) is all tone bands included in the noise band qi.
- the noise band qi In order for the sum of all noise components included in the copied subband signal to be equal to the noise component of the corresponding input high-frequency subband signal, the following equation must be satisfied.
- the noise component amount QU can be calculated as in the following equation.
- the process of calculating the noise component amount is performed for all the noise bands, and the calculated noise component amount QU is encoded and sent to the bit stream multiplexing unit 107.
- the component energy calculator 113 like the component energy calculator 112 in the chirp factor calculator 104, calculates the energy sum of the tone component St 2 (t, k) of the high-frequency sub-band signal in the noise band qi. , And the total energy of the noise components Sn, k) are calculated.
- the component energy calculation unit 113 of the noise component calculation unit 106 performs processing by the component energy calculation unit 112 of the chirp factor calculation unit 104 as well as addition of the chirp noise and the tone signal in the same noise band. Since the noise component is corrected in consideration of the increase / decrease of the tone component, a noise component closer to the original sound can be calculated.
- the noise component amount Qi In the calculation of the noise component amount Qi., It is possible to omit the noise component derived from the tone band to which the tone signal is added, and to reduce the amount of calculation required for the calculation. This is because, in the tone band to which the tone signal is added, the proportion of the tone component in the signal is very large, so even if a relatively small noise component is set to “0”, the effect on the calculation result is small. .
- the formula for calculating Qi. In this case is expressed by the following formula.
- the present invention is a means useful for improving the quality of a reproduced audio signal in an apparatus for efficiently encoding and decoding an audio signal by separating the spectrum of the audio signal into a tone component and a noise component. That is, the present invention is useful as an encoder that calculates information for extending the bandwidth of an audio signal in a decoder with a method that requires less calculation load, more accurately, and encodes the information together with the low-frequency signal.
Landscapes
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/575,452 US7668711B2 (en) | 2004-04-23 | 2005-04-20 | Coding equipment |
JP2006512555A JP4741476B2 (ja) | 2004-04-23 | 2005-04-20 | 符号化装置 |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2004-128961 | 2004-04-23 | ||
JP2004128961 | 2004-04-23 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2005104094A1 true WO2005104094A1 (ja) | 2005-11-03 |
Family
ID=35197222
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2005/007498 WO2005104094A1 (ja) | 2004-04-23 | 2005-04-20 | 符号化装置 |
Country Status (3)
Country | Link |
---|---|
US (1) | US7668711B2 (ja) |
JP (1) | JP4741476B2 (ja) |
WO (1) | WO2005104094A1 (ja) |
Cited By (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2007187905A (ja) * | 2006-01-13 | 2007-07-26 | Sony Corp | 信号符号化装置及び方法、信号復号装置及び方法、並びにプログラム及び記録媒体 |
JP2008129542A (ja) * | 2006-11-24 | 2008-06-05 | Fujitsu Ltd | 復号化装置および復号化方法 |
WO2009084221A1 (ja) * | 2007-12-27 | 2009-07-09 | Panasonic Corporation | 符号化装置、復号装置およびこれらの方法 |
JP2010020251A (ja) * | 2008-07-14 | 2010-01-28 | Ntt Docomo Inc | 音声符号化装置及び方法、音声復号化装置及び方法、並びに、音声帯域拡張装置及び方法 |
JP2011059685A (ja) * | 2009-09-04 | 2011-03-24 | Thomson Licensing | 基本層及び拡張層を有する音声信号を検出する方法 |
JP2011215198A (ja) * | 2010-03-31 | 2011-10-27 | Sony Corp | 復号装置および復号方法、符号化装置および符号化方法、並びにプログラム |
JP2012515362A (ja) * | 2009-01-16 | 2012-07-05 | ドルビー インターナショナル アーベー | クロス生成物により向上された高調波転換 |
JP2012234184A (ja) * | 2006-01-06 | 2012-11-29 | Realnetworks Inc | 通信網を介して加入者端末機に送信されるオーディオ信号の出力品質改善のためのオーディオ信号の処理方法およびこの方法を採用したオーディオ信号処理装置 |
JP2014013408A (ja) * | 2009-05-27 | 2014-01-23 | Dolby International Ab | 信号の高周波成分を生成するためのシステム及び方法 |
WO2014115225A1 (ja) * | 2013-01-22 | 2014-07-31 | パナソニック株式会社 | 帯域幅拡張パラメータ生成装置、符号化装置、復号装置、帯域幅拡張パラメータ生成方法、符号化方法、および、復号方法 |
JP2014523548A (ja) * | 2011-06-30 | 2014-09-11 | サムスン エレクトロニクス カンパニー リミテッド | 帯域幅拡張信号生成装置及びその方法 |
JP2014240974A (ja) * | 2014-08-06 | 2014-12-25 | ソニー株式会社 | 符号化装置、符号化方法、およびプログラム |
US9183847B2 (en) | 2010-09-15 | 2015-11-10 | Samsung Electronics Co., Ltd. | Apparatus and method for encoding and decoding signal for high frequency bandwidth extension |
JP2016105174A (ja) * | 2010-12-29 | 2016-06-09 | サムスン エレクトロニクス カンパニー リミテッド | 高周波数帯域幅拡張のための符号化/復号化装置及びその方法 |
US10002621B2 (en) | 2013-07-22 | 2018-06-19 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for decoding an encoded audio signal using a cross-over filter around a transition frequency |
US10896684B2 (en) | 2017-07-28 | 2021-01-19 | Fujitsu Limited | Audio encoding apparatus and audio encoding method |
US11657788B2 (en) | 2009-05-27 | 2023-05-23 | Dolby International Ab | Efficient combined harmonic transposition |
US12112765B2 (en) | 2015-03-09 | 2024-10-08 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoder, audio decoder, method for encoding an audio signal and method for decoding an encoded audio signal |
US12142251B2 (en) | 2024-02-12 | 2024-11-12 | Dolby International Ab | Efficient combined harmonic transposition |
Families Citing this family (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB0709044D0 (en) | 2007-05-11 | 2007-06-20 | Teradyne Diagnostic Solutions | Signal detection |
KR101161866B1 (ko) * | 2007-11-06 | 2012-07-04 | 노키아 코포레이션 | 오디오 코딩 장치 및 그 방법 |
US9082397B2 (en) * | 2007-11-06 | 2015-07-14 | Nokia Technologies Oy | Encoder |
WO2009059632A1 (en) * | 2007-11-06 | 2009-05-14 | Nokia Corporation | An encoder |
EP2301015B1 (en) * | 2008-06-13 | 2019-09-04 | Nokia Technologies Oy | Method and apparatus for error concealment of encoded audio data |
JP4932917B2 (ja) * | 2009-04-03 | 2012-05-16 | 株式会社エヌ・ティ・ティ・ドコモ | 音声復号装置、音声復号方法、及び音声復号プログラム |
WO2011114192A1 (en) * | 2010-03-19 | 2011-09-22 | Nokia Corporation | Method and apparatus for audio coding |
US8560330B2 (en) * | 2010-07-19 | 2013-10-15 | Futurewei Technologies, Inc. | Energy envelope perceptual correction for high band coding |
US8849663B2 (en) | 2011-03-21 | 2014-09-30 | The Intellisis Corporation | Systems and methods for segmenting and/or classifying an audio signal from transformed audio information |
US9142220B2 (en) | 2011-03-25 | 2015-09-22 | The Intellisis Corporation | Systems and methods for reconstructing an audio signal from transformed audio information |
US8620646B2 (en) | 2011-08-08 | 2013-12-31 | The Intellisis Corporation | System and method for tracking sound pitch across an audio signal using harmonic envelope |
US8548803B2 (en) | 2011-08-08 | 2013-10-01 | The Intellisis Corporation | System and method of processing a sound signal including transforming the sound signal into a frequency-chirp domain |
US9183850B2 (en) | 2011-08-08 | 2015-11-10 | The Intellisis Corporation | System and method for tracking sound pitch across an audio signal |
US9058820B1 (en) | 2013-05-21 | 2015-06-16 | The Intellisis Corporation | Identifying speech portions of a sound model using various statistics thereof |
EP3731226A1 (en) * | 2013-06-11 | 2020-10-28 | FRAUNHOFER-GESELLSCHAFT zur Förderung der angewandten Forschung e.V. | Device and method for bandwidth extension for acoustic signals |
US9484044B1 (en) | 2013-07-17 | 2016-11-01 | Knuedge Incorporated | Voice enhancement and/or speech features extraction on noisy audio signals using successively refined transforms |
US9530434B1 (en) | 2013-07-18 | 2016-12-27 | Knuedge Incorporated | Reducing octave errors during pitch determination for noisy audio signals |
US9208794B1 (en) | 2013-08-07 | 2015-12-08 | The Intellisis Corporation | Providing sound models of an input signal using continuous and/or linear fitting |
US9870785B2 (en) | 2015-02-06 | 2018-01-16 | Knuedge Incorporated | Determining features of harmonic signals |
US9922668B2 (en) | 2015-02-06 | 2018-03-20 | Knuedge Incorporated | Estimating fractional chirp rate with multiple frequency representations |
US9842611B2 (en) | 2015-02-06 | 2017-12-12 | Knuedge Incorporated | Estimating pitch using peak-to-peak distances |
TWI771266B (zh) * | 2015-03-13 | 2022-07-11 | 瑞典商杜比國際公司 | 解碼具有增強頻譜帶複製元資料在至少一填充元素中的音訊位元流 |
CN108496221B (zh) * | 2016-01-26 | 2020-01-21 | 杜比实验室特许公司 | 自适应量化 |
JP6769299B2 (ja) * | 2016-12-27 | 2020-10-14 | 富士通株式会社 | オーディオ符号化装置およびオーディオ符号化方法 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH07295594A (ja) * | 1994-04-28 | 1995-11-10 | Sony Corp | オーディオ信号符号化方法 |
JPH07336231A (ja) * | 1994-06-13 | 1995-12-22 | Sony Corp | 信号符号化方法及び装置、信号復号化方法及び装置、並びに記録媒体 |
WO2003046891A1 (en) * | 2001-11-29 | 2003-06-05 | Coding Technologies Ab | Methods for improving high frequency reconstruction |
JP2004080635A (ja) * | 2002-08-21 | 2004-03-11 | Sony Corp | 信号符号化装置及び方法、信号復号装置及び方法、並びにプログラム及び記録媒体 |
WO2004027368A1 (en) * | 2002-09-19 | 2004-04-01 | Matsushita Electric Industrial Co., Ltd. | Audio decoding apparatus and method |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
SE512719C2 (sv) | 1997-06-10 | 2000-05-02 | Lars Gustaf Liljeryd | En metod och anordning för reduktion av dataflöde baserad på harmonisk bandbreddsexpansion |
SE9903553D0 (sv) | 1999-01-27 | 1999-10-01 | Lars Liljeryd | Enhancing percepptual performance of SBR and related coding methods by adaptive noise addition (ANA) and noise substitution limiting (NSL) |
US6978236B1 (en) | 1999-10-01 | 2005-12-20 | Coding Technologies Ab | Efficient spectral envelope coding using variable time/frequency resolution and time/frequency switching |
US7187907B2 (en) * | 2000-05-09 | 2007-03-06 | Bernard Widrow | Simultaneous two-way transmission of information signals in the same frequency band |
SE0004163D0 (sv) | 2000-11-14 | 2000-11-14 | Coding Technologies Sweden Ab | Enhancing perceptual performance of high frequency reconstruction coding methods by adaptive filtering |
KR100841096B1 (ko) * | 2002-10-14 | 2008-06-25 | 리얼네트웍스아시아퍼시픽 주식회사 | 음성 코덱에 대한 디지털 오디오 신호의 전처리 방법 |
KR100754439B1 (ko) * | 2003-01-09 | 2007-08-31 | 와이더댄 주식회사 | 이동 전화상의 체감 음질을 향상시키기 위한 디지털오디오 신호의 전처리 방법 |
-
2005
- 2005-04-20 WO PCT/JP2005/007498 patent/WO2005104094A1/ja active Application Filing
- 2005-04-20 US US10/575,452 patent/US7668711B2/en not_active Expired - Fee Related
- 2005-04-20 JP JP2006512555A patent/JP4741476B2/ja not_active Expired - Fee Related
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH07295594A (ja) * | 1994-04-28 | 1995-11-10 | Sony Corp | オーディオ信号符号化方法 |
JPH07336231A (ja) * | 1994-06-13 | 1995-12-22 | Sony Corp | 信号符号化方法及び装置、信号復号化方法及び装置、並びに記録媒体 |
WO2003046891A1 (en) * | 2001-11-29 | 2003-06-05 | Coding Technologies Ab | Methods for improving high frequency reconstruction |
JP2004080635A (ja) * | 2002-08-21 | 2004-03-11 | Sony Corp | 信号符号化装置及び方法、信号復号装置及び方法、並びにプログラム及び記録媒体 |
WO2004027368A1 (en) * | 2002-09-19 | 2004-04-01 | Matsushita Electric Industrial Co., Ltd. | Audio decoding apparatus and method |
Cited By (59)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2012234184A (ja) * | 2006-01-06 | 2012-11-29 | Realnetworks Inc | 通信網を介して加入者端末機に送信されるオーディオ信号の出力品質改善のためのオーディオ信号の処理方法およびこの方法を採用したオーディオ信号処理装置 |
JP2007187905A (ja) * | 2006-01-13 | 2007-07-26 | Sony Corp | 信号符号化装置及び方法、信号復号装置及び方法、並びにプログラム及び記録媒体 |
JP4736812B2 (ja) * | 2006-01-13 | 2011-07-27 | ソニー株式会社 | 信号符号化装置及び方法、信号復号装置及び方法、並びにプログラム及び記録媒体 |
JP2008129542A (ja) * | 2006-11-24 | 2008-06-05 | Fujitsu Ltd | 復号化装置および復号化方法 |
WO2009084221A1 (ja) * | 2007-12-27 | 2009-07-09 | Panasonic Corporation | 符号化装置、復号装置およびこれらの方法 |
JP2010020251A (ja) * | 2008-07-14 | 2010-01-28 | Ntt Docomo Inc | 音声符号化装置及び方法、音声復号化装置及び方法、並びに、音声帯域拡張装置及び方法 |
JP2012515362A (ja) * | 2009-01-16 | 2012-07-05 | ドルビー インターナショナル アーベー | クロス生成物により向上された高調波転換 |
US11200874B2 (en) | 2009-05-27 | 2021-12-14 | Dolby International Ab | Efficient combined harmonic transposition |
JP2014013408A (ja) * | 2009-05-27 | 2014-01-23 | Dolby International Ab | 信号の高周波成分を生成するためのシステム及び方法 |
US11935508B2 (en) | 2009-05-27 | 2024-03-19 | Dolby International Ab | Efficient combined harmonic transposition |
US11657788B2 (en) | 2009-05-27 | 2023-05-23 | Dolby International Ab | Efficient combined harmonic transposition |
US10657937B2 (en) | 2009-05-27 | 2020-05-19 | Dolby International Ab | Efficient combined harmonic transposition |
US9190067B2 (en) | 2009-05-27 | 2015-11-17 | Dolby International Ab | Efficient combined harmonic transposition |
US10304431B2 (en) | 2009-05-27 | 2019-05-28 | Dolby International Ab | Efficient combined harmonic transposition |
JP2011059685A (ja) * | 2009-09-04 | 2011-03-24 | Thomson Licensing | 基本層及び拡張層を有する音声信号を検出する方法 |
JP2011215198A (ja) * | 2010-03-31 | 2011-10-27 | Sony Corp | 復号装置および復号方法、符号化装置および符号化方法、並びにプログラム |
US10152983B2 (en) | 2010-09-15 | 2018-12-11 | Samsung Electronics Co., Ltd. | Apparatus and method for encoding/decoding for high frequency bandwidth extension |
US9183847B2 (en) | 2010-09-15 | 2015-11-10 | Samsung Electronics Co., Ltd. | Apparatus and method for encoding and decoding signal for high frequency bandwidth extension |
US9837090B2 (en) | 2010-09-15 | 2017-12-05 | Samsung Electronics Co., Ltd. | Apparatus and method for encoding and decoding signal for high frequency bandwidth extension |
US10811022B2 (en) | 2010-12-29 | 2020-10-20 | Samsung Electronics Co., Ltd. | Apparatus and method for encoding/decoding for high frequency bandwidth extension |
JP2016105174A (ja) * | 2010-12-29 | 2016-06-09 | サムスン エレクトロニクス カンパニー リミテッド | 高周波数帯域幅拡張のための符号化/復号化装置及びその方法 |
US10453466B2 (en) | 2010-12-29 | 2019-10-22 | Samsung Electronics Co., Ltd. | Apparatus and method for encoding/decoding for high frequency bandwidth extension |
US9734843B2 (en) | 2011-06-30 | 2017-08-15 | Samsung Electronics Co., Ltd. | Apparatus and method for generating bandwidth extension signal |
US10037766B2 (en) | 2011-06-30 | 2018-07-31 | Samsung Electronics Co., Ltd. | Apparatus and method for generating bandwith extension signal |
JP2014523548A (ja) * | 2011-06-30 | 2014-09-11 | サムスン エレクトロニクス カンパニー リミテッド | 帯域幅拡張信号生成装置及びその方法 |
JP2018025830A (ja) * | 2011-06-30 | 2018-02-15 | サムスン エレクトロニクス カンパニー リミテッド | 帯域幅拡張信号の復号化装置 |
JP2016197271A (ja) * | 2011-06-30 | 2016-11-24 | サムスン エレクトロニクス カンパニー リミテッド | 帯域幅拡張信号の復号化装置 |
US9349380B2 (en) | 2011-06-30 | 2016-05-24 | Samsung Electronics Co., Ltd. | Apparatus and method for generating bandwidth extension signal |
WO2014115225A1 (ja) * | 2013-01-22 | 2014-07-31 | パナソニック株式会社 | 帯域幅拡張パラメータ生成装置、符号化装置、復号装置、帯域幅拡張パラメータ生成方法、符号化方法、および、復号方法 |
JPWO2014115225A1 (ja) * | 2013-01-22 | 2017-01-19 | パナソニック株式会社 | 帯域幅拡張パラメータ生成装置、符号化装置、復号装置、帯域幅拡張パラメータ生成方法、符号化方法、および、復号方法 |
US9424847B2 (en) | 2013-01-22 | 2016-08-23 | Panasonic Corporation | Bandwidth extension parameter generation device, encoding apparatus, decoding apparatus, bandwidth extension parameter generation method, encoding method, and decoding method |
US10593345B2 (en) | 2013-07-22 | 2020-03-17 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus for decoding an encoded audio signal with frequency tile adaption |
US11735192B2 (en) | 2013-07-22 | 2023-08-22 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoder, audio decoder and related methods using two-channel processing within an intelligent gap filling framework |
US10347274B2 (en) | 2013-07-22 | 2019-07-09 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for encoding and decoding an encoded audio signal using temporal noise/patch shaping |
US10332539B2 (en) | 2013-07-22 | 2019-06-25 | Fraunhofer-Gesellscheaft zur Foerderung der angewanften Forschung e.V. | Apparatus and method for encoding and decoding an encoded audio signal using temporal noise/patch shaping |
US10515652B2 (en) | 2013-07-22 | 2019-12-24 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for decoding an encoded audio signal using a cross-over filter around a transition frequency |
US10573334B2 (en) | 2013-07-22 | 2020-02-25 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for encoding or decoding an audio signal with intelligent gap filling in the spectral domain |
US10311892B2 (en) | 2013-07-22 | 2019-06-04 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for encoding or decoding audio signal with intelligent gap filling in the spectral domain |
US10276183B2 (en) | 2013-07-22 | 2019-04-30 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for decoding or encoding an audio signal using energy information values for a reconstruction band |
US10147430B2 (en) | 2013-07-22 | 2018-12-04 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for decoding and encoding an audio signal using adaptive spectral tile selection |
US10847167B2 (en) | 2013-07-22 | 2020-11-24 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoder, audio decoder and related methods using two-channel processing within an intelligent gap filling framework |
US12142284B2 (en) | 2013-07-22 | 2024-11-12 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoder, audio decoder and related methods using two-channel processing within an intelligent gap filling framework |
US10984805B2 (en) | 2013-07-22 | 2021-04-20 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for decoding and encoding an audio signal using adaptive spectral tile selection |
US11049506B2 (en) | 2013-07-22 | 2021-06-29 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for encoding and decoding an encoded audio signal using temporal noise/patch shaping |
US11996106B2 (en) | 2013-07-22 | 2024-05-28 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E. V. | Apparatus and method for encoding and decoding an encoded audio signal using temporal noise/patch shaping |
US11222643B2 (en) | 2013-07-22 | 2022-01-11 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus for decoding an encoded audio signal with frequency tile adaption |
US11250862B2 (en) | 2013-07-22 | 2022-02-15 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for decoding or encoding an audio signal using energy information values for a reconstruction band |
US11257505B2 (en) | 2013-07-22 | 2022-02-22 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoder, audio decoder and related methods using two-channel processing within an intelligent gap filling framework |
US11289104B2 (en) | 2013-07-22 | 2022-03-29 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for encoding or decoding an audio signal with intelligent gap filling in the spectral domain |
US10134404B2 (en) | 2013-07-22 | 2018-11-20 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoder, audio decoder and related methods using two-channel processing within an intelligent gap filling framework |
US10332531B2 (en) | 2013-07-22 | 2019-06-25 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for decoding or encoding an audio signal using energy information values for a reconstruction band |
US11769513B2 (en) | 2013-07-22 | 2023-09-26 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for decoding or encoding an audio signal using energy information values for a reconstruction band |
US11769512B2 (en) | 2013-07-22 | 2023-09-26 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for decoding and encoding an audio signal using adaptive spectral tile selection |
US11922956B2 (en) | 2013-07-22 | 2024-03-05 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for encoding or decoding an audio signal with intelligent gap filling in the spectral domain |
US10002621B2 (en) | 2013-07-22 | 2018-06-19 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for decoding an encoded audio signal using a cross-over filter around a transition frequency |
JP2014240974A (ja) * | 2014-08-06 | 2014-12-25 | ソニー株式会社 | 符号化装置、符号化方法、およびプログラム |
US12112765B2 (en) | 2015-03-09 | 2024-10-08 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoder, audio decoder, method for encoding an audio signal and method for decoding an encoded audio signal |
US10896684B2 (en) | 2017-07-28 | 2021-01-19 | Fujitsu Limited | Audio encoding apparatus and audio encoding method |
US12142251B2 (en) | 2024-02-12 | 2024-11-12 | Dolby International Ab | Efficient combined harmonic transposition |
Also Published As
Publication number | Publication date |
---|---|
US7668711B2 (en) | 2010-02-23 |
US20070156397A1 (en) | 2007-07-05 |
JPWO2005104094A1 (ja) | 2008-03-13 |
JP4741476B2 (ja) | 2011-08-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2005104094A1 (ja) | 符号化装置 | |
RU2473140C2 (ru) | Устройство для микширования множества входных данных | |
JP3762579B2 (ja) | デジタル音響信号符号化装置、デジタル音響信号符号化方法及びデジタル音響信号符号化プログラムを記録した媒体 | |
JP5485909B2 (ja) | オーディオ信号処理方法及び装置 | |
KR100986150B1 (ko) | 합성된 스펙트럼 성분을 적용하기 위하여 디코딩된 신호의 특성을 사용하는 오디오 코딩 시스템 | |
US9646616B2 (en) | System and method for audio coding and decoding | |
JP5539203B2 (ja) | 改良された音声及びオーディオ信号の変換符号化 | |
JP5224017B2 (ja) | オーディオ符号化装置、オーディオ符号化方法およびオーディオ符号化プログラム | |
JP5048697B2 (ja) | 符号化装置、復号化装置、符号化方法、復号化方法、プログラムおよび記録媒体 | |
JP6517723B2 (ja) | 高度なスペクトラム拡張を使用して量子化ノイズを低減するための圧縮伸張装置および方法 | |
JP3926726B2 (ja) | 符号化装置および復号化装置 | |
JP5267362B2 (ja) | オーディオ符号化装置、オーディオ符号化方法及びオーディオ符号化用コンピュータプログラムならびに映像伝送装置 | |
US20060031075A1 (en) | Method and apparatus to recover a high frequency component of audio data | |
JP5480274B2 (ja) | 信号処理方法及び装置 | |
CA2489443C (en) | Audio coding system using characteristics of a decoded signal to adapt synthesized spectral components | |
JP2010540990A (ja) | 埋め込み話声およびオーディオコーデックにおける変換情報の効率的量子化のための方法および装置 | |
JP4021124B2 (ja) | デジタル音響信号符号化装置、方法及び記録媒体 | |
JP2008519990A (ja) | 信号符号化の方法 | |
AU2021303726B2 (en) | Audio quantizer and audio dequantizer and related methods | |
JP7447085B2 (ja) | コンパンディングによる密過渡イベントの符号化 | |
JP2008026372A (ja) | 符号化データの符号化則変換方法および装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2006512555 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 10575452 Country of ref document: US |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWW | Wipo information: withdrawn in national office |
Country of ref document: DE |
|
122 | Ep: pct application non-entry in european phase | ||
WWP | Wipo information: published in national office |
Ref document number: 10575452 Country of ref document: US |