JP2005532585A

JP2005532585A - Audio coding

Info

Publication number: JP2005532585A
Application number: JP2004519077A
Authority: JP
Inventors: ロベルトジェイスルエイテル; アンドレアスジェイケッリツ; ゲラルドエイチホト; ブリンケルアルベルトゥスシーデン
Original assignee: Koninklijke Philips Electronics NV
Current assignee: Koninklijke Philips NV
Priority date: 2002-07-08
Filing date: 2003-06-18
Publication date: 2005-10-27
Also published as: AU2003237010A1; EP1522063B1; ATE356404T1; CN1666256A; EP1522063A1; DE60312336D1; US20050259822A1; WO2004006225A1

Abstract

オーディオ信号（ｘ）の符号化（１）は、複数の連続したセグメントの各々について、サンプリングされた信号値のそれぞれの組を提供するステップを有する。前記サンプリングされた信号値は分析され（１３０）、前記複数の連続したセグメントの各々について１つ又は複数の正弦波成分が生成される。正弦波成分は、複数の連続したセグメント間でリンクされる。正弦波コード（ＣＳ）は、前記複数の連続したセグメントの各々についてのリンクされた正弦波成分のトラックを有する。前記トラックのそれぞれは、トラックの開始セグメントに正弦波成分の周波数及び振幅を有する一方で、選択されたトラックは、前記開始セグメントの位相が含まれないことのインジケータを含む。The encoding (1) of the audio signal (x) comprises providing a respective set of sampled signal values for each of a plurality of consecutive segments. The sampled signal value is analyzed 130 and one or more sinusoidal components are generated for each of the plurality of consecutive segments. A sinusoidal component is linked between a plurality of consecutive segments. A sinusoidal code (CS) has a track of linked sinusoidal components for each of the plurality of consecutive segments. Each of the tracks has a frequency and amplitude of a sinusoidal component in the starting segment of the track, while the selected track includes an indicator that the phase of the starting segment is not included.

Description

本発明は、オーディオ信号の符号化及び復号に関する。 The present invention relates to encoding and decoding of audio signals.

パラメトリック符号化方式、特に正弦波コーダが、国際特許出願第ＷＯ００／７９５１９−Ａ１号（整理番号ＰＨＮ０１７５０２）及びＰＣＴ特許出願第ＩＢ０２／０１２９７（整理番号ＰＨＮＬ０１０２５２）において説明されている。このコーダでは、オーディオセグメント又はフレームが、正弦波コーダによって、振幅、周波数及び位相パラメータによって表される多くの正弦波を用いてモデル化される。セグメントの正弦波が推定されたら、トラッキングアルゴリズムが開始される。このアルゴリズムは、正弦波をセグメントごとに互いにリンクさせるよう試みる。このように、連続したセグメントからの適当な正弦波からの正弦波パラメータがリンクされ、いわゆるトラックが得られる。リンクの基準は、２つの連続したセグメントの周波数に基づくが、更に、振幅及び／又は位相情報も用いられることができる。この情報は、リンクされるべき正弦波を決定する費用関数において組み合わせられる。このように、トラッキングアルゴリズムは、特定の時間インスタンスで開始し、複数の時間セグメントに亘って或る時間の間発展し、そして停止するような正弦波トラックを生じさせる。 Parametric coding schemes, particularly sinusoidal coders, are described in International Patent Application No. WO 00 / 79519-A1 (reference number PHN017502) and PCT Patent Application No. IB02 / 01297 (reference number PHNL010252). In this coder, an audio segment or frame is modeled by a sine wave coder with a number of sine waves represented by amplitude, frequency and phase parameters. Once the sine wave of the segment is estimated, the tracking algorithm is started. This algorithm attempts to link sine waves together segment by segment. In this way, sine wave parameters from the appropriate sine wave from successive segments are linked to obtain a so-called track. The link criteria is based on the frequency of two consecutive segments, but also amplitude and / or phase information can be used. This information is combined in a cost function that determines the sine wave to be linked. Thus, the tracking algorithm produces a sinusoidal track that starts at a particular time instance, develops for a period of time over multiple time segments, and stops.

この方式においては、正弦波トラックのために、初期位相が送信され、このトラック中の他の正弦波の位相は、この初期位相及びこれら他の正弦波の周波数から取得される。正弦波の振幅及び周波数は、前の正弦波とは異なって符合化されることもできる。更に、非常に短いトラックは、除去されることができる。 In this scheme, the initial phase is transmitted for a sine wave track, and the phase of the other sine waves in the track is obtained from this initial phase and the frequency of these other sine waves. The amplitude and frequency of the sine wave can also be encoded differently than the previous sine wave. Furthermore, very short tracks can be removed.

このように、トラッキングに起因して、正弦波コーダのビットレートは、相当に低下されうる。 Thus, due to tracking, the bit rate of the sine wave coder can be significantly reduced.

本発明によれば、請求項１に記載のオーディオ信号を符合化する方法が提供される。 According to the invention, there is provided a method for encoding an audio signal according to claim 1.

本発明の好適な実施例（図１）において、エンコーダは、国際特許出願公開第０１／６９５９３−Ａ１号（整理番号ＰＨＮＬ０００１２０）に説明される種類の正弦波コーダである。このコーダ及びその対応するデコーダの動作は、よく説明されており、ここでは、本発明に関連する部分の説明のみが与えられる。 In the preferred embodiment of the present invention (FIG. 1), the encoder is a sine wave coder of the type described in WO 01 / 69593-A1 (reference number PHNL000120). The operation of this coder and its corresponding decoder is well described, and only a description of the parts relevant to the present invention is given here.

前の場合及び好適な実施例の両方において、オーディオコーダ１は、入力オーディオ信号を特定のサンプリング周波数でサンプリングし、結果として、オーディオ信号のデジタル表示ｘ（ｔ）が得られる。次にコーダ１は、サンプリングされた入力信号を３つの成分、即ち、過渡信号成分、持続決定的成分及び持続確率的成分に分離する。オーディオコーダ１は、過渡コーダ１１、正弦波コーダ１３及び雑音コーダ１４を有する。オーディオコーダは、随意に、利得圧縮機構（ＧＣ）１２を有する。 In both the previous case and the preferred embodiment, the audio coder 1 samples the input audio signal at a specific sampling frequency, resulting in a digital representation x (t) of the audio signal. The coder 1 then separates the sampled input signal into three components: a transient signal component, a persistent deterministic component and a persistent stochastic component. The audio coder 1 includes a transient coder 11, a sine wave coder 13, and a noise coder 14. The audio coder optionally has a gain compression mechanism (GC) 12.

過渡コーダ１１は、過渡検出器（ＴＤ）１１０、過渡解析器（ＴＡ）１１１及び過渡合成器（ＴＳ）１１２を有する。まず、信号ｘ（ｔ）が過渡検出器１１０に入る。この検出器１１０は、過渡信号成分が存在するか、及び、当該信号成分の位置を推定する。この情報は、過渡解析器１１１に供給される。過渡信号成分の位置が決定された場合、過渡解析器１１１は、該過渡信号成分（の主要部分）を抽出するよう試みる。該解析器は形状関数を、好ましくは推定された開始位置で始まる信号セグメントと整合し、例えば（少）数の正弦波成分を利用して該形状関数の下の内容を決定する。この情報は、過渡コードＣＴに含まされる。過渡コードＣＴを生成するためのより詳細な情報は、国際特許出願第０１／６９５９３−Ａ１号に与えられる。 The transient coder 11 includes a transient detector (TD) 110, a transient analyzer (TA) 111, and a transient synthesizer (TS) 112. First, the signal x (t) enters the transient detector 110. The detector 110 estimates whether there is a transient signal component and the position of the signal component. This information is supplied to the transient analyzer 111. If the position of the transient signal component is determined, the transient analyzer 111 attempts to extract the (main part) of the transient signal component. The analyzer aligns the shape function with a signal segment, preferably starting at the estimated starting position, and determines the content under the shape function using, for example, a (small) number of sinusoidal components. This information is included in the transient code CT. More detailed information for generating the transient code CT is given in International Patent Application No. 01 / 69593-A1.

過渡コードＣＴは、過渡合成器１１２に供給される。合成された過渡信号成分は減算器１６において入力信号ｘ（ｔ）から減算され、結果として信号ｘ１が得られる。ＧＣ１２が省略された場合は、ｘ１＝ｘ２となる。 The transient code CT is supplied to the transient synthesizer 112. The synthesized transient signal component is subtracted from the input signal x (t) in the subtracter 16, and as a result, the signal x1 is obtained. When GC 12 is omitted, x1 = x2.

信号ｘ２は、正弦波コーダ１３に供給されて、正弦波解析器（１３０）において解析され、該解析器は（決定的）正弦波成分を決定する。従って、過渡解析器の存在は望ましいが、これは必須なわけではなく、本発明は、このような解析器無しでも実現可能であることが分かるであろう。いずれにせよ、正弦波コーディングの結果は正弦波コードＣＳであり、例示的な正弦波コードＣＳの従来型の生成を示すより詳細な例は、国際特許出願公開第００／７９５１９−Ａ１号において与えられる。 The signal x2 is fed to a sine wave coder 13 and analyzed in a sine wave analyzer (130), which determines the (deterministic) sine wave component. Thus, although the presence of a transient analyzer is desirable, this is not essential and it will be appreciated that the present invention can be implemented without such an analyzer. In any case, the result of sinusoidal coding is a sinusoidal code CS, and a more detailed example showing the conventional generation of an exemplary sinusoidal code CS is given in WO 00 / 79519-A1. It is done.

しかし、要するに、このような正弦波コーダは、入力信号ｘ２を、１つのフレームセグメントから次のフレームセグメントにリンクされる正弦波成分のトラックとして、符合化する。従来技術においては、トラックは、最初に、所与のセグメントで始まる正弦波についての開始周波数、開始振幅及び開始位相によって表される（発生（birth））。 In essence, however, such a sine wave coder encodes the input signal x2 as a track of a sine wave component that is linked from one frame segment to the next. In the prior art, a track is first represented by a starting frequency, starting amplitude and starting phase for a sine wave starting in a given segment (birth).

本発明の好適な実施例において、開始位相は、トラックについて、当該トラックの長さの関数として選択的に符合化される。より詳しくは、開始位相は、長い存続期間のトラックについてのみ使用される。これはなぜなら、長い存続期間のトラックには、おそらく音的情報（tonal information）が符合化してあり、このような場合には、トラックの開始位相を送信することによって可能な限りトラックの音的特性を保存することが重要であると推定されるからである。短い存続期間のトラックは、非音的情報（non-tonal information）を符号化していると推定され、従って、開始位相をこのようなトラックと共に送信することは、実際には、トラックに音的特性を追加し、そのため、符号化されたビットストリームを再生するときに歪みの知覚を与えると推定される。 In the preferred embodiment of the invention, the starting phase is selectively encoded as a function of the length of the track. More specifically, the start phase is only used for long duration tracks. This is because a long-lived track is probably encoded with tonal information, and in such a case, the acoustic characteristics of the track are as much as possible by transmitting the starting phase of the track. This is because it is presumed that it is important to preserve. Short-lived tracks are presumed to encode non-tonal information, so transmitting a start phase with such a track is actually a sonic property to the track. Is thus estimated to give a perception of distortion when playing the encoded bitstream.

短いトラックについては開始位相を送信しないことによって、ビットレートの大幅な節減が起こり得ることが分かるであろう。なぜなら、短いトラックについての開始位相データのオーバヘッドは、より長いトラックについてよりも比例的に高いからである。 It will be appreciated that by not transmitting the start phase for short tracks, significant bit rate savings can occur. This is because the overhead of starting phase data for short tracks is proportionally higher than for longer tracks.

トラックが開始位相を必要とするのに十分長いか、又はこれに対応して、開始位相を必要としないのに十分短いか、を決定するための多くの代替基準がある。 There are many alternative criteria for determining whether a track is long enough to require a starting phase or correspondingly short enough not to require a starting phase.

最も単純な基準は絶対トラック長を選択することである。実験的に、４０ｍｓ未満のトラックが開始位相を必要としない一方で、これよりも長いトラックは、有利には、開始位相と共に送信されることが発見された。８ｍｓの更新間隔を有するエンコーダにおいては、これは、長さがセグメント５つ分未満のトラックは、開始位相を含むのではなく、該トラックによって開始位相が使用されないことのインジケータを含むことを意味する。（このようなインジケータを符合化することは、開始位相値と比較してより効率的であると推定される。）代替的に、エンコーダが、該エンコーダが作成する符合化された信号が互換デコーダによって復号されると推定する場合には、該エンコーダは、開始位相が使用されないという表示を含む必要はなく、デコーダに、開始位相のないトラックをどのように処理するかの決定を任せることができる。 The simplest criterion is to select an absolute track length. Experimentally, it has been discovered that tracks shorter than 40 ms do not require a start phase, while longer tracks are advantageously transmitted with a start phase. In an encoder with an update interval of 8 ms, this means that a track that is less than 5 segments in length does not contain a start phase, but an indicator that the start phase is not used by that track. . (Encoding such an indicator is presumed to be more efficient compared to the starting phase value.) Alternatively, the encoded signal produced by the encoder is a compatible decoder. The encoder does not have to include an indication that the starting phase is not used, but can leave the decoder to determine how to process the track without the starting phase. .

代替基準は、トラックが存在している時間間隔が、有声か（voiced）又は非有声か（non-voiced）を決定することに基づく。時間間隔が有声であると決定されたら、この、事実上非音的である時間間隔は（従ってトラックも）、開始位相を含まないべきであり、非有声の時間間隔についてはその逆が成り立つ、と推定される。L.R.
Rabiner, M.J. Cheng, A.E. Rosenberg及びC.A.
McGonegalによる「A
Comparative Performance Study of Several Pitch Detection Algorithms」(IEEE
Transactions on Acoustics, Speech and Signal Processing、ＡＳＳＰ−２４巻、ページ３９９〜４１７、１９７６年１０月）は、このような決定を行う方法を開示し、この方法においては、このような方法を実現する成分をトラッキングアルゴリズム中に含めることによって、このトラッキングアルゴリズムは、音的時間間隔内に存在しているトラックについては開始位相情報を含む一方で、非音的時間間隔内に存在しているトラックについては、開始位相は、符合化されたビットストリームに含まれない。この基準は、音的時間間隔においては、トラックは非音的時間間隔におけるよりも長くなる傾向にあり、従って、トラックが開始位相を含むべきか否かの決定がなされる前に、トラックの最終的な長さが知られている必要はない、と仮定する。 The alternative criteria is based on determining whether the time interval in which a track is present is voiced or non-voiced. If the time interval is determined to be voiced, this effectively silent time interval (and thus the track) should not include the start phase, and vice versa for unvoiced time intervals, It is estimated to be. LR
Rabiner, MJ Cheng, AE Rosenberg and CA
“A” by McGonegal
Comparative Performance Study of Several Pitch Detection Algorithms '' (IEEE
Transactions on Acoustics, Speech and Signal Processing, ASSP-24, pp. 399-417, October 1976) discloses a method for making such a determination, and in this method, components that realize such a method. Is included in the tracking algorithm so that the tracking algorithm includes start phase information for tracks that exist within a sound time interval, while for tracks that exist within a non-sound time interval, The starting phase is not included in the encoded bitstream. This criterion is that in a sound time interval, the track tends to be longer than in a non-sound time interval, so the final of the track is determined before a determination is made whether the track should contain a start phase. Suppose the specific length need not be known.

時間間隔が音的オーディオ信号を表示するか又は非音的オーディオ信号を表示するかを決定する代替の方法は、以下に述べるように、信号の雑音成分のエネルギーレベルを調べることである。雑音エネルギーの正弦波成分エネルギーに対する比が所与の時間間隔の間所与の閾値を超えると分かったら、上記と同様に、オーディオ信号は非音的であり、開始位相情報は、トラックに含まれる必要はないと仮定されることができる。また、雑音エネルギーの正弦波成分エネルギーに対する比が所与の閾値を下回ったときについて、逆もまた同じである。再び、信号が音的であると決定される場合には、トラックは非音的信号におけるよりも長いと仮定される。 An alternative way of determining whether the time interval represents a sonic or non-audio audio signal is to examine the energy level of the noise component of the signal, as described below. Once the ratio of noise energy to sinusoidal component energy is found to exceed a given threshold for a given time interval, the audio signal is silent and the start phase information is included in the track, as above. It can be assumed that there is no need. The reverse is also true when the ratio of noise energy to sinusoidal component energy falls below a given threshold. Again, if the signal is determined to be sonic, the track is assumed to be longer than in the non-phonic signal.

好適な実施例及び従来技術の両方において、トラックは、トラックが終了する（死亡）セグメントまで、周波数差及び振幅差によって（長いトラックについては更に位相差（継続）によって）連続したセグメントで表される。実際には、長いトラックについてさえ位相差を符号化することにはほとんど利得がないと決定され得る。従って、位相情報は、継続のために符号化される必要は全くなく、長いトラックのための位相情報は、継続的位相復元を用いて再生成されることができる。 In both the preferred embodiment and the prior art, a track is represented by a continuous segment by frequency and amplitude differences (or by phase difference (continuation) for longer tracks) until the track ends (dead) segment. . In practice, it can be determined that there is little gain in encoding the phase difference even for long tracks. Thus, the phase information need not be encoded for continuation, and the phase information for long tracks can be regenerated using continuous phase recovery.

従来技術と同様に、本発明の改善された正弦波コーダによって生成される正弦波コードＣＳから、正弦波合成器（ＳＳ）１３１によって正弦波信号成分が復元される。この信号は、減算器１７において正弦波コーダ１３への入力ｘ２から減算され、結果として（大きな）遷移信号成分及び（主な）決定的正弦波成分を含まない残存信号ｘ３が得られる。 Similar to the prior art, the sine wave signal component is recovered by the sine wave synthesizer (SS) 131 from the sine wave code CS generated by the improved sine wave coder of the present invention. This signal is subtracted from the input x2 to the sine wave coder 13 in a subtractor 17, resulting in a residual signal x3 that does not contain a (large) transition signal component and a (main) deterministic sine wave component.

残存信号ｘ３は、主に雑音を有すると推定され、好適な実施例の雑音解析器１４は、例えば国際特許出願第０１／８９０８６−Ａ１号に説明されるように、この雑音の表示である雑音コードＣＮを生じさせる。再び、このような解析器の使用は、本発明の実現のためには必須ではなく、このような使用に対しては補完的であることが分かるであろう。 The residual signal x3 is presumed to have mainly noise, and the noise analyzer 14 of the preferred embodiment is a noise that is an indication of this noise, as described, for example, in International Patent Application No. 01 / 89086-A1. The code CN is generated. Again, it will be appreciated that the use of such an analyzer is not essential for the realization of the present invention and is complementary to such use.

最後に、マルチプレクサ１５で、コードＣＴ、ＣＳ及びＣＮを含むオーディオストリームＡＳが構成される。オーディオストリームＡＳは、例えばデータバス、アンテナシステム、記憶媒体等に供給される。 Finally, the multiplexer 15 forms an audio stream AS including codes CT, CS, and CN. The audio stream AS is supplied to, for example, a data bus, an antenna system, a storage medium, and the like.

図２は、本発明によるオーディオプレーヤ３を示す。例えば図１によるエンコーダによって生成されるオーディオストリームＡＳ’は、データバス、アンテナシステム、記憶媒体等から得られる。オーディオストリームＡＳは、デマルチプレクサ３０において多重分離され、コードＣＴ、ＣＳ及びＣＮが得られる。これらのコードは、過渡合成器３１、正弦波合成器３２及び雑音合成器３３にそれぞれ供給される。過渡信号成分は、過渡合成器３１において過渡コードＣＴから計算される。過渡コードが形状関数を示す場合、形状は受信されたパラメータに基づいて計算される。更に、形状内容は、正弦波成分の周波数及び振幅に基づいて計算される。過渡コードＣＴがステップを示す場合、過渡は計算されない。合計過渡信号ｙＴは、全ての過渡の合計である。 FIG. 2 shows an audio player 3 according to the invention. For example, the audio stream AS ′ generated by the encoder according to FIG. 1 is obtained from a data bus, an antenna system, a storage medium or the like. The audio stream AS is demultiplexed by the demultiplexer 30, and codes CT, CS, and CN are obtained. These codes are supplied to the transient synthesizer 31, the sine wave synthesizer 32, and the noise synthesizer 33, respectively. The transient signal component is calculated from the transient code CT in the transient synthesizer 31. If the transient code indicates a shape function, the shape is calculated based on the received parameters. Further, the shape content is calculated based on the frequency and amplitude of the sine wave component. If the transient code CT indicates a step, no transient is calculated. The total transient signal yT is the sum of all transients.

正弦波コードＣＳが、信号ｙＳを生成するために用いられ、これは、所与のセグメントの正弦波の合計として説明される。デコーダにおいて、正弦波トラックの正弦波の位相は、２つの方法のうちの１つで決定される。トラックが開始位相を含む場合には、従来技術と同様に、位相は、生じる正弦波の位相及び中間正弦波の周波数から計算される。開始位相が提供されないという表示をトラックが含む好適な実施例において、デコーダは、トラックの全ての正弦波についてランダムな開始位相を生成して、トラックを前と同様にに合成する。（デコーダは、代替的に、生じる正弦波についてのみランダムな開始位相を計算してもよく、残りの位相は従来技術と同様に計算してもよい。）このような表示又は開始位相が提供されない場合には、デコーダは、トラックの正弦波のためのランダムな開始位相を作成することが必要とされると推定する。 A sine wave code CS is used to generate the signal yS, which is described as the sum of the sine waves of a given segment. At the decoder, the phase of the sine wave of the sine wave track is determined in one of two ways. If the track contains a starting phase, the phase is calculated from the resulting sine wave phase and the intermediate sine wave frequency, as in the prior art. In a preferred embodiment where the track includes an indication that no start phase is provided, the decoder generates a random start phase for all sine waves of the track and synthesizes the track as before. (The decoder may alternatively calculate a random starting phase only for the resulting sine wave, and the remaining phases may be calculated as in the prior art.) No such indication or starting phase is provided. In some cases, the decoder assumes that it is required to create a random starting phase for the sine wave of the track.

本発明の１つの側面が、非音的オーディオフラグメントの非音性（non-tonality）を保存することであることは理解されるであろう。従って、本発明を利用するときには、エンコーダが、非音的オーディオフラグメントのためには非常に短いトラックを保存し、デコーダが、これらの短いトラックをランダムな開始位相で再生することが望ましい可能性がある。これは、非常に短いトラックがビットストリームのどこにも含まれない従来技術とは異なる。 It will be appreciated that one aspect of the invention is to preserve the non-tonality of non-audio audio fragments. Thus, when utilizing the present invention, it may be desirable for the encoder to store very short tracks for non-audio audio fragments and for the decoder to play these short tracks with a random starting phase. is there. This is different from the prior art where very short tracks are not included anywhere in the bitstream.

同時に、雑音コードＣＮは、雑音のスペクトラムに近似した周波数応答を有する主にフィルタである雑音合成器ＮＳ３３に供給される。ＮＳ３３は、雑音コードＣＮによって白色雑音信号をフィルタすることによって復元された雑音ｙＮを生成する。 At the same time, the noise code CN is supplied to the noise synthesizer NS33, which is mainly a filter having a frequency response approximating the noise spectrum. The NS 33 generates a noise yN restored by filtering the white noise signal by the noise code CN.

合計信号ｙ（ｔ）は、過渡信号ｙＴ並びにあらゆる振幅伸長（ｇ）と正弦波信号ｙＳ及びノイズ信号ｙＮの合計との積の合計を有する。オーディオプレーヤは、それぞれの信号の和を取るための２つの加算器３６及び３７を有する。
合計信号は出力ユニット３５に供給され、これは例えばスピーカである。 The total signal y (t) has the sum of the product of the transient signal yT and any amplitude expansion (g) and the sum of the sinusoidal signal yS and the noise signal yN. The audio player has two adders 36 and 37 for taking the sum of the respective signals.
The total signal is supplied to the output unit 35, which is for example a speaker.

図３は、図１で示すオーディオコーダ１及び図２で示すオーディオプレーヤ３を有する本発明によるオーディオシステムを示す。このようなシステムは、再生及び記録機能を提供する。オーディオストリームＡＳは、オーディオコーダから通信チャネル２を通じてオーディオプレーヤに供給され、この通信チャネルは、ワイヤレス接続、データ２０バス又は記憶媒体であってもよい。通信チャネル２が記憶媒体である場合には、記憶媒体はシステムに固定されていてもよく、又は、取り外し可能なディスク、メモリスティック等であってもよい。通信チャネル２は、オーディオシステムの一部であってもよいが、多くの場合オーディオシステムの外部にある。 FIG. 3 shows an audio system according to the invention having the audio coder 1 shown in FIG. 1 and the audio player 3 shown in FIG. Such a system provides playback and recording functions. The audio stream AS is supplied from the audio coder to the audio player through the communication channel 2, which may be a wireless connection, a data 20 bus or a storage medium. When the communication channel 2 is a storage medium, the storage medium may be fixed to the system, or may be a removable disk, memory stick, or the like. The communication channel 2 may be part of the audio system, but is often external to the audio system.

本発明は、いかなる正弦波オーディオコーダにおいても用いられることができる。従って、本発明は、このようなコーダが使用されるどこにでも適用可能である。 The present invention can be used in any sinusoidal audio coder. Thus, the present invention is applicable wherever such a coder is used.

上述の実施例は、本発明を制限するのではなく説明しているのであり、当業者は、添付の請求項の範囲から離れることなく代替の実施例を設計することができることに注意されたい。請求項において、括弧内に配置されたいかなる引用符号も当該請求項を制限するように解釈されてはならない。「有する（comprising）」なる用語は、請求項に記載されたもの以外の要素又はステップの存在を排除するものではない。本発明は、幾つかの個別素子を有するハードウェアにより、及び適切にプログラムされたコンピュータによって実行されることができる。幾つかの手段を列挙している装置請求項において、複数のこれらの手段を、ハードウェアの全く同一のアイテムによって具体化することもできる。特定の手段が相互に異なる従属請求項において記載されているという事実のみでは、これらの手段の組合せが有利に用いられることができないということを示すことにはならない。 It should be noted that the above-described embodiments illustrate rather than limit the invention, and that those skilled in the art can design alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word “comprising” does not exclude the presence of elements or steps other than those listed in a claim. The present invention can be implemented by hardware having several individual elements and by a suitably programmed computer. In the device claim enumerating several means, several of these means can be embodied by one and the same item of hardware. The mere fact that certain measures are recited in mutually different dependent claims does not indicate that a combination of these measured cannot be used to advantage.

本発明によるオーディオコーダの実施例を示す。1 shows an embodiment of an audio coder according to the present invention. 本発明によるオーディオプレーヤの実施例を示す。1 shows an embodiment of an audio player according to the present invention. 本発明によるオーディオコーダ及びオーディオプレーヤを有するシステムを示す。1 shows a system having an audio coder and an audio player according to the present invention.

Claims

In a method of encoding an audio signal,
Providing a respective set of sampled signal values for each of a plurality of consecutive segments;
Analyzing the sampled signal value to generate one or more sinusoidal components for each of the plurality of consecutive segments;
Linking sinusoidal components between a plurality of consecutive segments;
Generating a sine wave code having linked sine wave component tracks for each of the plurality of consecutive segments, each of the tracks having a frequency and amplitude of a sine wave component of a start segment of the track; And the selected track does not include the phase of the start segment; and
Generating an encoded audio stream including the sine wave code;
Having a method.

2. The method of claim 1, wherein the selected track includes an indicator that the phase of the starting segment is not included.

The method of claim 1, wherein the selected track is less than 5 segments in length.

The method of claim 1, wherein the selected track is less than 40 ms in length.

The method of claim 1, wherein the selected track represents a silent component of an audio signal.

The method of claim 1, wherein the selected track represents a component of a voiced time interval of the audio signal.

The method of claim 1, wherein the selected track represents a noisy interval component of the audio signal.

The method of claim 1, wherein each of the tracks has a frequency and amplitude difference for each sinusoidal component of the next continuation segment of the track.

In a method for decoding an audio stream,
Reading an encoded audio stream including a sinusoidal code having a track of linked sinusoidal components for each of a plurality of consecutive segments, each of the tracks being a sinusoid of a start segment of the track Having a frequency and amplitude of components, and the selected track does not include the phase of the start segment;
Generating a random starting phase for the selected track;
Synthesizing the audio signal using the sine wave code comprising restoring a sine wave component over a plurality of consecutive segments;
Having a method.

10. The method of claim 9, wherein the generating step includes generating a random phase for each sine wave component of the selected track.

In an audio coder configured to process a respective set of sampled signal values for each of a plurality of consecutive segments of an audio signal,
An analyzer configured to analyze the sampled signal value to generate one or more sinusoidal components for each of the plurality of consecutive segments;
A linker configured to link sinusoidal components between a plurality of consecutive segments;
A component configured to generate, for each of the plurality of consecutive segments, a sine wave code having a track of linked sine wave components, each of the tracks being a sine wave component of a start segment of the track The selected track does not include the phase of the start segment; and
A bitstream generator for generating an encoded audio stream including the sine wave code;
Audio coder with.

Means for reading an encoded audio stream comprising a sinusoidal code having a track of linked sinusoidal components for each of a plurality of consecutive segments, each of said tracks comprising a start segment of the track Means having a frequency and amplitude of a sinusoidal component, wherein the selected track does not include the phase of the start segment;
A phase generator configured to generate a random starting phase for the selected track;
A synthesizer for synthesizing the audio signal using the sine wave code, comprising restoring a sine wave component over a plurality of consecutive segments;
An audio player.

An audio system comprising the audio coder according to claim 11 and the audio player according to claim 12.

In an audio stream having a sinusoidal code representing at least a component of the audio signal, the code has a track of sinusoidal components linked between the plurality of consecutive segments, each of the tracks being a start segment of the track. An audio stream having a frequency and amplitude of a sinusoidal component and the selected track does not include the phase of the start segment.

A storage medium in which the audio stream according to claim 14 is stored.