JPH0519789A

JPH0519789A - Voice encoding device

Info

Publication number: JPH0519789A
Application number: JP3171445A
Authority: JP
Inventors: Motoaki Koyama; 山元昭児
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1991-07-11
Filing date: 1991-07-11
Publication date: 1993-01-29
Anticipated expiration: 2017-09-17
Also published as: JP3325277B2

Abstract

PURPOSE:To facilitate addressing in voice reproduction and enable natural voice reproduction by compressing no-voice sections of a voice signal into frames which have constant length regardless of the length of the no-voice sections. CONSTITUTION:A no-voice detection part 1 detects the no-voice parts from the voice signal S1 and a no-voice counter 2 counts the number of the detected frames. A voice encoding part 4 encodes the voice to be encoded in the voice signal S1. Namely, the voice encoding part 4 encodes the voice parts as they are and compresses the no-voice parts to output data together with the counted value of the no-voice counter 2, so that they are recorded in a memory 5 for code storage. When it is judged that the no-voice part is longer than the specific memory 5 for code storage, and an address generator 3 stops advancing the address from the point of time. Then when the voice part of the voice signal S1 is inputted, the advance of the address is restarted and the encoded voice signal is stored in the memory 5.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、音声符号化装置に関
し、特に、無音部分の符号化に着目した音声符号化装置
に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech coder, and more particularly to a speech coder which focuses on coding a silent portion.

【０００２】[0002]

【従来の技術】従来、音声信号を符号化する場合、高効
率で符号化するために、無音部分の処理について特別な
処理を行なってきた。2. Description of the Related Art Conventionally, in the case of encoding a voice signal, special processing has been performed for processing a silent portion in order to encode with high efficiency.

【０００３】例えば、音声のない無音区間を符号化する
に当って、単位時間当りの符号化ビット数の少ない低ビ
ットレートで符号化する、いわゆる、アダプティブな方
式はその一つである。これは、例えば以下のようなもの
である。即ち、図５のタイミングチャートに示すよう
に、時間ｔの経過と共に順次入力される音声符号のｎフ
レームからｎ＋７フレームまでを低メモリアドレスから
高メモリアドレスに向かって順次符号化するとする。ｎ
＋３フレームからｎ＋５フレームまでの間は無音区間で
ある。このため、これらの区間は低ビットレートで符号
化することにより、通常の音声部分の符号化に対して、
圧縮率を１／ｋにしている。[0003] For example, when encoding a silent section with no voice, one of them is a so-called adaptive method in which encoding is performed at a low bit rate with a small number of encoding bits per unit time. This is as follows, for example. That is, as shown in the timing chart of FIG. 5, it is assumed that n frames to n + 7 frames of the voice code sequentially input with the elapse of time t are sequentially encoded from the low memory address to the high memory address. n
There is a silent section from the +3 frame to the n + 5 frame. Therefore, by encoding these sections at a low bit rate, compared with the encoding of the normal voice part,
The compression rate is 1 / k.

【０００４】一方、音声のない無音区間を全く符号化し
ないという、ランレングス的な符号化も知られている。
これは、以下のようなものである。即ち、図６のタイム
チャートに示すように、時間ｔの経過と共に入力される
音声波形Ｗのなかで、同図（ａ）に示すように、ある閾
値ＴＨ以上のｎ＋１区間、ｎ＋３区間のみを符号化の対
象とし、閾値ＴＨ以下のｎ区間、ｎ＋２区間は全く符号
化の対象としない。そして、同図（ｂ）に示すように、
音声のあるｎ＋１区間、ｎ＋３区間のみをつなげて符号
化するという方式である。On the other hand, there is also known run-length coding in which a silent section without voice is not coded at all.
This is as follows. That is, as shown in the time chart of FIG. 6, in the speech waveform W input with the elapse of the time t, only the n + 1 section and the n + 3 section having a certain threshold value TH or more are coded, as shown in FIG. The n sections and the n + 2 sections below the threshold TH are not targeted for encoding at all. Then, as shown in FIG.
This is a method in which only the n + 1 section and the n + 3 section having voice are connected and encoded.

【０００５】従来の音声符号化装置は以上のいずれかの
方法により、符号化効率を高めてきた。The conventional speech coding apparatus has improved coding efficiency by any of the above methods.

【０００６】[0006]

【発明が解決しようとする課題】無音区間を音声区間に
比べて低ビットレートで符号化する第１の方式の場合、
音声区間と無音区間の単位時間当りのメモリ量が異な
る。そのため、復号化時にスキップ等の特殊再生を行な
うと、符号化されたデータを音声か否かで単位時間当り
のアドレススキップ量が異なってくる。そのため、スキ
ップ時のアドレッシングの計算を行なう場合、メモリ内
容を一旦読み出す必要がある。これにより、回路規模が
大きくなり、演算時間も増大するという問題点がある。In the case of the first method of encoding the silent section at a bit rate lower than that of the speech section,
The memory amount per unit time in the voice section and the silent section is different. Therefore, if special reproduction such as skipping is performed at the time of decoding, the address skip amount per unit time varies depending on whether the encoded data is voice or not. Therefore, when calculating the addressing at the time of skip, it is necessary to read the memory contents once. As a result, there is a problem that the circuit scale increases and the calculation time also increases.

【０００７】一方、無音区間を全く符号化しないという
第２の方式の場合、無音区間のデータが全く欠落してい
る。このため、復号化した音声を再生すると、無音区間
が欠落した再生音となる。このため、録音時の音声の忠
実な再生ができず、不自然な再生音になってしまうとい
う問題がある。On the other hand, in the case of the second method in which the silent section is not coded at all, the data of the silent section is completely lost. For this reason, when the decoded voice is played back, the played sound becomes a playback sound in which a silent section is missing. For this reason, there is a problem in that the sound at the time of recording cannot be faithfully reproduced, resulting in an unnatural reproduced sound.

【０００８】以上のように、従来の音声符号化では、音
声の復号化時に処理が複雑になったり、再生音が不自然
になったりするという問題がある。As described above, the conventional speech coding has a problem that the processing becomes complicated at the time of decoding the speech and the reproduced sound becomes unnatural.

【０００９】本発明は、上記に鑑みてなされたもので、
その目的は、音声再生時のアドレッシングを容易にする
と共に自然な音声再生を可能とすることにある。The present invention has been made in view of the above,
The purpose is to facilitate addressing at the time of voice reproduction and enable natural voice reproduction.

【００１０】[0010]

【課題を解決するための手段】本発明の第１の音声符号
化装置は、入力音声信号の無音部分を検出して無音検出
信号を出力する無音検出手段と、前記無音検出信号出力
中のクロック数を計数してカウンタ値を出力する無音カ
ウンタと、前記無音検出信号が出力されないときの入力
音声信号の有音部分を符号化音声信号とすると共に、前
記無音検出信号出力中は無音符号を発生する、符号化手
段と、前記符号化手段からの符号化音声信号および無音
符号と前記無音カウンタ手段からのカウンタ値とを格納
するメモリと、前記カウンタ値が予め定めた基準値を超
えたときは前記メモリのアドレスの歩進を停止させ、前
記カウンタ値が前記基準値を超えないときは前記アドレ
スを歩進させる、アドレス発生手段とを備えるものとし
て構成される。A first speech coding apparatus according to the present invention comprises a silence detecting means for detecting a silent portion of an input speech signal and outputting a silence detecting signal, and a clock outputting the silence detecting signal. A silence counter that counts the number and outputs a counter value, and a voiced part of the input voice signal when the silence detection signal is not output is used as an encoded voice signal, and a silence code is generated during the output of the silence detection signal. The encoding means, the memory for storing the encoded voice signal and the silence code from the encoding means and the counter value from the silence counter means, and when the counter value exceeds a predetermined reference value, Address generation means for stopping the stepping of the address of the memory and stepping up the address when the counter value does not exceed the reference value.

【００１１】本発明の第２の音声符号化装置は、前記第
１の装置において、前記アドレス発生手段は、前記カウ
ンタ値が前記基準値を超えたときには、前記メモリのア
ドレスの歩進を停止させるだけでなく、前記アドレスを
前記無音検出信号の出力開始時点のアドレスまで戻す機
能を有するものとして構成されているものとして構成さ
れる。A second speech coding apparatus according to the present invention is the speech coding apparatus according to the first apparatus, wherein the address generating means stops the stepping of the address of the memory when the counter value exceeds the reference value. In addition, it is configured to have a function of returning the address to the address at the output start time of the silence detection signal.

【００１２】本発明の第３の音声符号化装置は、前記第
１又は第２の装置において、前記アドレス発生手段は、
無音部分のフレーム長と有音部分のフレーム長とが等し
くなるようにアドレッシングするものとして構成され
る。A third voice encoding apparatus of the present invention is the first or second apparatus, wherein the address generating means is
The addressing is configured so that the frame length of the silent portion and the frame length of the voiced portion are equal.

【００１３】[0013]

【作用】入力音声信号の無音部分が無音検出手段で検出
される。無音部分が所定の長さよりも長いと判断された
ときには、符号化手段からの無音符号がメモリに格納さ
れる。ただし、このときからアドレス発生手段はアドレ
スの歩進を停止する。この後、入力音声手段の音声部分
が入力すると、アドレスの歩進が再開する。そして、そ
の音声部分が符号化音声信号としてメモリに格納され
る。The silent part of the input voice signal is detected by the silent detecting means. When it is determined that the silent portion is longer than the predetermined length, the silent code from the encoding means is stored in the memory. However, from this time, the address generating means stops the stepping of the address. After that, when the voice part of the input voice means is input, the stepping of the address is restarted. Then, the audio part is stored in the memory as an encoded audio signal.

【００１４】[0014]

【実施例】以下、図面を参照しながら本発明の実施例を
説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００１５】図１は、本発明の一実施例に係る音声符号
化装置のブロック図である。図１に示すように、無音検
出部１は、音声信号Ｓ１から無音部を検出する。この検
出フレーム数は、無音カウンタ２によってカウントされ
る。音声符号化部４は、音声信号Ｓ１の中の符号化対象
となる音声を符号化する。つまり、音声符号化部４は、
音声部はそのまま符号化し、無音部は圧縮して無音カウ
ンタ２からのカウント値と共にデータ出力する。音声符
号化部４からのデータは、符号蓄積用メモリ５に記録さ
れる。この記録に使われるメモリアドレスＡｍは、アド
レス発生器３が無音カウンタ２からのカウント信号に基
づいて発生する。FIG. 1 is a block diagram of a speech coder according to an embodiment of the present invention. As shown in FIG. 1, the silence detector 1 detects a silence from the audio signal S1. The number of detected frames is counted by the silence counter 2. The audio encoding unit 4 encodes the audio to be encoded in the audio signal S1. That is, the voice encoding unit 4
The voice part is encoded as it is, the silent part is compressed, and data is output together with the count value from the silent counter 2. The data from the voice encoding unit 4 is recorded in the code storage memory 5. The memory address Am used for this recording is generated by the address generator 3 based on the count signal from the silence counter 2.

【００１６】図２は、図１の無音検出部１の一例を示す
ブロック図である。図２に示すように、検波器１１は入
力された音声信号Ｓ１を検波する。この場合の検波方式
としては、絶対値検波や２乗検波等の方式がとられる。
検波器１１で検波された信号は、ローパスフィルタ１２
で平滑された後、比較器１３で固定閾値θ１と比較され
る。そして、固定閾値θ１よりも小さい信号の区間につ
いては、無音検出信号Ｓ２が出力される。FIG. 2 is a block diagram showing an example of the silence detector 1 of FIG. As shown in FIG. 2, the detector 11 detects the input audio signal S1. As the detection method in this case, methods such as absolute value detection and square detection are used.
The signal detected by the detector 11 is the low-pass filter 12
After being smoothed by, the comparator 13 compares it with a fixed threshold value θ1. Then, the silence detection signal S2 is output for the section of the signal smaller than the fixed threshold θ1.

【００１７】図３は、図１の無音カウンタ２の一例を示
すブロック図である。図３に示すように、無音カウンタ
２はクロックＣ１ｋ１によりカウント動作し、無音検出
信号Ｓ２により同期リセットされる。カウンタ値θｎ
が、無音カウンタ２から出力される。FIG. 3 is a block diagram showing an example of the silence counter 2 of FIG. As shown in FIG. 3, the silence counter 2 counts by the clock C1k1 and is synchronously reset by the silence detection signal S2. Counter value θn
Is output from the silence counter 2.

【００１８】図４は、図１のアドレス発生器３の一例を
示すブロック図である。図４に示すように、比較器３１
は、無音カウンタ２からのカウンタ値θｎと固定閾値θ
２を比較し、その結果を出力する。論理ゲート３６は、
この比較結果とクロックＣｌｋ２とのナンド論理をと
り、アドレスレジスタ３２にストローブ信号を与える。
インクリメント部３３は、アドレスレジスタ３２の出力
データを入力し、そのデータに＋１したデータをメモリ
アドレスＡｍとして出力する。減算器３４は、アドレス
レジスタ３２の出力データを入力し、そのデータからデ
ータＤを減算した値をメモリアドレスＡｍとして出力す
る。切替器３５は、比較器３１からの出力に基づいて、
インクリメント部３３の出力と減算器３４の出力とのう
ちのいずれかのデータを切り替え出力し、アドレスレジ
スタ３２に与える。FIG. 4 is a block diagram showing an example of the address generator 3 of FIG. As shown in FIG.
Is the counter value θn from the silence counter 2 and the fixed threshold value θ.
2 is compared and the result is output. The logic gate 36 is
The NAND logic of this comparison result and the clock Clk2 is taken and a strobe signal is given to the address register 32.
The increment unit 33 inputs the output data of the address register 32 and outputs the data obtained by adding 1 to the data as the memory address Am. The subtractor 34 inputs the output data of the address register 32 and outputs a value obtained by subtracting the data D from the data as the memory address Am. The switch 35 is based on the output from the comparator 31,
Data of either the output of the increment unit 33 or the output of the subtractor 34 is switched and output, and given to the address register 32.

【００１９】比較器３１の出力は、符号蓄積用メモリ５
に対するライト信号ＷＲとしても用いられる。The output of the comparator 31 is the code storage memory 5
It is also used as a write signal WR for.

【００２０】以上述べたような構成において、次にその
動作を図７のタイミングチャートにしたがって説明す
る。ちなみに、図７（ａ）は入力される音声波形Ｗ、
（ｂ）は無音検出部１による無音検出信号、（ｃ）は無
音カウンタ２のカウンタ値、（ｄ）はアドレス発生器３
における比較器３１の出力であるライト信号ＷＲ、
（ｅ）はアドレス発生器３のアドレスレジスタ３２のカ
ウント値、（ｆ）はアドレスレジスタ３２のストローブ
信号をそれぞれ示すものである。The operation of the above-described structure will be described with reference to the timing chart of FIG. By the way, FIG. 7A shows an input voice waveform W,
(B) is a silence detection signal by the silence detector 1, (c) is a counter value of the silence counter 2, and (d) is an address generator 3.
The write signal WR which is the output of the comparator 31 in
(E) shows the count value of the address register 32 of the address generator 3, and (f) shows the strobe signal of the address register 32, respectively.

【００２１】先ず、音声信号Ｓ１が入力され、音声波形
Ｗが固定閾値θ１を上回っているとする。この場合は、
無音検出部１が無音を検出しない。このため、音声符号
化部４は、音声信号Ｓ１を符号化し、符号蓄積用メモリ
５に符号化データとして書き込む。この時、アドレスレ
ジスタ３２には、切替器３５を通じて、インクリメント
部３３の出力データが与えられる。このため、クロック
ＣＬＫ２に基づいてアドレスは順次インクリメントさ
れ、符号蓄積用メモリ５には順次符号化データが書き込
まれる。このとき、無音カウンタ２のカウント値は
“０”のままで歩進しない。First, it is assumed that the audio signal S1 is input and the audio waveform W exceeds the fixed threshold value θ1. in this case,
The silence detector 1 does not detect silence. Therefore, the voice encoding unit 4 encodes the voice signal S1 and writes it into the code storage memory 5 as encoded data. At this time, the output data of the increment unit 33 is given to the address register 32 through the switch 35. Therefore, the address is sequentially incremented based on the clock CLK2, and the coded data is sequentially written in the code storage memory 5. At this time, the count value of the silent counter 2 remains “0” and does not advance.

【００２２】一方、音声信号Ｓ１が入力され、音声波形
Ｗが固定閾値θ１以下とする。この場合、無音検出部１
からは、図７（ｂ）に示すように無音検出信号Ｓ２が出
力される。これにより、無音カウンタ２は、同図（ｃ）
に示すように、クロックＣＬＫ１のカウントを開始し、
カウンタ値θｎを出力する。このカウンタ値θｎが固定
閾値θ２に達すると、符号化圧縮すべきということが判
定される。この判定は、比較器３１で行われる。その結
果、同図７（ｄ）に示すように、比較器３１から、ライ
ト信号ＷＲと切替信号ＣＨが送出され、切替器３５が切
り替わる。切替器３５が切り替わると、図７（ｅ）に示
すように、アドレスレジスタ３２には、減算器３４によ
って、現在のアドレスｋよりデータＤだけ減算したアド
レス（ｋ−Ｄ）がセットされる。一方、図７（ｆ）に示
すように、論理ゲート３６に与えられていたクロックＣ
ｌＫ２がマスクされる。このため、アドレスレジスタ３
２はアドレス（ｋ−Ｄ）をキープすることになる。ちな
みに、ここでの減算データＤは、比較器３１に設定され
る固定閾値θ２に相当するデータに設定される。一方、
アドレス発生器３から、カウンタ値θｎが固定閾値θ２
を越えたことを示すライト信号ＷＲが出力されたとす
る。このときには、音声符号化部４は、音声信号Ｓ１の
符号化データの代わりに、無音マークを符号蓄積用メモ
リ５に書き込む。無音区間が続く限りアドレスレジスタ
３２はアドレス（ｋ−Ｄ）を保持する。無音カウンタ２
は無音区間のカウントを継続する。次に、音声信号Ｓ１
の音声波形Ｗが固定閾値θ１を越えると、無音検出部１
からの無音検出信号Ｓ２がなくなる。その結果、無音カ
ウンタ２はその時点のカウント値ｎのままでカウント動
作を停止し、次に同期リセットされてカウンタ値ｎを
“０”に戻す。これに先立って、音声符号化部４は、無
音カウンタ２のカウント値ｎを符号蓄積用メモリ５に記
録する。その結果、符号蓄積用メモリ５は、図９に示す
ように、（ａ）の状態までは音声符号化部４の符号化デ
ータを記録するが、無音カウンタ２のカウント値θｎが
固定閾値θ２に達すると無音と判定し、この時点で
（ｂ）に示すように、アドレスをデータＤだけ戻したメ
モリアドレスに無音マークを記録し、その後はアドレス
を進めずに、次に音声区間が始まった時点で無音カウン
タ２のカウンタ値θｎであるｎを記録する。On the other hand, the voice signal S1 is input and the voice waveform W is set to a fixed threshold value θ1 or less. In this case, the silence detector 1
Outputs a silence detection signal S2 as shown in FIG. 7 (b). As a result, the silent counter 2 is displayed in the same figure (c).
, The clock CLK1 starts counting,
The counter value θn is output. When the counter value θn reaches the fixed threshold value θ2, it is determined that encoding and compression should be performed. This judgment is performed by the comparator 31. As a result, as shown in FIG. 7D, the write signal WR and the switching signal CH are sent from the comparator 31, and the switching device 35 is switched. When the switch 35 is switched, as shown in FIG. 7E, the address register 32 is set to the address (k−D) obtained by subtracting the data D from the current address k by the subtractor 34. On the other hand, as shown in FIG. 7F, the clock C supplied to the logic gate 36
1K2 is masked. Therefore, the address register 3
2 will keep the address (k-D). Incidentally, the subtraction data D here is set to data corresponding to the fixed threshold value θ2 set in the comparator 31. on the other hand,
The counter value θn from the address generator 3 is a fixed threshold value θ2.
It is assumed that the write signal WR indicating that the value exceeds the limit is output. At this time, the voice encoding unit 4 writes a silence mark in the code storage memory 5 instead of the encoded data of the voice signal S1. The address register 32 holds the address (k−D) as long as the silent section continues. Silence counter 2
Keeps counting silent periods. Next, the audio signal S1
When the voice waveform W of the sound exceeds the fixed threshold θ1, the silence detector 1
The silence detection signal S2 from is lost. As a result, the silent counter 2 stops the counting operation while keeping the count value n at that time, and is then synchronously reset to return the counter value n to "0". Prior to this, the voice encoding unit 4 records the count value n of the silence counter 2 in the code storage memory 5. As a result, as shown in FIG. 9, the code storage memory 5 records the encoded data of the voice encoding unit 4 up to the state of (a), but the count value θn of the silence counter 2 becomes the fixed threshold θ2. When it reaches, it is judged as silence, and at this time, as shown in (b), a silence mark is recorded in the memory address where the address is returned by the data D, and thereafter the address is not advanced and the next voice section starts. Then, n, which is the counter value θn of the silent counter 2, is recorded.

【００２３】さて、音声区間に戻り、カウンタ値θｎが
“０”に戻ることに伴い、カウンタ値θｎは固定閾値θ
２より小さくなる。このため、ライト信号ＷＲがなくな
り、切替器３５も減算器３４からインクリメント部３３
側に切り替わる。その結果、アドレスレジスタ３２は、
再び、クロックＣｌｋ２に同期してアドレス（ｋ−Ｄ）
から（ｋ−Ｄ＋１）とインクリメント動作を始める。こ
れにより符号蓄積用メモリ５には、音声符号化部４で符
号化された音声信号Ｓ１が順次書き込まれる。By returning to the voice section and the counter value θn returning to “0”, the counter value θn is fixed to the fixed threshold value θ.
It becomes smaller than 2. Therefore, the write signal WR disappears, and the switching unit 35 also moves from the subtractor 34 to the increment unit 33
Switch to the side. As a result, the address register 32 is
Again, the address (k-D) is synchronized with the clock Clk2.
Then, the increment operation is started from (k-D + 1). As a result, the voice signal S1 encoded by the voice encoder 4 is sequentially written in the code storage memory 5.

【００２４】以上のようにして符号蓄積用メモリ５に書
き込まれた音声符号化データは、図１０に示すように、
無音区間で無音マークと無音区間フレーム数が無音デー
タフォーマットで記録され、音声区間では符号化音声デ
ータが音声データフォーマットで記録される。The voice coded data written in the code storage memory 5 as described above is as shown in FIG.
The silence mark and the number of frames in the silent section are recorded in the silent data format in the silent section, and the encoded audio data is recorded in the audio data format in the audio section.

【００２５】さて、以上のようなフォーマットで記録さ
れた符号化音声データを再生するには、音声データフォ
ーマット区間では符号蓄積用メモリ５のアドレスを進め
ながら符号化音声データをそのまま再生すればよい。一
方、無音データフォーマット区間では、無音マークによ
って無音フレームであることを検出し、その区間では符
号蓄積用メモリ５のアドレスを停止して、無音データを
再生し続ければよい。同時に、無音区間フレーム数をカ
ウントし、カウント値が一杯になった時点で符号蓄積用
メモリ５のアドレスを止めずに進めればよい。なお、無
音区間をスキップさせたければ符号蓄積用メモリ５のア
ドレスを止めずに進めればよい。なお、図１０に示すよ
うに、無音データフォーマットと音声データフォーマッ
トでデータサイズを同じにしておけば無音部と音声部で
アドレス制御を切り替える必要はない。In order to reproduce the coded voice data recorded in the above format, the coded voice data may be reproduced as it is while advancing the address of the code storage memory 5 in the voice data format section. On the other hand, in the silent data format section, it is only necessary to detect the silent frame by the silent mark, stop the address of the code storage memory 5 in that section, and continue to reproduce the silent data. At the same time, the number of silent section frames is counted, and when the count value becomes full, the address of the code storage memory 5 may be advanced without stopping. If it is desired to skip the silent section, it is sufficient to proceed without stopping the address of the code storage memory 5. As shown in FIG. 10, if the data size is the same in the silent data format and the audio data format, it is not necessary to switch the address control between the silent part and the audio part.

【００２６】なお、無音データフォーマットと音声デー
タフォーマットでデータサイズを同じにした場合、アド
レッシングが容易になり、高速サーチ、早送り、巻き戻
しなどの特殊再生を、データの内容を読み出す必要なく
実現することができる。When the silent data format and the audio data format have the same data size, addressing is facilitated, and special reproduction such as high-speed search, fast-forward, and rewind is realized without reading the contents of the data. You can

【００２７】以上のような構成によれば、連続無音区間
の長さに拘らずに無音区間は１フレームに圧縮されるの
で、無音区間が長ければ長い程データの圧縮率が高くな
る。今、音声区間と無音区間の比が１：ｕであるとす
る。図５のような圧縮方式をとって音声部に対して無音
部を１／ｋに圧縮した場合と比較してみる。図５のよう
な方式で実現できる圧縮率α１が、 α１＝（１＋ｕ）／｛１＋ｕ（１／ｋ）｝＝（１＋ｕ）・ｋ／（ｋ＋ｕ）となり、本実施例の方式では圧縮率α２が α２＝（１＋ｕ）／２となる。ｕ＞ｋであるとすると、 α１＜α２となり、本実施例の場合、無音区間が長ければ長い程、
データの圧縮率が高くなる。また、無音区間をスキップ
する場合も無音区間の展開をしなくて済むという利点が
ある。According to the above configuration, the silent section is compressed into one frame regardless of the length of the continuous silent section. Therefore, the longer the silent section, the higher the data compression rate. Now, it is assumed that the ratio between the voice section and the silent section is 1: u. A comparison will be made with the case where the silent part is compressed to 1 / k with respect to the voice part by using the compression method as shown in FIG. The compression rate α1 that can be realized by the method as shown in FIG. 5 is α1 = (1 + u) / {1 + u (1 / k)} = (1 + u) · k / (k + u), and the compression rate α2 in the method of this embodiment is α2 = (1 + u) / 2. If u> k, then α1 <α2, and in the present embodiment, the longer the silent section,
Higher data compression rate. Further, there is an advantage that it is not necessary to expand the silent section even when skipping the silent section.

【００２８】なお、図８は、アドレス発生器３のの構成
例を示す。同図８の構成と図４の構成の違いは、減算器
３４と切替器３５を省略した点にあり、構成が簡単にな
っている。無音区間を判別したときに、既に無音カウン
タ２は固定閾値θ２までカウントを進めている。しか
し、この例では、アドレスレジスタ３２のアドレスを元
に戻さずに単にストップさせるだけに留めている。その
結果、符号蓄積用メモリ５には、固定閾値θ２に対応す
るフレームだけ無音データが記録されてしまい、圧縮率
の点でも不利となる。しかし、回路構成が簡単でコスト
が安くなるという利点を有する。FIG. 8 shows a configuration example of the address generator 3. The difference between the configuration of FIG. 8 and the configuration of FIG. 4 is that the subtractor 34 and the switch 35 are omitted, and the configuration is simple. When the silent section is discriminated, the silent counter 2 has already advanced the count to the fixed threshold θ2. However, in this example, the address of the address register 32 is not returned but is simply stopped. As a result, silent data is recorded in the code storage memory 5 only for the frames corresponding to the fixed threshold value θ2, which is also disadvantageous in terms of compression rate. However, there are advantages that the circuit configuration is simple and the cost is low.

【００２９】なお、上記実施例では、それぞれの構成要
素を個別の回路構成として示した。しかし、ソフトウェ
アにより制御されるマイクロコンピュータやディジタル
シグナルプロセッサを用いても同様の機能を実現するこ
とが可能である。また、カウンタ等もディジタル的なカ
ウントに代えて、積分器等のアナログ的な回路でも実現
可能である。また、上記実施例では、アドレスレジスタ
３２にアドレス値を与えるために、減算器３４とインク
リメント部３３の２つの回路の出力値を切替器３５で切
り替える構成を例示した。しかし、インクリメント部３
３と減算器３４を共通の演算器で構成し、切替器３５に
対する切り替え信号により演算値を切り替える構成とし
てもよい。In the above embodiment, each component is shown as an individual circuit configuration. However, the same function can be realized by using a microcomputer or digital signal processor controlled by software. Further, the counter and the like can be realized by an analog circuit such as an integrator instead of the digital count. Further, in the above-described embodiment, in order to give the address value to the address register 32, the switch 35 switches the output values of the two circuits of the subtractor 34 and the incrementer 33. However, the increment unit 3
3 and the subtractor 34 may be configured by a common arithmetic unit, and the arithmetic value may be switched by a switching signal to the switching unit 35.

【００３０】なお、音声符号化装置４としては、ＰＣＭ
（パルスコードモジュレーション）、ＡＤＭ（アダプテ
ィブデルタモジュレーション）、ＡＤＰＣＭ（アダプテ
ィブヂッファレンシャルパルスコードモジュレーショ
ン）、ＡＰＣ−ＡＢ（アダプティブプレヂクティブコー
ディング−アダプティブビットアロケーション）等の各
種の符号化方式が適用可能である。また、音声符号化部
４に入力される音声信号Ｓ１は、アナログ信号でも、デ
ィジタル信号でもよい。It should be noted that the voice encoding device 4 is a PCM.
Various encoding methods such as (pulse code modulation), ADM (adaptive delta modulation), ADPCM (adaptive differential pulse code modulation), and APC-AB (adaptive predictive coding-adaptive bit allocation) are applicable. . The voice signal S1 input to the voice encoding unit 4 may be an analog signal or a digital signal.

【００３１】また、符号蓄積用メモリ５としては、半導
体メモリはもちろん、磁気ディスクや磁気テープでもよ
く、データを記録できる媒体ならあらゆるものが適用可
能である。また、リアルタイムな記録を要求されなけれ
ば、もう少し低速のメモリが適用可能である。As the code storage memory 5, not only a semiconductor memory but also a magnetic disk or a magnetic tape may be used, and any medium capable of recording data can be applied. Also, if a real-time recording is not required, a slightly slower memory can be applied.

【００３２】さらに、無音カウンタ２は、音声信号Ｓ１
の大きさが固定閾値θ１を越えた時点でリセットされる
が、固定閾値θ１を越えた時間が一定の時間を越えた場
合場合に無音カウンタ２をリセットするような構成とし
てもよい。また、無音カウンタ２は、アップカウンタに
限らず、ダウンカウンタとしてもよい。Further, the silence counter 2 is operated by the voice signal S1.
Is reset when the magnitude exceeds the fixed threshold θ1, but the silent counter 2 may be reset when the time over the fixed threshold θ1 exceeds a certain time. The silence counter 2 is not limited to the up counter, but may be the down counter.

【００３３】[0033]

【発明の効果】以上述べたように、本発明の実施例によ
れば、音声信号の無音区間をその長さに拘わらず一定の
長さのフレームに圧縮するようにしたので、無音区間が
長く続く場合の圧縮率を高めることが可能であり、また
再生を行なう場合のデータの取り扱いが容易で回路構成
が簡単になり、更に無音カウンターで一定時間以上無音
区間が続かないと無音と判定しないようにしたので再生
音声の品質を高めることができる。As described above, according to the embodiment of the present invention, the silent section of the audio signal is compressed into a frame of a constant length regardless of its length, so that the silent section is long. It is possible to increase the compression rate in the case of continuing, and the data handling during playback is easy and the circuit configuration is simple, and the silence counter will not judge that there is no silence unless it continues for a certain period of time or more. As a result, the quality of the reproduced voice can be improved.

[Brief description of drawings]

【図１】本発明の一実施例の音声符号化装置のブロック
図。FIG. 1 is a block diagram of a speech coder according to an embodiment of the present invention.

【図２】図１の無音検出部の一例を示すブロック図。FIG. 2 is a block diagram showing an example of a silence detecting section in FIG.

【図３】図１の無音カウンタの一例を示すブロック図。FIG. 3 is a block diagram showing an example of the silence counter of FIG.

【図４】図１のアドレス発生器３の一例を示すブロック
図。FIG. 4 is a block diagram showing an example of an address generator 3 of FIG.

【図５】従来の音声の符号化方式の一例を示すタイミン
グチャート。FIG. 5 is a timing chart showing an example of a conventional audio encoding method.

【図６】従来の音声の符号化方式の他の例を示すタイミ
ングチャート。FIG. 6 is a timing chart showing another example of a conventional audio encoding system.

【図７】図１〜図４の回路の動作を説明するためのタイ
ミングチャート。FIG. 7 is a timing chart for explaining the operation of the circuits of FIGS.

【図８】本発明の他の実施例の音声符号化装置の部分ブ
ロック図。FIG. 8 is a partial block diagram of a speech coder according to another embodiment of the present invention.

【図９】符号蓄積用メモリでの無音区間の記録状態の説
明図。FIG. 9 is an explanatory diagram of a recorded state of a silent section in the code storage memory.

【図１０】符号蓄積用メモリへの記録フォーマットの説
明図。FIG. 10 is an explanatory diagram of a recording format in a code storage memory.

[Explanation of symbols]

１無音検出部２無音カウンタ３アドレス発生器４音声符号化部５符号蓄積用メモリ１１検波器１２ローパスフィルタ１３比較器３１比較器３２アドレスレジスタ３３インクリメント部３４減算器３５切替器３６論理ゲート 1 silence detector 2 silence counter 3 address generator 4 Speech coding unit 5 Code storage memory 11 detector 12 Low-pass filter 13 Comparator 31 Comparator 32 address register 33 increment unit 34 Subtractor 35 switch 36 logic gates

Claims

[Claims]

1. A silence detecting means for detecting a silence portion of an input voice signal and outputting a silence detection signal; a silence counter for counting the number of clocks during the silence detection signal output and outputting a counter value; The voiced part of the input voice signal when the detection signal is not output is a coded voice signal, and a silence code is generated during the output of the silence detection signal, a coding means, and a coded voice from the coding means. A memory for storing a signal, a silence code, and a counter value from the silence counter means; and when the counter value exceeds a predetermined reference value, the stepping of the address of the memory is stopped, and the counter value is the A voice encoding device, comprising: address generating means for incrementing the address when the reference value is not exceeded.

2. When the counter value exceeds the reference value, the address generating means not only stops the stepping of the address of the memory, but also sets the address to the address at the time of starting the output of the silence detection signal. 3. The device of claim 2, configured as having a rewind function.

3. The apparatus according to claim 1, wherein the address generating means performs addressing so that a frame length of a silent portion and a frame length of a voiced portion become equal to each other.