JP2004061567A

JP2004061567A - Noise canceller

Info

Publication number: JP2004061567A
Application number: JP2002215931A
Authority: JP
Inventors: Yasumasa Morishita; 森下　康正
Original assignee: NEC Engineering Ltd
Current assignee: NEC Engineering Ltd
Priority date: 2002-07-25
Filing date: 2002-07-25
Publication date: 2004-02-26

Abstract

<P>PROBLEM TO BE SOLVED: To provide a noise canceller having superior quality of telephone call by suppressing the change in the tone of the background noise. <P>SOLUTION: In the noise canceller for attenuating the noise of the time region of an input signal, a time region signal is converted into frequency domain data in FFT4, an attenuation control signal e is smoothed with a smoother 1, when it is detected in a threshold circuit 2 with a voice content ratio d being less than a prescribed value, in the case that attenuation is controlled in response to a voice content ratio d to perform noise cancellation in the frequency region data; otherwise the smoother 1 is bypass controlled. Thereby the tone change in the background noise is suppressed. <P>COPYRIGHT: (C)2004,JPO

Description

【０００１】
【発明の属する技術分野】
本発明はノイズキャンセラに関し、特に据置型電話機、携帯型電話機、音声会議装置等のように音響信号を入力信号として取り扱う装置に用いて好適なノイズキャンセラに関する。
【０００２】
【従来の技術】
雑音を含む信号から雑音成分を減衰させ、所望の音声などの信号を取り出す装置は、ノイズキャンセラと呼ばれている。一般的に、背景雑音は白色雑音であり、パワーとしてみた場合すべての周波数領域に対して時間的変動が少ない。これに対し、信号成分は当然時間的パワー変動を持っている。特に、信号成分が音声信号である場合は、瞬間、瞬間をとってみるとひとつの周波数の音とその高調波のみである。従って、比較的容易にノイズキャンセラを構成することができる。
【０００３】
しかしながら、近年、据置型電話機や携帯型電話機等において、信号成分として音声信号ばかりでなく、音声信号以外の信号が多く含まれるようになってきており、このために、ノイズキャンセラの構成を根本的に見直す必要が生じてきている。その一つの方法として、雑音を含む信号に対し、周波数分析を行ってスペクトル（スペクトルパワー）を求め、周波数帯域別に時間変動を監視して帯域別信号対雑音比を求め、その信号対雑音（Ｓ／Ｎ）比に応じて減衰量を計算して、帯域別に減衰させることによって雑音成分を減衰させる方法がある。
【０００４】
すなわち、本質的には入力信号のスペクトル成分を刻々と変化させて、入力信号の中に含まれている雑音成分を除去するノイズキャンセラが提案されている。また、特開２０００−４７６９７号公報には、減衰量が周波数帯域別に制御されており、時間領域において推定した入力信号の信号対雑音比を信号（音声）含有率と考えて、各周波数帯域の減衰量を計算する際に、帯域別信号対雑音比に加味して減衰量を計算するノイズキャンセラが提案されている。
【０００５】
図５に、特開２０００−４７６９７号公報に開示されたノイズキャンセラのブロック図を示す。図５を参照すると、このノイズキャンセラは、入力信号ａからデータを切り出す窓関数回路３と、この窓関数回路３の出力を周波数領域データに、変換するＦＦＴ（Ｆａｓｔ　Ｆｏｕｒｉｅｒ　Ｔｒａｎｓｆｏｒｍ；高速フーリエ変換）回路４と、周波数領域毎に減衰を与える減衰器５と、周波数領域データを時系列データに変換するＩＦＦＴ（Ｉｎｖｅｒｓｅ　Ｆａｓｔ　Ｆｏｕｒｉｅｒ　Ｔｒａｎｓｆｏｒｍ；逆高速フーリエ変換）回路６と、ＩＦＦＴ回路６の出力を連続データにつなぎ合わす窓関数オーバラップ回路７とを含んでいる。
【０００６】
また、このノイズキャンセラは、入力信号ａの信号（音声）成分の含有率を計算する音声含有率計算部９と、減衰器５に与える周波数領域毎の減衰量を計算する減衰量計算部１５とを含んで構成されている。
【０００７】
以下、音声以外の非雑音信号も、代表して音声あるいは音声信号と表現する。入力データａは窓関数回路３に供給され、連続して入力される信号列の中からＦＦＴ４のポイント数個のデータが切り出される。ＦＦＴ４の入力は時系列順序に並んでいる実時間領域データであり、ＦＦＴ４にて時間周波数解析処理され、ＦＦＴポイント数個の帯域の周波数領域データに変換されて出力ｃされる。
【０００８】
この周波数領域データに対して演算を行うために、減衰量計算部１５及び減衰器５は帯域数個分設けられている。周波数領域データは、減衰器５によって、減衰量計算部１５にて計算された減衰量に応じてその雑音成分が減衰され、音声成分は減衰されることなく、雑音成分だけが減衰された後、ＩＦＦＴ６において周波数領域データから実時間領域データに変換される。さらに、切り出されている状態のデータは、窓関数オーバラップ回路７によって、連続データにつなぎ合わされて出力ｂされる。
【０００９】
音声含有率計算部９は、入力信号ａの信号パワー（変動分）を監視する信号パワー監視器１０と、入力信号ａの定常パワー（定常分；一定時間以上の平均分）を監視する定常パワー監視器１１と、信号パワー監視器１０と定常パワー監視器１１との出力から、信号対雑音比を計算する信号対雑音比計算器１２と、聴覚特性に合わせて対数化する対数計算器１３と、音声含有率ｄなる尺度に変換する音声含有率変換器１４とを含んで構成されている。
【００１０】
音声含有率計算部９は、時間周波数解析を行う前の入力信号ａのパワー変動を常に監視し、入力信号ａの信号対雑音比を実時間領域信号のパワー変動速度から推定して、音声含有率ｄなる尺度に変換する。このとき、音声含有率計算部９は推定した信号対雑音比を聴覚特性に合わせて対数化し、聴感上の信号対雑音比と線形的に対応させる。その後に、音声含有率計算部９は、この対数値から入力信号ａ中に音声成分が含まれている比率を確率計算し、音声含有率ｄなる尺度に変換して出力する。
【００１１】
音声含有率計算部９の信号パワー監視器１０は、入力信号ａのパワー変動に対して高速に追従し、実時間領域信号の音声成分のパワーを推定する。定常パワー監視器１１は、入力信号ａのパワー変動に対して低速にて追従し、実時間領域信号の背景雑音成分のパワーを推定する。信号対雑音比計算器１２は、それらのパワー変動から信号対雑音比を計算し、対数計算器１３は、信号対雑音比計算器１２にて計算された信号対雑音比を対数化する。音声含有率変換器１４は、対数計算器１３にて対数化された信号対雑音比を音声含有率ｄに換算し、その音声含有率ｄを減衰量計算部１５の帯域別減衰量計算器２０へ出力する。
【００１２】
減衰量計算部１５は、周波数領域の信号パワーを監視する信号パワー監視器１６と、周波数領域の定常パワーを監視する定常パワー監視器１７と、信号パワー監視器１６と定常パワー監視器１７との出力から、信号対雑音比を計算する信号対雑音比計算器１８と、計算された信号対雑音比を対数化する対数計算器１９と、周波数領域（帯域）別の減衰量を計算する帯域別減衰量計算２０とを含んで構成されている。
【００１３】
減衰量計算部１５は、信号パワー監視器１６と定常パワー監視器１７とを用いて、すべての周波数領域データのパワー変動を監視する。減衰量計算部１５は、周波数領域におけるパワー変動の監視のみでは判断しがたいある帯域の微弱なパワー変動に対して、音声含有率ｄを音声成分と見なすか、雑音成分と見なすかの判断の一助として、減衰量を計算する。これによって、例えば、音声成分に含まれる倍音成分を損なうことなく、雑音成分の減衰が可能となる。すなわち、減衰量計算部１５は実時間信号の信号対雑音比が大きい場合に、音声成分が多く含まれている確率が高いため、微弱なパワー変動を倍音成分が含まれると判断する可能性が高い。
【００１４】
また、減衰量計算部１５は実時間信号の信号対雑音比が小さい場合に、音声成分が多く含まれている確率が低く、微弱なパワー変動を雑音成分であると判断する可能性が高い。信号パワー監視器１６は、周波数領域データｃのパワー変動に対して高速に追従し、周波数領域データｃの音声成分のパワーを推定する。定常パワー監視器１７は、周波数領域データｃのパワー変動に対して低速にて追従し、周波数領域データｃの背景雑音成分のパワーを推定する。
【００１５】
信号対雑音比計算器１８は、それらのパワー変動から信号対雑音比を計算し、対数計算器１９は、信号対雑音比計算器１８にて計算された信号対雑音比を対数化する。帯域別減衰量計算器２０は、この対数化された信号対雑音比を基に帯域別に減衰量を計算する。このとき、帯域別減衰量計算器２０は、音声含有率変換器１４からの音声含有率ｄを参照パラメータとして用いる。信号対雑音比計算器１２，１８は、信号パワーを雑音パワーにて割る除算動作を行う。その実現方法は、実際に除算計算を行う方法でも良いし、簡略的に、逆数値が記憶されている変換テーブルを参照しても良い。
【００１６】
また、対数計算器１３，１９の実現方法は、実際に対数計算を行う方法でも良いし、簡略的に、対数値が記憶されている変換テーブルを参照しても良い。帯域別減衰量計算器２０が、対数化された信号対雑音比を基に帯域別に減衰量を計算する際、音声含有率ｄを参照パラメータとして用いることによって正確な帯域別減衰量を設定でき、良好なノイズキャンセラの特性を実現できる。
【００１７】
【発明が解決しようとする課題】
特開２０００−４７６９７号公報記載のノイズキャンセラは、それなりに良好なノイズキャンセラの特性を実現しているが、より良いものを求める要求はますます強くなってきた。
【００１８】
一般に、入力信号に音声成分が存在しない時間（非音声区間）には、入力信号に含まれているのは殆どが背景雑音であると見なせるが、特開２０００−４７６９７号公報記載のノイズキャンセラにおいては、この非音声区間においても減衰量を帯域別に計算し常に更新している。従って、非音声区間においても、入力信号のスペクトルを短時間にて変化させて、雑音成分を除去する動作をする。
【００１９】
しかしながら、スペクトルを短時間にて刻々と変化させると、背景雑音の持つ音色が変化することになり、「ひゅるひゅる」、「しゅるしゅる」、「きゅるきゅる」といった人工的な音色を生じ、聴感上不自然になる場合がある問題がある。すなわち、音質の悪い携帯電話の様な感じの音色になってしまうことがある。従って、背景雑音を減衰させる処理においては、雑音パワーを減衰させる過程にて雑音の音色は変化させないことが聴感上望ましい。
【００２０】
本発明の目的は、背景雑音の音色変化を抑えて、通話品質の優れたノイズキャンセラを提供することである。
【００２１】
【課題を解決するための手段】
本発明によるノイズキャンセラは、入力時間領域信号の背景雑音を減衰させるノイズキャンセラであって、前記入力時間領域信号を複数の周波数領域データに変換する時間／周波数領域変換手段と、前記複数の周波数領域データに減衰制御信号にて減衰を与える減衰手段と、前記減衰手段の出力を時間領域信号に変換して出力する周波数／時間領域変換手段と、前記入力時間領域信号の音声含有率を算出する音声含有率算出手段と、前記複数の周波数領域データの信号対雑音比を算出する信号対雑音比算出手段と、前記音声含有率及び前記周波数領域データの信号対雑音比を基に前記減衰制御信号を導出する減衰制御信号導出手段と、前記減衰制御信号を平滑化する平滑手段と、前記音声含有率に応じて前記平滑手段をバイパス制御する制御手段とを含むことを特徴とする。
【００２２】
本発明による他のノイズキャンセラは、入力時間領域信号の背景雑音を減衰させるノイズキャンセラであって、前記入力時間領域信号を複数の周波数領域データに変換する時間／周波数領域変換手段と、前記複数の周波数領域データに減衰制御信号にて減衰を与える減衰手段と、前記減衰手段の出力を時間領域信号に変換して出力する周波数／時間領域変換手段と、前記入力時間領域信号の音声含有率を算出する音声含有率算出手段と、前記複数の周波数領域データの信号対雑音比を算出する信号対雑音比算出手段と、前記音声含有率及び前記周波数領域データの信号対雑音比を基に前記減衰制御信号を導出する減衰制御信号導出手段と、前記減衰制御信号を平滑化する平滑手段と、前記複数の周波数領域データからピッチを検出するピッチ検出手段と、前記ピッチ検出結果と前記音声含有率とに応じて前記平滑制御手段を制御する手段とを含むことを特徴とする。
【００２３】
本発明の作用は次の通りである。入力信号の時間領域の雑音を減衰させるためのノイズキャンセラにおいて、時間領域信号を周波数領域データに変換して、この周波数領域データにおいて、音声含有率に応じた減衰制御をなすことによりノイズキャンセルを行う場合、この音声含有率が所定値未満の場合には減衰制御信号を平滑化する。他の方法として、周波数領域データからピッチを検出して、このピッチと音声含有率とに応じて減衰制御信号の平滑制御をなすようにする。こうすることによって、背景雑音の音色変化を抑えるものである。
【００２４】
【発明の実施の形態】
以下に、本発明の実施例について図面を参照して説明する。図１は本発明によるノイズキャンセラの実施例の構成を示すブロック図であり、図２〜５と同等部分は同一符号にて示している。図１を参照すると、本発明によるノイズキャンセラは、入力（時間領域）信号ａからデータを切り出す窓関数回路３と、窓関数回路３の出力を周波数領域データに変換するＦＦＴ（Ｆａｓｔ　Ｆｏｕｒｉｅｒ　Ｔｒａｎｓｆｏｒｍ；高速フーリエ変換）回路４と、周波数領域毎に減衰を与える減衰器５と、周波数領域データを時系列（時間領域）データに変換するＩＦＦＴ（Ｉｎｖｅｒｓｅ　Ｆａｓｔ　Ｆｏｕｒｉｅｒ　Ｔｒａｎｓｆｏｒｍ；逆高速フーリエ変換）回路６とを含む。
【００２５】
さらに、ＩＦＦＴ回路６の出力を連続データにつなぎ合わす窓関数オーバラップ回路７と、入力信号ａの信号（音声）成分の含有率を計算する音声含有率計算部９と、減衰器５に与える周波数領域毎の減衰量を計算する減衰量計算部１５と、減衰器５に与える周波数領域毎の減衰量を平滑化する平滑器と１、音声含有率計算部９の出力ｄに、閾値（しきい値）を与えるしきい値回路２とを含んで構成される。
【００２６】
本発明の実施例の動作を説明する。なお、以下、音声以外の非雑音信号も代表して音声あるいは音声信号と表現する。
【００２７】
図１において、入力（時間領域）データａは窓関数回路３に供給され、連続して入力される信号列の中からＦＦＴ４のポイント数個のデータが切り出される。ＦＦＴ４の入力は時系列順序に並んでいる実時間領域データであり、ＦＦＴ４により、時間周波数解析処理されてＦＦＴポイント数個の帯域の周波数領域データに変換されて出力ｃされる。
【００２８】
この周波数領域データに対して演算を行うために、減衰量計算部１５及び減衰器５は帯域数個分設けられている。周波数領域データは減衰器５によって、減衰量計算部１５において計算された減衰量に応じてその雑音成分が減衰され、音声成分は減衰されることなく雑音成分だけが減衰された後、ＩＦＦＴ６にて周波数領域データから実時間領域データに変換される。さらに、切り出されている状態のデータは窓関数オーバラップ回路７によって、連続データにつなぎ合わされて出力ｂされる。
【００２９】
音声含有率計算部９は、入力信号ａの信号パワー（変動分）を監視する信号パワー監視器１０と、入力信号ａの定常パワー（定常分；一定時間以上の平均分）を監視する定常パワー監視器１１と、信号パワー監視器１０と定常パワー監視器１１との出力から信号対雑音比を計算する信号対雑音比計算器１２と、聴覚特性に合わせて対数化する対数計算器１３と、音声含有率ｄなる尺度に変換する音声含有率変換器１４とを含んで構成されている。
【００３０】
音声含有率計算部９は、時間周波数解析を行う前の入力信号ａのパワー変動を常に監視し、入力信号ａの信号対雑音比を実時間領域信号のパワー変動速度から推定して、音声含有率ｄなる尺度に変換する。このとき、音声含有率計算部９は推定した信号対雑音比を聴覚特性に合わせて対数化し、聴感上の信号対雑音比と線形的に対応させる。その後に、音声含有率計算部９は、この対数値から入力信号ａ中に音声成分が含まれている比率を確率計算し、音声含有率ｄなる尺度に変換して出力する。
【００３１】
音声含有率計算部９の信号パワー監視器１０は、入力信号ａのパワー変動に対して高速に追従し、実時間領域信号の音声成分のパワーを推定する。定常パワー監視器１１は、入力信号ａのパワー変動に対して低速にて追従し、実時間領域信号の背景雑音成分の（平均）パワーを推定する。信号対雑音比計算器１２は、それらのパワー変動から信号対雑音比を計算し、対数計算器１３は、信号対雑音比計算器１２にて計算された信号対雑音比を対数化する。音声含有率変換器１４は、対数計算器１３にて対数化された信号対雑音比を音声含有率ｄに換算し、その音声含有率ｄを減衰量計算部１５の帯域別減衰量計算器２０へ出力する。
【００３２】
減衰量計算部１５は、周波数領域の信号パワーを監視する信号パワー監視器１６と、周波数領域の定常パワーを監視する定常パワー監視器１７と、信号パワー監視器１６と定常パワー監視器１７との出力から信号対雑音比を計算する信号対雑音比計算器１８と、計算された信号対雑音比を対数化する対数計算器１９と、周波数領域（帯域）別の減衰量を計算する帯域別減衰量計算２０とを含んで構成されている。
【００３３】
減衰量計算部１５は、信号パワー監視器１６と定常パワー監視器１７とを用いて、すべての周波数領域データのパワー変動を監視する。減衰量計算部１５は、周波数領域におけるパワー変動の監視のみでは判断しがたいある帯域の微弱なパワー変動に対して、音声含有率ｄを音声成分と見なすか、雑音成分と見なすかの判断の一助として、減衰量を計算する。これによって、例えば音声成分に含まれる倍音成分を損なうことなく、雑音成分の減衰が可能となる。すなわち、減衰量計算部１５は実時間信号の信号対雑音比が大きい場合に、音声成分が多く含まれている確率が高いため、微弱なパワー変動を倍音成分が含まれると判断する可能性が高い。
【００３４】
また、減衰量計算部１５は実時間信号の信号対雑音比が小さい場合に、音声成分が多く含まれている確率が低く、微弱なパワー変動を雑音成分であると判断する可能性が高い。信号パワー監視器１６は、周波数領域データｃのパワー変動に対して高速に追従し、周波数領域データｃの音声成分のパワーを推定する。定常パワー監視器１７は、周波数領域データｃのパワー変動に対して低速にて追従し、周波数領域データｃの背景雑音成分の（平均）パワーを推定する。
【００３５】
信号対雑音比計算器１８は、それらのパワー変動から信号対雑音比を計算し、対数計算器１９は、信号対雑音比計算器１８にて計算された信号対雑音比を対数化する。帯域別減衰量計算器２０は、この対数化された信号対雑音比を基に帯域別に減衰量を計算する。このとき、帯域別減衰量計算器２０は、音声含有率変換器１４からの音声含有率ｄを参照パラメータとして用いる。
【００３６】
信号対雑音比計算器１２，１８は、信号パワーを雑音パワーにて割る除算動作を行う。その実現方法は、実際に除算計算を行う方法でも良いし、簡略的に、逆数値が記憶されている変換テーブルを参照しても良い。また、対数計算器１３，１９の実現方法は、実際に対数計算を行う方法でも良いし、簡略的に、対数値が記憶されている変換テーブルを参照しても良い。
【００３７】
帯域別減衰量計算器２０の出力と減衰器５との間に平滑器１を挿入する。これによって、非音声区間においては減衰量の時間変動を平滑化して、減衰量が急峻に変化しないようにする。また、音声含有率変換器１４の出力ｄをしきい値回路２に供給し、その出力を平滑器１に供給する。しきい値回路２は、閾値（しきい値）以上の値が入力された時には平滑器１の平滑動作を停止（バイパス）させ、閾値未満の値が入力された時には、音声成分が存在しない（背景雑音のみ）として、平滑器１を平滑動作させる働きをする。平滑器１は、信号を滑らかにする働きをする一種のローパスフィルタ（積分器）であり、この様な動作をするものであればどんな回路でも良い。
【００３８】
例えば、図３に示すリーク積分器と呼ばれるものが広く知られている。ここでは平滑器１としてリーク積分器を用いたものとして説明する。図３において、リーク積分器は、１サンプル遅延回路３３、乗算（一種の増幅）器３１，３４、加算器３２にて構成され、乗算器３１，３４の乗算係数を用途に合わせて適当に選ぶことにより、適当な積分特性を実現できる。また、この積分特性を利用して信号の平滑化を行い、平滑化回路あるいは簡略的なパワー計算回路として広く用いられる。
【００３９】
本発明の他の実施例について図２を参照して説明する。図２において、本発明によるノイズキャンセラは、ＶＡＤ（Ｖｏｉｃｅ　Ａｃｔｉｖａｔｅ　Ｄｅｔｅｃｔｏｒ：　有音検出器）　８を有し、音声区間と非音声区間とにて減衰量の変化速度を変化させる。音声を所望入力信号とする場合、音声は概ね周期的な信号であり、自己相関を用いて音声の基本周期（ピッチ）を検出することができる。このピッチという情報は、音声区間と非音声区間との判別に用いることができる。この判別機能（装置）をＶＡＤと呼ぶ。
【００４０】
図２において、本発明によるノイズキャンセラは、図１に比べ、ＦＦＴ４の出力ｃを新たに設けたＶＡＤ８に供給し、その出力を平滑器１に供給する。ＶＡＤ８の詳細を図４に示す。図４を参照すると、ＶＡＤ８は、ＦＦＴ４の出力ｃ（周波数領域信号）から自己相関を利用してピッチを検出するピッチ検出器２１と、ピッチの変動を判定するピッチ変動判定器２２とを有している。また、ピッチ変動判定器２２は、図１に示されるのと同一（機能）のしきい値回路２と、ピッチの変動を判定するピッチ変動判定器２３と、ピッチの範囲を判定するピッチ範囲判定器２４と、論理積をとるＡＮＤ回路２５とを有している。
【００４１】
一般に、ＶＡＤの実現方法には様々なものが提案されているが、ここでは、例えば、しきい値回路２において、入力ｄの時間的パワー変動の緩急を監視する（急激な変化は音声、緩慢な変化は雑音）。さらに、ピッチ変動判定器２３において、ピッチの連続性を検証する（音声のピッチの時間変動は連続である）。さらにはまた、ピッチ範囲判定器２４において、ピッチの適正検証をする（音声のピッチは或る範囲内にしか存在しない）。
【００４２】
これらしきい値回路２、ピッチ変動判定器２３、ピッチ範囲判定器２４の出力のすべてにおいて、「入力は音声である」と見なした場合に、（論理積２５をとって）音声区間と判別するものとする。なお、入力の時間的パワー変動の緩急を監視する部分は音声含有率変換器１４の出力ｄを利用する。また、ＶＡＤ８が音声区間と判別した時に、平滑器１の平滑動作を停止（バイパス）させ、ＶＡＤ８が非音声区間と判別した時に、平滑器１を平滑動作させる。
【００４３】
【発明の効果】
以上説明したように本発明によれば、信号を減衰させるための制御信号に対する平滑器を設けることによって、雑音成分が多いと思われる区間、すなわち非音声区間では減衰量の変化が緩やかになるという効果がある。従って、非音声区間のノイズキャンセル後のスペクトルも緩やかに変化するため、会話の合間に発生する非音声区間においても、装置使用者に不自然な感じを与えない。
【００４４】
また、ＶＡＤを設けることによって、人間の音声特有の特性を利用して音声区間と非音声区間との判別基準に用い、判別精度を高めることができるという効果がある。すなわち、非音声区間にての減衰量の変化が緩やかになる様に制御して、ノイズキャンセル後のスペクトル変化の低減効果がさらに高められ、所期の目的を達成することができる。
【００４５】
さらに、入力はＦＦＴによって周波数領域に変換されており、ＶＡＤの構成要素である自己相関処理を周波数領域にて実施することによって、ＦＦＴ前の時間領域の信号に対して自己相関処理を行うよりも、はるかに処理量を軽減できるという効果もある。
【図面の簡単な説明】
【図１】本発明の実施例のブロック図である。
【図２】本発明の他の実施例のブロック図である。
【図３】リーク積分器のブロック図である。
【図４】ＶＡＤのブロック図である。
【図５】従来のノイズキャンセラのブロック図である。
【符号の説明】
１　平滑器
２　しきい値回路
３　窓関数回路
４　ＦＦＴ
５　減衰器
６　ＩＦＦＴ
７　窓関数オーバラップ回路
９　音声含有率計算部
１０，１６　信号パワー監視器
１１，１７　定常パワー監視器
１２，１８　信号対雑音比計算器
１３，１９　対数計算器
１４　音声含有率変換器
１５　減衰量計算部
２０　帯域別減衰量計算器[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention relates to a noise canceller, and more particularly to a noise canceller suitable for use in a device that handles an audio signal as an input signal, such as a stationary telephone, a portable telephone, and a voice conference device.
[0002]
[Prior art]
A device that attenuates a noise component from a signal containing noise and extracts a signal such as a desired sound is called a noise canceller. Generally, background noise is white noise, and when viewed as power, there is little temporal variation in all frequency regions. On the other hand, the signal component naturally has a temporal power fluctuation. In particular, when the signal component is an audio signal, only momentary sound is a sound of one frequency and its harmonics. Therefore, the noise canceller can be configured relatively easily.
[0003]
However, in recent years, in a stationary telephone, a portable telephone, and the like, not only an audio signal but also a lot of signals other than an audio signal have been included as signal components, and therefore, the configuration of the noise canceller has been fundamentally changed. There is a need to review it. As one of the methods, a spectrum (spectral power) is obtained by performing frequency analysis on a signal containing noise, a time variation is monitored for each frequency band, and a signal-to-noise ratio is obtained for each band. / N) There is a method of calculating the amount of attenuation according to the ratio and attenuating the noise component by attenuating each band.
[0004]
That is, a noise canceller has been proposed that essentially changes the spectral component of the input signal momentarily to remove noise components contained in the input signal. In Japanese Patent Application Laid-Open No. 2000-47697, the amount of attenuation is controlled for each frequency band, and the signal-to-noise ratio of the input signal estimated in the time domain is considered as the signal (speech) content, and There has been proposed a noise canceller that calculates the amount of attenuation in consideration of the signal-to-noise ratio for each band when calculating the amount of attenuation.
[0005]
FIG. 5 shows a block diagram of a noise canceller disclosed in Japanese Patent Application Laid-Open No. 2000-47697. Referring to FIG. 5, the noise canceller includes a window function circuit 3 for extracting data from an input signal a, and an FFT (Fast Fourier Transform) circuit 4 for converting an output of the window function circuit 3 into frequency domain data. An attenuator 5 for attenuating each frequency domain, an IFFT (Inverse Fast Fourier Transform) circuit 6 for converting frequency domain data into time series data, and an output of the IFFT circuit 6 connected to continuous data. And a window function overlapping circuit 7 for matching.
[0006]
The noise canceller includes a voice content calculator 9 for calculating the content of the signal (voice) component of the input signal a, and an attenuation calculator 15 for calculating the attenuation provided to the attenuator 5 for each frequency domain. It is comprised including.
[0007]
Hereinafter, non-noise signals other than voice are also represented as voice or voice signal. The input data a is supplied to the window function circuit 3, and data of several points of the FFT 4 is cut out from a signal sequence that is continuously input. The input of the FFT 4 is real-time domain data arranged in a time-series order, is subjected to time-frequency analysis processing in the FFT 4, is converted into frequency domain data of several FFT point bands, and is output c.
[0008]
In order to perform an operation on the frequency domain data, the attenuation calculator 15 and the attenuator 5 are provided for several bands. In the frequency domain data, the noise component is attenuated by the attenuator 5 in accordance with the attenuation calculated by the attenuation calculator 15, and only the noise component is attenuated without attenuating the voice component. In the IFFT 6, the frequency domain data is converted into real time domain data. Further, the data in the cut-out state is connected to continuous data by the window function overlap circuit 7 and output b.
[0009]
The voice content calculating unit 9 includes a signal power monitor 10 for monitoring the signal power (variation) of the input signal a and a steady power for monitoring the steady power (steady; average over a certain time) of the input signal a. A monitor 11, a signal-to-noise ratio calculator 12 for calculating a signal-to-noise ratio from outputs of the signal power monitor 10 and the steady-state power monitor 11, and a logarithmic calculator 13 for performing logarithm according to auditory characteristics. , And a voice content rate converter 14 that converts the voice content rate into a scale of d.
[0010]
The voice content calculating unit 9 constantly monitors the power fluctuation of the input signal a before performing the time-frequency analysis, estimates the signal-to-noise ratio of the input signal a from the power fluctuation speed of the real-time domain signal, and It is converted into a measure of the rate d. At this time, the voice content calculating unit 9 converts the estimated signal-to-noise ratio into a logarithm in accordance with the auditory characteristics, and makes the signal-to-noise ratio linearly correspond to the audible signal-to-noise ratio. Thereafter, the voice content calculating unit 9 calculates the probability that the voice signal is included in the input signal a from the logarithmic value, converts the probability to the scale of the voice content d, and outputs the scale.
[0011]
The signal power monitor 10 of the voice content calculating unit 9 follows the power fluctuation of the input signal a at high speed and estimates the power of the voice component of the real-time domain signal. The stationary power monitor 11 follows the power fluctuation of the input signal a at a low speed and estimates the power of the background noise component of the real time domain signal. The signal-to-noise ratio calculator 12 calculates a signal-to-noise ratio from the power fluctuations, and the logarithmic calculator 13 converts the signal-to-noise ratio calculated by the signal-to-noise ratio calculator 12 into a logarithm. The voice content rate converter 14 converts the signal-to-noise ratio logarithmized by the logarithmic calculator 13 into a voice content rate d, and converts the voice content rate d into a band-specific attenuation calculator 20 of the attenuation calculator 15. Output to
[0012]
The attenuation calculator 15 includes a signal power monitor 16 for monitoring the signal power in the frequency domain, a steady power monitor 17 for monitoring the steady power in the frequency domain, and a signal power monitor 16 and the steady power monitor 17. From the output, a signal-to-noise ratio calculator 18 for calculating a signal-to-noise ratio, a logarithmic calculator 19 for logarithmizing the calculated signal-to-noise ratio, and a band for calculating an attenuation amount for each frequency domain (band) And an attenuation calculation 20.
[0013]
The attenuation calculator 15 monitors the power fluctuation of all the frequency domain data using the signal power monitor 16 and the steady power monitor 17. The attenuation amount calculation unit 15 determines whether to consider the audio content rate d as an audio component or a noise component for a weak power variation in a certain band, which cannot be determined only by monitoring the power variation in the frequency domain. To help, calculate the attenuation. Thus, for example, it is possible to attenuate the noise component without damaging the overtone component included in the audio component. That is, when the signal-to-noise ratio of the real-time signal is large, the attenuation amount calculation unit 15 has a high possibility that a large amount of voice component is included. Therefore, there is a possibility that the weak power fluctuation is determined to include a harmonic component. high.
[0014]
Further, when the signal-to-noise ratio of the real-time signal is small, the attenuation amount calculation unit 15 has a low probability that many audio components are included, and it is highly likely that a weak power fluctuation is determined to be a noise component. The signal power monitor 16 quickly follows the power fluctuation of the frequency domain data c and estimates the power of the audio component of the frequency domain data c. The stationary power monitor 17 follows the power fluctuation of the frequency domain data c at a low speed and estimates the power of the background noise component of the frequency domain data c.
[0015]
The signal-to-noise ratio calculator 18 calculates the signal-to-noise ratio from the power fluctuations, and the logarithmic calculator 19 converts the signal-to-noise ratio calculated by the signal-to-noise ratio calculator 18 into a logarithm. The band-specific attenuation calculator 20 calculates the attenuation for each band based on the logarithmic signal-to-noise ratio. At this time, the band-specific attenuation calculator 20 uses the voice content d from the voice content converter 14 as a reference parameter. The signal-to-noise ratio calculators 12 and 18 perform a division operation of dividing the signal power by the noise power. The realization method may be a method of actually performing a division calculation, or may simply refer to a conversion table storing reciprocal values.
[0016]
As a method of realizing the logarithmic calculators 13 and 19, a method of actually performing logarithmic calculation may be used, or a conversion table storing logarithmic values may be simply referred to. When the band-specific attenuation calculator 20 calculates the attenuation for each band based on the logarithmic signal-to-noise ratio, the accurate band-specific attenuation can be set by using the voice content d as a reference parameter, Good noise canceller characteristics can be realized.
[0017]
[Problems to be solved by the invention]
The noise canceller described in Japanese Patent Application Laid-Open No. 2000-47697 realizes a reasonably good noise canceller characteristic, but the demand for a better noise canceller has been increasingly strong.
[0018]
Generally, during a time when no audio component exists in the input signal (non-speech section), most of the input signal can be considered to be background noise. However, in the noise canceller described in JP-A-2000-47697, Also in this non-voice section, the attenuation is calculated for each band and constantly updated. Therefore, even in the non-voice section, the operation of changing the spectrum of the input signal in a short time to remove the noise component is performed.
[0019]
However, if the spectrum is changed every moment in a short period of time, the timbre of the background noise will change, and artificial sounds such as "hyperhyuru", "syurusyuru", and "kyarikyaru" will occur. There is a problem that an unnatural tone is generated and the sound may be unnatural. That is, the tone may be like a mobile phone having poor sound quality. Therefore, in the process of attenuating the background noise, it is desirable from the viewpoint of hearing that the timbre of the noise is not changed in the process of attenuating the noise power.
[0020]
SUMMARY OF THE INVENTION An object of the present invention is to provide a noise canceller which suppresses a change in timbre of background noise and has excellent speech quality.
[0021]
[Means for Solving the Problems]
A noise canceller according to the present invention is a noise canceller that attenuates background noise of an input time-domain signal, and includes a time / frequency domain conversion unit that converts the input time-domain signal into a plurality of frequency domain data; Attenuating means for attenuating with an attenuation control signal, frequency / time domain converting means for converting an output of the attenuating means into a time domain signal and outputting the same, and a voice content rate for calculating a voice content rate of the input time domain signal Calculating means, signal-to-noise ratio calculating means for calculating a signal-to-noise ratio of the plurality of frequency domain data, and deriving the attenuation control signal based on the voice content and the signal-to-noise ratio of the frequency domain data. Attenuation control signal deriving means, smoothing means for smoothing the attenuation control signal, and control for bypass-controlling the smoothing means according to the audio content. Characterized in that it comprises a stage.
[0022]
Another noise canceller according to the present invention is a noise canceller that attenuates background noise of an input time domain signal, wherein the time / frequency domain conversion means converts the input time domain signal into a plurality of frequency domain data; Attenuating means for attenuating data with an attenuation control signal, frequency / time domain converting means for converting the output of the attenuating means into a time domain signal and outputting the same, and voice for calculating the voice content of the input time domain signal Content ratio calculation means, signal-to-noise ratio calculation means for calculating a signal-to-noise ratio of the plurality of frequency domain data, and the attenuation control signal based on the voice content rate and the signal-to-noise ratio of the frequency domain data. Attenuation control signal deriving means for deriving, a smoothing means for smoothing the attenuation control signal, and a pitch for detecting a pitch from the plurality of frequency domain data Means out, characterized in that it comprises a means for controlling the smoothing control means in response to said pitch detection result and the speech content.
[0023]
The operation of the present invention is as follows. When noise is canceled by converting a time-domain signal into frequency-domain data in a noise canceller for attenuating time-domain noise of an input signal and performing attenuation control in accordance with a voice content rate in the frequency-domain data. If the voice content is less than a predetermined value, the attenuation control signal is smoothed. As another method, a pitch is detected from the frequency domain data, and smoothing control of the attenuation control signal is performed according to the pitch and the voice content. By doing so, the timbre change of the background noise is suppressed.
[0024]
BEST MODE FOR CARRYING OUT THE INVENTION
Hereinafter, embodiments of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram showing a configuration of an embodiment of a noise canceller according to the present invention, and portions equivalent to those in FIGS. Referring to FIG. 1, a noise canceller according to the present invention includes a window function circuit 3 for extracting data from an input (time domain) signal a, and an FFT (Fast Fourier Transform; fast Fourier transform) for converting an output of the window function circuit 3 into frequency domain data. A conversion circuit 4, an attenuator 5 for attenuating each frequency domain, and an IFFT (Inverse Fast Fourier Transform) circuit 6 for converting frequency domain data into time-series (time domain) data.
[0025]
Further, a window function overlap circuit 7 for connecting the output of the IFFT circuit 6 to continuous data, a voice content calculation unit 9 for calculating the content of the signal (voice) component of the input signal a, and a frequency to be given to the attenuator 5 An attenuation calculator 15 for calculating the attenuation for each region, a smoother for smoothing the attenuation for each frequency region given to the attenuator 5, and an output d of the voice content calculator 9 are provided with a threshold (threshold). And a threshold value circuit 2 for giving a value).
[0026]
The operation of the embodiment of the present invention will be described. Hereinafter, non-noise signals other than voice are also represented as voice or voice signal.
[0027]
In FIG. 1, input (time domain) data a is supplied to a window function circuit 3, and data of several points of the FFT 4 are cut out from a continuously input signal sequence. The input of the FFT 4 is real-time domain data arranged in a time series order. The FFT 4 performs time-frequency analysis processing, converts the data into frequency domain data of several FFT points, and outputs the data.
[0028]
In order to perform an operation on the frequency domain data, the attenuation calculator 15 and the attenuator 5 are provided for several bands. The noise component of the frequency domain data is attenuated by the attenuator 5 in accordance with the amount of attenuation calculated by the attenuation amount calculator 15, and only the noise component is attenuated without attenuating the voice component. It is converted from frequency domain data to real time domain data. Further, the cut-out data is connected to continuous data by the window function overlap circuit 7 and output as b.
[0029]
The voice content calculating unit 9 includes a signal power monitor 10 for monitoring the signal power (variation) of the input signal a and a steady power for monitoring the steady power (steady; average over a certain time) of the input signal a. A monitor 11, a signal-to-noise ratio calculator 12 for calculating a signal-to-noise ratio from outputs of the signal power monitor 10 and the steady-state power monitor 11, and a logarithmic calculator 13 for performing logarithm according to auditory characteristics, And a voice content converter 14 for converting the voice content into a scale of voice content d.
[0030]
The voice content calculating unit 9 constantly monitors the power fluctuation of the input signal a before performing the time-frequency analysis, estimates the signal-to-noise ratio of the input signal a from the power fluctuation speed of the real-time domain signal, and It is converted into a measure of the rate d. At this time, the voice content calculating unit 9 converts the estimated signal-to-noise ratio into a logarithm in accordance with the auditory characteristics, and makes the signal-to-noise ratio linearly correspond to the audible signal-to-noise ratio. Thereafter, the voice content calculating unit 9 calculates the probability that the voice signal is included in the input signal a from the logarithmic value, converts the probability to the scale of the voice content d, and outputs the scale.
[0031]
The signal power monitor 10 of the voice content calculating unit 9 follows the power fluctuation of the input signal a at high speed and estimates the power of the voice component of the real-time domain signal. The stationary power monitor 11 follows the power fluctuation of the input signal a at a low speed and estimates the (average) power of the background noise component of the real time domain signal. The signal-to-noise ratio calculator 12 calculates a signal-to-noise ratio from the power fluctuations, and the logarithmic calculator 13 converts the signal-to-noise ratio calculated by the signal-to-noise ratio calculator 12 into a logarithm. The voice content rate converter 14 converts the signal-to-noise ratio logarithmized by the logarithmic calculator 13 into a voice content rate d, and converts the voice content rate d into a band-specific attenuation calculator 20 of the attenuation calculator 15. Output to
[0032]
The attenuation calculator 15 includes a signal power monitor 16 for monitoring the signal power in the frequency domain, a steady power monitor 17 for monitoring the steady power in the frequency domain, and a signal power monitor 16 and the steady power monitor 17. A signal-to-noise ratio calculator 18 for calculating a signal-to-noise ratio from an output, a logarithmic calculator 19 for logarithmizing the calculated signal-to-noise ratio, and a band-specific attenuation for calculating an attenuation amount for each frequency domain (band). And a quantity calculation 20.
[0033]
The attenuation calculator 15 monitors the power fluctuation of all the frequency domain data using the signal power monitor 16 and the steady power monitor 17. The attenuation amount calculation unit 15 determines whether to consider the audio content rate d as an audio component or a noise component for a weak power variation in a certain band, which cannot be determined only by monitoring the power variation in the frequency domain. To help, calculate the attenuation. This makes it possible to attenuate noise components, for example, without damaging the overtone components included in the audio components. That is, when the signal-to-noise ratio of the real-time signal is large, the attenuation amount calculation unit 15 has a high possibility that a large amount of voice component is included. Therefore, there is a possibility that the weak power fluctuation is determined to include a harmonic component. high.
[0034]
Further, when the signal-to-noise ratio of the real-time signal is small, the attenuation amount calculation unit 15 has a low probability that many audio components are included, and it is highly likely that a weak power fluctuation is determined to be a noise component. The signal power monitor 16 quickly follows the power fluctuation of the frequency domain data c and estimates the power of the audio component of the frequency domain data c. The stationary power monitor 17 follows the power fluctuation of the frequency domain data c at a low speed and estimates the (average) power of the background noise component of the frequency domain data c.
[0035]
The signal-to-noise ratio calculator 18 calculates the signal-to-noise ratio from the power fluctuations, and the logarithmic calculator 19 converts the signal-to-noise ratio calculated by the signal-to-noise ratio calculator 18 into a logarithm. The band-specific attenuation calculator 20 calculates the attenuation for each band based on the logarithmic signal-to-noise ratio. At this time, the band-specific attenuation calculator 20 uses the voice content d from the voice content converter 14 as a reference parameter.
[0036]
The signal-to-noise ratio calculators 12 and 18 perform a division operation of dividing the signal power by the noise power. The realization method may be a method of actually performing a division calculation, or may simply refer to a conversion table storing reciprocal values. As a method of realizing the logarithmic calculators 13 and 19, a method of actually performing logarithmic calculation may be used, or a conversion table storing logarithmic values may be simply referred to.
[0037]
The smoother 1 is inserted between the output of the band-specific attenuation calculator 20 and the attenuator 5. As a result, the time variation of the attenuation is smoothed in the non-voice section, so that the attenuation does not change sharply. The output d of the voice content converter 14 is supplied to the threshold circuit 2, and the output is supplied to the smoother 1. The threshold circuit 2 stops (bypasses) the smoothing operation of the smoother 1 when a value equal to or more than the threshold (threshold) is input, and when a value less than the threshold is input, there is no voice component ( The function of smoothing the smoothing device 1 is performed as background noise only). The smoother 1 is a kind of low-pass filter (integrator) that functions to smooth a signal, and any circuit that performs such an operation may be used.
[0038]
For example, a leak integrator shown in FIG. 3 is widely known. Here, a description will be given assuming that a leak integrator is used as the smoother 1. In FIG. 3, the leak integrator includes a one-sample delay circuit 33, multiplying (a kind of amplifying) units 31 and 34, and an adder 32, and appropriately selects a multiplication coefficient of the multipliers 31 and 34 according to the application. Thereby, an appropriate integration characteristic can be realized. In addition, the signal is smoothed using this integration characteristic, and is widely used as a smoothing circuit or a simple power calculation circuit.
[0039]
Another embodiment of the present invention will be described with reference to FIG. In FIG. 2, the noise canceller according to the present invention has a VAD (Voice Activate Detector: voiced detector) 8, and changes the rate of change of the amount of attenuation between a voice section and a non-voice section. When speech is used as a desired input signal, the speech is a substantially periodic signal, and the fundamental period (pitch) of the speech can be detected using autocorrelation. The information of the pitch can be used for discriminating between a voice section and a non-voice section. This determination function (apparatus) is called VAD.
[0040]
2, the noise canceller according to the present invention supplies the output c of the FFT 4 to the newly provided VAD 8 and supplies the output to the smoother 1 as compared with FIG. FIG. 4 shows the details of VAD8. Referring to FIG. 4, the VAD 8 includes a pitch detector 21 for detecting a pitch from an output c (frequency domain signal) of the FFT 4 using autocorrelation, and a pitch variation determiner 22 for determining a variation in pitch. ing. The pitch variation determiner 22 includes a threshold circuit 2 having the same (function) as that shown in FIG. 1, a pitch variation determiner 23 for determining a pitch variation, and a pitch range determination for determining a pitch range. And an AND circuit 25 for calculating a logical product.
[0041]
In general, various methods of realizing VAD have been proposed. In this case, for example, the threshold circuit 2 monitors the fluctuation of the temporal power fluctuation of the input d (a sudden change is caused by voice, A big change is noise). Further, the pitch variation determination unit 23 verifies the continuity of the pitch (the time variation of the voice pitch is continuous). Furthermore, the pitch range determination unit 24 verifies the properness of the pitch (the voice pitch exists only within a certain range).
[0042]
In all of the outputs of the threshold circuit 2, the pitch fluctuation determiner 23, and the pitch range determiner 24, when it is considered that the input is a voice, the voice section is determined (using the logical product 25). It shall be. The output d of the voice content converter 14 is used for monitoring the fluctuation of the input temporal power fluctuation. In addition, when the VAD 8 is determined to be a voice section, the smoothing operation of the smoothing device 1 is stopped (bypass), and when the VAD 8 is determined to be a non-voice section, the smoothing operation is performed.
[0043]
【The invention's effect】
As described above, according to the present invention, by providing a smoother for a control signal for attenuating a signal, a change in the amount of attenuation is gradual in a section where noise components are considered to be large, that is, in a non-voice section. effective. Therefore, the spectrum of the non-voice section after noise cancellation also changes gradually, so that the user of the apparatus does not feel unnatural even in the non-voice section generated between conversations.
[0044]
Further, by providing the VAD, there is an effect that it is possible to use the characteristic peculiar to human voice as a criterion for distinguishing between a voice section and a non-voice section, thereby improving the accuracy of the determination. That is, control is performed so that the change of the attenuation amount in the non-voice section becomes gentle, the effect of reducing the spectrum change after the noise cancellation is further enhanced, and the intended purpose can be achieved.
[0045]
Further, the input is converted into the frequency domain by the FFT, and by performing the autocorrelation processing which is a component of the VAD in the frequency domain, the autocorrelation processing is performed on the signal in the time domain before the FFT. Also, there is an effect that the processing amount can be greatly reduced.
[Brief description of the drawings]
FIG. 1 is a block diagram of an embodiment of the present invention.
FIG. 2 is a block diagram of another embodiment of the present invention.
FIG. 3 is a block diagram of a leak integrator.
FIG. 4 is a block diagram of a VAD.
FIG. 5 is a block diagram of a conventional noise canceller.
[Explanation of symbols]
1 smoother 2 threshold circuit 3 window function circuit 4 FFT
5 Attenuator 6 IFFT
7 Window Function Overlap Circuit 9 Speech Content Calculation Unit 10, 16 Signal Power Monitor 11, 17 Steady State Power Monitor 12, 18 Signal-to-Noise Ratio Calculator 13, 19 Logarithmic Calculator 14 Speech Content Converter 15 Attenuation Calculator 20 Attenuation calculator for each band

Claims

A noise canceller for attenuating background noise of an input time domain signal, a time / frequency domain conversion means for converting the input time domain signal into a plurality of frequency domain data, and attenuating the plurality of frequency domain data with an attenuation control signal. A frequency / time domain conversion means for converting an output of the attenuation means into a time domain signal and outputting the converted signal; a voice content calculating means for calculating a voice content of the input time domain signal; Signal-to-noise ratio calculation means for calculating the signal-to-noise ratio of the frequency domain data, and attenuation control signal derivation means for deriving the attenuation control signal based on the voice content and the signal-to-noise ratio of the frequency domain data. A noise control means for smoothing the attenuation control signal, and control means for bypass-controlling the smoothing means in accordance with the audio content. Canceller.

A noise canceller for attenuating background noise of an input time domain signal, a time / frequency domain conversion means for converting the input time domain signal into a plurality of frequency domain data, and attenuating the plurality of frequency domain data with an attenuation control signal. A frequency / time domain conversion means for converting an output of the attenuation means into a time domain signal and outputting the converted signal; a voice content calculating means for calculating a voice content of the input time domain signal; Signal-to-noise ratio calculation means for calculating the signal-to-noise ratio of the frequency domain data, and attenuation control signal derivation means for deriving the attenuation control signal based on the voice content and the signal-to-noise ratio of the frequency domain data. A smoothing means for smoothing the attenuation control signal; a pitch detection means for detecting a pitch from the plurality of frequency domain data; Noise canceller characterized in that it comprises a means for controlling the smoothing control means in response to the voice content.

3. The noise canceller according to claim 1, wherein the input time domain signal includes a voice signal.

The noise canceller according to claim 1, wherein the voice content is calculated based on a signal-to-noise ratio of the input time-domain signal.

The voice content calculating means and the signal-to-noise ratio calculating means calculate a signal-to-noise ratio by dividing a signal power by the averaged signal power. Noise canceller.