JP3965543B2

JP3965543B2 - Communication device

Info

Publication number: JP3965543B2
Application number: JP378099A
Authority: JP
Inventors: 進千田
Original assignee: Brother Industries Ltd
Current assignee: Brother Industries Ltd
Priority date: 1999-01-11
Filing date: 1999-01-11
Publication date: 2007-08-29
Anticipated expiration: 2019-01-11
Also published as: JP2000209309A

Description

【０００１】
【発明の属する技術分野】
本発明は、通信装置に関し、特に、雑音と音声を切り分けるための基準の設定を行いやすく、かつ、設定された基準がわかりやすい通信装置を提供することにある。
【０００２】
【背景技術】
近年、音声認識機能を備え、電話がかかってきたときに、受話器をとることなく音声応答により回線を自動閉結できる音声認識機能付き電話装置が開発されている。
【０００３】
また、このような音声認識機能付き電話装置においては、雑音と音声を切り分けるため、予想される雑音レベルが予め設定され、この雑音レベルを超える音声がマイクロホンから入力されたときに、音声応答があったと判断されて、回線が自動的に閉結されるものが多い。
【０００４】
そして、中には、雑音と音声を切り分けるための基準となる音声レベルの設定をユーザー自身が行えるものもある。
【０００５】
【発明が解決しようとする課題】
しかしながら、従来の音声レベルの設定は、ユーザーの設定操作に従って、装置の内部処理として行われる。このため、どの程度の音声レベルに設定されたのかがユーザーにわかりにくかった。また、どの程度の音声レベルを設定すれば雑音と音声を切り分けられるのかが、ユーザーにわかりにくいという問題もあった。
【０００６】
本発明は、上記課題に鑑みてなされたものであり、その目的は、雑音と音声を切り分けるための基準の設定を行いやすく、かつ、設定された基準がわかりやすい通信装置を提供することにある。
【０００７】
【課題を解決するための手段】
上記目的を達成するため、請求項１記載の通信装置は、マイクロホンにて入力される外部音声の音声レベルを測定しつつ、該外部音声の中から所定の音声レベルを超える外部音声を音声認識する音声認識手段と、回線からの発呼信号受信中に、前記音声認識手段の認識結果に基づいて回線を自動的に閉結する自動閉結手段と、前記音声認識手段における音声認識の基準となる前記音声レベルを設定するためのレベル設定モードに切り替える切り替え手段と、前記レベル設定モードに切り替えられたときに前記音声レベルを手動入力に基づいて設定する音声レベル設定手段と、前記マイクロホンにて入力される外部音声の音声レベル及び前記音声レベル設定手段にて設定される音声レベルを可視的に表示するレベル表示手段と、を有するものである。
【０００８】
請求項１記載の通信装置によれば、レベル表示手段にて、外部音声の音声レベル及び設定される音声レベルの両方が可視的に表示されることから、両者を視覚的に対比することが可能となる。このため、どの程度の音声レベルを設定すれば雑音と音声を切り分け得るかを即座に理解することができる。このため、音声レベル設定手段による音声レベルの設定作業が行いやすくなる。また、設定された音声レベルがユーザーにわかりやすくなる。
【０００９】
特に、マイクロホンからユーザーが音声を入力すれば、その音声の音声レベルも外部音声の音声レベルとして視覚的に表示されるため、自分の音声レベルと周囲の雑音の音声レベルとの対比を視覚的に行うことが可能となる。従って、音声認識の基準となる音声レベルを、例えば雑音の音声レベルと自分の声の音声レベルの中間の適切なレベルに簡単に設定することが可能となる。
【００１０】
請求項２記載の通信装置は、請求項１記載の通信装置において、前記レベル表示手段は、前記マイクロホンにて入力される外部音声の音声レベル及び前記音声レベル設定手段にて設定される音声レベルを並べて表示するものである。
【００１１】
請求項２記載の通信装置によれば、外部音声の音声レベル及び設定される音声レベルが並べて表示されることから、両者の対比を一層容易かつ確実に行うことができる。また、外部音声の音声レベル（この場合は、雑音の音声レベル、ユーザーの入力音声の音声レベルとを共用）を目で確認しながら音声認識の基準となる音声レベルを設定することができる。このため、表示手段の構成を簡易としながら、雑音と音声を切り分けるための基準となる音声レベルの設定が一層行いやすく、かつ、設定された音声レベルと外部音声の音声レベルとの高低がわかりやすい通信装置を得ることが可能となる。
【００１２】
請求項３記載の通信装置は、請求項１または請求項２に記載の通信装置において、前記外部音声は、前記マイクロホンにて入力される第１の外部音声と、この第１の外部音声の入力後に前記マイクロホンにて入力され前記第１の外部音声の音声レベルよりも高い第２の外部音声を含み、前記レベル表示手段は、前記第１の外部音声の音声レベルと、前記第２の外部音声の音声レベルを前記外部音声の音声レベルとして並べて表示するものである。
【００１３】
請求項３記載の通信装置によれば、第１の外部音声と、この第１の外部音声より高い第２の外部音声がレベル表示手段に並べて表示されることから、周囲の雑音の音声レベルと、ユーザーの音声の音声レベルとを並べて対比することができる。このため、両者を並べて対比することにより、どの程度の音声レベルを基準として設定すればよいかを一目で把握することができ、より音声レベル設定作業を行いやすくなる。
【００１４】
【発明の実施の形態】
次に、本発明に係る通信装置を、ファクシミリ装置に適用した場合を例にとって、図面を参照しつつ具体的に説明する。
【００１５】
まず、図２に、本実施形態のファクシミリ装置の外観斜視図を示す。
【００１６】
図２において、ファクシミリ装置１０は、本体１２と、この本体１２の左側部に設置された受話器２８からなる。この受話器２８は、図示しないコードによって本体１２に接続されている。また、本体１２の右側部には、スピーカ２６が設けられている。本体１２の上面の前部には、キー入力部１６が設けられ、そのキー入力部１６の左後部にはＬＣＤ１８が設けられ、右後部にはマイクロホン２７が設けられている。さらに、ＬＣＤ１８及びマイクロホン２７の後部には、原稿挿入口２０が設けられ、ここから挿入された原稿は、本体１２内部のスキャナ５２（図３参照）にて読みとられた後、本体１２の前面であってキー入力部１６の下方に設けられた原稿排出口１４から排出される。原稿挿入口２０の後部には、複数枚の記録紙を積層収納可能な記録紙ホルダ２４が着脱可能に取り付けられている。そして、記録紙ホルダ２４から供給され印字に使用された記録紙は、原稿排出口１４の下方に設けられた記録紙排出口２２から排出される。
【００１７】
このようなファクシミリ装置１０は、図３のブロック図に示されるような電気的構造を有する。つまり、ＣＰＵ３２、ＲＯＭ３４、ＥＥＰＲＯＭ３６、ＲＡＭ３８、画像メモリ４０、音声メモリ４２、計時部４３、センサ４４、音声認識部４５、ネットワーク・コントロール・ユニット（以下「ＮＣＵ」という）４６、モデム４８、バッファ５０、スキャナ５２、符号化部５４、復号化部５６、プリンタ５８、キー入力部１６、ＬＣＤ１８及びアンプ６０、６１が設けられ、システムバス３０を介して互いに接続されている。また、ＮＣＵ４６には、受話器２８及び電話回線６４が接続される他、モデム４８も接続されている。そして、アンプ６０にスピーカ２６が接続され、アンプ６１にマイクロホン２７が接続されている。
【００１８】
より詳しくは、ＣＰＵ３２は、システムバス３０を介して接続された各部を制御する。このＣＰＵ３２にて実行される制御プログラム及びこの制御プログラムの実行に必要な各種のデータは、ＲＯＭ３４、ＥＥＰＲＯＭ３６に格納される。
【００１９】
ここで、ＲＯＭ３４に格納される制御プログラムとしては、例えば、音声認識部４５における音声認識の基準となる音声レベルを設定するレベル設定プログラムや、このレベル設定プログラムを実行するためのレベル設定モードに切り替えるためのモード切り替えプログラム、音声認識部４５の音声認識結果に基づいて回線を自動閉結するための自動閉結プログラム等がある。また、ＣＰＵ３２や音声認識部４５等における処理に際して参照される音データ、数データ等の各種データも、ＲＯＭ３４に格納される。
【００２０】
ＥＥＰＲＯＭ３６には、音声レベルデータ、短縮ダイヤルデータ、各種の音声メッセージを出力するためのデータ等が格納される。
【００２１】
ＲＡＭ３８には、受話器２８や、マイクロホン２７や、電話回線６４からＮＣＵ４６を介して取り込まれた音声が格納される他、ＣＰＵ３２による動作実行時の各種データが一時的に格納される。
【００２２】
画像メモリ４０は、通信履歴、画像データ及び印刷のためのビットイメージを記憶し、音声メモリ４２は、相手側装置へ送出される応答メッセージや、相手側装置から送られてきた入来メッセージを記憶する。計時部４３は、現在時刻を計測する。センサ４４は、記録紙カバーの開閉状態を検出する。
【００２３】
音声認識部４５は、マイクロホン２７にて取り込まれた音声の中から、所定の音声レベルを超える音声を認識する。この音声認識部４５は、例えば音声ＬＳＩ等により構成することができる。
【００２４】
ＮＣＵ４６は、電話回線６４，６６及び交換機６２を介して接続される図示しない相手側装置との間の信号の送受信を行う。
【００２５】
モデム４８は、このＮＣＵ４６にて送受信される画像データや音声データ等の通信データの変調・復調を行う。バッファ５０は、相手側装置との間で送受信される符号化された画像情報を含むデータを一時的に記憶する。スキャナ５２は、原稿挿入口２０に挿入された原稿の読み取り面に記された文字・図形を、画像データとして読みとり、符号化部５４は、スキャナ５２が読みとった画像データを符号化する。一方、復号化部５６は、バッファ５０又は画像メモリ４０に記憶された画像データを読み出して、これを復号化する。プリンタ５８は、この復号化されたデータを記録紙に印刷する。キー入力部１６は、テンキーや機能キーを含み、これらのキーによって各種の設定操作や電話番号入力操作等を行うことを可能とする。アンプ６０は、スピーカ２６にて鳴動音や通話音声として出力されるべき音声信号を増幅し、アンプ６１は、マイクロホン２７にて取り込まれた外部音声を増幅する。
【００２６】
なお、本実施形態においては、マイクロホン２７がマイクロホンに相当し、音声認識部４５が音声認識手段に相当する。また、ＣＰＵ３２及びキー入力部１６が、切り替え手段及び音声レベル設定手段に相当し、ＬＣＤ１８がレベル表示手段に相当する。そして、ＣＰＵ３２及びＮＣＵ４６が自動閉結手段に相当する。
【００２７】
このようなファクシミリ装置１０において、音声認識部４５における音声認識の基準となる音声レベルは、図４のフローチャートに示すような手順で設定される。
【００２８】
図４において、スタート時点では、ファクシミリ装置１０は受信待機状態にある。この状態において、レベル設定モードに切り替えられると（Ｓ１）、マイクロホン２７にて周囲の外部音声が取り込まれて音声レベルが測定され、その結果が雑音レベルとしてＬＣＤ１８に画像表示される（Ｓ２）。このＳ２でマイクロホン２７にて取り込まれる外部音声が、第１の外部音声に相当する。
【００２９】
このときのＬＣＤ１８の表示画面を図１（Ａ）に示す。なお、図１は、本実施形態におけるＬＣＤ表示の例を示す図である。
【００３０】
この図１（Ａ）において、「雑音レベル」の表示の右側に一列に並んだ長方形は、雑音レベルを１０段階であらわすメモリであり、塗りつぶされた長方形が１つ増えるほどにレベルが１段階上がることを示す。従って、この例の場合は、雑音レベル２となる。
【００３１】
なお、レベル設定モードへの切り替えは、キー入力部１６からの手動入力に基づき、ＲＯＭ３４に格納されたモード切り替えプログラムに従って、ＣＰＵ３２により行われる。また、マイクロホン２７にて取り込まれた音声の音声レベルは、音声認識部４５にて測定され、ＲＡＭ３８に格納される。さらに、ＬＣＤ１８への画像表示は、ＲＯＭ３４に格納されたプログラムと、ＥＥＰＲＯＭ３６及びＲＡＭ３８等に格納されたデータに基づいて、ＣＰＵ３２の制御により実行される。
【００３２】
次いで、雑音レベルより高い音声レベルの音声が入力されたか否かが判断され（Ｓ３）、入力がない場合には（Ｓ３：ＮＯ）、マイクロホン２７からの音声入力を促すための入力指示メッセージが出力される（Ｓ１０）。このＳ３の判断は、ＲＯＭ３４に格納されたプログラムに従い、ＣＰＵ３２が行う。また、入力指示メッセージは、例えば「音声を入力してください」のように、ユーザーに音声入力を促すものであれば、表示出力、音声出力など、その具体的内容及び出力形態は特に問わない。なお、Ｓ３で測定される外部音声が、第２の外部音声に相当する。
【００３３】
Ｓ３で雑音レベルより高いレベルの音声が入力されると（Ｓ３：ＹＥＳ）、入力レベルとしてＬＣＤ１８に画像表示される（Ｓ４）。このときのＬＣＤ１８の表示画面を図１（Ｂ）に示す。図示された例の場合は、入力レベル８となる。
【００３４】
入力レベルを所定時間表示した後、音声レベル設定画面がＬＣＤ１８に表示される（Ｓ５）。このとき、本実施形態では、現在設定されている音声レベルが表示され、キー入力部１６からの入力により設定変更可能とされる。このため、雑音レベルと入力レベルとを切り分けるレベル設定を視覚的に容易にすることができる。具体的には、例えば図１（Ｃ）に示すような表示画面となる。なお、図示された例の場合、現在設定されている音声レベルは５となる。
【００３５】
その後、キー入力部１６を用いて音声レベルの設定入力があったか否かが判断され（Ｓ６）、入力があった場合には（Ｓ６：ＹＥＳ）、入力に基づいてＬＣＤ１８における音声レベル表示が変更される（Ｓ７）。そして、キー入力部１６から決定を示す入力がなされると（Ｓ８：ＹＥＳ）、Ｓ６で行われた入力に基づいて音声レベルが新たに設定され、ＥＥＰＲＯＭ３６に音声レベルデータとして格納される（Ｓ９）。このとき、古くなった音声レベルデータは、ＥＥＰＲＯＭ３６から消去される。一方、Ｓ８で決定を示す入力がない場合は（Ｓ８：ＮＯ）、音声レベルを新たに設定することなくレベル設定モードが終了する。この場合、Ｓ５で表示された音声レベルの設定がそのまま生きることとなる。なお、複数の音声レベルデータをあらかじめ記憶しておいて、設定したレベルに対応した音声レベルデータを選択する構成をとってもよい。
【００３６】
また、Ｓ６にて音声レベルを設定するための入力がない場合には（Ｓ６：ＮＯ）、Ｓ５で表示された現在の音声レベルの設定を変更する必要があるか否かの判断が行われる（Ｓ１１）。具体的には、例えば雑音レベルと入力レベルの中間の音声レベルが設定されているか否かが、ＲＯＭ３４に格納された判断プログラムに従い、ＣＰＵ３２により判断される。図示の例の場合には、音声レベル３〜７の範囲であるか否かが判断される。そして、設定変更が必要であれば（Ｓ１１：ＹＥＳ）、例えば「音声レベルを設定してください」のように、音声レベルの設定をユーザーに促す内容の設定指示メッセージが出力され（Ｓ１２）、Ｓ６に戻る。設定変更が不要である場合には（Ｓ１１：ＮＯ）、現在の音声レベルを変更せずに、レベル設定モードを終了する。なお、Ｓ１２におけるメッセージの具体的内容及び出力形態は、特に問わない。
【００３７】
そして、このような手順により設定された音声レベルを基準として音声認識が行われる。本実施形態のファクシミリ装置１０では、回線６４を通じて発呼信号を受信した場合に、設定された音声レベルを超える音声が音声認識部４５にて認識されると、ＲＯＭ３４に格納された回線閉結プログラムに基づいて、ＣＰＵ３２により、回線が自動的に閉結される。
【００３８】
このように、本実施形態では、音声レベル設定の際に、周囲の雑音レベルと、ユーザーの音声のレベルとを視覚的に比較できるため、音声認識の基準として適切な音声レベルを簡単に設定することが可能となる。特に、雑音レベルと入力レベルとが、ＬＣＤ１８に画像表示されることから、両者のレベルを把握しやすく、どの程度の音声レベルに設定すれば雑音と音声応答を切り分けて音声応答を確実に認識し得るかを、直ちに判断することができる。さらに、設定される音声レベルも画像表示されることから、設定された音声レベルがユーザーにわかりやすく、希望の音声レベルの設定を行いやすくなる。
【００３９】
また、本実施形態では、図４のＳ５において現在の音声レベル設定が表示されることから、現在どの程度の音声レベルに設定されているかを視覚的に判断することができる。さらに、Ｓ６で音声レベルの設定入力がない場合に設定変更の要否が判断され、設定変更の必要がない場合に、そのままレベル設定モードを終了することから、不要な作業を省略することができる。そして、設定変更が必要な場合には、設定指示メッセージが出力されることから、不適切な音声レベルにより音声認識が行われ、ファクシミリ装置１０が誤動作する事態を防止することが可能となる。
【００４０】
なお、本発明は、上述の例に限定されるものではなく、種々の変形が可能である。
【００４１】
例えば、図５に示すように、ＬＣＤ１８よりも表示領域の広いＬＣＤ１００を用い、雑音レベルと入力レベルを並べて表示した後に（図５（Ａ））、音声レベルを表示させることができる（図５（Ｂ））。この図５は、ＬＣＤ表示の他の例を示す図である。図５（Ａ）の場合には、記憶に頼ることなく、目で実際に確認しながら雑音レベルと入力レベルの対比を行うことができるため、適切な音声レベルの範囲を一層迅速に判断することができる。また、図５（Ｂ）のように、音声レベルの表示と併せて、その音声レベルが適切か否かの判断結果を表示させる場合には、ユーザーの過誤により不適切な音声レベルが設定されることを防止できる。なお、このような判断は、ＲＯＭ３４に格納されたプログラムに基づいてＣＰＵ３２が行う。
【００４２】
あるいは、図６に示すように、メモリではなく、数字でレベルを表示してもよい。この図６は、ＬＣＤ表示のさらに他の例を示す図である。図６の場合には、各レベルの表示に必要なスペースが狭くてすむため、表示領域の比較的狭いＬＣＤ１１０を用いながら、雑音レベル、入力レベル、及び音声レベルの全部を並べて表示することが可能となる。このため、設定された音声レベルと、雑音及びユーザーの音声等の外部音声とのレベル関係がわかりやすく、音声レベルの設定を、一層容易かつ確実に行うことが可能となる。従ってコストの上昇を抑制しつつ、使いやすさを向上させることができる。
【００４３】
また、音声レベル設定の際におけるユーザーの音声入力及びそれに基づく入力レベルの表示を省略することも可能である。この場合でも、外部の雑音レベル及び設定される音声レベルが視覚的に表示されれば、雑音を誤認識しない音声レベルを簡単に設定することができる。
【００４４】
【発明の効果】
以上説明したように、請求項１記載の通信装置によれば、レベル表示手段にて、外部音声の音声レベル及び設定される音声レベルの両方が可視的に表示されることから、両者を視覚的に対比することが可能となり、音声レベル設定手段による音声レベルの設定作業を行いやすくなる。また、設定された音声レベルがユーザーにわかりやすくなる。
【００４５】
請求項２記載の通信装置によれば、外部音声の音声レベル及び設定される音声レベルが並べて表示されることから、雑音と音声を切り分けるための基準となる音声レベルの設定が一層行いやすく、かつ、設定された音声レベルと外部音声の音声レベルとの高低がわかりやすい通信装置を得ることが可能となる。
【００４６】
請求項３記載の通信装置によれば、第１の外部音声と、この第１の外部音声より高い第２の外部音声がレベル表示手段に並べて表示されることから、周囲の雑音の音声レベルと、ユーザーの音声の音声レベルとを並べて対比することができる。このため、両者を並べて対比することにより、どの程度の音声レベルを基準として設定すればよいかを一目で把握することができ、より音声レベル設定作業を行いやすくなる。
【図面の簡単な説明】
【図１】本実施形態におけるＬＣＤ表示の例を示す図である。
【図２】本実施形態のファクシミリ装置を示す外観斜視図である。
【図３】本実施形態の電気的構成を示すブロック図である。
【図４】本実施形態における音声レベル設定手順を示すフローチャートである。
【図５】ＬＣＤ表示の他の例を示す図である。
【図６】ＬＣＤ表示のさらに他の例を示す図である。
【符号の説明】
１６キー入力部（切り替え手段、音声レベル設定手段）
１８ＬＣＤ（レベル表示手段）
２７マイクロホン
３２ＣＰＵ（自動閉結手段、切り替え手段、音声レベル設定手段）
４５音声認識部（音声認識手段）
４６ＮＣＵ（自動閉結手段）[0001]
BACKGROUND OF THE INVENTION
The present invention relates to a communication apparatus, and in particular, to provide a communication apparatus that is easy to set a reference for separating noise and voice and that can easily understand the set reference.
[0002]
[Background]
2. Description of the Related Art In recent years, a telephone device with a voice recognition function has been developed that has a voice recognition function and can automatically close a line by voice response without taking a handset when a call is received.
[0003]
In addition, in such a telephone device with a voice recognition function, an expected noise level is set in advance in order to separate noise and voice, and when a voice exceeding the noise level is input from the microphone, there is a voice response. In many cases, the line is automatically closed.
[0004]
In some cases, the user can set a sound level as a reference for separating noise and sound.
[0005]
[Problems to be solved by the invention]
However, the conventional audio level setting is performed as an internal process of the apparatus in accordance with a user setting operation. For this reason, it was difficult for the user to understand how much sound level was set. Also, there is a problem that it is difficult for the user to know how much sound level is set to separate noise and sound.
[0006]
The present invention has been made in view of the above problems, and an object of the present invention is to provide a communication device that makes it easy to set a reference for separating noise and speech and makes the set reference easy to understand.
[0007]
[Means for Solving the Problems]
In order to achieve the above object, the communication device according to claim 1 recognizes an external sound exceeding a predetermined sound level from the external sound while measuring a sound level of the external sound input by the microphone. A voice recognition unit, an automatic closing unit that automatically closes a line based on a recognition result of the voice recognition unit during reception of a call signal from the line, and a reference for voice recognition in the voice recognition unit Switching means for switching to a level setting mode for setting the audio level, audio level setting means for setting the audio level based on manual input when the mode is switched to the level setting mode, and input by the microphone Level display means for visually displaying the sound level of the external sound and the sound level set by the sound level setting means. That.
[0008]
According to the communication device of the first aspect, since both the audio level of the external audio and the set audio level are visually displayed on the level display means, it is possible to visually compare the two. It becomes. For this reason, it is possible to immediately understand how much sound level can be set to separate noise and sound. For this reason, it becomes easy to perform the sound level setting operation by the sound level setting means. Also, the set audio level is easy to understand for the user.
[0009]
In particular, when the user inputs sound from a microphone, the sound level of that sound is also visually displayed as the sound level of the external sound, so the contrast between your sound level and the sound level of ambient noise can be visually compared. Can be done. Therefore, it is possible to easily set the voice level that is a reference for voice recognition to an appropriate level between the voice level of noise and the voice level of one's own voice, for example.
[0010]
The communication apparatus according to claim 2 is the communication apparatus according to claim 1, wherein the level display means indicates a sound level of an external sound input by the microphone and a sound level set by the sound level setting means. They are displayed side by side.
[0011]
According to the communication device of the second aspect, since the audio level of the external audio and the set audio level are displayed side by side, it is possible to more easily and reliably compare the two. Further, it is possible to set a voice level as a reference for voice recognition while visually confirming the voice level of the external voice (in this case, the voice level of the noise and the voice level of the user input voice are shared). For this reason, while simplifying the configuration of the display means, it is easier to set the sound level as a reference for separating noise and sound, and the communication between the set sound level and the sound level of the external sound is easy to understand An apparatus can be obtained.
[0012]
The communication device according to claim 3 is the communication device according to claim 1 or 2, wherein the external sound is input to the first external sound input from the microphone and the first external sound. Including a second external sound that is input later by the microphone and higher than the sound level of the first external sound, and the level display means includes the sound level of the first external sound and the second external sound. Are displayed side by side as the audio levels of the external audio.
[0013]
According to the communication device of the third aspect, since the first external sound and the second external sound higher than the first external sound are displayed side by side on the level display means, the sound level of ambient noise and The user's voice level can be compared side by side. For this reason, by comparing both of them side by side, it is possible to grasp at a glance how much sound level should be set as a reference, and it becomes easier to perform the sound level setting work.
[0014]
DETAILED DESCRIPTION OF THE INVENTION
Next, the communication apparatus according to the present invention will be specifically described with reference to the drawings, taking as an example a case where the communication apparatus is applied to a facsimile apparatus.
[0015]
First, FIG. 2 shows an external perspective view of the facsimile apparatus of this embodiment.
[0016]
In FIG. 2, the facsimile machine 10 includes a main body 12 and a receiver 28 installed on the left side of the main body 12. The receiver 28 is connected to the main body 12 by a cord (not shown). A speaker 26 is provided on the right side of the main body 12. A key input unit 16 is provided at the front of the upper surface of the main body 12, an LCD 18 is provided at the left rear of the key input unit 16, and a microphone 27 is provided at the right rear. Further, a document insertion slot 20 is provided at the rear part of the LCD 18 and the microphone 27, and a document inserted therefrom is read by a scanner 52 (see FIG. 3) inside the main body 12, and then the front surface of the main body 12. In this case, the document is discharged from a document discharge port 14 provided below the key input unit 16. A recording paper holder 24 capable of stacking and storing a plurality of recording papers is detachably attached to the rear portion of the document insertion opening 20. The recording paper supplied from the recording paper holder 24 and used for printing is discharged from a recording paper discharge port 22 provided below the document discharge port 14.
[0017]
Such a facsimile machine 10 has an electrical structure as shown in the block diagram of FIG. That is, the CPU 32, the ROM 34, the EEPROM 36, the RAM 38, the image memory 40, the voice memory 42, the time measuring unit 43, the sensor 44, the voice recognition unit 45, the network control unit (hereinafter referred to as “NCU”) 46, the modem 48, the buffer 50, A scanner 52, an encoding unit 54, a decoding unit 56, a printer 58, a key input unit 16, an LCD 18, and amplifiers 60 and 61 are provided and are connected to each other via a system bus 30. In addition to the handset 28 and the telephone line 64, the NCU 46 is also connected to a modem 48. The speaker 26 is connected to the amplifier 60, and the microphone 27 is connected to the amplifier 61.
[0018]
More specifically, the CPU 32 controls each unit connected via the system bus 30. The control program executed by the CPU 32 and various data necessary for executing the control program are stored in the ROM 34 and the EEPROM 36.
[0019]
Here, as the control program stored in the ROM 34, for example, a level setting program for setting a voice level that is a reference for voice recognition in the voice recognition unit 45 or a level setting mode for executing the level setting program is switched. For example, an automatic closing program for automatically closing a line based on a voice recognition result of the voice recognition unit 45. Various data such as sound data and numerical data that are referred to during processing in the CPU 32 and the voice recognition unit 45 are also stored in the ROM 34.
[0020]
The EEPROM 36 stores voice level data, speed dial data, data for outputting various voice messages, and the like.
[0021]
In the RAM 38, voices captured from the receiver 28, the microphone 27, and the telephone line 64 via the NCU 46 are stored, and various data when the CPU 32 executes operations are temporarily stored.
[0022]
The image memory 40 stores a communication history, image data, and a bit image for printing, and the audio memory 42 stores a response message sent to the counterpart device and an incoming message sent from the counterpart device. To do. The timer unit 43 measures the current time. The sensor 44 detects the open / close state of the recording paper cover.
[0023]
The voice recognition unit 45 recognizes a voice that exceeds a predetermined voice level from the voice captured by the microphone 27. The voice recognition unit 45 can be configured by a voice LSI or the like, for example.
[0024]
The NCU 46 transmits and receives signals to and from a partner device (not shown) connected via the telephone lines 64 and 66 and the exchange 62.
[0025]
The modem 48 modulates and demodulates communication data such as image data and audio data transmitted and received by the NCU 46. The buffer 50 temporarily stores data including encoded image information transmitted / received to / from the counterpart device. The scanner 52 reads characters / graphics written on the reading surface of the document inserted into the document insertion slot 20 as image data, and the encoding unit 54 encodes the image data read by the scanner 52. On the other hand, the decoding unit 56 reads out the image data stored in the buffer 50 or the image memory 40 and decodes it. The printer 58 prints the decrypted data on a recording sheet. The key input unit 16 includes a numeric keypad and function keys, and enables various setting operations, telephone number input operations, and the like using these keys. The amplifier 60 amplifies a sound signal to be output as a ringing sound or a call voice by the speaker 26, and the amplifier 61 amplifies the external sound taken in by the microphone 27.
[0026]
In the present embodiment, the microphone 27 corresponds to a microphone, and the voice recognition unit 45 corresponds to a voice recognition unit. The CPU 32 and the key input unit 16 correspond to switching means and sound level setting means, and the LCD 18 corresponds to level display means. The CPU 32 and the NCU 46 correspond to automatic closing means.
[0027]
In such a facsimile apparatus 10, the voice level that is a reference for voice recognition in the voice recognition unit 45 is set according to the procedure shown in the flowchart of FIG. 4.
[0028]
In FIG. 4, the facsimile machine 10 is in a reception standby state at the start time. In this state, when the mode is switched to the level setting mode (S1), the surrounding external sound is taken in by the microphone 27 and the sound level is measured, and the result is displayed on the LCD 18 as a noise level (S2). The external sound captured by the microphone 27 in S2 corresponds to the first external sound.
[0029]
A display screen of the LCD 18 at this time is shown in FIG. FIG. 1 is a diagram showing an example of the LCD display in the present embodiment.
[0030]
In FIG. 1A, rectangles arranged in a line on the right side of the display of “noise level” are memories representing noise levels in 10 levels, and the level increases by 1 level as the number of filled rectangles increases. It shows that. Therefore, in this example, the noise level is 2.
[0031]
Switching to the level setting mode is performed by the CPU 32 according to a mode switching program stored in the ROM 34 based on manual input from the key input unit 16. Further, the sound level of the sound taken in by the microphone 27 is measured by the sound recognition unit 45 and stored in the RAM 38. Further, the image display on the LCD 18 is executed under the control of the CPU 32 based on the program stored in the ROM 34 and the data stored in the EEPROM 36 and the RAM 38.
[0032]
Next, it is determined whether or not a voice having a voice level higher than the noise level is input (S3). If there is no input (S3: NO), an input instruction message for prompting voice input from the microphone 27 is output. (S10). The determination of S3 is performed by the CPU 32 in accordance with a program stored in the ROM 34. In addition, if the input instruction message prompts the user to input a voice such as “Please input voice”, the specific content and output form such as display output and voice output are not particularly limited. Note that the external sound measured in S3 corresponds to the second external sound.
[0033]
When a voice having a level higher than the noise level is input in S3 (S3: YES), an image is displayed on the LCD 18 as an input level (S4). A display screen of the LCD 18 at this time is shown in FIG. In the illustrated example, the input level is 8.
[0034]
After the input level is displayed for a predetermined time, an audio level setting screen is displayed on the LCD 18 (S5). At this time, in the present embodiment, the currently set sound level is displayed, and the setting can be changed by an input from the key input unit 16. For this reason, it is possible to visually facilitate level setting for separating the noise level and the input level. Specifically, for example, a display screen as shown in FIG. In the case of the illustrated example, the currently set audio level is 5.
[0035]
Thereafter, it is determined whether or not a sound level setting input has been made using the key input unit 16 (S6). If there is an input (S6: YES), the sound level display on the LCD 18 is changed based on the input. (S7). When an input indicating determination is made from the key input unit 16 (S8: YES), a voice level is newly set based on the input made in S6, and stored as voice level data in the EEPROM 36 (S9). . At this time, the old audio level data is erased from the EEPROM 36. On the other hand, if there is no input indicating determination in S8 (S8: NO), the level setting mode is terminated without setting a new audio level. In this case, the setting of the sound level displayed in S5 is alive as it is. A configuration may be adopted in which a plurality of audio level data is stored in advance and audio level data corresponding to the set level is selected.
[0036]
If there is no input for setting the audio level in S6 (S6: NO), it is determined whether or not it is necessary to change the current audio level setting displayed in S5 ( S11). Specifically, for example, the CPU 32 determines whether or not an audio level intermediate between the noise level and the input level is set according to a determination program stored in the ROM 34. In the case of the illustrated example, it is determined whether or not the audio level is in the range of 3 to 7. If a setting change is necessary (S11: YES), a setting instruction message with a content prompting the user to set the audio level is output, for example, “Please set the audio level” (S12), S6 Return to. If it is not necessary to change the setting (S11: NO), the level setting mode is terminated without changing the current audio level. The specific contents and output form of the message in S12 are not particularly limited.
[0037]
And voice recognition is performed on the basis of the voice level set by such a procedure. In the facsimile apparatus 10 of the present embodiment, when a call signal is received through the line 64 and the voice exceeding the set voice level is recognized by the voice recognition unit 45, the line closing program stored in the ROM 34. The line is automatically closed by the CPU 32 based on the above.
[0038]
As described above, in the present embodiment, when setting the voice level, the ambient noise level and the user's voice level can be visually compared. Therefore, an appropriate voice level can be easily set as a reference for voice recognition. It becomes possible. In particular, since the noise level and the input level are displayed on the LCD 18 as an image, it is easy to grasp both levels, and the sound level can be reliably recognized by separating the noise and the voice response by setting the sound level to what level. You can immediately determine if you get it. Further, since the set sound level is displayed as an image, the set sound level is easily understood by the user, and the desired sound level can be easily set.
[0039]
In the present embodiment, since the current audio level setting is displayed in S5 of FIG. 4, it is possible to visually determine what audio level is currently set. Furthermore, if there is no audio level setting input in S6, it is determined whether or not the setting needs to be changed. If there is no need to change the setting, the level setting mode is terminated as it is, so unnecessary work can be omitted. . When a setting change is necessary, a setting instruction message is output, so that voice recognition is performed with an inappropriate voice level, and it is possible to prevent the facsimile apparatus 10 from malfunctioning.
[0040]
In addition, this invention is not limited to the above-mentioned example, A various deformation | transformation is possible.
[0041]
For example, as shown in FIG. 5, after using the LCD 100 having a wider display area than the LCD 18 to display the noise level and the input level side by side (FIG. 5A), the audio level can be displayed (FIG. 5 ( B)). FIG. 5 shows another example of the LCD display. In the case of FIG. 5 (A), it is possible to compare the noise level and the input level while actually checking with eyes without relying on memory, so that the appropriate audio level range can be determined more quickly. Can do. Further, as shown in FIG. 5B, when displaying the determination result as to whether or not the sound level is appropriate together with the display of the sound level, an inappropriate sound level is set due to a user error. Can be prevented. Such a determination is made by the CPU 32 based on a program stored in the ROM 34.
[0042]
Alternatively, as shown in FIG. 6, the level may be displayed with a number instead of a memory. FIG. 6 is a diagram showing still another example of the LCD display. In the case of FIG. 6, since the space required for displaying each level is small, it is possible to display all of the noise level, input level, and audio level side by side while using the LCD 110 having a relatively small display area. It becomes. For this reason, the level relationship between the set audio level and external audio such as noise and user's audio is easy to understand, and the audio level can be set more easily and reliably. Therefore, ease of use can be improved while suppressing an increase in cost.
[0043]
It is also possible to omit the user's voice input and the input level display based on the voice level when setting the voice level. Even in this case, if the external noise level and the set audio level are visually displayed, it is possible to easily set an audio level that does not misrecognize noise.
[0044]
【The invention's effect】
As described above, according to the communication device of the first aspect, since both the audio level of the external audio and the set audio level are visually displayed on the level display means, both are visually displayed. Therefore, it is easy to perform the sound level setting operation by the sound level setting means. Also, the set audio level is easy to understand for the user.
[0045]
According to the communication device according to claim 2, since the sound level of the external sound and the set sound level are displayed side by side, it is easier to set the sound level as a reference for separating noise and sound, and Thus, it is possible to obtain a communication device in which the level between the set audio level and the external audio level is easy to understand.
[0046]
According to the communication device of the third aspect, since the first external sound and the second external sound higher than the first external sound are displayed side by side on the level display means, the sound level of ambient noise and The user's voice level can be compared side by side. For this reason, by comparing both of them side by side, it is possible to grasp at a glance how much sound level should be set as a reference, and it becomes easier to perform the sound level setting work.
[Brief description of the drawings]
FIG. 1 is a diagram showing an example of an LCD display in the present embodiment.
FIG. 2 is an external perspective view showing the facsimile apparatus of the present embodiment.
FIG. 3 is a block diagram showing an electrical configuration of the present embodiment.
FIG. 4 is a flowchart showing an audio level setting procedure in the present embodiment.
FIG. 5 is a diagram showing another example of LCD display.
FIG. 6 is a diagram showing still another example of the LCD display.
[Explanation of symbols]
16 Key input part (switching means, voice level setting means)
18 LCD (level display means)
27 Microphone 32 CPU (automatic closing means, switching means, sound level setting means)
45 Voice recognition unit (voice recognition means)
46 NCU (automatic closing means)

Claims

A voice recognizing means for recognizing an external voice exceeding a predetermined voice level from the external voice while measuring a voice level of the external voice inputted by the microphone;
Automatic closing means for automatically closing a line based on a recognition result of the voice recognition means during reception of a call signal from the line;
Switching means for switching to a level setting mode for setting the voice level as a reference for voice recognition in the voice recognition means;
Audio level setting means for setting the audio level based on manual input when switched to the level setting mode;
Level display means for visually displaying the sound level of the external sound input by the microphone and the sound level set by the sound level setting means;
A communication apparatus comprising:

The communication device according to claim 1.
The communication device according to claim 1, wherein the level display means displays the sound level of the external sound input by the microphone and the sound level set by the sound level setting means side by side.

The communication device according to claim 1 or 2,
The external sound is a first external sound input by the microphone, and a second external sound input by the microphone after the input of the first external sound and higher than the sound level of the first external sound. Including audio,
The level display means displays the sound level of the first external sound and the sound level of the second external sound side by side as the sound level of the external sound.