JPH11261683A

JPH11261683A - Telephone system, and recording medium with recording program and recording medium recording data recorded therein

Info

Publication number: JPH11261683A
Application number: JP10058504A
Authority: JP
Inventors: Katsumi Shiono; 勝美塩野
Original assignee: NEC Saitama Ltd
Current assignee: NEC Saitama Ltd
Priority date: 1998-03-10
Filing date: 1998-03-10
Publication date: 1999-09-24

Abstract

PROBLEM TO BE SOLVED: To easily execute various functions and service functions with sound input by storing the function names of the function and a service function and registered name and telephone number as recognition words for sound recognition and extracting the feature quantity of inputted sound and comparing it with the stored recognition word. SOLUTION: When sound is inputted by using a microphone 8, a sound signal is added to a main CPU 5 via an A/D converter 7 and a sound recognition part 6 executes a recognition processing. Main CPU 5 refers to a memory dial, based on a recognition result from the sound recognition part 6, specifies the name and the dial number of an opposite party, recognizes the name and the dial number of the opposite party by a loudspeaker output or display output through a D/A converter 10 and dials to a number detected by the main CPU 5. Setting is executed in accordance with sound guidance, and the function and the service function can be set easily.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、ユーザの音声によ
る指示を受けて動作する電話装置及びこの電話装置で用
いられるプログラムやデータを記録した記録媒体に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a telephone device which operates in response to a user's voice instruction and a recording medium for recording programs and data used in the telephone device.

【０００２】[0002]

【従来の技術】従来より、マイクから入力された音声を
音声認識回路にて解読して音声に対応したコード信号を
出力し、処理手段による制御によりこのコード信号に基
づいて音声によるダイヤル番号の入力を検索し、検索し
たダイヤル番号に対応した相手方との交信を行う携帯電
話機や自動車電話機が種々提案されている。2. Description of the Related Art Conventionally, a voice input from a microphone is decoded by a voice recognition circuit to output a code signal corresponding to the voice, and a dial number is input by voice based on the code signal under the control of processing means. Various types of mobile telephones and automobile telephones have been proposed which search for a telephone number and communicate with a partner corresponding to the retrieved dialed number.

【０００３】本発明と技術分野が類似する従来例として
特開平３−４５０５７号公報の“自動ダイヤル式車載用
携帯電話機”がある。本従来例は、図３に示されるよう
に音声入力部１１、音声分析手段１２、スピーカ１３、
音声合成部１４、表示部１５、ダイヤル番号送出部１
６、音声認識手段１７、検索処理部１８、キー情報入力
部１９、音声標準パターンメモリ２０、情報メモリ２１
などを有して構成される。そして、音声入力による自動
ダイヤルを行う場合、ユーザがマイク等の音声入力部１
１を用いて相手先の氏名を入力すると、音声分析手段１
２がその音声の特徴量を抽出する。音声認識手段１７は
抽出された特徴量に基づいて音声標準パターンメモリ２
０に予め登録されたアドレス情報を検索する。次に検索
処理部１８は上記アドレス情報に基づいて情報メモリ２
１に予め登録された相手先の氏名、電話番号を検索す
る。検索された氏名、電話番号は表示部１５で表示され
ると共に、音声合成部１４で音声に合成されてスピーカ
１３から発声される。ユーザはこの発声を聞いて「正」
又は「誤」を音声入力し、「正」であれば、ダイヤル番
号送出部１６によりダイヤル動作が行われ、「誤」であ
れば、ユーザは再び相手先の音声入力からやり直す。
尚、上記特徴量の抽出によりアドレス情報が求められた
ときにも、スピーカ１３からそれを知らせる音が出され
る。また、各メモリ２０、２１への登録は、登録モード
においてユーザが音声入力部１１やキー情報入力部１９
のキー操作により予め行われる。As a conventional example having a technical field similar to that of the present invention, there is an "automatic dial type in-vehicle mobile phone" disclosed in Japanese Patent Application Laid-Open No. 3-45057. In this conventional example, as shown in FIG. 3, a voice input unit 11, a voice analysis unit 12, a speaker 13,
Voice synthesis unit 14, display unit 15, dial number sending unit 1
6, voice recognition means 17, search processing section 18, key information input section 19, voice standard pattern memory 20, information memory 21
And so on. Then, when performing automatic dialing by voice input, the user operates a voice input unit 1 such as a microphone.
When the name of the other party is input by using the "1", the voice analysis means 1
2 extracts the feature amount of the voice. The voice recognition unit 17 stores the voice standard pattern memory 2 based on the extracted feature amount.
0 is searched for address information registered in advance. Next, the search processing unit 18 makes the information memory 2 based on the address information.
1 to search for the name and telephone number of the other party registered in advance. The retrieved name and telephone number are displayed on the display unit 15, synthesized into a voice by the voice synthesis unit 14, and are uttered from the speaker 13. The user hears this utterance and says “correct”
Or, if "wrong" is input by voice, and if "correct", the dialing operation is performed by the dial number sending unit 16, and if "wrong", the user starts again from the voice input of the other party.
It should be noted that even when the address information is obtained by the extraction of the feature amount, the speaker 13 emits a sound indicating the same. The registration in the memories 20 and 21 is performed by the user in the registration mode by the voice input unit 11 or the key information input unit 19.
Is performed in advance by operating the key.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、上記の
従来例には以下のような問題がある。第１の問題点は音
声でダイヤル発信の機能しか行えないことである。この
ため、アイズフリーで携帯電話を使用したり、視覚障害
者が携帯電話を使用するときに使える機能が限定されて
しまう。However, the above-mentioned prior art has the following problems. The first problem is that only a dialing function can be performed by voice. For this reason, the functions that can be used when using a mobile phone for the eyes-free or when the visually impaired use the mobile phone are limited.

【０００５】第２の問題点は、認識を間違えたとき再度
認識動作を行わなければならないことである。このた
め、認識を間違えやすい言葉については誤認識処理を繰
り返し続けてしまう。[0005] The second problem is that when the recognition is wrong, the recognition operation must be performed again. For this reason, erroneous recognition processing is repeated repeatedly for words that are likely to be mistaken for recognition.

【０００６】第３の問題点は、音声認識で認識させる言
葉を事前に登録しなければならないことである。このた
め、認識語を増やすたびに登録を行わなければならず、
また、録音を行うときの環境が認識率に影響するため、
常にどこにいても新しい認識語の登録が行なえる訳では
ないという問題がある。また、音声認識を行うときには
事前に登録した音声を発声して認識処理を行うため、事
前に発声した言葉を覚えていなければならない。A third problem is that words to be recognized by voice recognition must be registered in advance. For this reason, registration must be performed each time recognition words are increased,
Also, the environment when recording will affect the recognition rate,
There is a problem that it is not always possible to register a new recognition word anywhere. In addition, when performing speech recognition, the speech registered in advance is uttered to perform the recognition process, so that the words uttered in advance must be remembered.

【０００７】本発明は、上記の問題を解決して種々のフ
ァンクション機能やサービス機能を音声入力で容易に行
えるようにした電話装置及びこの電話装置で用いられる
プログラムやデータを記録した記録媒体を提供すること
を目的とする。The present invention solves the above problems and provides a telephone device capable of easily performing various function functions and service functions by voice input, and a recording medium recording programs and data used in the telephone device. The purpose is to do.

【０００８】[0008]

【課題を解決するための手段】かかる目的を達成するた
めに本発明の電話装置は、操作者の音声を入力する音声
入力手段と、ファンクション機能、サービス機能の機能
名や登録された名前及び電話番号を音声認識のための認
識語として記憶する記憶手段と、音声入力手段より入力
された音声の特徴量を抽出して記憶手段に記憶した認識
語と比較することにより入力された音声を認識する音声
認識手段と、音声認識手段による認識結果を表示する表
示手段と、音声認識手段による認識結果を音声で出力す
る音声出力手段と、上記各手段を制御する制御手段とを
有することを特徴としている。In order to achieve the above object, a telephone apparatus according to the present invention comprises a voice input means for inputting a voice of an operator, a function name of a function function, a service function, a registered name and a telephone number. Storage means for storing a number as a recognition word for voice recognition, and recognition of the input voice by extracting a feature amount of the voice input from the voice input means and comparing it with the recognition word stored in the storage means It is characterized by having a voice recognition means, a display means for displaying a recognition result by the voice recognition means, a voice output means for outputting a recognition result by the voice recognition means by voice, and a control means for controlling each of the above means. .

【０００９】本発明の電話装置は、操作者からの設定を
入力する操作入力手段を有し、制御手段は、音声認識手
段による認識結果を表示手段により表示、または音声出
力手段により出力した結果、音声入力手段または操作入
力手段よりスクロールの指示が入力されると、音声認識
手段に記憶手段に記憶した情報をスクロールさせて再度
音声認識を行わせ、認識結果を再度表示手段に表示させ
る、または音声出力手段より出力させる制御を行うとよ
い。The telephone device of the present invention has operation input means for inputting settings from an operator, and the control means displays the recognition result by the voice recognition means on the display means or outputs the result of recognition by the voice output means; When a scroll instruction is input from the voice input means or the operation input means, the voice recognition means scrolls the information stored in the storage means to perform voice recognition again, and the recognition result is displayed again on the display means, or It is preferable to perform control to output from the output means.

【００１０】上記の制御手段は、音声認識手段による認
識結果を表示手段に表示、または音声出力手段より出力
した後に、音声入力手段または操作入力手段より処理を
実行せよとの指示が入力されると、音声認識手段による
音声認識結果に従い、該音声認識結果が登録された名前
または電話番号であった場合、該認識した電話番号にダ
イヤルするダイヤル処理を、また認識結果がファンクシ
ョン機能またはサービス機能であった場合、ファンクシ
ョン機能またはサービス機能を実現するための制御を行
うとよい。The control means displays the recognition result by the voice recognition means on the display means or outputs the result from the voice output means, and then receives an instruction to execute the processing from the voice input means or the operation input means. According to the speech recognition result by the speech recognition means, if the speech recognition result is a registered name or telephone number, dial processing for dialing the recognized telephone number is performed, and if the recognition result is a function function or a service function, In such a case, it is preferable to perform control for realizing the function function or the service function.

【００１１】上記の記憶手段は、一つの機能名について
複数の認識語を記憶するとよい。[0011] The storage means may store a plurality of recognition words for one function name.

【００１２】上記の記憶手段には登録された電話番号及
び名前が単音の文字や数字で記憶され、音声認識手段
は、音声入力手段より入力された相手先の名前の一部及
び／又はダイヤル番号の一部の音声により検索を行うと
よい。[0012] The registered telephone number and name are stored in the storage means as single-tone characters or numbers, and the voice recognizing means stores a part of the name of the other party input from the voice input means and / or a dial number. It is good to search by a part of voice of.

【００１３】上記の記憶手段には、各機能の操作設定手
順を示す音声ガイダンス語が記憶され、制御手段は、音
声入力手段または操作入力手段より入力された機能の操
作手順を示す音声ガイダンス語を音声出力手段より出力
する制御を行うとよい。The storage means stores voice guidance words indicating operation setting procedures for each function, and the control means stores voice guidance words indicating operation procedures for functions input from the voice input means or the operation input means. It is preferable to perform control for outputting from the audio output means.

【００１４】上記の制御手段は、操作入力手段からの設
定入力に従い音声ガイダンス語を音声出力手段より出力
するか否かを切り替える制御を行うとよい。The above-mentioned control means may perform control for switching whether or not to output the voice guidance word from the voice output means in accordance with the setting input from the operation input means.

【００１５】上記の制御手段は、操作入力手段より機能
の名称が新たに入力されると記憶手段に該入力された機
能の名称を登録する制御を行うとよい。It is preferable that the control means performs control for registering the input function name in the storage means when a new function name is input from the operation input means.

【００１６】上記の音声認識手段による音声認識処理は
不特定話者方式で行われるとよい。It is preferable that the voice recognition processing by the voice recognition means is performed by an unspecified speaker system.

【００１７】本発明のプログラムを記録した記録媒体
は、音声入力手段より入力された音声を記憶する入力音
声記憶処理と、音声入力手段より入力された音声の特徴
量を抽出し、記憶手段に記憶した認識語と比較して入力
された音声を認識する音声認識処理と、音声認識処理に
よる認識結果を表示手段にて表示させる表示処理と、音
声認識処理による認識結果を音声として音声出力手段よ
り出力させる音声出力処理とを実行させるためのプログ
ラムを記録したことを特徴としている。The recording medium on which the program of the present invention is recorded has an input voice storing process for storing the voice input from the voice input unit, and extracts the characteristic amount of the voice input from the voice input unit, and stores the characteristic amount in the storage unit. A voice recognition process for recognizing the input voice in comparison with the recognized recognition word, a display process for displaying the recognition result by the voice recognition process on the display unit, and outputting the recognition result by the voice recognition process as a voice from the voice output unit. And a program for executing a sound output process to be performed.

【００１８】上記のプログラムを記録した記録媒体は、
音声認識処理による認識結果を出力する表示処理または
音声出力処理終了後に、操作者からの正誤判断を音声入
力手段、または操作入力手段より入力する正誤判断入力
処理と、正誤判断入力処理により、認識結果が誤ってい
るとの入力を受けると、音声認識処理を行い、該音声認
識処理による認識結果を表示処理により表示手段に表示
させる、または音声出力処理により音声出力手段より出
力させる処理を再度行う処理と、正誤判断入力処理によ
り、認識結果が正しいとの入力を受けると、音声認識処
理による認識結果に従い、該認識結果が登録された名前
または電話番号であった場合、該認識した電話番号にダ
イヤルするダイヤル処理を、また認識結果がファンクシ
ョン機能またはサービス機能であった場合、ファンクシ
ョン機能またはサービス機能を実現する処理を実行させ
るためのプログラムを記録しているとよい。The recording medium on which the above program is recorded is:
After the display process or the voice output process for outputting the recognition result by the voice recognition process, the correctness / false judgment input process of inputting the correctness judgment from the operator from the voice input means or the operation input means, and the recognition result by the true / false judgment input process When receiving an input that is incorrect, the voice recognition process is performed, and the recognition result of the voice recognition process is displayed on the display unit by the display process, or the process of outputting again from the voice output unit by the voice output process is performed. When the input that the recognition result is correct is received by the correct / incorrect judgment input processing, if the recognition result is a registered name or telephone number according to the recognition result by the voice recognition processing, dial the recognized telephone number. If the recognition result is a function function or service function, the function processing or service function It may have recorded thereon a program for executing a process to realize the screw function.

【００１９】本発明のデータを記録した記録媒体は、電
話装置のファンクション機能及びサービス機能の機能名
をデータとして記録したことを特徴としている。The recording medium for recording data of the present invention is characterized in that the function names of the function functions and service functions of the telephone device are recorded as data.

【００２０】上記の認識語は一つの機能名に対して複数
個あるとよい。It is preferable that there are a plurality of recognition words for one function name.

【００２１】本発明のデータを記録した記録媒体は、登
録された名前及び電話を単音の文字や数字の状態のデー
タとして記録したことを特徴としている。The recording medium for recording data according to the present invention is characterized in that the registered name and telephone are recorded as single-tone character or numeric data.

【００２２】本発明のデータを記録した記録媒体は、電
話装置のファンクション機能及びサービス機能を設定す
るためのガイダンス語をデータとして記録したことを特
徴としている。The recording medium for recording data according to the present invention is characterized in that a guidance word for setting a function function and a service function of a telephone device is recorded as data.

【００２３】[0023]

【発明の実施の形態】次に添付図面を参照して本発明の
電話装置及びこの電話装置ので用いられるプログラムや
データを記録した記録媒体の実施の形態を詳細に説明す
る。図１及び図２を参照すると本発明の電話装置及びプ
ログラムやデータを記録した記録媒体の一実施形態が示
されている。なお、図１は本発明の電話装置及びプログ
ラムやデータを記録した記録媒体を移動電話機に適用し
た実施形態の構成を表すブロック図、図２は図１に示さ
れた実施形態による動作手順を表すフローチャートが示
されている。BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a block diagram showing an embodiment of a telephone device according to the present invention and a recording medium for recording programs and data used in the telephone device. Referring to FIGS. 1 and 2, there is shown an embodiment of a telephone apparatus and a recording medium on which programs and data are recorded according to the present invention. FIG. 1 is a block diagram showing the configuration of an embodiment in which the telephone device of the present invention and a recording medium storing programs and data are applied to a mobile telephone, and FIG. 2 shows an operation procedure according to the embodiment shown in FIG. A flowchart is shown.

【００２４】図１に示された本実施形態は、無線部１、
操作部２、表示部３、記憶装置４Ａ、ＲＯＭ４Ｂ、メイ
ンＣＰＵ５、音声認識部６、Ａ／Ｄコンバータ７、マイ
ク８、スピーカ９、Ｄ／Ａコンバータ１０を具備してい
る。The present embodiment shown in FIG.
An operation unit 2, a display unit 3, a storage device 4A, a ROM 4B, a main CPU 5, a voice recognition unit 6, an A / D converter 7, a microphone 8, a speaker 9, and a D / A converter 10 are provided.

【００２５】無線部１は不図示の基地局と無線信号の送
受信を行う。操作部２はテンキーやその他のファンクシ
ョンキーを有しており、電話機のキー操作を行う。ま
た、ダイヤル機能の外に種々のファンクション機能やサ
ービス機能を実現する場合にも用いられる。記憶装置４
Ａには操作部により入力設定された相手の名前や電話番
号などが記憶されておりメモリダイヤルとして機能す
る。また、記憶装置４Ａにはファンクション機能及びサ
ービス機能を音声入力により実現するため、ファンクシ
ョン機能及びサービス機能の機能名を複数認識語として
記憶している。ＲＯＭ４ＢにはメインＣＰＵ５の後述す
る図２のフローチャートによる処理を含む制御プログラ
ム等が格納されている。音声認識部６はマイク８より入
力された音声を認識し、認識結果をメインＣＰＵ５に送
信する。Ａ／Ｄコンバータ７はマイク８からのアナログ
信号である音声信号をＡ／Ｄ変換してデジタル信号に変
換しメインＣＰＵ５または音声認識部に送信する。Ｄ／
Ａコンバータ１０は音声認識部による認識結果に基づき
該当する相手先の名前または機能名を読み出し、アナロ
グ信号に変換してスピーカから出力させる。The radio unit 1 transmits and receives radio signals to and from a base station (not shown). The operation unit 2 has numeric keys and other function keys, and performs key operations of the telephone. It is also used to realize various function functions and service functions in addition to the dial function. Storage device 4
A stores the name and telephone number of the other party input and set by the operation unit, and functions as a memory dial. Further, in order to realize the function function and the service function by voice input, the storage device 4A stores the function names of the function function and the service function as a plurality of recognition words. The ROM 4B stores a control program and the like including a process of the main CPU 5 according to a flowchart of FIG. The voice recognition unit 6 recognizes voice input from the microphone 8 and transmits a recognition result to the main CPU 5. The A / D converter 7 A / D converts an audio signal, which is an analog signal from the microphone 8, to a digital signal, and transmits the digital signal to the main CPU 5 or a voice recognition unit. D /
The A-converter 10 reads out the name of the other party or the name of the corresponding function based on the recognition result by the voice recognition unit, converts the read-out name into an analog signal, and outputs the analog signal.

【００２６】尚、記憶装置４Ａ、ＲＯＭ４Ｂは、本発明
によるデータを記録した記録媒体及びプログラムを記録
した記録媒体を構成する。この記録媒体としては、半導
体メモリ、光ディスク、光磁気ディスク、磁気記録媒体
等を用いてよく、それらをメモリカード、ＣＤ−ＲＯ
Ｍ、フロッピーデスク等に構成して用いてよい。The storage device 4A and the ROM 4B constitute a recording medium on which data according to the present invention is recorded and a recording medium on which a program is recorded. As the recording medium, a semiconductor memory, an optical disk, a magneto-optical disk, a magnetic recording medium, or the like may be used.
M, a floppy desk or the like may be used.

【００２７】次に上記構成による動作について説明す
る。音声入力により相手先の電話番号を検出してダイヤ
ルする処理は、まず操作部２をキー操作して音声認識を
開始する。ユーザはマイク８を用いて音声を入力する
と、その音声信号はＡ／Ｄコンバータ７を介してメイン
ＣＰＵ５に加えられ、さらに音声認識部６で認識処理が
行われる。メインＣＰＵ５は音声認識部６からの認識結
果に基づいてメモリダイヤルを参照して相手先の名前や
ダイヤル番号を特定する。そして特定した相手先の名前
やダイヤル番号をＤ／Ａコンバータ１０を介してスピー
カから出力したり、または表示部に表示したりする。ユ
ーザにより認識結果が正しいと判断されると、メインＣ
ＰＵ５により検出した番号にダイヤルされる。Next, the operation of the above configuration will be described. In the process of dialing by detecting the telephone number of the other party by voice input, first, the key is operated on the operation unit 2 to start voice recognition. When the user inputs a voice using the microphone 8, the voice signal is applied to the main CPU 5 via the A / D converter 7, and the voice recognition unit 6 performs a recognition process. The main CPU 5 refers to the memory dial based on the recognition result from the voice recognition unit 6 and specifies the name and dial number of the other party. Then, the specified destination name or dial number is output from the speaker via the D / A converter 10 or displayed on the display unit. When the user determines that the recognition result is correct, the main C
The number detected by PU5 is dialed.

【００２８】さらに本実施形態では、音声入力を認識し
てメモリダイヤルを検索する際、誤認識を行っても、ス
クロールキーの操作又は音声入力により、音声認識に対
応したメモリダイヤルをスクロールすることにより、メ
モリダイヤルを検索すると共に、スクロールしたメモリ
ダイヤルを表示部３で表示し、スピーカ９からの音声に
よりダイヤル名を知らせるようにしている。これによ
り、誤認識を行っても、再度音声認識処理を行う必要が
ない。Further, in the present embodiment, when retrieving a memory dial by recognizing a voice input, even if an erroneous recognition is performed, the memory dial corresponding to the voice recognition can be scrolled by operating a scroll key or voice input. In addition, the memory dial is searched, and the scrolled memory dial is displayed on the display unit 3 so that the dial name is notified by voice from the speaker 9. Thus, even if erroneous recognition is performed, there is no need to perform voice recognition processing again.

【００２９】以上のようにして音声入力による番号また
は名前の検出が行われるが、さらに本実施形態では、マ
イクより入力された相手方の電話番号または名前がその
電話番号または名前の一部分であっても、メモリダイヤ
ルの検索を行うことができる。これは音声認識の辞書と
して単音の文字や数字を用意することにより音声認識部
が電話番号または名前の一部分からでも記憶装置に記憶
した相手方の電話番号を検索することができるからであ
る。また、音声認識処理としては、認識する言葉を発声
する話者が特定の人物に限定されない不特定話者方式を
用いている。As described above, the number or name is detected by voice input. In this embodiment, even if the telephone number or name of the other party input from the microphone is a part of the telephone number or name. , A memory dial search can be performed. This is because the voice recognition unit can search the telephone number of the other party stored in the storage device even from a part of the telephone number or the name by preparing single-tone characters or numbers as a voice recognition dictionary. Further, as the voice recognition processing, an unspecified speaker system in which a speaker who utters a word to be recognized is not limited to a specific person is used.

【００３０】次に、ファンクション機能やサービス機能
を音声入力により行う場合について説明する。記憶装置
４Ａにはファンクション機能やサービス機能の機能名が
音声認識のための辞書として複数登録されている。これ
により、ユーザは特定の言葉を覚えることなく、所望す
る機能に対応した言葉を発声することで、その機能を呼
び出すことができる。さらに呼び出された機能を音声や
キー操作等で設定する順序を示すガイダンスの言葉が登
録されている。これによりユーザはそのガイダンスを聞
きながら音声入力、キー操作を順次行うことによりその
機能を容易に設定することができる。また、ファンクシ
ョン機能やサービス機能名の認識語については予め登録
されている言葉に加えてユーザが各機能に対する認識語
を操作部２より入力して登録することができる。これに
より、本装置が予め登録されている認識語に対して認識
処理が正しく行えなくても、ユーザにより認識しやすい
言葉を改めて登録することでこの不具合を改善すること
ができる。Next, a case where the function function or the service function is performed by voice input will be described. A plurality of function names of function functions and service functions are registered in the storage device 4A as a dictionary for voice recognition. Thus, the user can call a function by uttering a word corresponding to a desired function without memorizing a specific word. Further, guidance words indicating the order in which the called function is set by voice or key operation are registered. Thus, the user can easily set the function by sequentially performing voice input and key operation while listening to the guidance. In addition to the recognition words of the function function and the service function name, in addition to the words registered in advance, the user can input the recognition words for each function from the operation unit 2 and register them. Thereby, even if the present apparatus cannot perform recognition processing correctly on a recognition word registered in advance, this problem can be solved by re-registering a word that is easy for the user to recognize.

【００３１】ファンクション機能やサービス機能を音声
入力により行う場合、ユーザはマイク８から所望する機
能名を発声する。そしてマイクから入力された音声の認
識が音声認識部で行われ、メインＣＰＵ５が音声認識部
の認識結果に基づいて該当する機能名を記憶装置から読
み出す。そして、その読み出された機能名は表示部３に
表示されるか、またはスピーカ９から出力される。ユー
ザにより認識結果が正しいと判断されると、その機能を
設定するための音声入力、キー操作が順次スピーカ９か
ら音声ガイダンスとして出力される。ユーザは音声ガイ
ダンスに従って設定を行っていくことにより、機能の設
定方法が判らなくても容易に本装置を操作することがで
きる。またユーザはスピーカ９から出力されるガイダン
スの出力のＯＮ／ＯＦＦの設定を行うことができる。When performing a function function or a service function by voice input, the user speaks a desired function name from the microphone 8. Then, the voice input from the microphone is recognized by the voice recognition unit, and the main CPU 5 reads out the corresponding function name from the storage device based on the recognition result of the voice recognition unit. Then, the read function name is displayed on the display unit 3 or output from the speaker 9. When the user determines that the recognition result is correct, voice input and key operation for setting the function are sequentially output from the speaker 9 as voice guidance. The user can easily operate this apparatus even if the user does not know how to set the function by performing the setting according to the voice guidance. Further, the user can set ON / OFF of the output of the guidance output from the speaker 9.

【００３２】図２に示されたフローチャートを用いて音
声入力による番号または名前の検出、ファンクション機
能、サービス機能を実行するための処理手順を説明す
る。まず音声認識を行うためのキーが操作されたことを
検出すると（ステップＳ１）、音声認識開始を知らせる
音をスピーカ９から発音する（ステップＳ２）。次にユ
ーザの音声を入力し（ステップＳ３）、入力された音声
の識識を行う（ステップＳ４）。この音声認識は、音声
認識部６で入力された音声の特徴量を抽出し、この特徴
量を記憶装置に記憶した認識語と比較することにより行
われる。認識結果が確定すると（ステップＳ５／ＹＥ
Ｓ）、メインＣＰＵは認識した結果を表示する制御を行
う（ステップＳ６）。また認識結果は、スピーカからも
出力される（ステップＳ７）。そしてユーザにより認識
結果が正しいか否かが判断される。認識結果が正しかっ
た場合、メインＣＰＵが認識結果に対応する処理を実行
する（ステップＳ８）。認識結果が登録された相手の電
話番号または名前であった場合、認識した電話番号にダ
イヤルされる。また認識結果がファンクション機能及び
サービス機能の名称であり、さらに音声ガイダンスの設
定がＯＮに設定されていた場合、該当する機能の音声ガ
イダンスをスピーカより出力し、ガイダンスに従う操作
者からの入力を受け付ける。また、認識結果が正しくな
かった場合、スクロールキーの操作又は音声入力によ
り、音声認識に対応したメモリダイヤルをスクロールす
ることにより、メモリダイヤルを検索すると共に、スク
ロールしたメモリダイヤルを表示部３で表示し、スピー
カ９からの音声によりダイヤル名を知らせる。これによ
り、誤認識を行っても、再度音声認識処理を行う必要が
ない。A processing procedure for detecting a number or name by voice input, executing a function function, and a service function will be described with reference to the flowchart shown in FIG. First, when it is detected that a key for performing voice recognition has been operated (step S1), a sound notifying the start of voice recognition is emitted from the speaker 9 (step S2). Next, the user's voice is input (step S3), and the input voice is recognized (step S4). This speech recognition is performed by extracting a feature amount of the speech input by the speech recognition unit 6 and comparing the feature amount with a recognition word stored in the storage device. When the recognition result is determined (step S5 / YE
S), the main CPU performs control to display the recognized result (step S6). The recognition result is also output from the speaker (step S7). Then, the user determines whether or not the recognition result is correct. If the recognition result is correct, the main CPU executes a process corresponding to the recognition result (step S8). If the recognition result is the telephone number or name of the registered partner, the recognized telephone number is dialed. When the recognition result is the name of the function function and the service function, and the voice guidance setting is set to ON, the voice guidance of the corresponding function is output from the speaker, and the input from the operator according to the guidance is accepted. When the recognition result is not correct, the memory dial corresponding to the voice recognition is scrolled by operating the scroll key or inputting the voice, thereby searching the memory dial and displaying the scrolled memory dial on the display unit 3. Then, the dial name is notified by voice from the speaker 9. Thus, even if erroneous recognition is performed, there is no need to perform voice recognition processing again.

【００３３】上述の実施形態は、音声認識の辞書として
ファンクションやサービス機能名を複数登録し、音声に
よりファンクションやサービス機能名が入力されると、
記憶手段に記憶した認識語を用いて入力された機能名を
認識し、認識した結果を表示部に表示したり、スピーカ
から出力するので、確実に音声入力により機能を呼び出
すことができる。また本装置が備える機能の設定方法が
判らない場合でも、機能設定のための音声ガイダンスが
出力されることにより容易に機能設定することができ
る。さらにこのガイダンス音を出力するか、しないかの
設定を行うことができるので機能設定に慣れたユーザは
音声ガイダンスを消すことができる。In the above-described embodiment, a plurality of function or service function names are registered as a voice recognition dictionary, and when a function or service function name is input by voice,
Since the input function name is recognized using the recognition word stored in the storage means and the recognized result is displayed on the display unit or output from the speaker, the function can be reliably called by voice input. Further, even when the setting method of the function provided in the present apparatus is unknown, the function can be easily set by outputting the voice guidance for the function setting. Furthermore, the user can set whether to output the guidance sound or not, so that the user accustomed to the function setting can turn off the voice guidance.

【００３４】さらに、ファンクションやサービス機能名
を音声認識の辞書として予め用意しておくが、各機能に
対して一語でなく複数の言葉を認識語として用意してお
くことで特定の言葉を覚えることなく、機能の呼出しを
行うことができる。さらに登録した認識語が音声認識し
づらい言葉であった場合、ユーザが操作部より新たな認
識語を登録することにより、認識率を向上させることが
できる。Furthermore, the names of functions and service functions are prepared in advance as a dictionary for voice recognition, but a specific word is memorized by preparing not one word but a plurality of words for each function as recognition words. Function calls can be made without the need. Further, when the registered recognition word is a word that is difficult to perform voice recognition, the recognition rate can be improved by the user registering a new recognition word from the operation unit.

【００３５】また、音声認識の辞書として単音の文字や
数字を用意しているため、音声認識部が音声入力された
言葉が相手方の名前や電話番号の一部であっても希望し
た相手方の電話番号や名前を認識することができる。し
たがってユーザは検索したいメモリダイヤルのすべての
情報を覚える必要がなくなる。Further, since single-sound letters and numbers are prepared as a dictionary for voice recognition, the voice recognition unit can use the telephone number of the desired other party even if the input word is part of the name or telephone number of the other party. Recognize numbers and names. Therefore, the user does not need to remember all the information of the memory dial to be searched.

【００３６】また、音声認識処理を不特定話者方式とす
ることにより、ユーザが特定されず誰にでも使用でき、
登録の手間を省くことができる。In addition, by making the voice recognition process an unspecified speaker system, the user can be used by anyone without being specified.
You can save the trouble of registration.

【００３７】[0037]

【発明の効果】以上の説明より明らかなように本発明の
電話装置は、音声認識の辞書としてファンクションやサ
ービス機能名を複数登録し、音声によりファンクション
やサービス機能名が入力されると、記憶手段に記憶した
認識語を用いて入力された機能名を認識し、認識した結
果を表示部に表示したり、音声出力手段から出力するの
で、確実に音声入力でファンクション機能を呼び出すこ
とができる。また本装置が備える機能の設定方法が判ら
ない場合でも、機能設定のための音声ガイダンスが出力
されることにより容易に機能設定することができる。さ
らにこのガイダンス音を出力するか、しないかの設定を
行うことができるので機能設定に慣れたユーザは音声ガ
イダンスを消すことができる。As is apparent from the above description, the telephone device of the present invention registers a plurality of function and service function names as a dictionary for voice recognition, and stores the function when the function or service function name is input by voice. Since the input function name is recognized using the recognition word stored in the storage unit and the recognized result is displayed on the display unit or output from the voice output unit, the function function can be reliably called by voice input. Further, even when the setting method of the function provided in the present apparatus is unknown, the function can be easily set by outputting the voice guidance for the function setting. Furthermore, the user can set whether to output the guidance sound or not, so that the user accustomed to the function setting can turn off the voice guidance.

【００３８】また、ファンクションやサービス機能名を
音声認識の辞書として予め用意しておくが、各機能に対
して一語でなく複数の言葉を認識語として用意しておく
ことで特定の言葉を覚えることなく、機能の呼出しを行
うことができる。さらに登録した認識語が音声認識しづ
らい言葉であった場合、ユーザが操作部より新たな認識
語を登録することにより、認識率を向上させることがで
きる。Also, the names of functions and service functions are prepared in advance as a dictionary for voice recognition, but a specific word is memorized by preparing not one word but a plurality of words for each function as recognition words. Function calls can be made without the need. Further, when the registered recognition word is a word that is difficult to perform voice recognition, the recognition rate can be improved by the user registering a new recognition word from the operation unit.

【００３９】また、音声認識の辞書として単音の文字や
数字を用意しているため、音声認識手段が音声入力され
た言葉が相手方の名前や電話番号の一部であっても希望
した相手方の電話番号や名前を認識することができる。
したがってユーザは検索したいメモリダイヤルのすべて
の情報を覚える必要がなくなる。Also, since single-sound characters and numbers are prepared as a dictionary for voice recognition, the voice recognition means can use the telephone number of the desired other party even if the voice input word is part of the name or telephone number of the other party. Recognize numbers and names.
Therefore, the user does not need to remember all the information of the memory dial to be searched.

【００４０】また、音声認識処理を不特定話者方式とす
ることにより、ユーザが特定されず誰にでも使用でき、
登録の手間を省くことができる。In addition, by making the voice recognition process an unspecified speaker system, the user can be used by anyone without being specified.
You can save the trouble of registration.

[Brief description of the drawings]

【図１】本発明の実施の形態を示すブロック図である。FIG. 1 is a block diagram showing an embodiment of the present invention.

【図２】音声認識による機能を実現するための処理を示
すフローチャートである。FIG. 2 is a flowchart illustrating a process for realizing a function based on voice recognition.

【図３】従来の音声による自動ダイヤル可能な携帯電話
機の構成を示すブロック図である。FIG. 3 is a block diagram showing a configuration of a conventional mobile telephone capable of automatic dialing by voice.

[Explanation of symbols]

２操作部３表示部４Ａ記憶装置４ＢＲＯＭ５メインＣＰＵ６音声認識部８マイク９スピーカ 2 operation unit 3 display unit 4A storage device 4B ROM 5 main CPU 6 voice recognition unit 8 microphone 9 speaker

─────────────────────────────────────────────────────
────────────────────────────────────────────────── ───

【手続補正書】[Procedure amendment]

【提出日】平成１１年４月１９日[Submission date] April 19, 1999

【手続補正１】[Procedure amendment 1]

【補正対象書類名】明細書[Document name to be amended] Statement

【補正対象項目名】全文[Correction target item name] Full text

【補正方法】変更[Correction method] Change

【補正内容】[Correction contents]

【書類名】明細書[Document Name] Statement

【発明の名称】電話装置、プログラムを記録した記録
媒体及びデータを記録した記録媒体Patent application title: Telephone device, recording medium recording program and recording medium recording data

【特許請求の範囲】[Claims]

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【０００２】[0002]

【０００３】本発明と技術分野が類似する従来例として
特開平３−４５０５７号公報の“自動ダイヤル式車載用
携帯電話機”がある。本従来例は、図３に示されるよう
に音声入力部１１、音声分析手段１２、スピーカ１３、
音声合成部１４、表示部１５、ダイヤル番号送出部１
６、音声認識手段１７、検索処理部１８、キー情報入力
部１９、音声標準パターンメモリ２０、情報メモリ２１
などを有して構成される。そして、音声入力による自動
ダイヤルを行う場合、ユーザがマイク等の音声入力部１
１を用いて相手先の氏名を入力すると、音声分析手段１
２がその音声の特徴量を抽出する。音声認識手段１７は
抽出された特徴量に基づいて音声標準パターンメモリ２
０に予め登録されたアドレス情報を検索する。次に検索
処理部１８は上記アドレス情報に基づいて情報メモリ２
１に予め登録された相手先の氏名、電話番号を検索す
る。検索された氏名、電話番号は表示部１５で表示され
ると共に、音声合成部１４で音声に合成されてスピーカ
１３から発声される。ユーザはこの発声を聞いて「正」
又は「誤」を音声入力し、「正」であれば、ダイヤル番
号送出部１６によりダイヤル動作が行われ、「誤」であ
れば、ユーザは再び相手先の音声入力からやり直す。
尚、上記特徴量の抽出によりアドレス情報が求められた
ときにも、スピーカ１３からそれを知らせる音が出され
る。また、各メモリ２０、２１への登録は、登録モード
においてユーザが音声入力部１１やキー情報入力部１９
のキー操作により予め行われる。As a conventional example having a technical field similar to that of the present invention, there is an "automatic dial type in-vehicle mobile phone" disclosed in Japanese Patent Application Laid-Open No. 3-45057. In this conventional example, as shown in FIG. 3, a voice input unit 11, a voice analysis unit 12, a speaker 13,
Voice synthesis unit 14, display unit 15, dial number sending unit 1
6, voice recognition means 17, search processing section 18, key information input section 19, voice standard pattern memory 20, information memory 21
And so on. Then, when performing automatic dialing by voice input, the user operates a voice input unit 1 such as a microphone.
When the name of the other party is input by using the "1", the voice analysis means 1
2 extracts the feature amount of the voice. The voice recognition unit 17 stores the voice standard pattern memory 2 based on the extracted feature amount.
0 is searched for address information registered in advance. Next, the search processing unit 18 makes the information memory 2 based on the address information.
1 to search for the name and telephone number of the other party registered in advance. The retrieved name and telephone number are displayed on the display unit 15, synthesized into a voice by the voice synthesis unit 14, and are uttered from the speaker 13. The user hears this utterance and says “correct”
Or, if "wrong" is input by voice, and if "correct", the dialing operation is performed by the dial number sending unit 16, and if "wrong", the user starts again from the voice input of the other party.
It should be noted that even when the address information is obtained by the extraction of the feature amount, a sound for notifying the address information is output from the speaker 13. The registration in the memories 20 and 21 is performed by the user in the registration mode by the voice input unit 11 or the key information input unit 19.
Is performed in advance by operating the key.

【０００４】[0004]

【０００８】[0008]

【課題を解決するための手段】かかる目的を達成するた
めに本発明の電話装置は、操作者の音声を入力する音声
入力手段と、ファンクション機能、サービス機能の機能
名を音声認識させるための認識語として記憶する記憶手
段と、音声入力手段より入力された音声の特徴量を抽出
して記憶手段に記憶した認識語と比較することにより入
力された音声を認識する音声認識手段と、音声認識手段
による認識結果を表示する表示手段と、音声認識手段に
よる認識結果を音声で出力する音声出力手段と、上記各
手段を制御する制御手段とを有することを特徴としてい
る。In order to achieve the above object, a telephone device according to the present invention comprises a voice input unit for inputting a voice of an operator, a function function, and a service function.
Recognizing a storage means for storing the name as a recognition word for speech recognition, the voice input by comparing the recognized words stored in the extraction and storing means a feature quantity of a voice input from the voice input means It is characterized by having a voice recognition means, a display means for displaying a recognition result by the voice recognition means, a voice output means for outputting a recognition result by the voice recognition means by voice, and a control means for controlling each of the above means. .

【０００９】上記の制御手段は、音声認識手段による認
識結果を表示手段により表示、または音声出力手段によ
り出力した結果、音声入力手段によりスクロールの指示
の音声が入力されると、記憶手段に記憶されている情報
をスクロールさせて音声認識手段により再度音声認識を
行わせ、認識結果を再度表示手段により表示、または音
声出力手段より出力させる制御を行うとよい。 [0009] The above-mentioned control means recognizes by the voice recognition means.
The recognition result is displayed by the display means or by the audio output means.
Output, and instruct scrolling by voice input means
Is input, the information stored in the storage means
Scroll and perform voice recognition again by voice recognition means.
Display the recognition result on the display unit again, or
It is preferable to perform control to output the voice from the voice output means.

【００１０】本発明の電話装置は、さらに操作者からの
設定を入力する操作入力手段を有し、制御手段は、音声
認識手段による認識結果を表示手段により表示、または
音声出力手段により出力した結果、操作入力手段により
スクロールの指示が入力されると、記憶手段に記憶され
ている情報をスクロールさせて音声認識手段により再度
音声認識を行わせ、認識結果を再度表示手段に表示させ
る、または音声出力手段より出力させる制御を行うとよ
い。 [0010] The telephone device of the present invention is further provided by an operator.
The device has operation input means for inputting settings, and the control means has a voice
The recognition result by the recognition means is displayed by the display means, or
As a result of output by voice output means, by operation input means
When a scroll instruction is input, the instruction is stored in the storage unit.
Scroll through the information and re-
Perform voice recognition and display the recognition result on the display unit again.
Or control to output from audio output means.
No.

【００１１】上記の制御手段は、音声認識手段による認
識結果を表示手段に表示、または音声出力手段より出力
した後に、音声入力手段または操作入力手段により処理
を実行せよとの指示が入力されると、音声認識手段によ
る認識結果に従い、認識結果がファンクション機能また
はサービス機能であった場合、ファンクション機能また
はサービス機能を実現するための制御を行うとよい。 [0011] The above-mentioned control means recognizes by the voice recognition means.
Display the recognition result on the display unit or output from the audio output unit
And then process by voice input means or operation input means
When the instruction to execute
According to the recognition result, the recognition result is
Is a service function, a function function or
Should perform control to realize the service function.

【００１２】上記の記憶手段は、一つの機能名について
複数の認識語を記憶するとよい。 [0012] The above storage means is provided for one function name.
A plurality of recognition words may be stored.

【００１３】上記の記憶手段は、ファンクション機能及
びサービス機能の操作設定手順を示す音声ガイダンス語
が記憶され、制御手段は、音声入力手段または操作入力
手段より入力された機能の操作手順を示す音声ガイダン
ス語を音声出力手段より出力する制御を行うとよい。The storage means described above has a function function and
And a voice guidance word indicating an operation setting procedure of the service function is stored, and the control means performs control to output a voice guidance word indicating an operation procedure of the function input from the voice input means or the operation input means from the voice output means. Good.

【００１７】本発明のプログラムを記録した記録媒体
は、音声入力手段より入力されたファンクション機能
名、サービス機能名、名前及び電話番号の音声を認識語
として記憶する入力音声記憶処理と、音声入力手段より
入力された音声の特徴量を抽出し、入力音声記憶処理に
より記憶されている認識語と比較して入力された音声を
認識する音声認識処理と、音声認識処理による認識結果
を表示手段にて表示する表示処理と、音声認識処理によ
る認識結果を音声として音声出力手段より出力させる音
声出力処理と、を実行させるためのプログラムを記録し
たことを特徴とする。The recording medium on which the program of the present invention is recorded has a function function input from the voice input means.
Recognized words of name, service function name, name and phone number
An input voice storage process for storing as to extract a feature quantity of a voice input from the voice input means, and recognizing the speech recognition process the voice input as compared to the recognized word stored by the input speech storage process a display processing that displays on the display unit a recognition result by the voice recognition processing, a voice output process to output from the audio output unit recognition result by the voice recognition processing as voice, by recording a program for execution Features.

【００１８】上記のプログラムを記録した記録媒体は、
音声認識処理による認識結果を出力する表示処理または
音声出力処理終了後に、操作者からの正誤判断を音声入
力手段より入力する正誤判断入力処理と、正誤判断入力
処理により、認識結果が誤っているとの入力を受ける
と、音声認識処理を行い、音声認識処理による認識結果
を表示処理により表示手段に表示させる、または音声出
力処理により音声出力手段より出力させる処理を再度行
う処理と、正誤判断入力処理により、認識結果が正しい
との入力を受けると、音声認識処理による認識結果に従
い、認識結果が登録された名前または電話番号であった
場合、該認識した電話番号にダイヤルするダイヤル処理
を、また認識結果がファンクション機能またはサービス
機能であった場合、ファンクション機能またはサービス
機能を実現する処理を実行させるためのプログラムを記
録しているとよい。The recording medium on which the above program is recorded is:
After the display processing or audio output processing end for outputting a recognition result of the speech recognition process, the validation checking input process the validation checking from the operator inputs Ri by voice input hand stage, the validation checking input processing, incorrectly recognition result When receiving an input of am, and performs a speech recognition process performed by the display processing recognition results by voice recognition processing is displayed on the display unit, or the processing for output from the audio output means by the audio output processing again process, errata the decision input processing, recognition result receives an input of the correct, in accordance with the recognition result of the speech recognition process, if the recognition result is the name or phone number registered, dial processing to dial a telephone number that the recognized If the recognition result is a function function or service function, the processing to realize the function function or service function The program for executing may have recorded.

【００２１】本発明のデータを記録した記録媒体は、電
話装置のファンクション機能及びサービス機能を設定す
るためのガイダンス語をデータとして記録したことを特
徴としている。 The recording medium recording the data of the present invention, electrostatic
Set the function and service functions of the talker
It is noted that guidance words for
It is a sign.

【００２２】[0022]

【００２３】図１に示された本実施形態は、無線部１、
操作部２、表示部３、記憶装置４Ａ、ＲＯＭ４Ｂ、メイ
ンＣＰＵ５、音声認識部６、Ａ／Ｄコンバータ７、マイ
ク８、スピーカ９、Ｄ／Ａコンバータ１０を具備してい
る。The present embodiment shown in FIG.
An operation unit 2, a display unit 3, a storage device 4A, a ROM 4B, a main CPU 5, a voice recognition unit 6, an A / D converter 7, a microphone 8, a speaker 9, and a D / A converter 10 are provided.

【００２４】無線部１は不図示の基地局と無線信号の送
受信を行う。操作部２はテンキーやその他のファンクシ
ョンキーを有しており、電話機のキー操作を行う。ま
た、ダイヤル機能の外に種々のファンクション機能やサ
ービス機能を実現する場合にも用いられる。記憶装置４
Ａには操作部により入力設定された相手の名前や電話番
号などが記憶されておりメモリダイヤルとして機能す
る。また、記憶装置４Ａにはファンクション機能及びサ
ービス機能を音声入力により実現するため、ファンクシ
ョン機能及びサービス機能の機能名を複数認識語として
記憶している。ＲＯＭ４ＢにはメインＣＰＵ５の後述す
る図２のフローチャートによる処理を含む制御プログラ
ム等が格納されている。音声認識部６はマイク８より入
力された音声を認識し、認識結果をメインＣＰＵ５に送
信する。Ａ／Ｄコンバータ７はマイク８からのアナログ
信号である音声信号をＡ／Ｄ変換してデジタル信号に変
換しメインＣＰＵ５または音声認識部に送信する。Ｄ／
Ａコンバータ１０は音声認識部による認識結果に基づき
該当する相手先の名前または機能名を読み出し、アナロ
グ信号に変換してスピーカから出力させる。The radio unit 1 transmits and receives radio signals to and from a base station (not shown). The operation unit 2 has numeric keys and other function keys, and performs key operations of the telephone. It is also used to realize various function functions and service functions in addition to the dial function. Storage device 4
A stores the name and telephone number of the other party input and set by the operation unit, and functions as a memory dial. Further, in order to realize the function function and the service function by voice input, the storage device 4A stores the function names of the function function and the service function as a plurality of recognition words. The ROM 4B stores a control program and the like including a process of the main CPU 5 according to a flowchart of FIG. The voice recognition unit 6 recognizes voice input from the microphone 8 and transmits a recognition result to the main CPU 5. The A / D converter 7 A / D converts an audio signal, which is an analog signal from the microphone 8, to a digital signal, and transmits the digital signal to the main CPU 5 or a voice recognition unit. D /
The A-converter 10 reads out the name of the other party or the name of the corresponding function based on the recognition result by the voice recognition unit, converts the read-out name into an analog signal, and outputs the analog signal.

【００２５】尚、記憶装置４Ａ、ＲＯＭ４Ｂは、本発明
によるデータを記録した記録媒体及びプログラムを記録
した記録媒体を構成する。この記録媒体としては、半導
体メモリ、光ディスク、光磁気ディスク、磁気記録媒体
等を用いてよく、それらをメモリカード、ＣＤ−ＲＯ
Ｍ、フロッピーデスク等に構成して用いてよい。The storage device 4A and the ROM 4B constitute a recording medium for recording data according to the present invention and a recording medium for recording a program. As the recording medium, a semiconductor memory, an optical disk, a magneto-optical disk, a magnetic recording medium, or the like may be used.
M, a floppy desk or the like may be used.

【００２６】次に上記構成による動作について説明す
る。音声入力により相手先の電話番号を検出してダイヤ
ルする処理は、まず操作部２をキー操作して音声認識を
開始する。ユーザはマイク８を用いて音声を入力する
と、その音声信号はＡ／Ｄコンバータ７を介してメイン
ＣＰＵ５に加えられ、さらに音声認識部６で認識処理が
行われる。メインＣＰＵ５は音声認識部６からの認識結
果に基づいてメモリダイヤルを参照して相手先の名前や
ダイヤル番号を特定する。そして特定した相手先の名前
やダイヤル番号をＤ／Ａコンバータ１０を介してスピー
カから出力したり、または表示部に表示したりする。ユ
ーザにより認識結果が正しいと判断されると、メインＣ
ＰＵ５により検出した番号にダイヤルされる。Next, the operation of the above configuration will be described. In the process of dialing by detecting the telephone number of the other party by voice input, first, the key is operated on the operation unit 2 to start voice recognition. When the user inputs a voice using the microphone 8, the voice signal is applied to the main CPU 5 via the A / D converter 7, and the voice recognition unit 6 performs a recognition process. The main CPU 5 refers to the memory dial based on the recognition result from the voice recognition unit 6 and specifies the name and dial number of the other party. Then, the specified destination name or dial number is output from the speaker via the D / A converter 10 or displayed on the display unit. When the user determines that the recognition result is correct, the main C
The number detected by PU5 is dialed.

【００２７】さらに本実施形態では、音声入力を認識し
てメモリダイヤルを検索する際、誤認識を行っても、ス
クロールキーの操作又は音声入力により、音声認識に対
応したメモリダイヤルをスクロールすることにより、メ
モリダイヤルを検索すると共に、スクロールしたメモリ
ダイヤルを表示部３で表示し、スピーカ９からの音声に
よりダイヤル名を知らせるようにしている。これによ
り、誤認識を行っても、再度音声認識処理を行う必要が
ない。Further, in the present embodiment, when retrieving a memory dial by recognizing a voice input, even if an erroneous recognition is performed, the memory dial corresponding to the voice recognition can be scrolled by operating a scroll key or voice input. In addition, the memory dial is searched, and the scrolled memory dial is displayed on the display unit 3 so that the dial name is notified by voice from the speaker 9. Thus, even if erroneous recognition is performed, there is no need to perform voice recognition processing again.

【００２８】以上のようにして音声入力による番号また
は名前の検出が行われるが、さらに本実施形態では、マ
イクより入力された相手方の電話番号または名前がその
電話番号または名前の一部分であっても、メモリダイヤ
ルの検索を行うことができる。これは音声認識の辞書と
して単音の文字や数字を用意することにより音声認識部
が電話番号または名前の一部分からでも記憶装置に記憶
した相手方の電話番号を検索することができるからであ
る。また、音声認識処理としては、認識する言葉を発声
する話者が特定の人物に限定されない不特定話者方式を
用いている。The number or name is detected by voice input as described above. In the present embodiment, even if the telephone number or name of the other party input from the microphone is a part of the telephone number or name. , A memory dial search can be performed. This is because the voice recognition unit can search the telephone number of the other party stored in the storage device even from a part of the telephone number or the name by preparing single-tone characters or numbers as a voice recognition dictionary. Further, as the voice recognition processing, an unspecified speaker system in which a speaker who utters a word to be recognized is not limited to a specific person is used.

【００２９】次に、ファンクション機能やサービス機能
を音声入力により行う場合について説明する。記憶装置
４Ａにはファンクション機能やサービス機能の機能名が
音声認識のための辞書として複数登録されている。これ
により、ユーザは特定の言葉を覚えることなく、所望す
る機能に対応した言葉を発声することで、その機能を呼
び出すことができる。さらに呼び出された機能を音声や
キー操作等で設定する順序を示すガイダンスの言葉が登
録されている。これによりユーザはそのガイダンスを聞
きながら音声入力、キー操作を順次行うことによりその
機能を容易に設定することができる。また、ファンクシ
ョン機能やサービス機能名の認識語については予め登録
されている言葉に加えてユーザが各機能に対する認識語
を操作部２より入力して登録することができる。これに
より、本装置が予め登録されている認識語に対して認識
処理が正しく行えなくても、ユーザにより認識しやすい
言葉を改めて登録することでこの不具合を改善すること
ができる。Next, a case where a function function or a service function is performed by voice input will be described. A plurality of function names of function functions and service functions are registered in the storage device 4A as a dictionary for voice recognition. Thus, the user can call a function by uttering a word corresponding to a desired function without memorizing a specific word. Further, guidance words indicating the order in which the called function is set by voice or key operation are registered. Thus, the user can easily set the function by sequentially performing voice input and key operation while listening to the guidance. In addition to the recognition words of the function function and the service function name, in addition to the words registered in advance, the user can input the recognition words for each function from the operation unit 2 and register them. Thereby, even if the present apparatus cannot perform recognition processing correctly on a recognition word registered in advance, this problem can be solved by re-registering a word that is easy for the user to recognize.

【００３０】ファンクション機能やサービス機能を音声
入力により行う場合、ユーザはマイク８から所望する機
能名を発声する。そしてマイクから入力された音声の認
識が音声認識部で行われ、メインＣＰＵ５が音声認識部
の認識結果に基づいて該当する機能名を記憶装置から読
み出す。そして、その読み出された機能名は表示部３に
表示されるか、またはスピーカ９から出力される。ユー
ザにより認識結果が正しいと判断されると、その機能を
設定するための音声入力、キー操作が順次スピーカ９か
ら音声ガイダンスとして出力される。ユーザは音声ガイ
ダンスに従って設定を行っていくことにより、機能の設
定方法が判らなくても容易に本装置を操作することがで
きる。またユーザはスピーカ９から出力されるガイダン
スの出力のＯＮ／ＯＦＦの設定を行うことができる。When performing a function function or a service function by voice input, the user speaks a desired function name from the microphone 8. Then, the voice input from the microphone is recognized by the voice recognition unit, and the main CPU 5 reads out the corresponding function name from the storage device based on the recognition result of the voice recognition unit. Then, the read function name is displayed on the display unit 3 or output from the speaker 9. When the user determines that the recognition result is correct, voice input and key operation for setting the function are sequentially output from the speaker 9 as voice guidance. The user can easily operate this apparatus even if the user does not know how to set the function by performing the setting according to the voice guidance. Further, the user can set ON / OFF of the output of the guidance output from the speaker 9.

【００３１】図２に示されたフローチャートを用いて音
声入力による番号または名前の検出、ファンクション機
能、サービス機能を実行するための処理手順を説明す
る。まず音声認識を行うためのキーが操作されたことを
検出すると（ステップＳ１）、音声認識開始を知らせる
音をスピーカ９から発音する（ステップＳ２）。次にユ
ーザの音声を入力し（ステップＳ３）、入力された音声
の識識を行う（ステップＳ４）。この音声認識は、音声
認識部６で入力された音声の特徴量を抽出し、この特徴
量を記憶装置に記憶した認識語と比較することにより行
われる。認識結果が確定すると（ステップＳ５／ＹＥ
Ｓ）、メインＣＰＵは認識した結果を表示する制御を行
う（ステップＳ６）。また認識結果は、スピーカからも
出力される（ステップＳ７）。そしてユーザにより認識
結果が正しいか否かが判断される。認識結果が正しかっ
た場合、メインＣＰＵが認識結果に対応する処理を実行
する（ステップＳ８）。認識結果が登録された相手の電
話番号または名前であった場合、認識した電話番号にダ
イヤルされる。また認識結果がファンクション機能及び
サービス機能の名称であり、さらに音声ガイダンスの設
定がＯＮに設定されていた場合、該当する機能の音声ガ
イダンスをスピーカより出力し、ガイダンスに従う操作
者からの入力を受け付ける。また、認識結果が正しくな
かった場合、スクロールキーの操作又は音声入力によ
り、音声認識に対応したメモリダイヤルをスクロールす
ることにより、メモリダイヤルを検索すると共に、スク
ロールしたメモリダイヤルを表示部３で表示し、スピー
カ９からの音声によりダイヤル名を知らせる。これによ
り、誤認識を行っても、再度音声認識処理を行う必要が
ない。A processing procedure for detecting a number or name by voice input, executing a function function, and a service function will be described with reference to the flowchart shown in FIG. First, when it is detected that a key for performing voice recognition has been operated (step S1), a sound notifying the start of voice recognition is emitted from the speaker 9 (step S2). Next, the user's voice is input (step S3), and the input voice is recognized (step S4). This speech recognition is performed by extracting a feature amount of the speech input by the speech recognition unit 6 and comparing the feature amount with a recognition word stored in the storage device. When the recognition result is determined (step S5 / YE
S), the main CPU performs control to display the recognized result (step S6). The recognition result is also output from the speaker (step S7). Then, the user determines whether or not the recognition result is correct. If the recognition result is correct, the main CPU executes a process corresponding to the recognition result (step S8). If the recognition result is the telephone number or name of the registered partner, the recognized telephone number is dialed. When the recognition result is the name of the function function and the service function, and the voice guidance setting is set to ON, the voice guidance of the corresponding function is output from the speaker, and the input from the operator according to the guidance is accepted. When the recognition result is not correct, the memory dial corresponding to the voice recognition is scrolled by operating the scroll key or inputting the voice, thereby searching the memory dial and displaying the scrolled memory dial on the display unit 3. Then, the dial name is notified by voice from the speaker 9. Thus, even if erroneous recognition is performed, there is no need to perform voice recognition processing again.

【００３２】上述の実施形態は、音声認識の辞書として
ファンクションやサービス機能名を複数登録し、音声に
よりファンクションやサービス機能名が入力されると、
記憶手段に記憶した認識語を用いて入力された機能名を
認識し、認識した結果を表示部に表示したり、スピーカ
から出力するので、確実に音声入力により機能を呼び出
すことができる。また本装置が備える機能の設定方法が
判らない場合でも、機能設定のための音声ガイダンスが
出力されることにより容易に機能設定することができ
る。さらにこのガイダンス音を出力するか、しないかの
設定を行うことができるので機能設定に慣れたユーザは
音声ガイダンスを消すことができる。In the above-described embodiment, a plurality of function or service function names are registered as a dictionary for voice recognition, and when a function or service function name is input by voice,
Since the input function name is recognized using the recognition word stored in the storage means and the recognized result is displayed on the display unit or output from the speaker, the function can be reliably called by voice input. Further, even when the setting method of the function provided in the present apparatus is unknown, the function can be easily set by outputting the voice guidance for the function setting. Furthermore, the user can set whether to output the guidance sound or not, so that the user accustomed to the function setting can turn off the voice guidance.

【００３３】さらに、ファンクションやサービス機能名
を音声認識の辞書として予め用意しておくが、各機能に
対して一語でなく複数の言葉を認識語として用意してお
くことで特定の言葉を覚えることなく、機能の呼出しを
行うことができる。さらに登録した認識語が音声認識し
づらい言葉であった場合、ユーザが操作部より新たな認
識語を登録することにより、認識率を向上させることが
できる。Furthermore, the names of functions and service functions are prepared in advance as a dictionary for speech recognition. By preparing a plurality of words, not one word, as recognition words for each function, a specific word can be learned. Function calls can be made without the need. Further, when the registered recognition word is a word that is difficult to perform voice recognition, the recognition rate can be improved by the user registering a new recognition word from the operation unit.

【００３４】また、音声認識の辞書として単音の文字や
数字を用意しているため、音声認識部が音声入力された
言葉が相手方の名前や電話番号の一部であっても希望し
た相手方の電話番号や名前を認識することができる。し
たがってユーザは検索したいメモリダイヤルのすべての
情報を覚える必要がなくなる。Further, since single-sound characters and numbers are prepared as a dictionary for voice recognition, the voice recognition unit may use the telephone number of the desired other party even if the input word is part of the name or telephone number of the other party. Recognize numbers and names. Therefore, the user does not need to remember all the information of the memory dial to be searched.

【００３５】また、音声認識処理を不特定話者方式とす
ることにより、ユーザが特定されず誰にでも使用でき、
登録の手間を省くことができる。Further, by making the voice recognition process an unspecified speaker system, the user can be used by anyone without being specified.
You can save the trouble of registration.

【００３６】[0036]

【００３７】また、ファンクションやサービス機能名を
音声認識の辞書として予め用意しておくが、各機能に対
して一語でなく複数の言葉を認識語として用意しておく
ことで特定の言葉を覚えることなく、機能の呼出しを行
うことができる。さらに登録した認識語が音声認識しづ
らい言葉であった場合、ユーザが操作部より新たな認識
語を登録することにより、認識率を向上させることがで
きる。Also, the names of functions and service functions are prepared in advance as a dictionary for voice recognition, but a specific word is memorized by preparing not one word but a plurality of words as recognition words for each function. Function calls can be made without the need. Further, when the registered recognition word is a word that is difficult to perform voice recognition, the recognition rate can be improved by the user registering a new recognition word from the operation unit.

【００３８】また、音声認識の辞書として単音の文字や
数字を用意しているため、音声認識手段が音声入力され
た言葉が相手方の名前や電話番号の一部であっても希望
した相手方の電話番号や名前を認識することができる。
したがってユーザは検索したいメモリダイヤルのすべて
の情報を覚える必要がなくなる。Further, since single-sound letters and numbers are prepared as a dictionary for voice recognition, the voice recognition means may use the telephone number of the desired other party even if the input word is part of the name or telephone number of the other party. Recognize numbers and names.
Therefore, the user does not need to remember all the information of the memory dial to be searched.

【００３９】また、音声認識処理を不特定話者方式とす
ることにより、ユーザが特定されず誰にでも使用でき、
登録の手間を省くことができる。Further, by making the voice recognition process an unspecified speaker system, the user can be used by anyone without being specified.
You can save the trouble of registration.

【図面の簡単な説明】[Brief description of the drawings]

【符号の説明】２操作部３表示部４Ａ記憶装置４ＢＲＯＭ５メインＣＰＵ６音声認識部８マイク９スピーカ[Description of Signs] 2 Operation unit 3 Display unit 4A Storage device 4B ROM 5 Main CPU 6 Voice recognition unit 8 Microphone 9 Speaker

Claims

[Claims]

1. A voice input unit for inputting a voice of an operator, a storage unit for storing a function name of a function function and a service function, a registered name and a telephone number as a recognition word for voice recognition, and the voice. Speech recognition means for recognizing the input speech by extracting a feature amount of the speech input from the input means and comparing the feature quantity with the recognition word stored in the storage means, and displaying a recognition result by the speech recognition means. A telephone device comprising: a display unit; a voice output unit that outputs a recognition result by the voice recognition unit as a voice; and a control unit that controls each of the above units.

2. An operation input unit for inputting a setting from an operator, wherein the control unit displays a recognition result by the voice recognition unit on the display unit, or outputs the recognition result by the voice output unit, When a scroll instruction is input from the voice input means or the operation input means, the voice recognition means scrolls the information stored in the storage means to perform voice recognition again, and the recognition result is displayed again on the display means. 2. The telephone device according to claim 1, wherein control is performed to cause the voice output means to output the voice message.

3. The control means according to claim 1, wherein a result of recognition by said voice recognition means is displayed on said display means or output from said voice output means, and then the processing is executed by said voice input means or said operation input means. When an instruction is input, a dial process for dialing the recognized telephone number is performed according to the voice recognition result by the voice recognition means, and if the voice recognition result is the registered name or telephone number, 3. The telephone device according to claim 1, wherein when the result is the function function or the service function, control for realizing the function function or the service function is performed.

4. The telephone device according to claim 1, wherein said storage means stores a plurality of recognition words for one function name.

5. The storage means stores the registered telephone number and name as single-letter characters or numbers, and the voice recognition means includes a part of the name of the other party input from the voice input means and The telephone device according to any one of claims 1 to 4, wherein the search is performed by a voice of a part of a dial number.

6. The storage means stores a voice guidance word indicating an operation setting procedure of each function, and the control means executes an operation procedure of a function input from the voice input means or the operation input means. The telephone device according to any one of claims 1 to 5, wherein control is performed to output a voice guidance word shown from the voice output means.

7. The telephone according to claim 6, wherein said control means performs control for switching whether or not to output said voice guidance word from said voice output means in accordance with a setting input from said operation input means. apparatus.

8. The controller according to claim 1, wherein when a function name is newly input from the operation input unit, the control unit performs control for registering the input function name in the storage unit. The telephone device according to any one of claims 1 to 7.

9. The telephone device according to claim 1, wherein the voice recognition processing by the voice recognition means is performed by an unspecified speaker system.

10. An input voice storage process for storing a voice input from a voice input unit, extracting a feature amount of the voice input from the voice input unit, comparing the extracted feature amount with a recognition word stored in the storage unit, and inputting the extracted feature amount. Voice recognition processing for recognizing the received voice, display processing for displaying a recognition result by the voice recognition processing on a display unit, and voice output processing for outputting the recognition result by the voice recognition processing as voice from a voice output unit. A recording medium on which a program for recording a program to be executed is recorded.

11. A true / false judgment input process of inputting a true / false judgment from an operator from the voice input means or the operation input means after the display processing or the voice output processing for outputting a recognition result by the voice recognition processing, According to the right / wrong judgment input process, when an input indicating that the recognition result is incorrect is received, the voice recognition process is performed, and the recognition result by the voice recognition process is displayed on a display unit by the display process, or the voice output process is performed. When the input that the recognition result is correct is received by the process of performing the process of outputting again from the voice output unit and the right / wrong judgment input process, the recognition result is stored in the registered name or in accordance with the recognition result of the voice recognition process. If it is a telephone number, dial processing for dialing the recognized telephone number is performed. Or were service function, claim, characterized by recording a program for executing the process for realizing the function function or service functions 10
A recording medium on which the program described above is recorded.

12. A recording medium for recording data, wherein function names of function functions and service functions of the telephone device are recorded as data.

13. The recording medium according to claim 12, wherein a plurality of the recognition words are provided for one function name.

14. A recording medium for recording data, characterized in that registered names and telephones are recorded as data of single-tone characters and numbers.

15. A recording medium for recording data, wherein guidance words for setting a function function and a service function of a telephone device are recorded as data.