JP5556529B2

JP5556529B2 - In-vehicle speech recognition device

Info

Publication number: JP5556529B2
Application number: JP2010207766A
Authority: JP
Inventors: 聡田中; 剛宏津田; 邦雄横井
Original assignee: Denso Corp
Current assignee: Denso Corp
Priority date: 2010-09-16
Filing date: 2010-09-16
Publication date: 2014-07-23
Anticipated expiration: 2030-09-16
Also published as: JP2012063582A

Description

本発明は、音声認識した音声コマンドに応じた制御信号を制御対象機器へ送出する車載音声認識装置に関するものである。 The present invention relates to an in-vehicle voice recognition device that sends a control signal corresponding to a voice command that has been voice-recognized to a control target device.

従来、車載情報端末においては、安全性や利便性向上のために音声認識機能を用いたＨＭＩ（ＨｕｍａｎＭａｃｈｉｎｅＩｎｔｅｒｆａｃｅ）が多数採用されている。 Conventionally, in an in-vehicle information terminal, many HMIs (Human Machine Interfaces) using a voice recognition function have been adopted in order to improve safety and convenience.

このような車載情報端末として、例えば、音声認識機能を用いて乗員の発話する音声から音声コマンドを認識し、認識結果に応じた動作を行うようにしたものがある（例えば、特許文献１参照）。 As such an in-vehicle information terminal, for example, there is one that recognizes a voice command from a voice spoken by an occupant using a voice recognition function and performs an operation according to a recognition result (for example, see Patent Document 1). .

特開２００３−１５２８８４号公報JP 2003-152848 A

上記特許文献１に記載されたような装置では、音声認識した音声コマンドに対する動作が一律に規定されている。すなわち、音声認識した音声コマンドと、この音声コマンドに対応する動作とが１対１の関係となっている。 In an apparatus such as that described in Patent Document 1, the operations for voice commands that have been voice-recognized are uniformly defined. That is, there is a one-to-one relationship between a voice command that has been voice-recognized and an operation corresponding to this voice command.

しかし、このように音声認識した音声コマンドと、この音声コマンドに対応する動作とが１対１の関係となっている構成では、１つの音声コマンドで１つの動作しか実施することかできないため、利便性の高い機能を実現することはできないといった問題がある。 However, in the configuration in which the voice command recognized in this way and the operation corresponding to the voice command have a one-to-one relationship, only one operation can be performed with one voice command. There is a problem that a highly functional function cannot be realized.

本発明は上記問題に鑑みたもので、より利便性の高い機能を実現できるようにすることを目的とする。 The present invention has been made in view of the above problems, and an object thereof is to realize a more convenient function.

上記目的を達成するため、請求項１に記載の発明は、音声信号入力手段を介して入力される音声信号から音声コマンドを音声認識し、当該音声コマンドに応じた制御信号を制御対象機器へ送出する車載音声認識装置であって、音声信号入力手段の１つは車室内マイクであり、音声信号入力手段の１つは、通信インタフェースを介して接続された携帯型通信機器であり、音声信号入力手段を介して入力される音声信号を予め定められた条件に従って区別し、当該音声信号の区別に従って音声コマンドに応じた制御信号を異ならせて制御対象機器へ送出する制御信号送出手段と、音声信号が入力される音声信号入力手段の区別に従って音声コマンドに応じた制御信号を異なるように規定した情報テーブルを記憶する記憶手段と、を備え、制御信号送出手段は、記憶手段に記憶された情報テーブルを参照して、音声信号が入力される音声信号入力手段の区別に従って音声コマンドに応じた制御信号を異ならせて制御対象機器へ送出することを特徴としている。 In order to achieve the above object, according to the first aspect of the present invention, a voice command is recognized from a voice signal input via the voice signal input means, and a control signal corresponding to the voice command is sent to the control target device. One of the voice signal input means is a vehicle interior microphone, and one of the voice signal input means is a portable communication device connected via a communication interface, and the voice signal input A control signal sending means for distinguishing a voice signal input through the means according to a predetermined condition, sending a control signal according to a voice command according to the distinction of the voice signal to a control target device, and a voice signal Storage means for storing an information table that defines different control signals according to voice commands according to the distinction of voice signal input means to Output means, characterized by referring to the information table stored in the storage unit, that transmits to the control target device by varying a control signal corresponding to the voice command according to the distinction between the speech signal input means to which the audio signal is inputted It is said.

このような構成によれば、音声信号入力手段を介して入力される音声信号を予め定められた条件に従って区別し、当該音声信号の区別に従って音声コマンドに応じた制御信号を異ならせて制御対象機器へ送出するので、１つの音声コマンドであっても音声信号の区別に従って異なる制御信号を制御対象機器へ送出することができ、より利便性の高い機能を実現することができる。
また、音声信号が入力される音声信号入力手段の区別に従って音声コマンドに応じた制御信号を異なるように規定した情報テーブルを参照して、音声信号が入力される音声信号入力手段の区別に従って音声コマンドに応じた制御信号を異ならせて制御対象機器へ送出することができる。
また、音声信号入力手段の１つを車室内マイクとし、音声信号入力手段の１つは、通信インタフェースを介して接続された携帯型通信機器とすることができる。 According to such a configuration, the audio signal input via the audio signal input unit is distinguished according to a predetermined condition, and the control target device is differentiated according to the audio command according to the audio signal distinction. Therefore, even for one voice command, different control signals can be sent to the control target device according to the distinction of the voice signal, and a more convenient function can be realized.
In addition, referring to the information table in which the control signal according to the voice command is defined differently according to the distinction of the voice signal input means to which the voice signal is inputted, the voice command according to the distinction of the voice signal input means to which the voice signal is inputted The control signal corresponding to the signal can be made different and sent to the control target device.
Further, one of the audio signal input means can be a vehicle interior microphone, and one of the audio signal input means can be a portable communication device connected via a communication interface.

また、請求項３に記載の発明は、複数の音声信号入力手段を介して音声信号が別々に入力されるようになっており、制御信号送出手段は、音声信号入力手段を介して入力される音声信号を、音声信号が入力される音声信号入力手段毎に区別することを特徴としている。In the invention according to claim 3, the audio signals are separately input via the plurality of audio signal input means, and the control signal sending means is input via the audio signal input means. The audio signal is distinguished for each audio signal input means to which the audio signal is input.

このように、音声信号入力手段を介して入力される音声信号を、音声信号が入力される音声信号入力手段毎に区別し、音声信号の区別に従って音声コマンドに応じた制御信号を異ならせて制御対象機器へ送出することができる。 In this way, the audio signal input through the audio signal input means is distinguished for each audio signal input means to which the audio signal is input, and control is performed by varying the control signal corresponding to the audio command according to the audio signal distinction. It can be sent to the target device.

また、請求項２に記載の発明は、車室内の乗員の有無を判定する乗員有無判定手段と、乗員有無判定手段により車室内に乗員がいると判定された場合、携帯型通信機器のマイクを介して入力される音声信号より音声認識した音声コマンドに応じた制御信号の送出を禁止する制御信号送出禁止手段と、を備えたことを特徴としている。 According to a second aspect of the present invention, when it is determined by the occupant presence / absence determining means for determining the presence / absence of an occupant in the vehicle interior and the occupant presence / absence determining means, the microphone of the portable communication device is used. Control signal transmission prohibiting means for prohibiting transmission of a control signal in accordance with a voice command recognized by voice from a voice signal input via the voice signal.

このような構成によれば、車室内に乗員がいると判定された場合、携帯型通信機器のマイクを介して入力される音声信号より音声認識した音声コマンドに応じた制御信号の送出が禁止されるので、車室内にいる搭乗者の意に反して携帯型通信機器の使用者によって制御対象機器の設定等が変更されてしまうといった問題を防止することができる。 According to such a configuration, when it is determined that there is an occupant in the vehicle compartment, transmission of a control signal according to a voice command recognized by voice from a voice signal input via the microphone of the portable communication device is prohibited. Therefore, it is possible to prevent the problem that the setting of the control target device is changed by the user of the portable communication device against the intention of the passenger in the passenger compartment.

本発明の第１実施形態に係る車載音声認識装置のブロック構成を示す図である。It is a figure which shows the block configuration of the vehicle-mounted speech recognition apparatus which concerns on 1st Embodiment of this invention. 情報テーブルの構成について説明するための図である。It is a figure for demonstrating the structure of an information table. 本発明の第１実施形態に係る制御部の処理を示すフローチャートである。It is a flowchart which shows the process of the control part which concerns on 1st Embodiment of this invention. 本発明の第２実施形態に係る車載音声認識装置のブロック構成を示す図である。It is a figure which shows the block configuration of the vehicle-mounted speech recognition apparatus which concerns on 2nd Embodiment of this invention. データベースの構成についえ説明するための図である。It is a figure for demonstrating also about the structure of a database. 本発明の第２実施形態に係る制御部の処理を示すフローチャートである。It is a flowchart which shows the process of the control part which concerns on 2nd Embodiment of this invention.

（第１実施形態）
本発明の第１実施形態に係る車載音声認識装置のブロック構成を図１に示す。本車載音声認識装置１は、マイク１０、通信部１２、車内通信制御部１３、ディスプレイ１４、スピーカ１５および制御部１６を備えている。本車載音声認識装置１は、車内通信制御部１３に接続された車内ＬＡＮ５を介してエアコンＥＣＵ２、オーディオＥＣＵ３等と接続されている。 (First embodiment)
FIG. 1 shows a block configuration of the in-vehicle speech recognition device according to the first embodiment of the present invention. The in-vehicle voice recognition device 1 includes a microphone 10, a communication unit 12, an in-vehicle communication control unit 13, a display 14, a speaker 15, and a control unit 16. The in-vehicle voice recognition device 1 is connected to an air conditioner ECU 2, an audio ECU 3, and the like via an in-vehicle LAN 5 connected to the in-vehicle communication control unit 13.

マイク１０は、車室内の音声を集音するためのものであり、車室内の音声に応じた音声信号を制御部１６へ送出する。 The microphone 10 is for collecting sound in the vehicle interior and sends out an audio signal corresponding to the sound in the vehicle interior to the control unit 16.

通信部１２は、無線通信網を介して外部通信機器と通信するためのものである。本車載音声認識装置１は、通信部１２を介して携帯電話４と通信することが可能となっている。 The communication unit 12 is for communicating with an external communication device via a wireless communication network. The in-vehicle voice recognition device 1 can communicate with the mobile phone 4 via the communication unit 12.

車内通信制御部１３は、車内ＬＡＮ５を介して接続されたエアコンＥＣＵ２、オーディオＥＣＵ３等の制御対象機器と通信するためのものである。 The in-vehicle communication control unit 13 is for communicating with control target devices such as the air conditioner ECU 2 and the audio ECU 3 connected via the in-vehicle LAN 5.

制御部１６は、この車内通信制御部１３を介してエアコンＥＣＵ２およびオーディオＥＣＵ３等の制御対象機器を制御することが可能となっている。 The control unit 16 can control controlled devices such as the air conditioner ECU 2 and the audio ECU 3 through the in-vehicle communication control unit 13.

ディスプレイ１４は、液晶等の表示部を有しており、制御部１６より入力される映像信号に応じた映像を表示部に表示させる。 The display 14 has a display unit such as a liquid crystal, and displays a video corresponding to the video signal input from the control unit 16 on the display unit.

スピーカ１５は、車室内に取り付けられており、制御部１６より入力される音声信号に応じた音声を出力する。 The speaker 15 is attached to the vehicle interior and outputs sound corresponding to the sound signal input from the control unit 16.

制御部１６は、音声認識処理を行うための音声認識エンジン１６ａと、ＲＯＭ、ＲＡＭ、フラッシュメモリから成る記憶部１６ｂと、ＣＰＵ（図示せず）を備えたコンピュータとして構成されている。ＣＰＵは、記憶部１６ｂに記憶されたプログラムに従って各種処理を実施する。 The control unit 16 is configured as a computer including a speech recognition engine 16a for performing speech recognition processing, a storage unit 16b including a ROM, a RAM, and a flash memory, and a CPU (not shown). The CPU performs various processes according to the program stored in the storage unit 16b.

制御部１６は、マイク１０あるいは携帯電話４を介して入力される音声信号から音声コマンドを音声認識し、当該音声コマンドに応じた制御信号を各制御対象機器２、３へ送出する。 The control unit 16 recognizes a voice command from a voice signal input via the microphone 10 or the mobile phone 4, and sends a control signal corresponding to the voice command to the control target devices 2 and 3.

本車載音声認識装置１の制御部１６には、マイク１０より音声信号が入力されるとともに、携帯電話４より音声信号が入力されるようになっている。制御部１６は、マイク１０より入力される音声信号から音声コマンドを音声認識することも、携帯電話４より入力される音声信号から音声コマンドを音声認識することも可能となっている。 An audio signal is input from the microphone 10 and an audio signal is input from the mobile phone 4 to the control unit 16 of the in-vehicle voice recognition device 1. The control unit 16 can recognize a voice command from a voice signal input from the microphone 10 or can recognize a voice command from a voice signal input from the mobile phone 4.

本実施形態における制御部１６は、マイク１０より入力される音声信号と携帯電話４より入力される音声信号を区別し、マイク１０より入力される音声信号から音声認識した音声コマンドと携帯電話４より入力される音声信号から音声認識した音声コマンドが同一であっても、マイク１０より入力される音声信号から音声認識した音声コマンドに応じて送出する制御信号と、携帯電話４より入力される音声信号から音声認識した音声コマンドに応じて送出する制御信号とを異ならせることが可能となっている。 The control unit 16 in the present embodiment distinguishes between the voice signal input from the microphone 10 and the voice signal input from the mobile phone 4, and the voice command recognized from the voice signal input from the microphone 10 and the mobile phone 4. Even if the voice command recognized from the input voice signal is the same, the control signal sent in response to the voice command recognized from the voice signal inputted from the microphone 10 and the voice signal inputted from the mobile phone 4 Thus, it is possible to make the control signal to be transmitted different in accordance with the voice command recognized from the voice.

制御部１６の記憶部１６ｂには、音声信号の区別に従って音声コマンドに応じた動作、すなわち音声コマンドに応じて送出する制御信号を異ならせるように規定した情報テーブルが記憶されている。 The storage unit 16b of the control unit 16 stores an information table that defines an operation according to the voice command according to the distinction of the voice signal, that is, a control signal transmitted according to the voice command.

制御部１６は、記憶部１６ｂに記憶された情報テーブルを参照してマイク１０より入力される音声信号から音声認識した音声コマンドに応じて送出する制御信号と、携帯電話４より入力される音声信号から音声認識した音声コマンドに応じて送出する制御信号とを異ならせて送出する。 The control unit 16 refers to the information table stored in the storage unit 16b, and transmits a control signal sent in response to a voice command recognized from a voice signal input from the microphone 10, and a voice signal input from the mobile phone 4. The control signal to be sent out in response to the voice command recognized from the voice is sent out differently.

情報テーブルの構成を図２に示す。この図に示すように、情報テーブルには、各音声コマンドに対する動作が入力インタフェース毎に規定されている。 The configuration of the information table is shown in FIG. As shown in this figure, the operation for each voice command is defined for each input interface in the information table.

例えば、「エアコンオン」という音声コマンドに対する動作について、入力インタフェースがマイク１０の場合には「前回使用時の送風箇所からの送風でエアコンをオン」すると規定され、入力インタフェースが携帯電話４の場合には「全ての送風箇所からの送風でエアコンをオン」すると規定されている。 For example, regarding the operation for the voice command “air conditioner on”, when the input interface is the microphone 10, it is defined that “the air conditioner is turned on by blowing air from the blowing portion at the previous use”, and the input interface is the mobile phone 4. Stipulates that “the air conditioner is turned on by blowing air from all the air blowing locations”.

したがって、車室内でマイク１０に向かって「エアコンオン」と発声した場合には、車両の前回使用時の送風箇所からの送風でエアコンがオンするような制御信号がエアコンＥＣＵ２に送出され、例えば、車両に乗り込む前に車両から離れた場所から携帯電話４を用いて車載音声認識装置１に通信接続して「エアコンオン」と発声した場合には、車両の全ての送風箇所からの送風でエアコンがオンするような制御信号がエアコンＥＣＵ２に送出される。 Therefore, when “air conditioner on” is uttered toward the microphone 10 in the passenger compartment, a control signal is sent to the air conditioner ECU 2 so that the air conditioner is turned on by blowing air from the blowing area at the previous use of the vehicle. If the mobile phone 4 is used for communication connection to the in-vehicle speech recognition device 1 from a place away from the vehicle before getting into the vehicle and “air-conditioner is turned on” is uttered, the air conditioner is blown from all the air blowing locations of the vehicle. A control signal that turns on is sent to the air conditioner ECU 2.

また、「インフォメーション」という音声コマンドに対する動作について、入力インタフェースがマイク１０の場合には「交通情報、ガソリン価格等の情報を提供」すると規定され、入力インタフェースが携帯電話４の場合には「鍵が掛かっているかどうか、窓の開閉状態等の情報を提供」すると規定されている。 Further, regarding the operation for the voice command “information”, it is defined that “information such as traffic information and gasoline price” is provided when the input interface is the microphone 10, and “key is locked” when the input interface is the mobile phone 4. "Provides information such as whether it is hung and the open / closed state of the window".

したがって、車室内でマイク１０に向かって「インフォメーション」と発声した場合には、交通情報、ガソリン価格等の情報を表示させるような制御信号がオーディオＥＣＵ３に送出され、例えば、車両から離れた場所から携帯電話４を用いて車載音声認識装置１に通信接続して「インフォメーション」と発声した場合には、「鍵が掛かっているかどうか」、「窓の開閉状態」等の情報が携帯電話４へ通知される。 Therefore, when “information” is uttered toward the microphone 10 in the passenger compartment, a control signal for displaying information such as traffic information and gasoline price is sent to the audio ECU 3, for example, from a place away from the vehicle. When the mobile phone 4 is connected to the in-vehicle speech recognition apparatus 1 and utters “information”, information such as “whether it is locked”, “open / closed state of window”, etc. is notified to the mobile phone 4 Is done.

次に、図３に従って制御部１６の処理について説明する。本車載音声認識装置１は、車両のイグニッションスイッチがオン状態になると通常モードで動作するようになり、車両のイグニッションスイッチがオフ状態になると低消費電力モードで動作するようになる。制御部１６は、音声認識の開始を指示するためのＰＴＴ（ＰｕｓｈＴａｌｋＳｗｉｔｃｈ）の押下操作、あるいは、外部機器からの着呼があると、動作モードと関係なく図３に示す処理を実施する。 Next, processing of the control unit 16 will be described with reference to FIG. The vehicle-mounted speech recognition device 1 operates in the normal mode when the vehicle ignition switch is turned on, and operates in the low power consumption mode when the vehicle ignition switch is turned off. When there is an operation of pressing a PTT (Push Talk Switch) for instructing the start of voice recognition or an incoming call from an external device, the control unit 16 performs the process shown in FIG. 3 regardless of the operation mode.

まず、入力インタフェースの判定を行う（Ｓ１００）。本実施形態では、入力インタフェースとしてマイク１０が用いられているか携帯電話４が用いられているかを判定する。 First, the input interface is determined (S100). In the present embodiment, it is determined whether the microphone 10 or the mobile phone 4 is used as the input interface.

ここで、入力インタフェースとしてマイク１０が用いられている場合、Ｓ１００の判定は「マイク」となり、次に、マイクを入力インタフェースとした設定にする（Ｓ１０２）。具体的には、ＲＡＭの予め定められた領域に入力インタフェースがマイクであることを示すフラグを設定する。 Here, when the microphone 10 is used as the input interface, the determination in S100 is “microphone”, and then the microphone is set as the input interface (S102). Specifically, a flag indicating that the input interface is a microphone is set in a predetermined area of the RAM.

次に、コマンド発話を促すガイダンスを再生する（Ｓ１０４）。例えば、「音声コマンドを発声して下さい」といったガイダンスをスピーカ１５より音声出力させる。 Next, the guidance for prompting command utterance is reproduced (S104). For example, a guidance such as “Please speak a voice command” is output from the speaker 15.

このガイダンスに従ってユーザの発話が行われると、次に、音声認識処理を実施する（Ｓ１０６）。例えば、ユーザが「エアコンオン」と発話すると、「エアコンオン」という音声コマンドが音声認識される。 If the user utters according to this guidance, then a speech recognition process is performed (S106). For example, when the user speaks “air conditioner on”, a voice command “air conditioner on” is recognized.

次に、認識結果に応じた制御信号を送出する（Ｓ１０８）。具体的には、Ｓ１０２にて設定したフラグを参照して、入力インタフェースとしてマイク１０が用いられているか携帯電話４が用いられているかを区別するとともに、情報テーブルを参照して、音声信号が入力される入力インタフェースの区別に従って音声コマンドに応じた制御信号を異ならせて制御対象機器へ送出する。 Next, a control signal corresponding to the recognition result is sent (S108). Specifically, referring to the flag set in S102, it is discriminated whether the microphone 10 or the mobile phone 4 is used as an input interface, and an audio signal is input by referring to the information table. The control signal corresponding to the voice command is made different according to the distinction of the input interface to be transmitted to the control target device.

上記したように、入力インタフェースとしてマイク１０が用いられており、Ｓ１０６にて「エアコンオン」という音声コマンドが音声認識された場合、車両の全ての送風箇所からの送風でエアコンがオンするような制御信号がエアコンＥＣＵ２に送出される。 As described above, when the microphone 10 is used as the input interface, and the voice command “air conditioner on” is recognized in S106, the air conditioner is turned on by the air blowing from all the air blowing portions of the vehicle. A signal is sent to the air conditioner ECU 2.

また、入力インタフェースとしてマイク１０が用いられており、Ｓ１０６にて「インフォメーション」という音声コマンドが音声認識された場合、交通情報やガソリン価格等の情報を表示させるような制御信号がオーディオＥＣＵ３に送出され、本処理を終了する。 In addition, when the microphone 10 is used as an input interface and a voice command “information” is recognized in S106, a control signal for displaying information such as traffic information and gasoline price is sent to the audio ECU 3. This process is terminated.

また、入力インタフェースとして携帯電話４が用いられている場合、Ｓ１００の判定は「携帯電話」となり、次に、搭乗者が１人でもいるか否かを判定する（Ｓ１１０）。具体的には、車両の各座席の着座面に乗員の座席への着座を検出する着座センサが設けられており、これらの着座センサより出力される信号に基づいて搭乗者が１人でもいるか否かを判定する。 When the mobile phone 4 is used as an input interface, the determination in S100 is “mobile phone”, and then it is determined whether or not there is even one passenger (S110). Specifically, seating sensors for detecting seating of occupants on the seating surface of each seat of the vehicle are provided, and whether or not there is even one passenger based on signals output from these seating sensors. Determine whether.

ここで、搭乗者が１人もいない場合、Ｓ１１０の判定はＮＯとなり、次に、携帯電話４を入力インタフェースとして設定にする（Ｓ１１２）。具体的には、ＲＡＭの予め定められた領域に入力インタフェースが携帯電話４であることを示すフラグを設定し、Ｓ１０４へ進む。 Here, if there is no passenger, the determination in S110 is NO, and then the mobile phone 4 is set as an input interface (S112). Specifically, a flag indicating that the input interface is the mobile phone 4 is set in a predetermined area of the RAM, and the process proceeds to S104.

この場合、Ｓ１０４にてコマンド発話を促すガイダンスが再生され、このガイダンスに従って、例えば、ユーザが「エアコンオン」と発話すると、「エアコンオン」という音声コマンドが音声認識され、Ｓ１０８にて、認識結果に応じた制御信号が送出される。 In this case, the guidance for prompting the command utterance is reproduced in S104. According to this guidance, for example, when the user utters “air conditioner on”, the voice command “air conditioner on” is recognized and the recognition result is obtained in S108. A corresponding control signal is sent out.

ここで、入力インタフェースとして携帯電話４が用いられており、Ｓ１０６にて「エアコンオン」という音声コマンドが音声認識された場合、車両の全ての送風箇所からの送風でエアコンがオンするような制御信号がエアコンＥＣＵ２に送出される。 Here, when the cellular phone 4 is used as an input interface, and the voice command “air conditioner on” is recognized in S106, a control signal that turns on the air conditioner by blowing air from all the blowing locations of the vehicle. Is sent to the air conditioner ECU 2.

また、入力インタフェースとして携帯電話４が用いられており、Ｓ１０６にて「インフォメーション」という音声コマンドが音声認識された場合、「鍵が掛かっているかどうか」、「窓の開閉状態」等の情報が携帯電話４へ通知される。 In addition, when the mobile phone 4 is used as an input interface and the voice command “Information” is recognized in S106, information such as “whether it is locked”, “open / closed state of window”, etc. is carried. The telephone 4 is notified.

また、入力インタフェースとして携帯電話４が用いられており、搭乗者が１人でもいる場合、Ｓ１１０の判定はＹＥＳとなり、次に、携帯電話でのアクセスを禁止する（Ｓ１１４）。具体的には、携帯電話４のマイクにより集音された音声に応じた音声コマンドに応じた制御信号の送出を禁止する。これにより、車室内にいる搭乗者の意に反して携帯電話４を所有する他者によって勝手に制御対象機器の設定等が変更されてしまうといったことが防止される。 If the mobile phone 4 is used as the input interface and there is even one passenger, the determination in S110 is YES, and access by the mobile phone is prohibited (S114). Specifically, transmission of a control signal according to a voice command according to the voice collected by the microphone of the mobile phone 4 is prohibited. Thereby, it is prevented that the setting etc. of a control object apparatus will be changed without permission by the other person who owns the mobile telephone 4 against the will of the passenger in a vehicle interior.

次に、音声認識終了のガイダンスを再生して（Ｓ１１６）、本処理を終了する。 Next, the voice recognition end guidance is reproduced (S116), and this process is ended.

上記した構成によれば、音声信号入力手段を介して入力される音声信号を予め定められた条件に従って区別し、当該音声信号の区別に従って音声コマンドに応じた制御信号を異ならせて制御対象機器へ送出するので、１つの音声コマンドであっても音声信号の区別に従って異なる制御信号を制御対象機器へ送出することができ、より利便性の高い機能を実現することができる。 According to the above-described configuration, the audio signal input via the audio signal input unit is distinguished according to a predetermined condition, and the control signal according to the audio command is changed according to the audio signal to the control target device. Since it is transmitted, even if it is one voice command, different control signals can be sent to the control target device according to the distinction of the voice signal, and a more convenient function can be realized.

具体的には、入力される音声信号を、音声信号が入力される音声信号入力手段毎に区別し、音声信号の区別に従って音声コマンドに応じた制御信号を異ならせて制御対象機器へ送出することができる。 Specifically, the input audio signal is distinguished for each audio signal input means to which the audio signal is input, and the control signal corresponding to the audio command is differentiated according to the audio signal and sent to the control target device. Can do.

また、音声信号が入力される音声信号入力手段の区別に従って音声コマンドに応じた制御信号を異なるように規定した情報テーブルを参照して、音声信号が入力される音声信号入力手段の区別に従って音声コマンドに応じた制御信号を異ならせて制御対象機器へ送出することができる。 In addition, referring to the information table in which the control signal according to the voice command is defined differently according to the distinction of the voice signal input means to which the voice signal is inputted, the voice command according to the distinction of the voice signal input means to which the voice signal is inputted The control signal corresponding to the signal can be made different and sent to the control target device.

また、音声信号入力手段の１つを車室内マイクとし、音声信号入力手段の１つは、通信インタフェースを介して接続された携帯型通信機器とすることができる。 Further, one of the audio signal input means can be a vehicle interior microphone, and one of the audio signal input means can be a portable communication device connected via a communication interface.

また、車室内に乗員がいると判定された場合、携帯型通信端末のマイクを介して入力される音声信号より音声認識した音声コマンドに応じた制御信号の送出が禁止されるので、車室内にいる搭乗者の意に反して携帯型通信端末の使用者によって制御対象機器の設定等が変更されてしまうといった問題を防止することができる。 In addition, when it is determined that there is an occupant in the passenger compartment, transmission of a control signal according to a voice command recognized by voice from an audio signal input via the microphone of the portable communication terminal is prohibited. It is possible to prevent the problem that the setting of the control target device is changed by the user of the portable communication terminal against the intention of the passenger.

（第２実施形態）
本実施形態に係る車載音声認識装置の構成を図４に示す。本車載音声認識装置１は、マイク１０、車内通信制御部１３、ディスプレイ１４、スピーカ１５および制御部１６を備えている。本車載音声認識装置１は、通信部１２を介して携帯電話４と通信することが可能となっている。なお、上記第１実施形態と同一部分には同一符号を付して説明を省略し、以下、異なる部分を中心に説明する。 (Second Embodiment)
The configuration of the in-vehicle speech recognition device according to this embodiment is shown in FIG. The in-vehicle voice recognition device 1 includes a microphone 10, an in-vehicle communication control unit 13, a display 14, a speaker 15, and a control unit 16. The in-vehicle voice recognition device 1 can communicate with the mobile phone 4 via the communication unit 12. In addition, the same code | symbol is attached | subjected to the same part as the said 1st Embodiment, description is abbreviate | omitted and it demonstrates centering on a different part hereafter.

本実施形態では、マイク１０より出力される音声信号が制御部１６へ直接入力されるようになっている。 In the present embodiment, an audio signal output from the microphone 10 is directly input to the control unit 16.

上記第１実施形態では、入力される音声信号を、マイク１０より入力されたものであるか携帯電話４より入力されたものであるかによって区別し、この区別に従って音声コマンドに応じた制御信号を異ならせて制御対象機器へ送出したが、本実施形態では、入力される音声信号を解析して音声のトーンを区別し、この区別に従って音声コマンドに応じて制御対象機器へ送出する制御信号を異ならせるようになっている。 In the first embodiment, the input voice signal is distinguished depending on whether it is input from the microphone 10 or input from the mobile phone 4, and the control signal corresponding to the voice command is determined according to this distinction. In this embodiment, the input audio signal is analyzed to distinguish voice tones, and the control signal sent to the control target device according to the voice command is different according to this distinction. It comes to let you.

本実施形態における制御部１６は、マイク１０を介して入力される音声信号を解析して音声のトーンを区別するとともに、当該区別した音声のトーンと制御対象機器の設定内容を関連付けして蓄積記憶したデータベースを作成して記憶部１６ｂに蓄積記憶させるとともに、このデータベースを参照して、解析した音声のトーンと同じトーンで過去に高い頻度で設定された設定内容を特定し、この特定した操作内容を示す音声コマンドを示す制御信号を制御対象機器へ送出する処理を実施する。この処理については後で詳細に説明する。 The control unit 16 according to the present embodiment analyzes a voice signal input via the microphone 10 to distinguish a voice tone, and stores and stores the distinguished voice tone and the setting contents of the control target device in association with each other. The stored database 16b is stored and stored in the storage unit 16b, the setting content set in the past with the same tone as the analyzed voice tone is specified with reference to the database, and the specified operation content is specified. A process of sending a control signal indicating a voice command indicating to the control target device is performed. This process will be described later in detail.

図５に、データベースの構成を示す。図に示すように、ユーザ毎に、かつ、音声コマンドを複数のトーンパターンに分類して、音声コマンドと動作履歴を関連付けしたデータベースが作成される。 FIG. 5 shows the configuration of the database. As shown in the figure, a database in which voice commands are classified into a plurality of tone patterns and associated with voice commands and operation histories is created for each user.

なお、トーンパターンは、音声のトーンの高さを表している。本実施形態では、音声コマンドの周波数を解析して、音声コマンドのトーンパターンを特定する。なお、「エアコンオン」という音声コマンドについては、Ａ、Ｂ、Ｃの３つのトーンパターンに分類して音声コマンドと動作履歴が関連付けてデータベースに蓄積記憶され、「オーディオオン」という音声コマンドについてはＡ、Ｂの２つのトーンパターンに分類して音声コマンドと動作履歴が関連付けてデータベースに蓄積記憶される。なお、トーンパターンＡ、Ｂ、Ｃの順にトーンが低くなる。 The tone pattern represents the tone level of the voice. In the present embodiment, the tone pattern of the voice command is specified by analyzing the frequency of the voice command. The voice command “air conditioner on” is classified into three tone patterns A, B, and C, and the voice command and the operation history are associated with each other and accumulated and stored in the database. , B are classified into two tone patterns, and voice commands and operation histories are associated and stored in a database. Note that the tone decreases in the order of tone patterns A, B, and C.

例えば、ユーザＸが、「エアコンオン」という音声コマンドに続いてエアコンの設定温度の入力を示す「設定温度変更」、「２５℃」という音声コマンドをトーンパターンＡに分類されるトーンで発声すると、図５に示すように「エアコンオン」という音声コマンドに対して、ユーザＸ、トーンパターンＡ、動作履歴２５℃の項目の登録回数が１つ増加するようになっている。 For example, when the user X utters the voice command “set temperature change” and “25 ° C.” indicating the input of the set temperature of the air conditioner following the voice command “air conditioner on” with a tone classified into the tone pattern A, As shown in FIG. 5, the number of registrations of the items of user X, tone pattern A, and operation history 25 ° C. is increased by one for the voice command “air conditioner ON”.

また、ユーザＸの体調が悪く、「エアコンオン」という音声コマンドに続いて、「設定温度変更」、「２８℃」という音声コマンドをトーンパターンＢに分類されるトーンで発声すると、図５に示すように「エアコンオン」という音声コマンドに対して、ユーザＸ、トーンパターンＢ、動作履歴２８℃の項目の登録回数が１つ増加するようになっている。 Further, when the user X is in a poor physical condition and the voice command “set temperature change” and “28 ° C.” are uttered by the tone classified into the tone pattern B following the voice command “air conditioner ON”, it is shown in FIG. Thus, the number of registrations of the items of user X, tone pattern B, and operation history 28 ° C. is increased by one for the voice command “air conditioner ON”.

また、ユーザがＹさんとなっている場合には、ユーザＹの音声コマンドと動作履歴を関連付けしたデータベースが蓄積記憶される。 When the user is Mr. Y, a database in which the voice command of the user Y and the operation history are associated is stored.

また、「オーディオオン」という音声コマンドについては、ユーザ毎に、かつ、音声コマンドをトーンパターンＡ、Ｂに分類して、音声コマンドと動作履歴を関連付けしたデータベースが作成される。 For the voice command “audio on”, a database in which the voice command is classified into tone patterns A and B for each user and the voice command and the operation history are associated is created.

次に、制御部１６の処理について説明する。図６に、制御部１６のフローチャートを示す。本実施形態に係る制御部１６は、音声認識の開始を指示するためのＰＴＴ（ＰｕｓｈＴａｌｋＳｗｉｔｃｈ）が押下操作されると、図６に示す処理を実施する。 Next, processing of the control unit 16 will be described. FIG. 6 shows a flowchart of the control unit 16. When a PTT (Push Talk Switch) for instructing the start of speech recognition is pressed, the control unit 16 according to the present embodiment performs the process illustrated in FIG.

まず、ユーザ名の発話を促すガイダンスを生成する（Ｓ２００）。例えば、「ユーザ名を発声して下さい」といったガイダンスをスピーカ１５より音声出力させる。 First, a guidance for prompting a user name is generated (S200). For example, a guidance such as “Please speak your user name” is output from the speaker 15.

このガイダンスに従ってユーザ名の発話が行われると、次に、音声認識処理を実施する（Ｓ２０２）。例えば、ユーザが「Ｘ」と発話すると、「Ｘ」というユーザ名が音声認識される。 When the user name is uttered according to this guidance, a voice recognition process is performed (S202). For example, when the user utters “X”, the user name “X” is recognized by voice.

次に、音声認識したユーザ名に従って使用ユーザを設定する（Ｓ２０４）。Ｓ２０２にて「Ｘ」というユーザ名が音声認識された場合、使用ユーザを「Ｘ」として登録する。 Next, a user to be used is set according to the user name that has been recognized (S204). When the user name “X” is recognized by voice in S202, the user in use is registered as “X”.

次に、コマンド発話を促すガイダンスを再生する（Ｓ２０６）。例えば、「音声コマンドを発声して下さい」といったガイダンスをスピーカ１５より音声出力させる。 Next, guidance for prompting command utterance is reproduced (S206). For example, a guidance such as “Please speak a voice command” is output from the speaker 15.

このガイダンスに従ってユーザの発話が行われると、次に、音声認識処理を実施する（Ｓ２０８）。例えば、「エアコンオン」という音声コマンドが発声されると、「エアコンオン」という音声コマンドが音声認識される。 When the user utters according to this guidance, next, speech recognition processing is performed (S208). For example, when a voice command “air conditioner on” is uttered, the voice command “air conditioner on” is recognized.

次に、マイク１０より入力される音声信号を解析して音声コマンドのトーンを検出する（Ｓ２１０）。ここでは、「エアコンオン」という音声コマンドのトーンパターンを特定する。 Next, the voice signal input from the microphone 10 is analyzed to detect the tone of the voice command (S210). Here, the tone pattern of the voice command “air conditioner on” is specified.

次に、トーンパターンに対応した動作履歴内で登録回数が規定値を超えているものがあるか否かを判定する（Ｓ２１２）。具体的には、データベースを参照して、Ｓ２０８にて認識した音声コマンドで、かつ、Ｓ２１０にて特定されたトーンパターンで登録されている設定温度で、規定値を超えて登録されているものがあるか否かを判定する。 Next, it is determined whether or not there is an operation history corresponding to the tone pattern whose registration count exceeds a specified value (S212). Specifically, the voice command recognized in S208 with reference to the database and the preset temperature registered with the tone pattern specified in S210 and registered exceeding the specified value. It is determined whether or not there is.

ここで、Ｓ２０８にて認識した音声コマンドで、かつ、Ｓ２１０にて特定されたトーンパターンで登録されている設定温度で、規定値を超えて登録されているものがない場合、Ｓ２１２の判定はＮＯとなり、エアコンの設定温度を標準温度設定にする（Ｓ２２０）。本実施形態では、２５℃が標準温度となっている。 Here, if there is no voice command recognized in S208 and the set temperature registered with the tone pattern specified in S210 that exceeds the specified value, the determination in S212 is NO. Thus, the set temperature of the air conditioner is set to the standard temperature setting (S220). In this embodiment, 25 ° C. is the standard temperature.

次に、設定温度の変更指示があるか否かを判定する（Ｓ２２２）。具体的には、「設定温度変更」という音声コマンドを音声認識した否かに基づいて設定温度の変更指示があるか否かを判定する。 Next, it is determined whether there is an instruction to change the set temperature (S222). Specifically, it is determined whether or not there is an instruction to change the set temperature based on whether or not the voice command “set temperature change” has been recognized.

ここで、「設定温度変更」という音声コマンドが発声され、「設定温度変更」という音声コマンドを音声認識した場合、Ｓ２２２の判定はＹＥＳとなり、次に、音声認識処理を実施する（Ｓ２２４）。ここで、ユーザＸが、「２７℃」と発声すると、「２７℃」という音声コマンドを音声認識する。 Here, when the voice command “change in set temperature” is uttered and the voice command “change in set temperature” is voice-recognized, the determination in S222 is YES, and then the voice recognition process is performed (S224). Here, if the user X utters “27 ° C.”, the voice command “27 ° C.” is recognized.

次に、ユーザＸが発声した音声コマンドのトーンを検出する（Ｓ２２６）。ここでは、「２７℃」という音声コマンドのトーンパターンを特定する。このように、Ｓ２１０とは別に、再度、音声コマンドのトーンパターンを特定することで、音声コマンドのトーンの検出精度を高くしている。 Next, the tone of the voice command uttered by the user X is detected (S226). Here, the tone pattern of the voice command “27 ° C.” is specified. Thus, apart from S210, the tone pattern of the voice command is specified again to increase the accuracy of the voice command tone detection.

次に、トーンパターンと設定温度を関連付けてデータベースに登録する（Ｓ２２８）。例えば、Ｓ２２４にて「２７℃」という音声コマンドが音声認識されており、Ｓ２２６にてトーンパターンがＡとして特定されている場合、ユーザＸに対して、トーンパターンＡと設定温度「２７℃」を関連付けてデータベースに登録する。なお、同条件のものが既にデータベースに登録されている場合には、同条件の登録回数に１を加算する。 Next, the tone pattern and the set temperature are associated and registered in the database (S228). For example, when the voice command “27 ° C.” is recognized in S224 and the tone pattern is specified as A in S226, the tone pattern A and the set temperature “27 ° C.” are set to the user X. Associate and register in the database. If the same condition is already registered in the database, 1 is added to the number of registrations under the same condition.

次に、認識結果に応じた制御信号をエアコンＥＣＵ２へ送信する（Ｓ２１８）。ここでは、エアコンの設定温度が２７℃になるような制御信号をエアコンＥＣＵ２へ送信する。
Next, a control signal corresponding to the recognition result is transmitted to the air conditioner ECU 2 (S218). Here, a control signal is sent to the air conditioner ECU 2 so that the set temperature of the air conditioner becomes 27 ° C.

また、「設定温度変更」という音声コマンドが一定時間内に発声されない場合、Ｓ２２２の判定はＮＯとなり、Ｓ２１０にて検出されたトーンパターンを標準温度と関連付けてデータベースに登録する（Ｓ２３０）。例えば、Ｓ２１０にて「エアコンオン」という音声コマンドのトーンパターンがＣとして特定されている場合、トーンパターンを標準温度（２５℃）と関連付けてデータベースに登録する。 If the voice command “change set temperature” is not uttered within a predetermined time, the determination in S222 is NO, and the tone pattern detected in S210 is registered in the database in association with the standard temperature (S230). For example, when the tone pattern of the voice command “air conditioner on” is specified as C in S210, the tone pattern is registered in the database in association with the standard temperature (25 ° C.).

次に、認識結果に応じた制御信号をエアコンＥＣＵ２へ送信する（Ｓ２１８）。ここでは、設定温度が標準温度（２５℃）で、エアコンをオンする制御信号をエアコンＥＣＵ２へ送信する。 Next, a control signal corresponding to the recognition result is transmitted to the air conditioner ECU 2 (S218). Here, the set temperature is the standard temperature (25 ° C.), and a control signal for turning on the air conditioner is transmitted to the air conditioner ECU 2.

また、Ｓ２０８にて、ユーザＸによる「エアコンオン」という音声コマンドが音声認識され、Ｓ２１０にて、「エアコンオン」という音声コマンドのトーンパターンを特定し、データベースにＳ２０８にて認識した音声コマンドで、かつ、Ｓ２１０にて特定されたトーンパターンで登録されている設定温度で、規定値を超えて登録されているものがある場合、Ｓ２１２の判定はＹＥＳとなり、次に、最多の設定温度に設定する（Ｓ２１３）。すなわち、規定値を超えて登録されているものの中で、最多の設定温度にする。 In S208, the voice command “air conditioner on” by the user X is recognized as voice, and in S210, the tone pattern of the voice command “air conditioner on” is specified, and the voice command recognized in S208 in the database, If there is a set temperature registered with the tone pattern specified in S210 that exceeds the specified value, the determination in S212 is YES, and then the highest set temperature is set. (S213). That is, the highest set temperature among the registered values exceeding the specified value is set.

従って、図５に示したデータベース構成となっている場合、ユーザＸが「エアコンオン」という音声コマンドをトーンパターンＡに分類されるトーンで発声した場合、登録回数の最も多い２５℃を設定温度として設定する。 Therefore, in the case of the database configuration shown in FIG. 5, when the user X utters the voice command “air conditioner ON” with a tone classified as tone pattern A, the preset temperature is 25 ° C. Set.

また、ユーザＸが「エアコンオン」という音声コマンドをトーンパターンＢに分類されるトーンで発声した場合、トーンパターンＢで、規定値以上、かつ、最も登録回数の多い２８℃を設定温度として設定する。 Further, when the user X utters a voice command “air conditioner ON” with a tone classified into the tone pattern B, the set temperature is set to 28 ° C., which is the specified value or more and the most frequently registered, in the tone pattern B. .

次に、設定温度の変更があるか否かを判定する（Ｓ２１４）。ここで、ユーザ操作により設定温度が変更されると、Ｓ２１４の判定はＹＥＳとなり、Ｓ２２４へ進む。また、設定温度が変更されない場合、Ｓ２１４の判定はＮＯとなり、次に、音声コマンドとトーンパターンを関連付けてデータベースを更新する（Ｓ２１５）。例えば、ユーザＸに対して、「エアコンオン」という音声コマンドとトーンパターンを関連付けてデータベースに登録する。なお、同条件のものが既にデータベースに登録されている場合には、同条件の登録回数に１を加算する。 Next, it is determined whether there is a change in the set temperature (S214). Here, if the set temperature is changed by a user operation, the determination in S214 is YES, and the process proceeds to S224. If the set temperature is not changed, the determination in S214 is NO, and the database is then updated by associating the voice command with the tone pattern (S215). For example, for the user X, a voice command “air conditioner ON” and a tone pattern are associated with each other and registered in the database. If the same condition is already registered in the database, 1 is added to the number of registrations under the same condition.

次に、認識結果に応じた制御信号をエアコンＥＣＵ２へ送信する（Ｓ２１８）。ユーザＸが「エアコンオン」という音声コマンドをトーンパターンＡに分類されるトーンで発声した場合には、設定温度を２５℃として、エアコンをオンする制御信号をエアコンＥＣＵ２へ送信する。また、ユーザＸが「エアコンオン」という音声コマンドをトーンパターンＢに分類されるトーンで発声した場合には、設定温度を２８℃として、エアコンをオンする制御信号をエアコンＥＣＵ２へ送信する。 Next, a control signal corresponding to the recognition result is transmitted to the air conditioner ECU 2 (S218). When the user X utters a voice command “air conditioner on” with a tone classified as the tone pattern A, the control signal for turning on the air conditioner is transmitted to the air conditioner ECU 2 at a set temperature of 25 ° C. When the user X utters the voice command “air conditioner on” with a tone classified as the tone pattern B, the control signal for turning on the air conditioner is transmitted to the air conditioner ECU 2 with the set temperature set at 28 ° C.

上記したように、音声信号入力手段を介して入力される音声信号を解析して音声の特徴を特定する音声特徴特定手段を備え、音声信号入力手段を介して入力される音声信号を、音声特徴特定手段により特定された音声の特徴に従って区別し、音声の特徴の区別に従って音声コマンドに応じた制御信号を異ならせて制御対象機器へ送出することができる。 As described above, the audio signal specifying unit that analyzes the audio signal input through the audio signal input unit and specifies the audio feature is provided, and the audio signal input through the audio signal input unit is converted into the audio feature. It is possible to discriminate according to the voice feature specified by the specifying means, and to send the control signal according to the voice command differently according to the voice feature distinction and send it to the control target device.

また、音声特徴特定手段により特定された音声の特徴と制御対象機器の設定内容を関連付けしたデータベースが作成され、このデータベースを参照して、音声特徴特定手段により特定された音声の特徴と同一区分の特徴に関連付けられた制御対象機器の設定内容を示す音声コマンドを示す制御信号が制御対象機器へ送出される。すなわち、ユーザが過去に発声した音声コマンドの特徴と、そのときに設定された制御対象機器の設定内容とを関連付けしたデータベースが自動的に作成され、このデータベースを参照して、音声特徴特定手段により特定された音声の特徴と同一区分の特徴に関連付けられた制御対象機器の設定内容を示す音声コマンドを示す制御信号を制御対象機器へ送出することができる。 In addition, a database is created in which the audio features specified by the audio feature specifying means are associated with the setting contents of the control target device. The database is referred to, and the same classification as the audio features specified by the audio feature specifying means is made. A control signal indicating a voice command indicating the setting content of the control target device associated with the feature is transmitted to the control target device. That is, a database that automatically associates the features of the voice command that the user has uttered in the past with the settings of the control target device set at that time is automatically created. It is possible to send a control signal indicating a voice command indicating the setting contents of the control target device associated with the feature of the same category as the specified voice feature to the control target device.

また、音声コマンドを発声する乗員が別の乗員に変わっても、上記した構成によれば、乗員毎にデータベースが作成され、音声コマンドを発声する乗員に関する情報をデータベースより読み出して、解析された音声の特徴と同じ分類に関連付けられた制御対象機器の設定内容を示す音声コマンドを示す制御信号が制御対象機器へ送出されるので、乗員の音声を別の乗員の音声と混同して誤動作を引き起こすようなことを防止することが可能である。 Moreover, even if the occupant who utters a voice command changes to another occupant, according to the configuration described above, a database is created for each occupant, and information on the occupant who utters a voice command is read from the database and analyzed. Since a control signal indicating a voice command indicating the setting contents of the control target device associated with the same classification as that of the feature is sent to the control target device, the voice of the occupant is confused with the voice of another occupant so as to cause a malfunction. It is possible to prevent this.

なお、本発明は上記実施形態に限定されるものではなく、本発明の趣旨に基づいて種々なる形態で実施することができる。 In addition, this invention is not limited to the said embodiment, Based on the meaning of this invention, it can implement with a various form.

例えば、上記実施形態では、エアコンＥＣＵ２、オーディオＥＣＵ３を制御対象機器として説明したが、これらのＥＣＵの限定されるものではなく、例えば、他のＥＣＵや携帯電話４を制御対象機器としてもよい。 For example, in the above-described embodiment, the air conditioner ECU 2 and the audio ECU 3 have been described as control target devices. However, these ECUs are not limited, and for example, another ECU or the mobile phone 4 may be the control target device.

また、上記第１実施形態では、音声信号入力手段の１つを通信インタフェースを介して接続された携帯電話のマイクとした構成を示したが、携帯電話に限定されるものではなく、例えば、携帯型通信装置とすることもできる。 In the first embodiment, the configuration is shown in which one of the audio signal input means is a microphone of a mobile phone connected via a communication interface. However, the present invention is not limited to a mobile phone. Type communication device.

また、上記第実施形態では、無線通信網を介して携帯電話４と通信する構成を示したが、例えば、ブルートゥース等のインタフェースを介して携帯電話４と通信するようにしてもよい。 Moreover, although the structure which communicates with the mobile telephone 4 via a wireless communication network was shown in the said 1st Embodiment, you may make it communicate with the mobile telephone 4 via interfaces, such as Bluetooth, for example.

上記第２実施形態では、入力される音声信号を、音声のトーンに従って区別したが、音声のトーンに限定されるものではなく、音声の大きさ、音声の大きさとトーンの組合せなど、各種音声の特徴に従って区別するようにしてもよい。 In the second embodiment, the input audio signal is distinguished according to the audio tone. However, the input audio signal is not limited to the audio tone, and various audio signals such as an audio volume, a combination of audio volume and tone, and the like are used. You may make it distinguish according to the characteristic.

なお、上記実施形態における構成と特許請求の範囲の構成との対応関係について説明すると、マイク１０、携帯電話４のマイクが音声信号入力手段に相当し、エアコンＥＣＵ２、オーディオＥＣＵ３が制御対象機器に相当し、Ｓ１００〜Ｓ１０８、Ｓ２１８が制御信号送出手段に相当し、記憶部１６ｂが記憶手段に相当し、Ｓ１１０が乗員有無判定手段に相当し、Ｓ１１４が制御信号送出禁止手段に相当し、Ｓ２１０、Ｓ２２６が音声特徴特定手段に相当し、Ｓ２２２〜Ｓ２３０がデータベース作成処理手段に相当し、Ｓ２００〜Ｓ２０４が乗員特定手段に相当する。 The correspondence relationship between the configuration of the above embodiment and the configuration of the claims will be described. The microphone 10 and the microphone of the mobile phone 4 correspond to audio signal input means, and the air conditioner ECU 2 and the audio ECU 3 correspond to control target devices. S100 to S108 and S218 correspond to control signal transmission means, the storage unit 16b corresponds to storage means, S110 corresponds to passenger presence determination means, S114 corresponds to control signal transmission prohibition means, and S210 and S226. Corresponds to voice feature specifying means, S222 to S230 correspond to database creation processing means, and S200 to S204 correspond to occupant specifying means.

１車載音声認識装置
２エアコンＥＣＵ
３オーディオＥＣＵ
４携帯電話
１０マイク
１２通信部
１３車内通信制御部
１４ディスプレイ
１５スピーカ
１６制御部
１６ａ音声認識エンジン
１６ｂ記憶部 1 On-vehicle speech recognition device 2 Air conditioner ECU
3 Audio ECU
4 Mobile phone 10 Microphone 12 Communication unit 13 In-vehicle communication control unit 14 Display 15 Speaker 16 Control unit 16a Speech recognition engine 16b Storage unit

Claims

An in-vehicle voice recognition device that recognizes a voice command from a voice signal input via a voice signal input unit and sends a control signal corresponding to the voice command to a control target device,
One of the audio signal input means is a vehicle interior microphone, and one of the audio signal input means is a portable communication device connected via a communication interface,
Control for distinguishing the voice signal input through the voice signal input means according to a predetermined condition, and sending a control signal according to the voice command to the control target device according to the distinction of the voice signal A signal transmission means ;
Storage means for storing an information table defining different control signals according to the voice command according to the distinction of the voice signal input means to which the voice signal is input,
The control signal sending means refers to the information table stored in the storage means, and changes the control signal according to the voice command according to the distinction of the voice signal input means to which the voice signal is inputted. A vehicle-mounted speech recognition apparatus characterized by being sent to a control target device.

Occupant presence / absence determining means for determining the presence / absence of an occupant in the passenger compartment;
When it is determined by the occupant presence / absence determination means that there is an occupant in the passenger compartment, a control signal corresponding to the voice command recognized by the voice signal input from the microphone signal of the portable communication device is transmitted. The vehicle-mounted speech recognition apparatus according to claim 1 , further comprising a control signal transmission prohibiting unit that prohibits the signal.

The audio signals are input separately via a plurality of the audio signal input means,
Said control signal transmitting means, said voice signal inputted through the sound signal input means, to claim 1 or 2, characterized in that to distinguish for each of the audio signal input means for the audio signal is input The vehicle-mounted speech recognition apparatus described.