CN105513594A - Voice control system - Google Patents
Voice control system Download PDFInfo
- Publication number
- CN105513594A CN105513594A CN201510835844.2A CN201510835844A CN105513594A CN 105513594 A CN105513594 A CN 105513594A CN 201510835844 A CN201510835844 A CN 201510835844A CN 105513594 A CN105513594 A CN 105513594A
- Authority
- CN
- China
- Prior art keywords
- voice
- control system
- speech
- command
- speech control
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000009471 action Effects 0.000 claims description 9
- 238000004088 simulation Methods 0.000 claims description 4
- 230000005540 biological transmission Effects 0.000 claims description 3
- 238000005070 sampling Methods 0.000 claims description 3
- 238000007650 screen-printing Methods 0.000 claims description 3
- 230000002452 interceptive effect Effects 0.000 claims 1
- 238000000034 method Methods 0.000 abstract description 4
- 230000003993 interaction Effects 0.000 abstract 1
- 238000005516 engineering process Methods 0.000 description 6
- 230000008859 change Effects 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0487—Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- User Interface Of Digital Computer (AREA)
- Toys (AREA)
Abstract
The invention provides a voice control system capable of improving the operation convenience of vehicle terminals, mobile phones, smart homes, mobile internet and other various kinds of applications. Meanwhile, the system makes the control and the interaction to be simpler and more humanized. The technical scheme of the voice control system is as the following: the system customizing commands and the system intercepting a screen and identifying an operable region in the picture, the operable region including icons, buttons, and text labels and other interface controls, then generating a corresponding command set library; the system acquiring voice, identifying the voice and converting the voice to characters in an online or offline method; the system matching the characters with the command set library, identifying voice control commands; the system simulating the control operation corresponding to a control command or notifying an application to operate, so as to realize a voice control target.
Description
Technical field
The present invention relates to speech recognition technology, image recognition technology, text recognition technique, action simulation technology and Bluetooth technology.
Background technology
Very large by hand manipulation vehicle entertainment system danger during driving.
Current Smart Home, intelligence are dressed, intelligent toy is more and more stronger for the demand of speech control.
In fields such as medical treatment, the inconvenient operating terminal of a lot of personage's hand, needs to manipulate some smart machines extremely not convenient.
How effectively to manipulate various smart machine by voice is current urgent problem.
Summary of the invention
The object of the invention is to solve the problem, provide a kind of speech control system, improve the convenience of the types of applications operations such as car-mounted terminal, mobile phone, Smart Home, mobile interchange, make manipulation mutual simpler, more humane.
Speech control system technical scheme describes as follows.
System custom command and system screen printing also identify operable area in picture, and operable area comprises the interface controls such as icon, button, text label, then generate corresponding command set storehouse.
System acquisition voice, are identified as word by mode that is online or off-line to voice.
System matches word and command set storehouse, identify voice control command.
The control action that system simulation control command is corresponding or notice application operate, and realize speech control target.
According to an embodiment of speech control system of the present invention, speech recognition and command recognition are realized by the voice command identification layer of system.
According to an embodiment of speech control system of the present invention, the operation of application is completed by the operation execution level of system.
According to an embodiment of speech control system of the present invention, speech control system state comprises the free time at initial stage, starts to point out, gather voice, speech recognition, the prompting of commands match result, operation execution.
User side telepilot comprises blue Tooth remote controller, for receiving user key-press event transmission to intelligent terminal.
Intelligent terminal comprises figure and text identification module, voice recognition commands module and operation executing module, and each functions of modules is as follows.
Figure and text identification module, converge the operable area order of self-defining order and sectional drawing identification with generation command set storehouse.
Voice recognition commands module, the speech recognition with regard to sampling is word, then with command set storehouse match cognization voice control command.
Operation executing module, the control action that analogue enlargement order is corresponding or notice application operate, and realize speech control target.
The present invention contrasts prior art following beneficial effect: the solution of the present invention be to existing interface should be had to identify its operable area dynamically generates command set storehouse, after speech recognition, carry out the identification of voice command, then simulate corresponding operational motion or notice application execution.Compared to conventional art, the present invention can perform any order of application, and traditional technology can only support several conventional order, and each interpolation order all needs the bottom degree of depth to customize; The present invention can be generalized to the various intelligent terminals accepting speech control, includes but not limited to smart mobile phone, the user terminal that intelligent vehicle-carried, intelligence wearing, Smart Home, intelligent medical, intelligent toy etc. can accept phonetic entry.
Accompanying drawing explanation
Fig. 1 shows the process flow diagram of the preferred embodiment of speech control system of the present invention.
Fig. 2 shows the service logic figure of the preferred embodiment of speech control system of the present invention.Embodiment
Below in conjunction with drawings and Examples, the invention will be further described.
Fig. 1 shows the flow process of the preferred embodiment of speech control system of the present invention.Refer to Fig. 1, details are as follows for the implementation step of the speech control system of the present embodiment.
The custom command of step 100 system and system screen printing also identify operable area in picture, and operable area comprises the interface controls such as icon, button, text label, then generate corresponding command set storehouse.
Step 102 system acquisition voice, are identified as word by mode that is online or off-line to voice.
Step 104 system matches word and command set storehouse, identify voice control command.
Fig. 2 shows the service logic figure of the preferred embodiment of speech control system of the present invention, refers to Fig. 2, and the speech control system of the present embodiment comprises user side telepilot 20 and intelligent terminal 60.
User side telepilot 20 comprises blue Tooth remote controller, for receiving user key-press event transmission to intelligent terminal.Intelligent terminal 60 comprises figure and text identification module 602, voice command recognition module 604 and operation executing module 606.Change into word after the voice of voice command recognition module 604 to sampling identify and command recognition is carried out to the word after transforming.The control action that the order of operation executing module 606 analogue enlargement is corresponding or notice application operate, and realize speech control target.
Such as, user opens vehicle mounted guidance, and user clicks blue Tooth remote controller voice and starts key, says " searching place " order.
From the angle of user, control command and the operable area of user are searching place text box and can operate and try to please in Corresponding matching corresponding interface of " searching place ", " sight spot " orders the label operable area in corresponding interface, broadcasting icon in the corresponding player interface of the Play command, operable area in interface, it is all voice command that operable area comprises the interface controls such as icon, button, text label, also have the self-defining voice command of system in addition, such as " Home " returns interface of main menu etc.
System intercepts current and On-Screen Identification operable area coupling system self-defining order generation command set storehouse.
System identification goes out " searching place " speech text, then goes out " searching place " order with command set storehouse match cognization.
Control action or the notice application of correspondence that the order of step 106 analogue enlargement " searches place " operate, and jump to next operation interface, realize speech control target.
Above-described embodiment is available to persons skilled in the art to realize and uses of the present invention; persons skilled in the art can when not departing from thought of the present invention; various modifications or change are made to above-described embodiment; thus protection scope of the present invention not limit by above-described embodiment, and should be the maximum magnitude meeting the inventive features that claims are mentioned.
Claims (8)
1. a speech control system, comprising: system custom command and system screen printing also identify operable area in picture, and operable area comprises the interface controls such as icon, button, text label, then generate corresponding command set storehouse; System acquisition voice, are identified as word by mode that is online or off-line to voice; System matches word and command set storehouse, identify voice control command; The control action that system simulation control command is corresponding or notice application operate, and realize speech control target; User side telepilot comprises blue Tooth remote controller, for receiving user key-press event transmission to intelligent terminal; Intelligent terminal comprises figure and text identification module, voice recognition commands module and operation executing module, and each functions of modules is as follows: figure and text identification module, and the operable area order of self-defining order and sectional drawing identification is converged generation command set storehouse; Voice recognition commands module, the speech recognition with regard to sampling is word, then with command set storehouse match cognization voice control command; Operation executing module, the control action that analogue enlargement order is corresponding or notice application operate, and realize speech control target.
2. speech control system according to claim 1, is characterized in that, command set storehouse is that figure and text identification layer realize.
3. speech control system according to claim 1, is characterized in that, speech recognition and voice control command identification are realized by the voice command identification layer of system.
4. speech control system according to claim 1, is characterized in that, the operation of application is realized by the operation execution level analog subscriber operational motion of system or notice application execution.
5. speech control system according to claim 1, is characterized in that, speech control system state comprises the free time at initial stage, starts to point out, gather voice, speech recognition, the prompting of commands match result, operation execution.
6. speech control system according to claim 2, is characterized in that, the screen interface operable area of identification comprises the region that all users such as icon, button, text label, Text Entry, word navigation can carry out motion action.
7. speech control system according to claim 3, is characterized in that, speech recognition library two kinds of modes that speech recognition comprises online cloud platform and off-line realize.
8. speech control system according to claim 4, is characterized in that, the user operation action of simulation comprise click, double-click, pull, multiple point touching, the interactive action such as horizontal stroke.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510835844.2A CN105513594A (en) | 2015-11-26 | 2015-11-26 | Voice control system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510835844.2A CN105513594A (en) | 2015-11-26 | 2015-11-26 | Voice control system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN105513594A true CN105513594A (en) | 2016-04-20 |
Family
ID=55721522
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510835844.2A Pending CN105513594A (en) | 2015-11-26 | 2015-11-26 | Voice control system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105513594A (en) |
Cited By (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105913841A (en) * | 2016-06-30 | 2016-08-31 | 北京小米移动软件有限公司 | Voice recognition method, voice recognition device and terminal |
CN106201177A (en) * | 2016-06-24 | 2016-12-07 | 维沃移动通信有限公司 | A kind of operation execution method and mobile terminal |
CN106328136A (en) * | 2016-08-19 | 2017-01-11 | 黄广明 | Voice-controllable intelligent device |
CN106683675A (en) * | 2017-02-08 | 2017-05-17 | 张建华 | Control method and voice operating system |
CN106782493A (en) * | 2016-11-28 | 2017-05-31 | 湖北第二师范学院 | A kind of children private tutor's machine personalized speech control and VOD system |
CN108763068A (en) * | 2018-05-15 | 2018-11-06 | 福建天泉教育科技有限公司 | A kind of automated testing method and terminal based on machine learning |
CN109326290A (en) * | 2018-12-10 | 2019-02-12 | 苏州思必驰信息科技有限公司 | Audio recognition method and device |
CN110795175A (en) * | 2018-08-02 | 2020-02-14 | Tcl集团股份有限公司 | Method and device for analog control of intelligent terminal and intelligent terminal |
CN111722826A (en) * | 2020-06-28 | 2020-09-29 | 广州小鹏车联网科技有限公司 | Construction method of voice interaction information, vehicle and storage medium |
CN112511882A (en) * | 2020-11-13 | 2021-03-16 | 海信视像科技股份有限公司 | Display device and voice call-up method |
CN114005445A (en) * | 2020-06-28 | 2022-02-01 | 广州小鹏汽车科技有限公司 | Information processing method, server, and computer-readable storage medium |
CN115248650A (en) * | 2022-06-24 | 2022-10-28 | 南京伟柏软件技术有限公司 | Screen reading method and device |
-
2015
- 2015-11-26 CN CN201510835844.2A patent/CN105513594A/en active Pending
Cited By (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106201177B (en) * | 2016-06-24 | 2019-10-15 | 维沃移动通信有限公司 | A kind of operation execution method and mobile terminal |
CN106201177A (en) * | 2016-06-24 | 2016-12-07 | 维沃移动通信有限公司 | A kind of operation execution method and mobile terminal |
CN105913841A (en) * | 2016-06-30 | 2016-08-31 | 北京小米移动软件有限公司 | Voice recognition method, voice recognition device and terminal |
CN105913841B (en) * | 2016-06-30 | 2020-04-03 | 北京小米移动软件有限公司 | Voice recognition method, device and terminal |
CN106328136A (en) * | 2016-08-19 | 2017-01-11 | 黄广明 | Voice-controllable intelligent device |
CN106782493A (en) * | 2016-11-28 | 2017-05-31 | 湖北第二师范学院 | A kind of children private tutor's machine personalized speech control and VOD system |
CN106683675A (en) * | 2017-02-08 | 2017-05-17 | 张建华 | Control method and voice operating system |
CN108763068B (en) * | 2018-05-15 | 2021-12-28 | 福建天泉教育科技有限公司 | Automatic testing method and terminal based on machine learning |
CN108763068A (en) * | 2018-05-15 | 2018-11-06 | 福建天泉教育科技有限公司 | A kind of automated testing method and terminal based on machine learning |
CN110795175A (en) * | 2018-08-02 | 2020-02-14 | Tcl集团股份有限公司 | Method and device for analog control of intelligent terminal and intelligent terminal |
CN109326290A (en) * | 2018-12-10 | 2019-02-12 | 苏州思必驰信息科技有限公司 | Audio recognition method and device |
CN111722826A (en) * | 2020-06-28 | 2020-09-29 | 广州小鹏车联网科技有限公司 | Construction method of voice interaction information, vehicle and storage medium |
WO2022000863A1 (en) * | 2020-06-28 | 2022-01-06 | 广东小鹏汽车科技有限公司 | Speech interaction information construction method, vehicle, and storage medium |
CN114005445A (en) * | 2020-06-28 | 2022-02-01 | 广州小鹏汽车科技有限公司 | Information processing method, server, and computer-readable storage medium |
EP3955098A4 (en) * | 2020-06-28 | 2022-05-25 | Guangdong Xiaopeng Motors Technology Co., Ltd. | CONSTRUCTION PROCEDURES FOR VOICE INTERACTION INFORMATION, VEHICLE AND STORAGE MEDIA |
CN112511882A (en) * | 2020-11-13 | 2021-03-16 | 海信视像科技股份有限公司 | Display device and voice call-up method |
CN112511882B (en) * | 2020-11-13 | 2022-08-30 | 海信视像科技股份有限公司 | Display device and voice call-out method |
CN115248650A (en) * | 2022-06-24 | 2022-10-28 | 南京伟柏软件技术有限公司 | Screen reading method and device |
CN115248650B (en) * | 2022-06-24 | 2024-05-24 | 南京伟柏软件技术有限公司 | Screen reading method and device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104965596A (en) | Voice control system | |
CN105513594A (en) | Voice control system | |
CN103415835B (en) | The processing method of a kind of touch screen-device user interface and touch screen-device | |
CN103197756B (en) | A kind of operation information inputting method of electronic equipment and electronic equipment | |
CN102184014A (en) | Intelligent appliance interaction control method and device based on mobile equipment orientation | |
CN105335048A (en) | Electron equipment with concealed application icon and application icon conceal method | |
CN106201219A (en) | The quick call method of function of application and system | |
CN102306059A (en) | Universal remote controller based on touch control input for handheld digital television | |
CN110557699B (en) | Intelligent sound box interaction method, device, equipment and storage medium | |
CN112581946B (en) | Voice control method, voice control device, electronic equipment and readable storage medium | |
CN104238741A (en) | User interface comprising radial layout soft keypad | |
CN104750498A (en) | Method for controlling mouse module and electronic device | |
CN102929385A (en) | Method for controlling application program by voice | |
CN110768877B (en) | Voice control instruction processing method and device, electronic equipment and readable storage medium | |
CN104184890A (en) | Information processing method and electronic device | |
CN108762489A (en) | Control method, data glove, system based on data glove and storage medium | |
CN104252287A (en) | Interaction device and method for improving expression capability based on interaction device | |
CN202395925U (en) | Television system | |
Watanabe et al. | Remote touch pointing for smart TV interaction | |
CN113126875B (en) | Virtual gift interaction method and device, computer equipment and storage medium | |
CN105094344B (en) | Fixed terminal control method and device | |
CN202486951U (en) | Touching remote control | |
US20210098012A1 (en) | Voice Skill Recommendation Method, Apparatus, Device and Storage Medium | |
CN202281975U (en) | Equipment for remote control | |
CN110675188A (en) | Method and device for acquiring feedback information |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20160420 |
|
WD01 | Invention patent application deemed withdrawn after publication |