[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN105513594A - Voice control system - Google Patents

Voice control system Download PDF

Info

Publication number
CN105513594A
CN105513594A CN201510835844.2A CN201510835844A CN105513594A CN 105513594 A CN105513594 A CN 105513594A CN 201510835844 A CN201510835844 A CN 201510835844A CN 105513594 A CN105513594 A CN 105513594A
Authority
CN
China
Prior art keywords
voice
control system
speech
command
speech control
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510835844.2A
Other languages
Chinese (zh)
Inventor
不公告发明人
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201510835844.2A priority Critical patent/CN105513594A/en
Publication of CN105513594A publication Critical patent/CN105513594A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0487Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • User Interface Of Digital Computer (AREA)
  • Toys (AREA)

Abstract

The invention provides a voice control system capable of improving the operation convenience of vehicle terminals, mobile phones, smart homes, mobile internet and other various kinds of applications. Meanwhile, the system makes the control and the interaction to be simpler and more humanized. The technical scheme of the voice control system is as the following: the system customizing commands and the system intercepting a screen and identifying an operable region in the picture, the operable region including icons, buttons, and text labels and other interface controls, then generating a corresponding command set library; the system acquiring voice, identifying the voice and converting the voice to characters in an online or offline method; the system matching the characters with the command set library, identifying voice control commands; the system simulating the control operation corresponding to a control command or notifying an application to operate, so as to realize a voice control target.

Description

Speech control system
Technical field
The present invention relates to speech recognition technology, image recognition technology, text recognition technique, action simulation technology and Bluetooth technology.
Background technology
Very large by hand manipulation vehicle entertainment system danger during driving.
Current Smart Home, intelligence are dressed, intelligent toy is more and more stronger for the demand of speech control.
In fields such as medical treatment, the inconvenient operating terminal of a lot of personage's hand, needs to manipulate some smart machines extremely not convenient.
How effectively to manipulate various smart machine by voice is current urgent problem.
Summary of the invention
The object of the invention is to solve the problem, provide a kind of speech control system, improve the convenience of the types of applications operations such as car-mounted terminal, mobile phone, Smart Home, mobile interchange, make manipulation mutual simpler, more humane.
Speech control system technical scheme describes as follows.
System custom command and system screen printing also identify operable area in picture, and operable area comprises the interface controls such as icon, button, text label, then generate corresponding command set storehouse.
System acquisition voice, are identified as word by mode that is online or off-line to voice.
System matches word and command set storehouse, identify voice control command.
The control action that system simulation control command is corresponding or notice application operate, and realize speech control target.
According to an embodiment of speech control system of the present invention, speech recognition and command recognition are realized by the voice command identification layer of system.
According to an embodiment of speech control system of the present invention, the operation of application is completed by the operation execution level of system.
According to an embodiment of speech control system of the present invention, speech control system state comprises the free time at initial stage, starts to point out, gather voice, speech recognition, the prompting of commands match result, operation execution.
User side telepilot comprises blue Tooth remote controller, for receiving user key-press event transmission to intelligent terminal.
Intelligent terminal comprises figure and text identification module, voice recognition commands module and operation executing module, and each functions of modules is as follows.
Figure and text identification module, converge the operable area order of self-defining order and sectional drawing identification with generation command set storehouse.
Voice recognition commands module, the speech recognition with regard to sampling is word, then with command set storehouse match cognization voice control command.
Operation executing module, the control action that analogue enlargement order is corresponding or notice application operate, and realize speech control target.
The present invention contrasts prior art following beneficial effect: the solution of the present invention be to existing interface should be had to identify its operable area dynamically generates command set storehouse, after speech recognition, carry out the identification of voice command, then simulate corresponding operational motion or notice application execution.Compared to conventional art, the present invention can perform any order of application, and traditional technology can only support several conventional order, and each interpolation order all needs the bottom degree of depth to customize; The present invention can be generalized to the various intelligent terminals accepting speech control, includes but not limited to smart mobile phone, the user terminal that intelligent vehicle-carried, intelligence wearing, Smart Home, intelligent medical, intelligent toy etc. can accept phonetic entry.
Accompanying drawing explanation
Fig. 1 shows the process flow diagram of the preferred embodiment of speech control system of the present invention.
Fig. 2 shows the service logic figure of the preferred embodiment of speech control system of the present invention.Embodiment
Below in conjunction with drawings and Examples, the invention will be further described.
Fig. 1 shows the flow process of the preferred embodiment of speech control system of the present invention.Refer to Fig. 1, details are as follows for the implementation step of the speech control system of the present embodiment.
The custom command of step 100 system and system screen printing also identify operable area in picture, and operable area comprises the interface controls such as icon, button, text label, then generate corresponding command set storehouse.
Step 102 system acquisition voice, are identified as word by mode that is online or off-line to voice.
Step 104 system matches word and command set storehouse, identify voice control command.
Fig. 2 shows the service logic figure of the preferred embodiment of speech control system of the present invention, refers to Fig. 2, and the speech control system of the present embodiment comprises user side telepilot 20 and intelligent terminal 60.
User side telepilot 20 comprises blue Tooth remote controller, for receiving user key-press event transmission to intelligent terminal.Intelligent terminal 60 comprises figure and text identification module 602, voice command recognition module 604 and operation executing module 606.Change into word after the voice of voice command recognition module 604 to sampling identify and command recognition is carried out to the word after transforming.The control action that the order of operation executing module 606 analogue enlargement is corresponding or notice application operate, and realize speech control target.
Such as, user opens vehicle mounted guidance, and user clicks blue Tooth remote controller voice and starts key, says " searching place " order.
From the angle of user, control command and the operable area of user are searching place text box and can operate and try to please in Corresponding matching corresponding interface of " searching place ", " sight spot " orders the label operable area in corresponding interface, broadcasting icon in the corresponding player interface of the Play command, operable area in interface, it is all voice command that operable area comprises the interface controls such as icon, button, text label, also have the self-defining voice command of system in addition, such as " Home " returns interface of main menu etc.
System intercepts current and On-Screen Identification operable area coupling system self-defining order generation command set storehouse.
System identification goes out " searching place " speech text, then goes out " searching place " order with command set storehouse match cognization.
Control action or the notice application of correspondence that the order of step 106 analogue enlargement " searches place " operate, and jump to next operation interface, realize speech control target.
Above-described embodiment is available to persons skilled in the art to realize and uses of the present invention; persons skilled in the art can when not departing from thought of the present invention; various modifications or change are made to above-described embodiment; thus protection scope of the present invention not limit by above-described embodiment, and should be the maximum magnitude meeting the inventive features that claims are mentioned.

Claims (8)

1. a speech control system, comprising: system custom command and system screen printing also identify operable area in picture, and operable area comprises the interface controls such as icon, button, text label, then generate corresponding command set storehouse; System acquisition voice, are identified as word by mode that is online or off-line to voice; System matches word and command set storehouse, identify voice control command; The control action that system simulation control command is corresponding or notice application operate, and realize speech control target; User side telepilot comprises blue Tooth remote controller, for receiving user key-press event transmission to intelligent terminal; Intelligent terminal comprises figure and text identification module, voice recognition commands module and operation executing module, and each functions of modules is as follows: figure and text identification module, and the operable area order of self-defining order and sectional drawing identification is converged generation command set storehouse; Voice recognition commands module, the speech recognition with regard to sampling is word, then with command set storehouse match cognization voice control command; Operation executing module, the control action that analogue enlargement order is corresponding or notice application operate, and realize speech control target.
2. speech control system according to claim 1, is characterized in that, command set storehouse is that figure and text identification layer realize.
3. speech control system according to claim 1, is characterized in that, speech recognition and voice control command identification are realized by the voice command identification layer of system.
4. speech control system according to claim 1, is characterized in that, the operation of application is realized by the operation execution level analog subscriber operational motion of system or notice application execution.
5. speech control system according to claim 1, is characterized in that, speech control system state comprises the free time at initial stage, starts to point out, gather voice, speech recognition, the prompting of commands match result, operation execution.
6. speech control system according to claim 2, is characterized in that, the screen interface operable area of identification comprises the region that all users such as icon, button, text label, Text Entry, word navigation can carry out motion action.
7. speech control system according to claim 3, is characterized in that, speech recognition library two kinds of modes that speech recognition comprises online cloud platform and off-line realize.
8. speech control system according to claim 4, is characterized in that, the user operation action of simulation comprise click, double-click, pull, multiple point touching, the interactive action such as horizontal stroke.
CN201510835844.2A 2015-11-26 2015-11-26 Voice control system Pending CN105513594A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510835844.2A CN105513594A (en) 2015-11-26 2015-11-26 Voice control system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510835844.2A CN105513594A (en) 2015-11-26 2015-11-26 Voice control system

Publications (1)

Publication Number Publication Date
CN105513594A true CN105513594A (en) 2016-04-20

Family

ID=55721522

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510835844.2A Pending CN105513594A (en) 2015-11-26 2015-11-26 Voice control system

Country Status (1)

Country Link
CN (1) CN105513594A (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105913841A (en) * 2016-06-30 2016-08-31 北京小米移动软件有限公司 Voice recognition method, voice recognition device and terminal
CN106201177A (en) * 2016-06-24 2016-12-07 维沃移动通信有限公司 A kind of operation execution method and mobile terminal
CN106328136A (en) * 2016-08-19 2017-01-11 黄广明 Voice-controllable intelligent device
CN106683675A (en) * 2017-02-08 2017-05-17 张建华 Control method and voice operating system
CN106782493A (en) * 2016-11-28 2017-05-31 湖北第二师范学院 A kind of children private tutor's machine personalized speech control and VOD system
CN108763068A (en) * 2018-05-15 2018-11-06 福建天泉教育科技有限公司 A kind of automated testing method and terminal based on machine learning
CN109326290A (en) * 2018-12-10 2019-02-12 苏州思必驰信息科技有限公司 Audio recognition method and device
CN110795175A (en) * 2018-08-02 2020-02-14 Tcl集团股份有限公司 Method and device for analog control of intelligent terminal and intelligent terminal
CN111722826A (en) * 2020-06-28 2020-09-29 广州小鹏车联网科技有限公司 Construction method of voice interaction information, vehicle and storage medium
CN112511882A (en) * 2020-11-13 2021-03-16 海信视像科技股份有限公司 Display device and voice call-up method
CN114005445A (en) * 2020-06-28 2022-02-01 广州小鹏汽车科技有限公司 Information processing method, server, and computer-readable storage medium
CN115248650A (en) * 2022-06-24 2022-10-28 南京伟柏软件技术有限公司 Screen reading method and device

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106201177B (en) * 2016-06-24 2019-10-15 维沃移动通信有限公司 A kind of operation execution method and mobile terminal
CN106201177A (en) * 2016-06-24 2016-12-07 维沃移动通信有限公司 A kind of operation execution method and mobile terminal
CN105913841A (en) * 2016-06-30 2016-08-31 北京小米移动软件有限公司 Voice recognition method, voice recognition device and terminal
CN105913841B (en) * 2016-06-30 2020-04-03 北京小米移动软件有限公司 Voice recognition method, device and terminal
CN106328136A (en) * 2016-08-19 2017-01-11 黄广明 Voice-controllable intelligent device
CN106782493A (en) * 2016-11-28 2017-05-31 湖北第二师范学院 A kind of children private tutor's machine personalized speech control and VOD system
CN106683675A (en) * 2017-02-08 2017-05-17 张建华 Control method and voice operating system
CN108763068B (en) * 2018-05-15 2021-12-28 福建天泉教育科技有限公司 Automatic testing method and terminal based on machine learning
CN108763068A (en) * 2018-05-15 2018-11-06 福建天泉教育科技有限公司 A kind of automated testing method and terminal based on machine learning
CN110795175A (en) * 2018-08-02 2020-02-14 Tcl集团股份有限公司 Method and device for analog control of intelligent terminal and intelligent terminal
CN109326290A (en) * 2018-12-10 2019-02-12 苏州思必驰信息科技有限公司 Audio recognition method and device
CN111722826A (en) * 2020-06-28 2020-09-29 广州小鹏车联网科技有限公司 Construction method of voice interaction information, vehicle and storage medium
WO2022000863A1 (en) * 2020-06-28 2022-01-06 广东小鹏汽车科技有限公司 Speech interaction information construction method, vehicle, and storage medium
CN114005445A (en) * 2020-06-28 2022-02-01 广州小鹏汽车科技有限公司 Information processing method, server, and computer-readable storage medium
EP3955098A4 (en) * 2020-06-28 2022-05-25 Guangdong Xiaopeng Motors Technology Co., Ltd. CONSTRUCTION PROCEDURES FOR VOICE INTERACTION INFORMATION, VEHICLE AND STORAGE MEDIA
CN112511882A (en) * 2020-11-13 2021-03-16 海信视像科技股份有限公司 Display device and voice call-up method
CN112511882B (en) * 2020-11-13 2022-08-30 海信视像科技股份有限公司 Display device and voice call-out method
CN115248650A (en) * 2022-06-24 2022-10-28 南京伟柏软件技术有限公司 Screen reading method and device
CN115248650B (en) * 2022-06-24 2024-05-24 南京伟柏软件技术有限公司 Screen reading method and device

Similar Documents

Publication Publication Date Title
CN104965596A (en) Voice control system
CN105513594A (en) Voice control system
CN103415835B (en) The processing method of a kind of touch screen-device user interface and touch screen-device
CN103197756B (en) A kind of operation information inputting method of electronic equipment and electronic equipment
CN102184014A (en) Intelligent appliance interaction control method and device based on mobile equipment orientation
CN105335048A (en) Electron equipment with concealed application icon and application icon conceal method
CN106201219A (en) The quick call method of function of application and system
CN102306059A (en) Universal remote controller based on touch control input for handheld digital television
CN110557699B (en) Intelligent sound box interaction method, device, equipment and storage medium
CN112581946B (en) Voice control method, voice control device, electronic equipment and readable storage medium
CN104238741A (en) User interface comprising radial layout soft keypad
CN104750498A (en) Method for controlling mouse module and electronic device
CN102929385A (en) Method for controlling application program by voice
CN110768877B (en) Voice control instruction processing method and device, electronic equipment and readable storage medium
CN104184890A (en) Information processing method and electronic device
CN108762489A (en) Control method, data glove, system based on data glove and storage medium
CN104252287A (en) Interaction device and method for improving expression capability based on interaction device
CN202395925U (en) Television system
Watanabe et al. Remote touch pointing for smart TV interaction
CN113126875B (en) Virtual gift interaction method and device, computer equipment and storage medium
CN105094344B (en) Fixed terminal control method and device
CN202486951U (en) Touching remote control
US20210098012A1 (en) Voice Skill Recommendation Method, Apparatus, Device and Storage Medium
CN202281975U (en) Equipment for remote control
CN110675188A (en) Method and device for acquiring feedback information

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20160420

WD01 Invention patent application deemed withdrawn after publication