CN109669662A

CN109669662A - A kind of pronunciation inputting method, device, storage medium and mobile terminal

Info

Publication number: CN109669662A
Application number: CN201811575128.5A
Authority: CN
Inventors: 洪旭杰; 洪俞
Original assignee: Huizhou TCL Mobile Communication Co Ltd
Current assignee: Huizhou TCL Mobile Communication Co Ltd
Priority date: 2018-12-21
Filing date: 2018-12-21
Publication date: 2019-04-23

Abstract

The embodiment of the present application provides a kind of pronunciation inputting method, device, storage medium and mobile terminal, which comprises detection screen press event；If the corresponding user interface components in position that the screen press event occurs are input controls, start voice and listen to, to receive the voice content of user's input；To the end of the screen press event, speech recognition is carried out to the voice content, obtains the corresponding content of text of the voice content；The content of text is inserted into the input control.Pronunciation inputting method provided herein, by pressure touch technical application into voice input, the voice that input control in user interface can be triggered according to pressing operation of the user on mobile terminal screen inputs, it solves the problems, such as that user needs to toggle phonitic entry method and inputting method when modifying the content of voice input, effectively increases the input efficiency of voice input, improves the user experience of mobile terminal.

Description

A kind of pronunciation inputting method, device, storage medium and mobile terminal

Technical field

The application belongs to voice processing technology field, and in particular to a kind of pronunciation inputting method, device, storage medium and shifting Dynamic terminal.

Background technique

With the fast development of mobile terminal technology, the popularity of phonitic entry method on mobile terminals is more and more wider, Especially on smart phone and tablet computer, phonitic entry method is because of its rapid and convenient, interaction naturally, becoming more and more users Input method selection.

But at present there is an experience in existing phonitic entry method.Usual user using voice when being inputted, meeting Because the accuracy rate of speech recognition not enough needs to modify the content of input, to need to switch to keyboard input, modify again Switch back into voice input.The inconvenience toggled affects the user experience of input efficiency and mobile terminal.

Summary of the invention

The embodiment of the present application provides a kind of pronunciation inputting method, and be able to solve user needs when modifying the content of voice input The problem of toggling phonitic entry method and inputting method promotes the input efficiency of phonitic entry method and the use of mobile terminal Family experience.

In a first aspect, the embodiment of the present application provides a kind of pronunciation inputting method, it is applied to mobile terminal, comprising:

Detect screen press event；

If the corresponding user interface components in position that the screen press event occurs are input controls, start voice and detect It listens, to receive the voice content of user's input；

To the end of the screen press event, speech recognition is carried out to the voice content, obtains the voice content pair The content of text answered；

The content of text is inserted into the input control.

Second aspect, the embodiment of the present application provide a kind of speech input device, are applied to mobile terminal, comprising:

Detection module, for detecting screen press event；

Voice input module, the corresponding user interface components in position for occurring when the screen press event are inputs When control, starting voice is listened to, to receive the voice content of user's input；

Speech recognition module carries out speech recognition to the voice content after to the screen press event, Obtain the corresponding content of text of the voice content；

Execution module, for the content of text to be inserted the input control.

The third aspect, the embodiment of the present application provide a kind of storage medium, are stored thereon with computer program, when the calculating When machine program executes on computers, so that the voice that the computer executes the offer of the embodiment of the present application first aspect is defeated Enter method.

Fourth aspect, the embodiment of the present application provide a kind of mobile terminal, and the mobile terminal includes processor and memory, The memory is for storing executable program code, and the processor is by reading the executable journey stored in the memory Sequence code is to execute following steps:

Detect screen press event；

The content of text is inserted into the input control.

Pronunciation inputting method provided herein, by pressure touch technical application to voice input in, can according to The voice input of family input control in the pressing operation triggering user interface on mobile terminal screen, solves user in modification language The problem of needing to toggle phonitic entry method and inputting method when the content of sound input effectively increases the defeated of voice input Enter efficiency, improve the user experience of mobile terminal.

Detailed description of the invention

In order to more clearly explain the technical solutions in the embodiments of the present application, make required in being described below to embodiment Attached drawing is briefly described.It should be evident that the drawings in the following description are only some examples of the present application, for For those skilled in the art, without creative efforts, it can also be obtained according to these attached drawings other attached Figure.

Fig. 1 is a kind of flow diagram of pronunciation inputting method provided by the embodiments of the present application；

Fig. 2 is a usage scenario schematic diagram of pronunciation inputting method provided by the embodiments of the present application；

Fig. 3 is another usage scenario schematic diagram of pronunciation inputting method provided by the embodiments of the present application；

Fig. 4 is another flow diagram of pronunciation inputting method provided by the embodiments of the present application；

Fig. 5 is the structural schematic diagram of speech input device provided by the embodiments of the present application；

Fig. 6 is the structural schematic diagram of mobile terminal provided by the embodiments of the present application.

Specific embodiment

Below in conjunction with the attached drawing in the embodiment of the present application, technical solutions in the embodiments of the present application carries out clear, complete Site preparation description, it is clear that described embodiments are only a part of embodiments of the present application, instead of all the embodiments.It is based on Embodiment in the application, those skilled in the art's every other implementation obtained without creative efforts Example, shall fall in the protection scope of this application.

The embodiment of the present application provides a kind of pronunciation inputting method, device, storage medium and mobile terminal, carries out individually below It is described in detail.

Referring to Fig. 1, Fig. 1 is a kind of flow diagram of pronunciation inputting method provided by the embodiments of the present application.The voice Input method can be applied to mobile terminal, and the process of the pronunciation inputting method may include:

Step 101, screen press event is detected.

In the embodiment of the present application, the voice input of mobile terminal is to press some element on screen by user to be triggered , therefore needing to detect on the screen of the mobile terminal whether have pressing operation, this step is mainly by pressure touch module (Force Touch it) completes.

In one embodiment, pressure touch module can be by touch screen (Touch Panel) and pressure sensor (Force Touch Sensor) composition.When user presses some element on the touch screen, which can be obtained The intensity for getting user's pressing, by the event system of the mobile terminal, the position of user's pressing operation and will be by by Compressive Strength It is reported to mobile terminal.

Step 102, if the corresponding user interface components in position that the screen press event occurs are input controls, start Voice is listened to, to receive the voice content of user's input.

In the embodiment of the present application, user can press any position on mobile terminal screen, and each position has its right User interface (UI, the User Interface) component answered, but be not that each user interface components support voice input.Than Such as, if user is carrying out wechat communication using tablet computer at this time, in Chat page, in addition to the input frame of page bottom, Remaining part point is not support voice to input.If the position that user presses on the screen of the mobile terminal is input frame, that is, should The corresponding user interface components in position that screen press event occurs are input controls, then the voice that can trigger the mobile terminal is defeated Enter, which listens to voice is started, to receive the voice content of user's input.

Step 103, to the end of the screen press event, speech recognition is carried out to the voice content, obtains the voice content Corresponding content of text.

In the embodiment of the present application, if after the voice that user triggers mobile terminal inputs, the screen of the Continued depression mobile terminal Curtain, then the mobile terminal default user does not complete voice input also, and lasting unlatching voice is listened to.When the mobile terminal detects The screen press event terminates, then the mobile terminal judges that voice input is completed in user, will close voice and listens to, to user The identification that voice content through inputting performs the next step operates and obtains recognition result.

In one embodiment, in order to save the power consumption of mobile terminal, the mobile terminal can be set a screen by Press the upper limit threshold of event duration duration, such as two minutes.When the screen press event duration be more than the upper limit threshold, Then think the screen press event belong to user accidentally touch, rather than user wish to carry out triggering mobile terminal voice input Operation.

Step 104, text content is inserted into the input control.

In the embodiment of the present application, the voice content that the speech recognition process of mobile terminal can input user is identified, Obtain the corresponding content of text of the voice content, text content is only user and desires to input into be sent in input frame Content, so when the mobile terminal identifies the voice content, it, will after obtaining the corresponding content of text of the voice content Text content inserts corresponding input control, then completes the text input of the corresponding input frame of the input control.

Fig. 2 to Fig. 3 is please referred to, Fig. 2 to Fig. 3 is a usage scenario of pronunciation inputting method provided by the embodiments of the present application Schematic diagram.

User wants that sending short message about friend Zhang San using smart phone watches a ball game to home, can send out in the short message to Zhang San The page is sent to press Text Entry, as shown in Figure 2.Smart phone detects screen press event at this time, and the screen presses The corresponding user interface components in position that event occurs are input controls, then the smart phone is opened voice and listened to.User says The voice content of " coming my family at night to watch a ball game together ", is listened to by the smart phone, and by identification, which will be to the language Sound content " coming my family at night to watch a ball game together " carries out being recognized and converted into content of text, inserts in Text Entry, such as Fig. 3 institute Show.User only need to send page click send button in short message and carry out confirmation operation.

In one embodiment, speech recognition is carried out to the voice content, comprising: whether judge mobile terminal local There are offline speech recognition modelings；If so, the offline speech recognition modeling is called to carry out speech recognition to the voice content.

In one embodiment, it after judging the mobile terminal locally and whether there is offline speech recognition modeling, also wraps It includes: speech recognition being carried out to the voice content if it is not, then connecting cloud speech recognition server.

In one embodiment, after detection screen press event, further includes: judge whether the screen press event is full Sufficient preset condition；If the screen press event meets the corresponding user in position that the preset condition and the screen press event occur Interface assembly is input control, then starts voice and listen to, to receive the voice content of user's input.

In one embodiment, preset condition, comprising: it is default that the duration of the screen press event reaches first Value.

In one embodiment, preset condition, comprising: it is default that the pressure value of the screen press event reaches second Value.

In one embodiment, preset condition, comprising: it is default that the duration of the screen press event reaches first The pressure value of value and the screen press event reaches the second preset value.

Referring to Fig. 4, Fig. 4 is another flow diagram of pronunciation inputting method provided by the embodiments of the present application.The language Phoneme input method can be applied to mobile terminal, and the process of the pronunciation inputting method may include:

Step 401, screen press event is detected.

Step 402, judge whether the screen press event meets preset condition.It is to then follow the steps 403, otherwise returns to step Rapid 401.

In the embodiment of the present application, missed to avoid mobile terminal that user is missed the clicking operation of touch screen curtain and user on the screen It is judged to the trigger signal of voice input, a preset condition can be set.If the screen press event meets the preset condition, The screen press event is identified as the trigger signal of voice input by the mobile terminal, performs the next step operation；If the screen is pressed Pressure event is unsatisfactory for the preset condition, then the mobile terminal is not believed the triggering that the screen press event is identified as voice input Number, continue to test next screen press event.

In one embodiment, the duration which can be set as to the screen press event reaches first Preset value is also duration one the first lower threshold of setting of the screen press event.When user presses movement end When the time of the screen at end is more than first lower threshold, which will confirm that user needs to input using voice.

In one embodiment, the pressure value which can be set as to the screen press event reaches second in advance It is also pressure value one the second lower threshold of setting of the screen press event if value.When user presses the mobile terminal When the dynamics of screen is more than second lower threshold, which will confirm that user needs to input using voice.

It should be noted that the preset condition in the embodiment of the present application is not limited to this, it can also be other modes, such as The duration that the preset condition can be set as to the screen press event reaches the first preset value and the screen press event Pressure value reaches the second preset value.It is more than first lower threshold (first when the time that user presses the screen of the mobile terminal Preset value) and user press the screen of the mobile terminal dynamics be more than the second lower threshold (second preset value) when, the shifting Dynamic terminal will confirm that user needs to input using voice.

Step 403, whether the corresponding user interface components in position for judging that the screen press event occurs are input controls. It is to then follow the steps 404, otherwise return step 401.

In the embodiment of the present application, user can press any position on mobile terminal screen, and each position has its right User interface (UI, the User Interface) component answered, but be not that each user interface components support voice input.Than Such as, if user is carrying out wechat communication using tablet computer at this time, in Chat page, in addition to the input frame of page bottom, Remaining part point is not support voice to input.If the position that user presses on the screen of the mobile terminal is input frame, that is, should The corresponding user interface components in position that screen press event occurs are input controls, then the voice that can trigger the mobile terminal is defeated Enter, which will perform the next step operation；If the position that user presses on the screen of the mobile terminal is not input frame, also The corresponding user interface components in position that i.e. the screen press event occurs are not input controls, then will not trigger the mobile terminal Voice input, which will continue to test next screen press event.

Step 404, starting voice is listened to receive the voice content of user's input.

In the embodiment of the present application, if the screen press event meets the position that preset condition and the screen press event occur Corresponding user interface components are input controls, then mobile terminal starting voice is listened to, and receive the voice content of user's input.

Step 405, judge whether the screen press event terminates.It is to then follow the steps 406, otherwise return step 404.

In the embodiment of the present application, if after the voice that user triggers mobile terminal inputs, the screen of the Continued depression mobile terminal Curtain, then the mobile terminal default user does not complete voice input also, and lasting unlatching voice is listened to.When the mobile terminal detects The screen press event terminates, then the mobile terminal judges that voice input is completed in user, will close voice and listens to, to user Voice content through inputting performs the next step operation.

In one embodiment, in order to save the power consumption of mobile terminal, the mobile terminal can be set a screen by Press the upper limit threshold of event duration duration, such as two minutes.When the screen press event duration be more than the upper limit threshold, Then think that the screen press event belongs to user and accidentally touches, rather than the voice input for triggering the mobile terminal that user wishes to carry out Operation.

Step 406, speech recognition is carried out to the voice content of user's input, obtained in the corresponding text of the voice content Hold.

In the embodiment of the present application, after screen press event, voice input is completed in mobile terminal default user, will It closes voice to listen to, speech recognition is carried out to the voice content of user's input, obtains the corresponding content of text of the voice content.

In one embodiment, which can be complete by the offline speech recognition modeling of mobile terminal local At；In one embodiment, which can also be completed by cloud speech recognition server.

Step 407, text content is inserted into the input control.

Referring to Fig. 5, Fig. 5 is the structural schematic diagram of speech input device provided by the embodiments of the present application.Voice input Device can be applied to mobile terminal, which may include: detection module 501, voice input module 502, language Sound identification module 503 and execution module 504.

Detection module 501, for detecting screen press event；

Voice input module 502, the corresponding user interface components in position for occurring when the screen press event are defeated When entering control, starting voice is listened to, to receive the voice content of user's input；

Speech recognition module 503 carries out speech recognition to the voice content after to the screen press event, Obtain the corresponding content of text of the voice content；

Execution module 504, for text content to be inserted the input control.

In one embodiment, when carrying out speech recognition to the voice content, speech recognition module 503 can be used In: judge the mobile terminal locally and whether there is offline speech recognition modeling；If so, calling the offline speech recognition modeling pair The voice content carries out speech recognition.

In one embodiment, after judging the mobile terminal locally and whether there is offline speech recognition modeling, language Sound identification module 503 can be also used for: carry out voice knowledge to the voice content if it is not, then connecting cloud speech recognition server Not.

In one embodiment, after detecting screen press event, detection module 501 be can be also used for: judgement should Whether screen press event meets preset condition；If the screen press event meets the preset condition and the screen press event is sent out The corresponding user interface components in raw position are input controls, then voice input module 502 starts voice and listens to, to receive user The voice content of input.

When it is implemented, the above modules can be used as independent entity to realize, any combination can also be carried out, is made It is realized for same or several entities, the specific implementation of the above modules can be found in the embodiment of the method for front, herein not It repeats again.

The embodiment of the present application provides a kind of computer-readable storage medium, is stored thereon with computer program, when the meter When calculation machine program executes on computers, so that the computer executes the voice input provided such as the embodiment of the present application first aspect Step in method.

The embodiment of the present application also provides a kind of mobile terminal, including memory, and processor, the processor is by calling this to deposit The computer program stored in reservoir, for executing the step in pronunciation inputting method provided by the embodiments of the present application.

Referring to Fig. 6, Fig. 6 is the structural schematic diagram of mobile terminal provided by the embodiments of the present application.Mobile terminal can wrap Include the processor 601 of one or more than one processing core, one or more computer readable storage mediums are deposited Reservoir 602, input unit 603, output unit 604, sensor 605, voicefrequency circuit 606, Wireless Fidelity (WiFi, Wireless Fidelity) module 607, radio frequency (RF, Radio Frequency) components such as circuit 608 and power supply 609.Those skilled in the art Member is appreciated that mobile terminal structure shown in Fig. 6 does not constitute the restriction to the mobile terminal, may include than illustrating more More or less component perhaps combines certain components or different component layouts.Wherein:

Processor 601 is the control centre of mobile terminal, utilizes each of various interfaces and the entire mobile terminal of connection A part by running or execute the application program and/or module that are stored in memory 602, and calls and is stored in storage Data in device 602 execute the various functions and processing data of mobile terminal, to carry out integral monitoring to mobile terminal.It can Choosing, processor 601 may include one or more processing cores；Preferably, processor 601 can integrate application processor and modulation Demodulation processor.Wherein, the main processing operation system of application processor, user interface and application program etc., modulation /demodulation processing Device mainly handles wireless communication.It is understood that above-mentioned modem processor can not also be integrated into processor 601.

Memory 602 can be used for storing application program and data.Including in the application program that memory 602 stores can Code is executed, application program can form various functional modules.Processor 601 is stored in answering in memory 602 by operation With program, thereby executing various function application and data processing.Memory 602 can mainly include storing program area and storage number According to area.Wherein, storing program area can application program needed for storage program area, at least one function (for example sound plays function Energy, image player function etc.) etc.；Storage data area, which can be stored, uses created data (such as audio according to mobile terminal Data, phone directory etc.) etc..In addition, memory 602 may include high-speed random access memory, it can also include non-volatile Memory, for example, at least a disk memory, flush memory device or other volatile solid-state parts.Correspondingly, it stores Device 602 can also include Memory Controller, to provide the access of processor 601 and input unit 603 to memory 602.

Input unit 603 can be used for receiving the number of input, character information or user's characteristic information (such as fingerprint, vocal print, Face information or iris information etc.), and generate keyboard related with user setting and function control, mouse, operating stick, light Or trackball signal input etc..Specifically, in a specific embodiment, input unit 603 may include touch sensitive surface And other input equipments.Touch sensitive surface, also referred to as touch display screen or Trackpad collect user on it or nearby Touch operation (for example user uses any suitable objects or attachment such as finger, stylus on touch sensitive surface or in touch sensitive surface Neighbouring operation), and corresponding attachment device is driven according to preset formula.Optionally, touch sensitive surface may include touching Two parts of detection device and touch controller.Wherein, the touch orientation of touch detecting apparatus detection user, and detect touch behaviour Make bring signal, transmits a signal to touch controller；Touch controller receives touch information from touch detecting apparatus, and It is converted into contact coordinate, then gives processor 601, and order that processor 601 is sent can be received and executed.This Outside, touch sensitive surface can be realized using multiple types such as resistance-type, condenser type, infrared ray and surface acoustic waves.In addition to touch-sensitive table Face, input unit 603 can also include other input equipments.Specifically, other input equipments can include but is not limited to physics One of keyboard, function key (such as volume control button, switch key etc.), trace ball, mouse, operating stick etc. are a variety of.

Output unit 604 can be used for exporting information input by user or be supplied to user information and terminal it is various Graphical user interface, these graphical user interface can be made of figure, text, icon, video and any combination thereof.Output Unit 604 may include that loudspeaker and display panel etc. optionally can use liquid crystal display (LCD, Liquid Crystal Display), the forms such as Organic Light Emitting Diode (OLED, Organic Light-Emitting Diode) configure display surface Plate.Further, touch sensitive surface can cover display panel, after touch sensitive surface detects touch operation on it or nearby, Processor 601 is sent to determine the type of touch event, is followed by subsequent processing device 601 according to the type of touch event in display panel It is upper that corresponding visual output is provided.Although touch sensitive surface and display panel are realized as two independent components in Fig. 6 Output and input function, but in some embodiments it is possible to it is touch sensitive surface is integrated with display panel and realize input and defeated Function out.

Terminal may also include at least one sensor 605, such as optical sensor, motion sensor and other sensors. Specifically, optical sensor may include ambient light sensor and proximity sensor, wherein ambient light sensor can be according to ambient light Light and shade adjust the brightness of display panel, proximity sensor can close display panel and/or back when terminal is moved in one's ear Light.As a kind of motion sensor, gravity accelerometer can detect (generally three axis) acceleration in all directions Size can detect that size and the direction of gravity when static, can be used to identify mobile phone posture application (such as horizontal/vertical screen switching, Dependent game, magnetometer pose calibrating), Vibration identification correlation function (such as pedometer, tap) etc.；It can also configure as terminal The other sensors such as gyroscope, barometer, hygrometer, thermometer, infrared sensor, details are not described herein.

Voicefrequency circuit 606 may include loudspeaker and microphone, it is possible to provide the audio interface between user and terminal.Audio Electric signal after the audio data received conversion can be transferred to loudspeaker, be converted to voice signal by loudspeaker by circuit 606 Output；On the other hand, the voice signal of collection is converted to electric signal by microphone, is converted to sound after being received by voicefrequency circuit 606 Frequency evidence, then by after the processing of audio data output processor 601, such as another terminal is sent to through RF circuit 608, or will Audio data exports to memory 602 to be further processed.Voicefrequency circuit 606 is also possible that earphone jack, outer to provide If the communication of earphone and terminal.

WiFi belongs to short range wireless transmission technology, and mobile terminal can help user to receive and dispatch electricity by WiFi module 607 Sub- mail, browsing webpage and access streaming video etc., it provides wireless broadband internet access for user.Although Fig. 6 shows Go out WiFi module 607, but it is understood that, and it is not belonging to must be configured into for mobile terminal, it completely can be according to need It to omit within the scope of not changing the essence of the invention.

RF circuit 608 can be used for receiving and sending messages or communication process in signal send and receive, particularly, will be under base station After row information receives, one or the processing of more than one processor 601 are transferred to；In addition, the data for being related to uplink are sent to base It stands.In general, RF circuit 608 includes but is not limited to antenna, at least one amplifier, tuner, one or more oscillators, user Identity module (SIM, Subscriber Identity Module) card, transceiver, coupler, low-noise amplifier (LNA, Low Noise Amplifier), duplexer etc..In addition, RF circuit 608 can also by wireless communication with network and other set Standby communication.The wireless communication can be used any communication standard or agreement, including but not limited to global system for mobile communications (GSM, Global System of Mobile communication), general packet radio service (GPRS, General Packet Radio Service), CDMA (CDMA, Code Division Multiple Access), wideband code division multiple access (WCDMA, Wideband Code Division Multiple Access), long term evolution (LTE, Long Term Evolution), Email, short message service (SMS, Short Messaging Service) etc..

Mobile terminal further includes the power supply 609 (such as battery) powered to all parts, it is preferred that power supply 609 can lead to Cross power-supply management system and processor 601 be logically contiguous, thus by power-supply management system realize management charging, electric discharge and The functions such as power managed.Power supply 609 can also include one or more direct current or AC power source, recharging system, electricity The random components such as source fault detection circuit, power adapter or inverter, power supply status indicator.

Although being not shown, mobile terminal can also include camera, bluetooth module etc., and details are not described herein.

In the embodiment of the present application, processor 601 in mobile terminal can according to following instruction, by one or one with On the corresponding executable code of process of application program be loaded into memory 602, and be stored in by processor 601 to run Application program in memory 602, thereby executing:

Detect screen press event；

To the end of the screen press event, speech recognition is carried out to the voice content, obtains the corresponding text of the voice content This content；

Text content is inserted into the input control.

In one embodiment, when carrying out speech recognition to the voice content, processor 601 can be executed: judgement The mobile terminal locally whether there is offline speech recognition modeling；If so, calling the offline speech recognition modeling to the voice Content carries out speech recognition.

In one embodiment, after judging the mobile terminal locally and whether there is offline speech recognition modeling, place Reason device 601 can also be performed: carry out speech recognition to the voice content if it is not, then connecting cloud speech recognition server.

In one embodiment, after detecting screen press event, processor 601 be can also be performed: judge the screen Whether curtain press event meets preset condition；If the screen press event meets the preset condition and the screen press event occurs The corresponding user interface components in position be input control, then start voice and listen to, with receive user input voice content.

Pronunciation inputting method in mobile terminal provided by the embodiments of the present application and foregoing embodiments belongs to same design, Either offer method, specific implementation process in the pronunciation inputting method embodiment can be provided on the mobile terminal and are detailed in this Pronunciation inputting method embodiment, details are not described herein again.

In the above-described embodiments, it all emphasizes particularly on different fields to the description of each embodiment, there is no the portion being described in detail in some embodiment Point, it may refer to the detailed description above with respect to pronunciation inputting method, details are not described herein again.

It should be noted that for pronunciation inputting method provided by the embodiments of the present application, those of ordinary skill in the art It is understood that realize all or part of the process of pronunciation inputting method provided by the embodiments of the present application, being can be by computer journey Sequence is completed to control relevant hardware, which can be stored in a computer-readable storage medium, such as stores In memory, and by least one processor it executes, in the process of implementation may include the embodiment such as the pronunciation inputting method Process.Wherein, which can be magnetic disk, CD, read-only memory (ROM, Read Only Memory), arbitrary access Memory body (RAM, Random Access Memory) etc..

For the speech input device of the embodiment of the present application, each functional module be can integrate in a processing chip In, it is also possible to modules and physically exists alone, can also be integrated in two or more modules in a module.It is above-mentioned Integrated module both can take the form of hardware realization, can also be realized in the form of software function module.This is integrated If module is realized and when sold or used as an independent product in the form of software function module, also can store at one In computer-readable storage medium, which is for example read-only memory, disk or CD etc..

Above to a kind of pronunciation inputting method, device, storage medium and mobile terminal provided by the embodiment of the present application It is described in detail, specific examples are used herein to illustrate the principle and implementation manner of the present application, the above reality The explanation for applying example is merely used to help understand the present processes and its core concept；Meanwhile for those skilled in the art, According to the thought of the application, there will be changes in the specific implementation manner and application range, in conclusion in this specification Hold the limitation that should not be construed as to the application.

Claims

1. a kind of pronunciation inputting method is applied to mobile terminal characterized by comprising

Detect screen press event；

If the corresponding user interface components in position that the screen press event occurs are input controls, start voice and listens to, To receive the voice content of user's input；

To the end of the screen press event, speech recognition is carried out to the voice content, it is corresponding to obtain the voice content Content of text；

The content of text is inserted into the input control.

2. the method according to claim 1, wherein described carry out speech recognition to the voice content, comprising:

Judge the mobile terminal locally and whether there is offline speech recognition modeling；

If so, the offline speech recognition modeling is called to carry out speech recognition to the voice content.

3. according to the method described in claim 2, it is characterized in that, the judgement mobile terminal locally whether there is offline After speech recognition modeling, further includes:

Speech recognition is carried out to the voice content if it is not, then connecting cloud speech recognition server.

4. the method according to claim 1, wherein after the detection screen press event, further includes:

Judge whether the screen press event meets preset condition；

If the screen press event meets the corresponding user in position that the preset condition and the screen press event occur Interface assembly is input control, then starts voice and listen to, to receive the voice content of user's input.

5. according to the method described in claim 4, it is characterized in that, the preset condition, comprising:

The duration of the screen press event reaches the first preset value.

6. according to the method described in claim 4, it is characterized in that, the preset condition, comprising:

The pressure value of the screen press event reaches the second preset value.

7. according to the method described in claim 4, it is characterized in that, the preset condition, comprising:

The duration of the screen press event reaches the first preset value and the pressure value of the screen press event reaches Two preset values.

8. a kind of speech input device is applied to mobile terminal characterized by comprising

Detection module, for detecting screen press event；

Voice input module, the corresponding user interface components in position for occurring when the screen press event are input controls When, starting voice is listened to, to receive the voice content of user's input；

Speech recognition module carries out speech recognition to the voice content, obtains after to the screen press event The corresponding content of text of the voice content；

Execution module, for the content of text to be inserted the input control.

9. a kind of storage medium, which is characterized in that the storage medium is stored with a plurality of instruction, described instruction be suitable for processor into Row load requires the step in 1 to 7 described in any item pronunciation inputting methods with perform claim.

10. a kind of mobile terminal, the mobile terminal includes processor and memory；Wherein, the memory can for storing Execute program code, the processor run by reading the executable program code stored in the memory with it is described can The corresponding program of program code is executed, to execute following steps:

Detect screen press event；

The content of text is inserted into the input control.