Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the invention, the technical scheme in the embodiment of the invention is clearly and completely described, obviously, described embodiment only is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills belong to the scope of protection of the invention not making the every other embodiment that is obtained under the creative work prerequisite.
Embodiment one
As shown in Figure 1, present embodiment provides a kind of information interaction system, and this system comprises: audio-video terminal 10, server 11.
Audio-video terminal 10 supports the encoding and decoding of standard pronunciation video flowing to handle and the transmission reception, can receive the common format video data, it is reduced to video image is presented on the video display; When being pressed by the user, the keyboard of audio-video terminal 10 produces DTMF/FSK (Double Tone Multiple Frequency/Frequency-Shift Keying, dual-tone multifrequency/frequency shift keying) signal, be sent to server 11 by voice-grade channel, audio-video terminal 10 is connected with server 11 by the IP broadband network, communicates.
Server 11, can receive the DTMF/FSK signal that audio-video terminal 10 sends by voice-grade channel, the DTMF/FSK conversion of signals is become press key message sent by user, then according to the input method information retrieval dictionary of key information and current use, determine described key information corresponding character result, generate text results, and composite video image during with this fructufy, at last with synthetic video image after encoding and decoding conversion is view data, be sent to audio-video terminal 10 by video channel; Wherein, the real-time synthetic video image can be: tabling look-up by character library obtains the view data of text results, for example message bit pattern in the dot matrix word library or the contour vector information in the vector font library, be added on the output image assigned address according to the view data that obtains, finish of the conversion of the ISN of a literal to image.
Wherein, audio-video terminal 10 comprises: video display 101, standard telephone keypad 102, telephone receiver 103, and dial module 104, audio-frequency module 105 and video module 106; Video display 101 is used for display video image; Standard telephone keypad 102 is used to receive user's clicking operation; Telephone receiver 103 is used to gather user's voice, the audio-frequency information of receiving to user's playback terminal by loud speaker; Dial module 104 is used for producing corresponding DTMF/FSK signal according to the input of keyboard 102; Audio-frequency module 105 is used for communicating by voice-grade channel and server 11, send the speech data of receiver 103 collections and the DTMF/FSK signal that dial module 104 produces to server 11, the voice data that reception server 11 sends sends telephone receiver 103 to after the decoding; Video module 106 is used for communicating by video channel and server 11, and the video data that reception server 11 sends sends video display 101 to after the decoding.
The present embodiment technical scheme, server is by carrying out dissection process according to input method information to the key information that audio-video terminal sends, generate the corresponding character result, afterwards this literal result is sent to audio-video terminal with the form of video image, thereby on the basis of existing common audio frequency and video telephone terminal that do not need to upgrade, make the user utilize the general audio-video terminal equipment just can to finish complicated literal mutual with server, send complicated literal to server, obtain to get more information and service.
Embodiment two
As shown in Figure 2, present embodiment provides a kind of information interactive method, and this method comprises:
Step 201: the key information that receiving terminal sends;
Step 202: the key information of receiving according to the input method information butt joint carries out dissection process, determines this key information corresponding character result, generates text results; Wherein, described dissection process is as input, to inquire about the dictionary of current input with key information, the key information corresponding character result of acquisition input;
Step 203: with the text results composite video image that generates; Wherein, with the text results composite video image that generates can be: tabling look-up by character library obtains the view data of text results, for example message bit pattern in the dot matrix word library or the contour vector information in the vector font library, be added on the output image assigned address according to the view data that obtains, finish of the conversion of the ISN of a literal to image;
Step 204: synthetic video image is sent to terminal.
The present embodiment technical scheme, by the key information that audio-video terminal sends being carried out dissection process according to input method information, generate the corresponding character result, afterwards this literal result is sent to audio-video terminal with the form of video image, thereby on the basis of existing common audio frequency and video telephone terminal that do not need to upgrade, it is mutual to make the user utilize general audio-video terminal equipment just can finish complicated literal, sends complicated literal to server, obtains to get more information and service.
Embodiment three
As shown in Figure 3, present embodiment provides a kind of information interactive method, and this method comprises:
Step 300: audio-video terminal and server connect, and select input method; Wherein, this connection comprises voice-grade channel and video channel, can support input methods in the server, for example phonetic, stroke, English, symbol, Japanese, Korean etc., after the user selectes a kind of input method by audio-video terminal, with the input method of this selected input method as current use;
Step 301: server receives the key information that audio-video terminal sends by voice-grade channel, and this key information is the DTMF/FSK signal that audio-video terminal produces;
Step 302: the key information that server is received according to the input method information butt joint of current use carries out dissection process, as input, utilizes the input rule of the input method defined of current use with the key information that receives, generates text results;
Step 303: server is with the text results composite video image that generates;
Step 304: server sends to audio-video terminal with synthetic vedio data by video channel.
The present embodiment technical scheme, server carries out dissection process according to input method information to the key information that audio-video terminal sends, generate the corresponding character result, server sends to audio-video terminal with this literal result with the form of video image afterwards, thereby on the basis of existing common audio frequency and video telephone terminal that do not need to upgrade, make the user utilize general audio-video terminal equipment just can and server between to finish complicated literal mutual, send complicated literal to server, and obtain to get more information and service from server; Support input methods on the server simultaneously, make the literal of user terminal and server more flexible alternately, more can satisfy requirements of different users.
Embodiment four
Be a kind of information interactive method that example explanation present embodiment provides to import " " word below.
Shown in Fig. 4 a, this information interactive method phonetic input process comprises the steps:
Step 400: terminal is connected with server by dialing, sets up voice-grade channel and video channel;
Step 401: server defines the synthetic keyboard structural images of descriptor with keyboard;
Step 402: server is sent to terminal by video channel with this image; Wherein, the keyboard structure image can be as shown in Figure 5, uses spelling input method this moment;
Step 403: terminal is the display keyboard structural images on its video display;
Step 404: the user clicks phonetic alphabet " a " corresponding key of " " word, i.e. " 2 " key of terminal keyboard;
Step 405: terminal is converted to the DTMF/FSK signal with button " 2 ", is sent to server by voice-grade channel;
Step 406: server detects the DTMF/FSK signal in the voice-grade channel, and conversion DTMF/FSK signal is button " 2 ";
Step 407: according to the input rule of spelling input method, button " 2 " is converted to option " a ", " b " and " c ", and determines the corresponding character result according to the dictionary of spelling input method;
Step 408:, generate video data with option " a ", " b " and " c " and corresponding character composograph as a result; Wherein, with option " a ", " b " and " c " and corresponding character as a result composograph can be: tabling look-up by character library obtains option " a ", " b " and " c " and corresponding character result's view data, for example message bit pattern in the dot matrix word library or the contour vector information in the vector font library, be added on the output image assigned address according to the view data that obtains, finish of the conversion of the ISN of a literal to image, below other composograph and similar repeating no more herein;
Step 409: server is sent to terminal with the video data that generates by video channel;
Step 410: terminal is reduced into image with video data, is shown to the user by video display; Wherein, user's click keys " 2 " back terminal video display prompts displayed image as shown in Figure 6.
Certainly, above-mentioned phonetic input process can be come input Pinyin by a plurality of buttons of one click, and for example click keys " 5 ", button " 3 " obtain option " le " " ke " and corresponding character result.
After input Pinyin, can further confirm that to the phonetic of input shown in Fig. 4 b, this information interactive method phonetic affirmation process comprises the steps:
Step 411: the user puts " 1 " button of beating keyboard and confirms phonetic;
Step 412: terminal is converted to the DTMF/FSK signal with button " 1 ", is sent to server by voice-grade channel;
Step 413: server detects the DTMF/FSK signal in the voice-grade channel, and conversion DTMF/FSK signal is button " 1 ";
Step 414: according to the input rule of spelling input method, button " 1 " is converted to affirmation, and provides literal information to be selected;
Step 415:, generate video data with literal information composograph to be selected;
Step 416: server is sent to terminal with the video data that generates by video channel;
Step 417: terminal is reduced into image with video data, is shown to the user by video display; Wherein, user's click keys " 1 " back terminal video display prompts displayed image as shown in Figure 7.
After determining phonetic, therefore the corresponding a plurality of literal of phonetic possibility can further be selected a plurality of literal, and shown in Fig. 4 c, this information interactive method literal selection course comprises the steps:
Step 418: the user puts the selected literal of " 6 " button of beating keyboard;
Step 419: terminal is converted to the DTMF/FSK signal with button " 6 ", is sent to server by voice-grade channel;
Step 420: server detects the DTMF/FSK signal in the voice-grade channel, and conversion DTMF/FSK signal is button " 6 ";
Step 421:, button " 6 " is converted to selected literal " " according to the input rule of spelling input method;
Step 422: the literal " " that server record is selected;
Step 423:, generate video data with the selected information composograph of literal;
Step 424: server is sent to terminal with the video data that generates by video channel;
Step 425: terminal is reduced into image with video data, is shown to the user by video display; Wherein, user's click keys " 6 " back terminal video display prompts displayed image as shown in Figure 8.
With reference to the accompanying drawings 5, server can be given the different different implications of key definition of pressing, and provide prompting, except definition enter key (numerical key " 2 " is to " 9 "), acknowledgement key (numerical key " 1 "), can also define switch key (" * " key), delete key (numerical key " 0 "), ESC Escape (" # " key) etc. simultaneously.The function of each function key is carried out as giving a definition, for example, in step 417, not only there is phonetic " a " to also have " b " and " c " in the prompts displayed image, switch key is used for switching to another phonetic from a phonetic and switches, if this moment, the user wished to check phonetic " b " corresponding character result, then click a switch key and switch to phonetic " b " corresponding character result, click a switch key again and switch to phonetic " c " corresponding character result, when showing that last phonetic (being " c ") corresponding character as a result herein, if click one time switch key again, then return and show first phonetic (being " a ") corresponding character result herein, certainly before confirming, also can switch a plurality of phonetics at any time, not limit here by clicking switch key; In superincumbent each step of user, carried out the input of a mistake, for example, imported the phonetic of a mistake, perhaps Cuo Wu click acknowledgement key, then can click the phonetic of the mistake of delete key deletion input, the input state before perhaps recovering to confirm; When user's input characters finishes or need withdraw from input method because of other reasons midway, then can click ESC Escape and withdraw from.
The present embodiment technical scheme, server carries out dissection process according to input method information to the key information that audio-video terminal sends, generate the corresponding character result, server sends to audio-video terminal with this literal result with the form of video image afterwards, thereby on the basis of existing common audio frequency and video telephone terminal that do not need to upgrade, make the user utilize general audio-video terminal equipment just can and server between to finish complicated literal mutual, send complicated literal to server, and obtain to get more information and service from server; In addition, in input process, affirmation process and selection course, can be at different operation of user and current state, server is made different corresponding, and the function that provide switching, delete, withdraws from, thereby adopt the information interacting method of present embodiment, can satisfy the various information interaction demands of user, have good versatility.
Embodiment five
As shown in Figure 9, present embodiment provides a kind of device of information interaction, and this device comprises:
Modular converter 91 is used for the audio-frequency information that terminal sends is converted to key information;
Processing module 92 is connected with modular converter 91, is used for according to input method information, determines the key information corresponding character result of described modular converter output;
Image module 93 is connected with processing module 92, is used for the text results that described processing module is determined is changed into view data; Wherein, text results changes into view data: tabling look-up by character library obtains the view data of literal, for example message bit pattern in the dot matrix word library or the contour vector information in the vector font library, be added on the output image assigned address according to the view data that obtains, finish of the conversion of the ISN of a literal to image;
Sending module 94 is connected with image module 93, is used for the view data that described image module transforms is sent to described terminal.
Wherein, modular converter 91 can comprise: detection module 911 is used for detecting the DTMF/FSK signal that terminal that voice-grade channel receives sends; Decoder module 912 is connected with detection module 911, is used for detection module 911 detected DTMF/FSK signals are decoded, and this DTMF/FSK conversion of signals is become key information.
Wherein, processing module 92 can comprise: determination module 921, be connected with modular converter 91 or decoder module 912, be used for according to input method information, described key information is converted to input method coding, according to the input method coding of described input method coding inquiry preservation and the corresponding relation of text results, determine described key information corresponding character result; Dictionary module 922 is connected with determination module 921, and the input method coding that is used to preserve and the corresponding relation of text results are for described determination module 921 inquiries.
Wherein, image module 93 can comprise: synthesis module 931, be connected with processing module 92 or determination module 921, and be used to obtain the character pattern information of the text results that described processing module 92 determines, utilize described character pattern information composograph data; Character base module 932, the character pattern information that is used to preserve literal is obtained for described synthesis module 931.
In other embodiments, can comprise above each module simultaneously in the device of same information interaction.
The device of above-mentioned each information interaction can be a kind of server, communicates by IP broadband network and audio-video terminal, for audio-video terminal provides the audio frequency and video service.Wherein, this server can be the server in the information interaction system of embodiment one.
The present embodiment technical scheme, server is by carrying out dissection process according to input method information to the key information that audio-video terminal sends, generate the corresponding character result, afterwards this literal result is sent to audio-video terminal with the form of video image, thereby on the basis of existing common audio frequency and video telephone terminal that do not need to upgrade, make the user utilize the general audio-video terminal equipment just can to finish complicated literal mutual with server, send complicated literal to server, obtain to get more information and service.
In a word, the above is preferred embodiment of the present invention only, is not to be used to limit protection scope of the present invention.Within the spirit and principles in the present invention all, any modification of being done, be equal to replacement, improvement etc., all should be included within protection scope of the present invention.