[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN107330120B - Inquire answer method, inquiry answering device and computer readable storage medium - Google Patents

Inquire answer method, inquiry answering device and computer readable storage medium Download PDF

Info

Publication number
CN107330120B
CN107330120B CN201710575024.3A CN201710575024A CN107330120B CN 107330120 B CN107330120 B CN 107330120B CN 201710575024 A CN201710575024 A CN 201710575024A CN 107330120 B CN107330120 B CN 107330120B
Authority
CN
China
Prior art keywords
inquiry
correlation
degree
inquiry message
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710575024.3A
Other languages
Chinese (zh)
Other versions
CN107330120A (en
Inventor
陈华荣
亓超
王卓然
马宇驰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Triangle Animal (beijing) Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Triangle Animal (beijing) Technology Co Ltd filed Critical Triangle Animal (beijing) Technology Co Ltd
Priority to CN201710575024.3A priority Critical patent/CN107330120B/en
Publication of CN107330120A publication Critical patent/CN107330120A/en
Application granted granted Critical
Publication of CN107330120B publication Critical patent/CN107330120B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation or dialogue systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Human Computer Interaction (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A kind of inquiry answer method of present invention offer, inquiry answering device and computer readable storage medium, the inquiry answer method include:Semantic processes step (S101) carries out semantic processes, with the user view of the inquiry purpose of reaction of formation inquiry message and for carrying out retrieving retrieval information used according to inquiry message to inquiry message input by user;Searching step (S102) is based on the retrieval information, and the data retrieval based on participle is carried out from database, obtains the list of candidate solid data;Sequence step (S103) is ranked up processing based on the degree of correlation between candidate solid data and user view to candidate solid data;And first result determine step (S104), will the candidate solid data with the highest degree of correlation in list, be determined as the response result for user's inquiry message.

Description

Inquire answer method, inquiry answering device and computer readable storage medium
Technical field
The present invention relates to inquiry answer method, inquiry answering device and computer-readable storage mediums based on semantic understanding Matter.
Background technology
Currently, fuzzy semantics understanding is a very universal problem in information retrieval and semantic analysis, if cannot be very Good carries out it semantic identification, and the result of return may not be very much the result that user wants greatly.Voice input is just becoming more Carry out more universal interactive mode, although having benefited from the promotion of computing capability and the accumulation of mass data, the use of deep learning is big Width reduces identification error rate, but still has the error rate of 4%-5%, is particularly acute in the field that certain neologisms occur frequently, this is just So that fuzzy semantics understanding seems critically important.Still further aspect, due to information huge explosion, man memory power is limited, when very much Waiting possibly can not accurately say whole information, this is but also fuzzy semantics understand a necessary part as system.
In view of the above-mentioned problems, the Chinese patent application that application publication number is CN106294875A proposes a kind of fuzzy inspection of entity Rope method and system, but this method is relatively simple, does not account for the factor of phonetic error correction etc, it is difficult to solve current Vague language The problem of reason and good sense solution.
Separately have, it is crucial in a kind of network search procedure of Chinese patent application proposition that application publication number is CN101206673A The intelligent correction system and method for word.The system is applied on internet platform, establish language model, corresponding dictionary and Data directory database, calculates sound character error and fuzzy matching calculates morphological pattern error correction, and degree of correlation filtering and sequence are carried out to result, Obtain immediate several results.This method is to be used for web search, is not applied for taking turns the fuzzy search in dialogue more, cannot Solve the problem of that the error correction of fuzzy phoneme cannot solve state transition in more wheel dialogues, cannot solving retrieval result, there is no optimal In the case of error correction, also not to coming to nothing when how to deal with and be defined, also influence of the error correction result to display, such as Prompt message etc..
Invention content
The present invention is had developed in view of the above problem in the prior art.The present invention is intended to provide one kind can carry out Vague language The system and method for reason and good sense solution are sent out inaccurate the problems such as user is because of the fuzzy expression of speech intonation, mistake or does not remember clearly When true instruction, system remains to make correct semantic understanding and smoothly completes information retrieval on this basis.It is suitable for institute Some need obscures the scene of error correction, includes the fuzzy semantics error correction in the semantic error correction of web search, and more wheel dialogues.
The first aspect of the present invention provides a kind of inquiry answer method based on semantic understanding, the inquiry answer method packet It includes:Semantic processes step (S101) carries out semantic processes, with the inquiry of reaction of formation inquiry message to inquiry message input by user Ask the user view of purpose and for carrying out retrieving retrieval information used according to inquiry message;Searching step (S102), is based on The retrieval information carries out the data retrieval based on participle from database, obtains the list of candidate solid data;Sequence step (S103), based on the degree of correlation between candidate solid data and user view, processing is ranked up to candidate solid data;And First result determines step (S104), and the candidate solid data with the highest degree of correlation in list is determined as asking for user Ask the response result of information.
The second aspect of the present invention provides a kind of inquiry answering device based on semantic understanding, the inquiry answering device packet It includes:Semantic processing unit (1101) carries out semantic processes, with the inquiry of reaction of formation inquiry message to inquiry message input by user Ask the user view of purpose and for carrying out retrieving retrieval information used according to inquiry message;Retrieval unit (1102), is based on The retrieval information carries out the data retrieval based on participle from database, obtains the list of candidate solid data;Sequencing unit (1103), based on the degree of correlation between candidate solid data and user view, processing is ranked up to candidate solid data;And Candidate solid data with the highest degree of correlation in list is determined as asking for user by the first result determination unit (1104) Ask the response result of information.
The third aspect of the present invention provides a kind of inquiry response system (100) based on semantic understanding, the system comprises User terminal (1001) and the server (1002) being connect with user terminal, the user terminal include:Input receiving unit (10011), receive inquiry message input by user;Semantic processing unit (10012) carries out semantic processes to inquiry message, with The user view of the inquiry purpose of reaction of formation inquiry message and for being carried out retrieving retrieval information used according to inquiry message; Inquiry message, the user view of the inquiry message and retrieval information are sent to by transmission unit (10013) in a manner of associated Server, and the response result for inquiry message is received from server, the server includes:Receiving unit (10021), from User terminal receives inquiry message and user view associated with the inquiry message and retrieval information;Retrieval unit (10022), Based on the retrieval information, the data retrieval based on participle is carried out from database, obtains the list of candidate solid data;Sequence Unit (10023) is ranked up candidate solid data based on the degree of correlation between candidate solid data and user view;With And the candidate solid data with the highest degree of correlation in list is determined as inquiring for user by result determination unit (10024) The response result of information, and response result is sent to user terminal.
The fourth aspect of the present invention provides a kind of computer readable storage medium, stores computer program, the calculating When being executed by processor, realization includes the steps that machine program according in above-mentioned inquiry answer method.
According to the present invention, even if can be smooth if the inquiry message of input error the problems such as due to user's vagueness in memory Completion information retrieval so that user can obtain and the closer retrieval result of the intention of user.In addition, even if there is no In the case of optimal retrieval result, error correction can be also carried out, and provide a user the result after error correction.
Description of the drawings
In order to more clearly explain the technical solutions in the embodiments of the present application, make required in being described below to embodiment Attached drawing is briefly described, it should be apparent that, the accompanying drawings in the following description is only some implementations described in the application Example, without creative efforts, can also be according to these attached drawings for this field or those of ordinary skill Obtain other attached drawings.
Fig. 1 is the figure for the hardware construction for showing the inquiry answering device in the present invention.
Fig. 2 is the flow chart for illustrating inquiry answer method according to a first embodiment of the present invention.
Fig. 3 is the flow chart for the semantic processes step for illustrating inquiry answer method according to the present invention.
Fig. 4 is the flow chart for the sequence step for illustrating inquiry answer method according to the present invention.
Fig. 5 is the block diagram for the software configuration for illustrating inquiry answering device according to first embodiment.
Fig. 6 is the flow chart for illustrating inquiry answer method according to a second embodiment of the present invention.
Fig. 7 is the block diagram for the software configuration for illustrating inquiry answering device according to second embodiment.
Fig. 8 is the flow chart for illustrating inquiry answer method according to the preferred embodiment of the invention.
Fig. 9 is the block diagram for the software configuration for illustrating the inquiry answering device according to preferred embodiment.
Figure 10 is the schematic diagram for illustrating the inquiry response system of the present invention.
Specific implementation mode
Hereinafter describe the embodiment of the present invention in detail with reference to the accompanying drawings.It should be appreciated that following embodiments and unawareness The figure limitation present invention, also, about the means according to the present invention solved the problems, such as, it is not absolutely required to be retouched according to following embodiments The whole combinations for the various aspects stated.For simplicity, to identical structure division or step, identical label or mark have been used Number, and the description thereof will be omitted.
[hardware configuration of inquiry answering device]
Fig. 1 is the figure for the hardware construction for showing the inquiry answering device in the present invention.In the present embodiment, with smart phone Example as inquiry answering device provides description.Although it is noted that illustrating smart phone in the present embodiment as inquiry Ask answering device 1000, but it is clear that without being limited thereto, inquiry answering device of the invention can be mobile terminal (smart mobile phone, Smartwatch, Intelligent bracelet, music player devices), laptop, tablet computer, PDA (personal digital assistant), fax dress Set, printer or be with inquiry answering internet device (such as digital camera, refrigerator, television set etc.) Etc. various devices.
First, the hardware configuration of block diagram description inquiry answering device 1000 (2000,3000) referring to Fig.1.In addition, at this Following construction is described as example in embodiment, but the inquiry answering device of the present invention is not limited to construction shown in FIG. 1.
Inquire that answering device 1000 includes input interface 101, CPU 102, the ROM being connected to each other via system bus 103, RAM 105, storage device 106, output interface 104, communication unit 107 and short-range wireless communication unit 108 and display Unit 109.Input interface 101 is the interface executed instruction for receiving data and function that user is inputted, and is For being received from data input by user via the operating unit (not shown) of such as microphone, button, button or touch screen and The interface of operational order.It note that the display unit 109 being described later on and operating unit can be at least partly integrated, also, For example, it may be carrying out picture output in same picture and receiving the construction of user's operation.
CPU 102 is system control unit, and generally comprehensively answering device 1000 is inquired in control.In addition, for example, CPU 102 carries out the display control of the display unit 109 of inquiry answering device 1000.ROM 103 stores all of the execution of CPU 102 Such as fixed data of tables of data and control program and operating system (OS) program.In the present embodiment, it is stored in ROM 103 Each control program carry out at such as scheduling, task switching and interruption for example, under the management of the OS stored in ROM 103 The software of reason etc. executes control.
The constructions such as SRAM (static RAM), DRAM by needing stand-by power supply of RAM 105.This In the case of, RAM 105 can store the significant data of control variable of program etc. in a non-volatile manner.In addition, RAM 105 Working storage as CPU 102 and main memory.
Storage device 106 stores model trained in advance (for example, word error correction mode, physical model, Rank models, semanteme Model etc.), the database for being retrieved and for execute it is according to the present invention inquiry answer method application program etc.. It note that database here can also be stored in the external device (ED) of such as server.In addition, storage device 106 store it is all It is such as used to carry out the information for sending/receiving transmission/receiving control program via communication unit 107 and communication device (not shown) Various programs and the various information that use of these programs.In addition, storage device 106 also stores inquiry answering device 1000 Setting information, inquire answering device 1000 management data etc..
Output interface 104 is for being controlled display unit 109 to show the display picture of information and application program The interface in face.Display unit 109 is for example constructed by LCD (liquid crystal display).Have such as by being arranged on display unit 109 The soft keyboard of the key of numerical value enter key, mode setting button, decision key, cancel key and power key etc. can receive single via display The input from the user of member 109.
Inquire answering device 100 via communication unit 107 for example, by wireless communications such as Wi-Fi (Wireless Fidelity) or bluetooth Method holds row data communication with external device (ED) (not shown).
In addition, inquiry answering device 1000 can also via short-range wireless communication unit 108, in short-range with External device (ED) etc. is wirelessly connected and holds row data communication.And short-range wireless communication unit 108 by with communication unit 107 different communication means are communicated.It is, for example, possible to use its communication range is shorter than the communication means of communication unit 107 Communication means of the Bluetooth Low Energy (BLE) as short-range wireless communication unit 108.In addition, as short-distance wireless communication list The communication means of member 108, for example, it is also possible to perceive (Wi-Fi Aware) using NFC (near-field communication) or Wi-Fi.
[first embodiment]
[inquiry answer method according to first embodiment]
Inquiry answer method according to the present invention can be by inquiring that the readings of CPU 102 of answering device 1000 are stored in ROM 103 or control program on storage device 106 or via communication unit 107 from passing through network and inquiry answering device The network server (not shown) of 1000 connections and the control program downloaded are realized.
Before carrying out inquiry answer method according to the present invention, first preparation model and database are needed.Detailed process is such as Under:
(1) crawl of related data:The crawl of the crawl and associated data such as label etc. of solid data, In, solid data refers to the entity in certain field (such as video field), such as film " private savings of husbands ", " the Mi months pass ", and Label is exactly the word for describing the entity:Such as " social forest ", " love ".
(2) training of model:Word error correcting model:The mapping for establishing the pinyin table and fuzzy phoneme of word, passes through the instruction to language material Practice, calculates the transition probability model between the probabilistic model of word and word;Physical model:Using language model to including solid data Language material be trained, it is established that identify the model of entity;Rank models:Pass through ready data and feature extraction, training At the decision-tree model of GBDT;Semantic model:By language model and training corpus, semantic model can be extracted by being trained to. Sample needed for model above training process can be crawled from public network.
(3) the index storage of data:To the field modeling, be based on existing data and model, be processed into for retrieval and The structural data and storage of semantic understanding.
Next, being illustrated to inquiry answer method according to a first embodiment of the present invention in conjunction with Fig. 2 to Fig. 4.Wherein, Fig. 2 is the flow chart for illustrating inquiry answer method according to a first embodiment of the present invention;Fig. 3 is to illustrate inquiry according to the present invention The flow chart of the semantic processes step of answer method;Fig. 4 is the sequence step for illustrating inquiry answer method according to the present invention Flow chart.
As shown in Fig. 2, first, in semantic processes step S101, language is carried out to inquiry message input by user (query) Justice processing, it is used with the user view of the inquiry purpose of reaction of formation inquiry message and for according to inquiry message retrieve Retrieve information.Preferably, as shown in figure 3, semantic processes step S101 further comprises:User view identification step S1011 is right Inquiry message carries out user view identification, obtains the user view corresponding to inquiry message;Entity recognition step S1012, passes through Trained physical model in advance, identifies solid data from inquiry message;And semantic understanding step S1013, by advance Trained semantic model carries out semantic understanding, to obtain retrieval information to inquiry message.Here, inquiry message be user for example Pass through the text message of keyboard input, the text envelope for example generated by the voice messaging of microphone input by converting user One in text message made of breath and text message input by user and the text combination for being converted into user speech information Kind.For example, user can input inquiry message " I will see The Shawshank Redemption ", at this point, by Entity recognition step, Ke Yicong Entity " The Shawshank Redemption " is identified in the inquiry message, and by semantic understanding step, semantic reason is carried out to the inquiry message Solution, can obtain retrieval information, retrieval information here using the intelligible slot value pair of computer form, such as " title= The Shawshank Redemption ".
Next, in searching step S102, it is based on the retrieval information, the data based on participle are carried out from database Retrieval obtains the list of candidate solid data.Here, first by the slot value obtained in semantic understanding step S1013 to conversion At the sentence that can be retrieved (for example, " title=The Shawshank Redemption " is converted into " film title:The Shawshank Redemption "), Then retrieval request is sent to return the result list to database.It, can be according to pre-prepd participle mould in retrieving Type segments the value (such as " The Shawshank Redemption ") in retrieval information, and in the preparation of database, it also can be in library Each solid data carries out participle and is indexed with falling to sort, and this makes it possible to matched result is found out by the result of participle Come.The advantages of being retrieved based on participle be, though user due to vagueness in memory and the inquiry message of input error (such as " cucurbit baby brother "), it is " cucurbit baby " and " brother " to be segmented inquiry message by participle model, can also be examined from database Rope goes out desired result (such as " Calabash Brothers ");Alternatively, user may input incomplete inquiry message (such as " Xiao Shenke Redeem "), it is " Xiao Shenke " and " redeeming " to be segmented inquiry message by participle model, and the phase can be also retrieved from database The result (such as " The Shawshank Redemption ") of prestige.
Next, in sequence step S103, based on the degree of correlation between candidate solid data and user view, to candidate Solid data is ranked up processing.Preferably, as shown in figure 4, sequence step S103 further comprises:Relatedness computation step S1031 calculates the degree of correlation between candidate solid data and user view according to GBDT models;And relevancy ranking step S1032 is based on the calculated degree of correlation, is ranked up to candidate solid data using Rank models.Here, candidate is being calculated in fact When the degree of correlation between volume data and user view, first by context state, entity static information (such as label, name, Classification etc.), the multidate information (such as distance of temperature, marking and current time) of entity calculate characteristic value, then will own Characteristic value calculate the last degree of correlation by pre-prepd GBDT models.Here, the characteristic value of static information is logical Information is crossed with the matching degree of inquiry message input by user come what is calculated, this matching degree can pass through phonetic (including fuzzy phoneme) Editing distance, the editing distance of word, semantic editing distance etc. determine that and the characteristic value of multidate information can be by certain Formula calculate.
Finally, in the first result determines step S104, by the candidate solid data with the highest degree of correlation in list, really It is set to the response result for user's inquiry message.Here it is possible to by display unit 109, by the time with the highest degree of correlation Solid data is selected to return to user as optimal result.
Inquiry answer method according to a first embodiment of the present invention carries out base by being based on retrieval information from database In the data retrieval of participle, the list of candidate solid data is obtained, and based on the phase between candidate solid data and user view Guan Du is ranked up processing to candidate solid data, can obtain following technique effect:Even if a. due to user's vagueness in memory or Input error and input incomplete inquiry message, can also retrieve ideal result;B. it allows users to obtain and use The closer retrieval result of intention at family.
[software configuration of inquiry answering device according to first embodiment]
Fig. 5 is the block diagram for the software configuration for illustrating inquiry answering device according to first embodiment.As shown in figure 5, inquiry Answering device 1000 includes that semantic processing unit 1101, retrieval unit 1102, sequencing unit 1103 and the first result determine list Member 1104.
Specifically, semantic processing unit 1101 includes:User view recognition unit 11011, uses inquiry message Family intention assessment obtains the user view corresponding to inquiry message;Entity recognition unit 11012 passes through entity trained in advance Model identifies solid data from inquiry message;And semantic understanding unit 11013, by semantic model trained in advance, Semantic understanding is carried out to inquiry message, to obtain retrieval information.Retrieval unit 1102 is based on the retrieval information, from database The data retrieval based on participle is carried out, the list of candidate solid data is obtained.Sequencing unit 1103 includes:Correlation calculating unit 11031, the degree of correlation between candidate solid data and user view is calculated according to GBDT models;And relevancy ranking unit 11032, it is based on the calculated degree of correlation, candidate solid data is ranked up.First result determination unit 1104, will be in list Candidate solid data with the highest degree of correlation is determined as the response result for user's inquiry message.
[second embodiment]
[inquiry answer method according to second embodiment]
Inquiry answer method according to a second embodiment of the present invention is illustrated with reference to Fig. 6.Wherein, Fig. 6 is example Show the flow chart of inquiry answer method according to a second embodiment of the present invention.
As shown in fig. 6, inquiry answer method according to second embodiment and the inquiry answer method according to first embodiment Difference lies in increase the first judgment step S204, the second judgment step S205 and the second result and determine step S206.
Specifically, in the first judgment step S204, according to similarity distance, the list obtained in step s 103 is calculated In first degree of correlation between candidate solid data and inquiry message with the highest degree of correlation, and whether judge first degree of correlation Less than first threshold.Here, the different attribute of solid data is equivalent to the different slots of semantic understanding, and the inquiry of attribute and user Ask that the degree of correlation of information is determined by similarity distance, similarity distance here include phonetic (including fuzzy phoneme) editor away from The editing distance etc. of editing distance and semanteme from, word, wherein the editing distance of word is for example since font is close, unisonance is different Word lacks situations such as word multiword and generates.If first degree of correlation is less than first threshold (being "Yes" in step S204), then it represents that from Differing greatly between the desired result of optimal result and user retrieved in database, at this moment, processing proceed to the second knot Fruit determines step S206, and the solid data identified in Entity recognition step S1012, which is determined as response result, to be returned to User so that even if in the database it is not anticipated that ideal result can be obtained in the case of result if user.For example, with Family inputs inquiry message " I will see dear Interpreter Officer " in step S101, and in the database without the film, at this moment The entity " dear Interpreter Officer " identified in Entity recognition step S1012 can be returned to user.
On the other hand, if first degree of correlation is greater than or equal to first threshold (in step S204 be "No"), handle into Row is to the second judgment step S205, to judge whether first degree of correlation is more than second threshold.If first degree of correlation is more than second Threshold value (being "Yes" in step S205), then it represents that the optimal result retrieved from database is consistent with the desired result of user, And it handles and proceeds to step S104, by the optimal result, be determined as the response result for user's inquiry message.So that User can obtain satisfied response result.
On the other hand, if first degree of correlation is no more than second threshold (being "No" in step S205), then it represents that from data There are still differences between the desired result of optimal result and user retrieved in library, and at this moment, processing proceeds to step S206, with The solid data identified in Entity recognition step S1012 is determined as response result and returns to user.
It note that the above first threshold and second threshold are advance in the performance of training, verification and test according to model Determining, to ensure in the performance recalled with had in accuracy rate.
In addition, in above-mentioned second judgment step S205, if the candidate solid data with the highest degree of correlation in list First degree of correlation between inquiry message is not more than second threshold, can also further judge there is the second high correlation in list Whether the degree of correlation between the candidate solid data and inquiry message of degree is more than second threshold, and the case where being judged as "Yes" Under, proceed to step S104.It can avoid leading to miss optimal response knot due to sequencing errors in step s 103 in this way Fruit.In the case where not appreciably affecting processing speed, can in step S205 successively the preceding N in calculations list (for example, N= 3) degree of correlation between the candidate solid data and inquiry message of position.
Inquiry answer method according to a second embodiment of the present invention, by calculating the phase between optimal result and inquiry message Guan Du carrys out the response result that certainly directional user returns, can obtain following technique effect:Even if making in the database without pre- In the case of phase result, user can also obtain ideal result.
[software configuration of inquiry answering device according to second embodiment]
Fig. 7 is the block diagram for the software configuration for illustrating inquiry answering device according to second embodiment.As shown in fig. 7, according to Difference lies in increase the inquiry answering device 2000 of second embodiment with inquiry answering device 1000 according to first embodiment First judging unit 1204, the second result determination unit 1206 and second judgment unit 1205.
Specifically, the first judging unit is according to the candidate entity number with the highest degree of correlation in similarity distance calculations list According to first degree of correlation between inquiry message, and judge whether first degree of correlation is less than first threshold.Second result determines single Member identifies the Entity recognition unit in the case where first judging unit judges that first degree of correlation is less than first threshold The solid data gone out, is determined as response result.Second judgment unit, judges whether first degree of correlation is more than second threshold, wherein In the case where the second judgment unit judges that first degree of correlation is more than second threshold, the first result determination unit will have The candidate solid data for having the highest degree of correlation, is determined as response result, and wherein, and the similarity distance includes the editor of phonetic At least one of the editing distance of distance, the editing distance of word and semanteme.
[preferred embodiment]
[according to the inquiry answer method of preferred embodiment]
Inquiry answer method according to the preferred embodiment of the invention is illustrated with reference to Fig. 8.Fig. 8 is to illustrate basis The flow chart of the inquiry answer method of the preferred embodiment of the present invention.
As shown in figure 8, according to the inquiry answer method of preferred embodiment and inquiry answer method according to first embodiment Difference lies in increase pretreatment and error correction step S301.
Specifically, in pretreatment and error correction step S301, inquiry message is pre-processed, and by instructing in advance Experienced word error correcting model, to carrying out correction process by pretreated inquiry message.Here, the pretreatment includes believing inquiry The deletion of the stop words and spoken word that include in breath and the capital and small letter conversion of the letter and number to including in inquiry message Deng.For example, when in inquiry message input by user include some colloquial words when, carry out semantic processes step S101 it Before, it needs to remove these colloquial words.For example, in the feelings that inquiry message input by user is " I will see dear diplomat " Under condition, colloquial word " I will see " can be deleted by pretreatment first.It then, will be pre- by word error correcting model trained in advance Treated, and inquiry message " dear diplomat " is corrected as " dear Interpreter Officer ".Next, to by pretreatment and error correction Treated, and inquiry message carries out subsequent processing.In addition, user is also possible to due to pronunciation mistake and the inquiry message of input error, Such as it in the case where inquiry message input by user is " Xiao Shengke's redeems ", is entangled by the fuzzy phoneme in word correction process It is wrong, additionally it is possible to be corrected as " The Shawshank Redemption ".
According to the inquiry answer method of preferred embodiment by being carried out before carrying out semantic processes at pretreatment and error correction Reason, can be corrected inquiry message input by user, to improve the accuracy of later retrieval.
[according to the software configuration of the inquiry answering device of preferred embodiment]
Fig. 9 is the block diagram for the software configuration for illustrating the inquiry answering device according to preferred embodiment.As shown in figure 9, according to Difference lies in increase the inquiry answering device 3000 of preferred embodiment with inquiry answering device 1000 according to first embodiment Pretreatment and error correction unit 1301.
Specifically, pretreatment and error correction unit 1301 pre-process inquiry message, and pass through training in advance Word error correcting model, to carrying out correction process by pretreated inquiry message.
In addition, the present invention also provides a kind of inquiry response systems based on semantic understanding.Figure 10 is to illustrate the present invention The schematic diagram of inquiry response system.As shown in Figure 10, inquiry response system 100 includes user terminal 1001 and server 1002, User terminal 1001 is connect with server 1002 via network 1003, and network 1003 can be cable network or wireless network.
User terminal 1001 includes input receiving unit 10011, semantic processing unit 10012 and transmission unit 10013.Clothes Business device 1002 includes receiving unit 10021, retrieval unit 10022, sequencing unit 10023 and result determination unit 10024.
Specifically, in user terminal 1001, input receiving unit 10011 receives inquiry message input by user;Language Adopted processing unit 10012 carries out semantic processes to inquiry message, with the user view of the inquiry purpose of reaction of formation inquiry message With for being carried out retrieving retrieval information used according to inquiry message;Transmission unit 10013 is by inquiry message, the inquiry message User view and retrieval information are sent to server in a manner of associated, and receive the response for inquiry message from server As a result.
On the other hand, in server 1002, receiving unit 10021 from user terminal receive inquiry message and with the inquiry The associated user view of information and retrieval information;Retrieval unit 10022 is based on the retrieval information, and base is carried out from database In the data retrieval of participle, the list of candidate solid data is obtained;Sequencing unit 10023 is based on candidate solid data and anticipates with user The degree of correlation between figure is ranked up candidate solid data;As a result determination unit 10024 will have the highest degree of correlation in list Candidate solid data, be determined as the response result for user's inquiry message, and response result is sent to user terminal.
Although exemplary embodiments describe the present invention for reference above, above-described embodiment is only to illustrate this hair Bright technical concepts and features, it is not intended to limit the scope of the present invention.It is all to be done according to spirit of the invention Any equivalent variations or modification, should be covered by the protection scope of the present invention.

Claims (16)

1. a kind of inquiry answer method based on semantic understanding, the inquiry answer method include:
Semantic processes step (S101) carries out semantic processes, with reaction of formation inquiry message to inquiry message input by user It inquires the user view of purpose and for being carried out retrieving retrieval information used according to inquiry message, and is identified from inquiry message Go out solid data;
Searching step (S102) is based on the retrieval information, and the data retrieval based on participle is carried out from database, obtains candidate The list of solid data;
Sequence step (S103) carries out candidate solid data based on the degree of correlation between candidate solid data and user view Sequence is handled;
First judgment step (S204), according to the candidate solid data in similarity distance calculations list with the highest degree of correlation and inquiry It asks first degree of correlation between information, and judges whether first degree of correlation is less than first threshold;
Second result determines step (S206), judges that first degree of correlation is less than the feelings of first threshold in first judgment step Under condition, the solid data that will be identified in the semantic processes step is determined as response result;
Second judgment step (S205) judges the case where first degree of correlation is not less than first threshold in first judgment step Under, judge whether first degree of correlation is more than second threshold;And
First result determines step (S104), judges that first degree of correlation is more than the feelings of second threshold in second judgment step Under condition, by the candidate solid data with the highest degree of correlation in list, it is determined as the response result for user's inquiry message, and It, will be in the semantic processes step in the case of judging that first degree of correlation is not more than second threshold in second judgment step In the solid data that identifies, be determined as response result.
2. inquiry answer method according to claim 1, wherein the semantic processes step (S101) includes:
User view identification step (S1011) carries out user view identification to inquiry message, obtains the use corresponding to inquiry message Family is intended to;
Entity recognition step (S1012) identifies solid data by physical model trained in advance from inquiry message;With And
Semantic understanding step (S1013) carries out semantic understanding by semantic model trained in advance to inquiry message, to obtain Retrieve information.
3. inquiry answer method according to claim 1,
Wherein, the similarity distance include the editing distance of phonetic, word editing distance and semanteme editing distance at least One.
4. inquiry answer method according to any one of claim 1 to 3, wherein the sequence step (S103) includes:
Relatedness computation step (S1031) calculates the degree of correlation between candidate solid data and user view according to GBDT models; And
Relevancy ranking step (S1032) is based on the calculated degree of correlation, is ranked up to candidate solid data.
5. inquiry answer method according to any one of claim 1 to 3, the inquiry answer method is at the semanteme Further include before reason step (S101):
Pretreatment and error correction step (S301), pre-process inquiry message, and the word error correcting model by training in advance, To carrying out correction process by pretreated inquiry message.
6. inquiry answer method according to claim 5, the pretreatment includes the stop words to including in inquiry message The capital and small letter of deletion with spoken word and the letter and number to including in inquiry message is converted.
7. inquiry answer method according to any one of claim 1 to 3, the inquiry message is text input by user Information, the text message generated by converting voice messaging input by user and text message input by user with will use One kind in text message made of the text combination that family voice messaging is converted into.
8. a kind of inquiry answering device based on semantic understanding, the inquiry answering device include:
Semantic processing unit (1101) carries out semantic processes, with reaction of formation inquiry message to inquiry message input by user It inquires the user view of purpose and for being carried out retrieving retrieval information used according to inquiry message, and is identified from inquiry message Go out solid data;
Retrieval unit (1102) is based on the retrieval information, and the data retrieval based on participle is carried out from database, obtains candidate The list of solid data;
Sequencing unit (1103) carries out candidate solid data based on the degree of correlation between candidate solid data and user view Sequence is handled;
First judging unit (1204), according to the candidate solid data in similarity distance calculations list with the highest degree of correlation and inquiry It asks first degree of correlation between information, and judges whether first degree of correlation is less than first threshold;
Second result determination unit (1206) judges the case where first degree of correlation is less than first threshold in first judging unit Under, the solid data that the semantic processing unit is identified is determined as response result;
Second judgment unit (1205) judges the case where first degree of correlation is not less than first threshold in first judging unit Under, judge whether first degree of correlation is more than second threshold;And
First result determination unit (1104) judges the case where first degree of correlation is more than second threshold in the second judgment unit Under, by the candidate solid data with the highest degree of correlation in list, it is determined as the response result for user's inquiry message;And The second judgment unit judges that no more than in the case of second threshold, the semantic processing unit is identified for first degree of correlation Solid data, be determined as response result.
9. inquiry answering device according to claim 8, wherein the semantic processing unit includes:
User view recognition unit (11011) carries out user view identification to inquiry message, obtains the use corresponding to inquiry message Family is intended to;
Entity recognition unit (11012) identifies solid data by physical model trained in advance from inquiry message;With And
Semantic understanding unit (11013) carries out semantic understanding by semantic model trained in advance to inquiry message, to obtain Retrieve information.
10. inquiry answering device according to claim 8, wherein the similarity distance includes the editing distance of phonetic, word Editing distance and semantic at least one of editing distance.
11. the inquiry answering device according to any one of claim 8 to 10, wherein the sequencing unit includes:
Correlation calculating unit (11031) calculates the degree of correlation between candidate solid data and user view according to GBDT models; And
Relevancy ranking unit (11032) is based on the calculated degree of correlation, is ranked up to candidate solid data.
12. the inquiry answering device according to any one of claim 8 to 10, the inquiry answering device further include:
Pretreatment and error correction unit (1301), pre-process inquiry message, and the word error correcting model by training in advance, To carrying out correction process by pretreated inquiry message.
13. inquiry answering device according to claim 12, the pretreatment includes being deactivated to include in inquiry message The capital and small letter of the deletion of word and spoken word and the letter and number to including in inquiry message is converted.
14. the inquiry answering device according to any one of claim 8 to 10, the inquiry message is text input by user This information, the text message generated by converting voice messaging input by user and text message input by user and general One kind in text message made of the text combination that user speech information is converted into.
15. a kind of inquiry response system (100) based on semantic understanding, the system comprises user terminal (1001) and and users The server (1002) of terminal connection,
The user terminal includes:
Receiving unit (10011) is inputted, inquiry message input by user is received;
Semantic processing unit (10012) carries out semantic processes, with the inquiry purpose of reaction of formation inquiry message to inquiry message User view and for being carried out retrieving retrieval information used according to inquiry message, and entity number is identified from inquiry message According to;
Transmission unit (10013) sends out inquiry message, the user view of the inquiry message and retrieval information in a manner of associated Server is given, and the response result for inquiry message is received from server,
The server includes:
Receiving unit (10021) receives inquiry message and user view associated with the inquiry message and inspection from user terminal Rope information;
Retrieval unit (10022) is based on the retrieval information, the data retrieval based on participle is carried out from database, is waited Select the list of solid data;
Sequencing unit (10023) carries out candidate solid data based on the degree of correlation between candidate solid data and user view Sequence;
First judging unit (1204), according to the candidate solid data in similarity distance calculations list with the highest degree of correlation and inquiry It asks first degree of correlation between information, and judges whether first degree of correlation is less than first threshold;
Second result determination unit (1206) judges the case where first degree of correlation is less than first threshold in first judging unit Under, the solid data that the semantic processing unit is identified is determined as response result, and response result is sent to user's end End;
Second judgment unit judges in the case where first judging unit judges that first degree of correlation is not less than first threshold Whether first degree of correlation is more than second threshold;And
As a result determination unit (10024), in the case where the second judgment unit judges that first degree of correlation is more than second threshold, By the candidate solid data with the highest degree of correlation in list, it is determined as the response result for user's inquiry message, and will answer It answers result and is sent to user terminal;And judge the case where first degree of correlation is not more than second threshold in the second judgment unit Under, the solid data that the semantic processing unit is identified is determined as response result, and response result is sent to user's end End.
16. a kind of computer readable storage medium, stores computer program, the computer program is being executed by processor When, it realizes in inquiry answer method according to any one of claim 1 to 7 and includes the steps that.
CN201710575024.3A 2017-07-14 2017-07-14 Inquire answer method, inquiry answering device and computer readable storage medium Active CN107330120B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710575024.3A CN107330120B (en) 2017-07-14 2017-07-14 Inquire answer method, inquiry answering device and computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710575024.3A CN107330120B (en) 2017-07-14 2017-07-14 Inquire answer method, inquiry answering device and computer readable storage medium

Publications (2)

Publication Number Publication Date
CN107330120A CN107330120A (en) 2017-11-07
CN107330120B true CN107330120B (en) 2018-09-18

Family

ID=60226783

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710575024.3A Active CN107330120B (en) 2017-07-14 2017-07-14 Inquire answer method, inquiry answering device and computer readable storage medium

Country Status (1)

Country Link
CN (1) CN107330120B (en)

Families Citing this family (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108021554A (en) * 2017-11-14 2018-05-11 无锡小天鹅股份有限公司 Audio recognition method, device and washing machine
CN109800407B (en) * 2017-11-15 2021-11-16 腾讯科技(深圳)有限公司 Intention recognition method and device, computer equipment and storage medium
KR102517219B1 (en) * 2017-11-23 2023-04-03 삼성전자주식회사 Electronic apparatus and the control method thereof
KR102561712B1 (en) * 2017-12-07 2023-08-02 삼성전자주식회사 Apparatus for Voice Recognition and operation method thereof
CN110472058B (en) * 2018-05-09 2023-03-03 华为技术有限公司 Entity searching method, related equipment and computer storage medium
CN110737756B (en) * 2018-07-03 2023-06-23 百度在线网络技术(北京)有限公司 Method, apparatus, device and medium for determining answer to user input data
JP7068962B2 (en) * 2018-08-13 2022-05-17 株式会社日立製作所 Dialogue methods, dialogue systems and programs
CN109597993B (en) * 2018-11-30 2021-11-05 深圳前海微众银行股份有限公司 Statement analysis processing method, device, equipment and computer readable storage medium
CN110457423A (en) * 2019-06-24 2019-11-15 平安科技(深圳)有限公司 A kind of knowledge mapping entity link method, apparatus, computer equipment and storage medium
CN110334347B (en) * 2019-06-27 2024-06-28 腾讯科技(深圳)有限公司 Information processing method based on natural language recognition, related equipment and storage medium
CN110456339B (en) * 2019-08-12 2021-09-14 四川九洲电器集团有限责任公司 Inquiring and responding method and device, computer storage medium and electronic equipment
CN112396481A (en) * 2019-08-13 2021-02-23 北京京东尚科信息技术有限公司 Offline product information transmission method, system, electronic device, and storage medium
CN110647987B (en) * 2019-08-22 2024-09-13 腾讯科技(深圳)有限公司 Method and device for processing data in application program, electronic equipment and storage medium
CN110765342A (en) * 2019-09-12 2020-02-07 竹间智能科技(上海)有限公司 Information query method and device, storage medium and intelligent terminal
CN111597808B (en) * 2020-04-24 2023-07-25 北京百度网讯科技有限公司 Instrument panel drawing processing method and device, electronic equipment and storage medium
CN111538894B (en) * 2020-06-19 2020-10-23 腾讯科技(深圳)有限公司 Query feedback method and device, computer equipment and storage medium
CN112527819B (en) * 2020-12-08 2024-06-04 北京百度网讯科技有限公司 Address book information retrieval method and device, electronic equipment and storage medium
CN114328655B (en) * 2021-12-14 2024-11-01 上海金仕达软件科技股份有限公司 Intelligent business processing method and system based on deep learning

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8301633B2 (en) * 2007-10-01 2012-10-30 Palo Alto Research Center Incorporated System and method for semantic search
CN104598445B (en) * 2013-11-01 2019-05-10 腾讯科技(深圳)有限公司 Automatically request-answering system and method
CN104050256B (en) * 2014-06-13 2017-05-24 西安蒜泥电子科技有限责任公司 Initiative study-based questioning and answering method and questioning and answering system adopting initiative study-based questioning and answering method

Also Published As

Publication number Publication date
CN107330120A (en) 2017-11-07

Similar Documents

Publication Publication Date Title
CN107330120B (en) Inquire answer method, inquiry answering device and computer readable storage medium
US10909969B2 (en) Generation of language understanding systems and methods
CN110765244B (en) Method, device, computer equipment and storage medium for obtaining answering operation
US20210311975A1 (en) Language agnostic command-understanding digital assistant
Schalkwyk et al. “Your word is my command”: Google search by voice: A case study
CN110111780B (en) Data processing method and server
JP5926378B2 (en) Text input method, apparatus, and system
US9582757B1 (en) Scalable curation system
CN107430616B (en) Interactive reformulation of voice queries
JP2022547631A (en) Stopword data augmentation for natural language processing
US9336298B2 (en) Dialog-enhanced contextual search query analysis
CN110415679B (en) Voice error correction method, device, equipment and storage medium
KR101751113B1 (en) Method for dialog management based on multi-user using memory capacity and apparatus for performing the method
US20230186161A1 (en) Data manufacturing frameworks for synthesizing synthetic training data to facilitate training a natural language to logical form model
CN112948534A (en) Interaction method and system for intelligent man-machine conversation and electronic equipment
CN112154465A (en) Method, device and equipment for learning intention recognition model
KR101677859B1 (en) Method for generating system response using knowledgy base and apparatus for performing the method
CN110147494B (en) Information searching method and device, storage medium and electronic equipment
CN116501960A (en) Content retrieval method, device, equipment and medium
CN107424612A (en) Processing method, device and machine readable media
KR20220109238A (en) Device and method for providing recommended sentence related to utterance input of user
CN110809796A (en) Speech recognition system and method with decoupled wake phrases
US20230376700A1 (en) Training data generation to facilitate fine-tuning embedding models
US12099908B2 (en) Updating training examples for artificial intelligence
RU2759090C1 (en) Method for controlling a dialogue and natural language recognition system in a platform of virtual assistants

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20200727

Address after: 518000 Nanshan District science and technology zone, Guangdong, Zhejiang Province, science and technology in the Tencent Building on the 1st floor of the 35 layer

Patentee after: TENCENT TECHNOLOGY (SHENZHEN) Co.,Ltd.

Address before: 100029, Beijing, Chaoyang District new East Street, building No. 2, -3 to 25, 101, 8, 804 rooms

Patentee before: Tricorn (Beijing) Technology Co.,Ltd.

TR01 Transfer of patent right