CN110211573A

CN110211573A - A kind of task-driven type dialogue decision-making technique based on neural network model

Info

Publication number: CN110211573A
Application number: CN201910450074.8A
Authority: CN
Inventors: 许清; 方昳凡; 滕飞宇; 陆相羽; 黄子恒; 易和阳; 戈洋; 杨颜睿
Original assignee: Chongqing University of Post and Telecommunications
Current assignee: Chongqing University of Post and Telecommunications
Priority date: 2019-05-28
Filing date: 2019-05-28
Publication date: 2019-09-06

Abstract

The present invention provides a kind of, and the task-driven type based on neural network model talks with decision-making technique, pass through the confidence state tracker and other partial parameters in training pattern, the coded vector generated by intention assessment model is as the confidence probability distribution over states for being intended to distribution and the generation of confidence state tracker, database is transferred to be inquired using the corpus being collected, use database search result, it is intended to distribution and probability distribution transfers to tactful combination of network to form system acting, it passes to and generates the response of combination of network output system, complete dialogue function, possess the features such as robustness is preferable, it is poor to solve model tormulation ability existing for task-driven type conversational system instantly, training difficulty is big, the limitation of model learnability is more, model training data volume is huge, model training reward mechanism is imperfect, practicability is poor etc. in specific field Problem.

Description

Task-driven type conversation decision-making method based on neural network model

Technical Field

The invention relates to the technical field of artificial intelligence, in particular to a task-driven type conversation decision-making method based on a neural network model.

Background

In recent years, artificial intelligence technology is gradually applied widely, human-computer interaction is applied to various fields such as smart home, smart medical treatment, public service, intelligent networked automobiles and the like, and human beings are pursuing more convenient life, so that robots or systems which can interpret human languages and autonomously complete responses are gradually an active development direction. The task-driven multi-turn dialogue is one of the main forms of the current development of human-computer interaction, and the research on the task-driven multi-turn dialogue system is widely developed at home and abroad at present, mainly aims at the human-computer interaction in the field of customer service, and is mainly applied to the service industry. However, in vehicles, human-computer interaction is not rapidly developed due to small space, high cost control requirements and the like, and emerging technologies are not applicable to services for drivers and passengers. Human-vehicle dialogue interaction involves a variety of proprietary knowledge in the field of driving, which makes current service-class dialogue systems unable to meet the needs of human-vehicle interaction.

In the prior art, a man-vehicle interaction system mainly adopts a keyword and key sentence matching mode to search the answer of a client problem in an existing database. The mechanized answering mode severely limits the expression mode of the driver and passengers, and causes the conversation interaction to be too programmed. In a single-round conversation system, when the user has situations of unclear semantic expression, language sickness, spoken language expression and the like, the time for obtaining correct response is greatly increased, and great influence is caused on the user experience.

For various problems existing in the prior art and the market, an application of a multi-turn dialogue interaction technology aiming at a specific field and a specific group is urgently needed, and in recent years, the remarkable progress of big data collection and deep learning in the natural language processing direction provides a new idea for realizing the design of a vehicle-mounted multi-turn dialogue system facing the requirements of drivers and passengers.

Disclosure of Invention

The invention aims to provide a task-driven conversation decision method based on a neural network model, so as to improve the experience of drivers and passengers on a vehicle-mounted system and the rapidness and convenience for obtaining answers to questions.

In order to achieve the above object, the present invention provides a task-driven dialog decision method based on a neural network model, comprising:

collecting a multi-round dialogue training text in a set field, constructing an intention recognition model based on an LSTM network, and training and cross-verifying the intention recognition model;

constructing a confidence state tracker for each information slot, and constructing an updating rule of a convolutional neural network and a cyclic neural network so as to train the confidence state tracker;

designing a database and a query mode of the database;

generating an output single vector according to the graph recognition model, the confidence state tracker and the output result of the database by three-phase matrix transformation, and adjusting the output single vector based on a language model;

training the wheel dialog data is used, and all possible parallel dialog cases are distributed according to the global confidence probability of the dialog state, so that the dialog action proposed at the next moment is generated.

Optionally, the step of constructing the intention recognition model based on the LSTM network includes:

coding each turn of dialog in a plurality of turns of dialogs in an intention recognition model to obtain a coding vector t of each turn of dialog;

the LSTM network is constructed according to the following formula:

wherein z is_tFor input encoded by a sequence as tIs represented by the distribution of_tDenotes an input gate, f_tIndicating forgetting to leave door o_tRepresents an output gate, c_i-1,c_iRepresenting a short-term memory state. W_xc,W_hcIs a trainable parameter, h_i-1Representing a hidden layer.

Optionally, the confidence state tracker comprises a feature extractor with a convolutional neural network and a Jordan type recurrent neural network.

Optionally, the step of building an update rule of the convolutional neural network and the cyclic neural network includes:

building the convolutional neural network, extracting word intermediate characteristics and sentence representation characteristics, and designing a characteristic vectorFor the concatenation of two convolutional neural network derived features, the input u of round t-1 is processed according to the following formula_tProcessing the response s of the t-1 round_t-1：

Both sides of the sentence are filled with sentences before each convolution operation according to the following formula:

wherein the vector w_sThe matrix W_sBias term b_sAnd b'_sAnd a scalar g_φ,sIs a parameter that is a function of,is the probability that the value is not mentioned up to the t round, the recurrent neural network weights of each value v are combined together, and each activation function is updatedTime varying feature

Optionally, the query content q of the database_tIs realized by the following formula:

where S' is the information entered, S_IIs a set of information slots, and the information slots,is the output of the confidence state tracker.

Optionally, identifying the output z of the model from said graph_tOutput of confidence state trackerAnd true value vector x obtained from output result of database_tGenerating an output single vector O by performing three-phase matrix transformation according to the following formula_t：

Wherein, W_zo、W_po、W_xoIs a parameter that is a function of,is a concatenation of all the aggregated confidence vectors.

The task-driven dialog decision method based on the neural network model provided by the invention takes the coding vector generated by the intention recognition model as the intention distribution and the confidence state probability distribution generated by the confidence state tracker through the confidence state tracker and other partial parameters in the training model, sends the results to the database to be inquired by using the collected corpus, uses the database to search the results, the intention distribution and the probability distribution are combined by a strategy network to form system actions, the system actions are transmitted to a generation network combination output system to respond, a conversation function is completed, the characteristics of better robustness and the like are achieved, and the problems that the current task driving type conversation system is poor in model expression capability, large in training difficulty, more in model learnability limitation, huge in model training data volume, incomplete in model training reward mechanism, poor in practicability in a specific field and the like are solved.

Drawings

FIG. 1 is a flow chart of a task-driven conversational decision-making method based on a neural network model according to the present invention;

FIG. 2 is a general framework diagram of the vehicle-mounted multi-turn dialogue oriented by the present invention;

FIG. 3 is a Recurrent Neural Network (RNN) confidence tracking model of the bound Convolutional Neural Network (CNN) of the present invention.

Detailed Description

The following describes in more detail embodiments of the present invention with reference to the schematic drawings. Advantages and features of the present invention will become apparent from the following description and claims. It is to be noted that the drawings are in a very simplified form and are not to precise scale, which is merely for the purpose of facilitating and distinctly claiming the embodiments of the present invention.

As shown in fig. 1 to fig. 3, the present embodiment provides a task-driven dialog decision method based on a neural network model, which includes the following steps:

step 1: intent recognition, comprising the following 3 sub-steps:

(1) collecting dialogue training texts in the field of driver and passenger requirements for training a subsequent intention recognition model and a confidence state tracker;

(2) coding each dialog in multiple dialogs in the intention recognition model to obtain a coding vector t, z of the dialog_tInput coded by sequence as tThe distribution of (2) is expressed, an LSTM network is constructed, and the output layer of the last step isThe method is used as probability representation of different intentions in the field of driver and passenger requirements, and the intention recognition model design is completed; wherein,

z_tfor input encoded by a sequence as tIs represented by the distribution of_tDenotes an input gate, f_tIndicating forgetting to leave door o_tRepresents an output gate, c_i-1,c_iTo representShort term memory states. W_xc,W_hcIs a trainable parameter, h_i-1Representing a hidden layer.

(3) And for a given intention recognition model training data set, training and cross-verifying the intention recognition model, and finally generating an analysis result of the user intention.

Step 2: confidence state tracking, comprising the following 4 sub-steps:

(1) constructing a special confidence state tracker for each information slot, wherein each confidence state tracker is composed of a convolutional neural network feature extractor and a Jordan type cyclic neural network;

(2) modeling each turn of utterance context background, feature vectorProcessing t rounds of user input u for concatenation of derived features of two convolutional neural networks_tProcessing the system response s of the t-1 round_t-1The calculation formula is as follows:

convolution neural network operation special for slot valueNot only sentence representation is extracted, but also the position of the lexical mark is extracted, and the embedding of the intermediate class n-gram model is determined. In each dialog, if multiple matches are observed, the corresponding embeddings are summed. On the other hand, if the particular slots or values do not match, null n-gram embedding is padded with zeros. To track the location of the de-lexical tokens, the two edges of the sentence are filled with sentences before each convolution operation, the number of vectors being determined by the filter size of each layer.

(3) Designing an updating rule of a recurrent neural network, and iteratively calculating the confidence state probability of the content expressed by each round of users;

The confidence state tracker maintains a polynomial distribution p for each information slot (a slot that can be used to constrain a search, such as parking) and a binary distribution for each request slot (a slot of interrogatable values, such as an address).

(4) Training confidence state tracker, and inquiring content q of database through output of confidence state tracker_tThe method is realized by the following algorithm:

s' is the information entered, S_IIs a set of information slots, and the information slots,is the output of the confidence state tracker. Applying the content to a database, creating a binary truth vector x on the database_tWhere 1 indicates that the corresponding entity is consistent with the query, it can be concluded that it is consistent with the most likely confidence state. If x is not completely empty, the associated entity pointer keeps identifying a matching entity that was randomly selected, and the entity referenced by the entity pointer is used to form the final system response.

And step 3: dialog generation, comprising the following 4 sub-steps:

(1) z of output of intention recognition model_tOutput of confidence state trackerAnd true value vector x obtained from database search result_tInput and output of an output unit vector O representing the system operation_tGenerating the appropriate sentence form, the individual probabilities of the classification values in the informative confidence states are irrelevant and are added together to form a summarized confidence vector for each information slot. Represented by three parts: the sum value probabilities, the probability that the user indicates that they "don't care" this slot and the probability that the slot is not mentioned. Finally, the output is generated by a three-way matrix transform:

wherein, W_zo、W_po、W_xoIs a parameter that is a function of,is the concatenation of all summary confidence vectors;

(2) generating sentence tokens similar to the templates based on the language model, and adjusting according to the output one-way quantity to generate system response; the token generation process is enhanced by a set of pointer networks to transfer entity specific information into a response by randomly sampling from a surface form list to replace the non-linearized tags, e.g., < s.place > to a place or region, replacing the de-lexical values with the actual attribute values of the entity currently selected by the database pointer.

(3) Training using multiple rounds of session data;

(4) and generating the dialog action proposed at the next moment according to all possible parallel dialog cases of the global confidence probability distribution of the dialog state.

The above description is only a preferred embodiment of the present invention, and does not limit the present invention in any way. It will be understood by those skilled in the art that various changes, substitutions and alterations can be made herein without departing from the spirit and scope of the invention as defined by the appended claims.

Claims

1. A task-driven conversation decision method based on a neural network model is characterized by comprising the following steps:

designing a database and a query mode of the database;

2. The neural network model-based task-driven dialog decision method of claim 1, wherein the step of building an intent recognition model based on an LSTM network comprises:

the LSTM network is constructed according to the following formula:

3. The neural network model-based task-driven dialog decision method of claim 2 in which the confidence state tracker consists of a recurrent neural network with convolutional neural network feature extractor and Jordan-type recurrent neural network.

4. The task-driven dialog decision method based on the neural network model as claimed in claim 3, wherein the step of building the update rules of the convolutional neural network and the cyclic neural network comprises:

wherein,vector w_sThe matrix W_sBias term b_sAnd b'_sAnd a scalar g_φ,sIs a parameter that is a function of,is the probability that the value is not mentioned up to the t round, the recurrent neural network weights of each value v are combined together, and each activation function is updatedTime varying feature

5. The neural network model-based task-driven conversational decision making method of claim 4, wherein query content q of the database_tIs realized by the following formula:

6. The neural network model-based task-driven dialog decision method of claim 4, characterized in that the output z of the model is recognized from the graph_tOutput of confidence state trackerAnd true value vector x obtained from output result of database_tGenerating a three-phase matrix transformation according to the following formulaOutput of a single vector O_t：