JP7028746B2

JP7028746B2 - Question generator and question generation method

Info

Publication number: JP7028746B2
Application number: JP2018189979A
Authority: JP
Inventors: 淳細矢; 弘樹山田; 剛白井
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2018-10-05
Filing date: 2018-10-05
Publication date: 2022-03-02
Anticipated expiration: 2038-10-05
Also published as: JP2020060846A; WO2020071398A1

Description

本発明は、質問生成装置および質問生成方法に関するものである。 The present invention relates to a question generator and a question generation method.

商品、サービスに関して不具合等が生じた場合、そのユーザがベンダーに問い合わせを行うことは一般的である。そのためベンダー側ではお客様対応窓口やコールセンターを運用し、そうした問い合わせに対応している。 When a problem occurs with a product or service, it is common for the user to make an inquiry to the vendor. Therefore, the vendor operates a customer service center and call center to respond to such inquiries.

しかしながら、そうした顧客対応に要するコストは小さくないため、近年ではチャットボットやＩＶＲ（ＩｎｔｅｒａｃｔｉｖｅＶｏｉｃｅＲｅｓｐｏｎｓｅ）等の自動応答システムを適宜採用し、該当業務の効率化が図られつつある。 However, since the cost required for dealing with such customers is not small, in recent years, automatic response systems such as chatbots and IVR (Interactive Voice Response) have been appropriately adopted to improve the efficiency of the relevant business.

このような自動応答システムでは、問い合わせの内容を認識し、その内容に応じた応答をコンピュータが自動実行する。この自動応答の仕組みには、いわゆるシナリオ型や一問一答型など種々のタイプが存在する。 In such an automatic response system, the computer recognizes the content of the inquiry and automatically executes a response according to the content. There are various types of this automatic response mechanism, such as a so-called scenario type and a question-and-answer type.

このうちシナリオ型の自動応答システムは、ユーザからの問い合わせを契機に、最終的な回答に至るまで、複数回のやり取りをユーザと行って回答を絞り込む。いわゆるＦＡＱの内容を単純にシステム化した一問一答型とは異なり、多少の手間と引き換えに、ユーザの望む回答を導き出せる確率が高くなりやすい特徴がある。 Of these, the scenario-type automatic response system narrows down the answers by exchanging multiple times with the user until the final answer is reached, triggered by an inquiry from the user. Unlike the one-question-one-answer type, in which the contents of the so-called FAQ are simply systematized, there is a characteristic that the probability of being able to derive the answer desired by the user is likely to increase in exchange for some effort.

こうした自動応答技術に関連する従来技術としては、例えば、予め登録されている検索条件と、検索対象である検索データからなるデータベースとによって構成され、上記検索条件についてＹｅｓ／Ｎｏで答えることによって上記検索データを絞り込むＹｅｓ／Ｎｏ型質問木を作成するＹｅｓ／Ｎｏ型質問木作成装置において、所定の検索条件によって検索データ群を抽出し、上記抽出された検索データ群を、任意の検索条件で検索できる検索データ群と、上記任意の検索条件では検索できない検索データ群とに分割するデータ分割手段と；上記登録されている検索条件のうちで、上記検索データ群について検索結果が在在する検索条件だけを、上記質問木を構成する検索条件として作成し、上記検索データ群を再帰的に分割する再帰的分割手段と；上記再帰的に検索データ群を分割する場合における当該検索データ群の評価値を、当該検索データ群における最大評価値という関数に代入する関数代入手段と；所定の検索条件で検索できる検索データ群の最大評価値と、上記所定の検索条件で検索できない検索データ群の最大評価値とを足し合わせた値を、当該検索条件における分割の最大評価値として決定する最大評価値決定手段と；上記最大評価値が最も大きくなるように、検索条件で検索データを分割する質問木、または、分割の対象となる検索データ群を質問木として設定する質問木設定手段と；を有することを特徴とするＹｅｓ／Ｎｏ型質問木作成装置（特許文献１参照）などが提案されている。 As a conventional technique related to such an automatic response technique, for example, it is composed of a search condition registered in advance and a database consisting of search data to be searched, and the search is performed by answering Yes / No about the search condition. In the Yes / No type question tree creating device that narrows down the data, the search data group can be extracted according to a predetermined search condition, and the extracted search data group can be searched by any search condition. A data dividing means for dividing into a search data group and a search data group that cannot be searched by any of the above search conditions; among the above registered search conditions, only the search conditions in which the search results exist for the above search data group. Is created as a search condition constituting the question tree, and a recursive division means for recursively dividing the search data group; and an evaluation value of the search data group in the case of recursively dividing the search data group. , A function assignment means to be assigned to a function called the maximum evaluation value in the search data group; With a maximum evaluation value determining means that determines the sum of the above values as the maximum evaluation value for division in the search condition; a question tree that divides the search data according to the search condition so that the above maximum evaluation value is the largest, or , A Yes / No type question tree creating device (see Patent Document 1) characterized by having a question tree setting means for setting a search data group to be divided as a question tree; and the like have been proposed.

また、対話型検索システム及びプログラム、並びに対話シナリオ生成システム及びプログラム（特許文献２参照）や、対話シナリオに対する動的対話ノードの挿入行動を制御する対話プログラム、サーバ及び方法（特許文献３参照）なども提案されている。 Further, an interactive search system and a program, a dialogue scenario generation system and a program (see Patent Document 2), a dialogue program, a server and a method for controlling the insertion behavior of a dynamic dialogue node for a dialogue scenario (see Patent Document 3), and the like. Has also been proposed.

特開２００３－２６３４３８号公報Japanese Patent Application Laid-Open No. 2003-263438 特開２０１２－２４８１６１号公報Japanese Unexamined Patent Publication No. 2012-248161 特開２０１５－１２５１９８号公報JP-A-2015-125198

上述のシナリオ型の自動応答システムは、顧客対応の品質面では確かに相応の利点を有する。しかし一方で、その運用コストが大きいという課題がある。 The scenario-based automated response system described above certainly has considerable advantages in terms of customer service quality. However, on the other hand, there is a problem that the operating cost is high.

すなわち、ユーザからの問い合わせに対する回答ないし追加の問いかけを順次行うためには、そうした問いかけ等を場合分けした質問木すなわちシナリオを作成し、必要に応じてメンテナンスするが必要あり、大きなコストが継続的に発生しうる。
そこで本発明の目的は、質問木を効率的に自動生成する技術を提供することにある。 In other words, in order to answer inquiries from users or to make additional inquiries in sequence, it is necessary to create a question tree, that is, a scenario, in which such inquiries are divided into cases, and maintain them as necessary, resulting in continuous high costs. It can occur.
Therefore, an object of the present invention is to provide a technique for efficiently and automatically generating a question tree.

上記課題を解決する本発明の質問生成装置は、所定事項各々に関する各説明文を保持する記憶部と、前記各説明文の形態素解析結果に基づき、前記各説明文における各単語の有無を記述したマトリクスの生成を行う処理と、所定端末から問い合わせを受信し、当該問い合わせが含むキーワードに基づき、前記各説明文のうち前記キーワードを含むものを候補説明文として特定する処理と、前記マトリクスにおける前記候補説明文に関する記述に基づき決定木分析を実行し、分岐判定項目を質問文とした質問木を生成する処理と、を実行する演算部とを備えることを特徴とする。 The question generator of the present invention that solves the above problems describes the presence or absence of each word in each of the explanations based on the storage unit that holds each explanation for each predetermined item and the morphological analysis result of each of the explanations. A process of generating a matrix, a process of receiving an inquiry from a predetermined terminal, and a process of specifying a candidate explanatory text among the explanatory texts including the keyword based on the keyword included in the query, and the candidate in the matrix. It is characterized by having a process of executing a decision tree analysis based on a description of an explanatory text and generating a question tree using a branch determination item as a question sentence, and a calculation unit for executing the process.

また、本発明の質問生成方法は、所定事項各々に関する各説明文を保持する記憶部を備えた情報処理装置が、前記各説明文の形態素解析結果に基づき、前記各説明文における各単語の有無を記述したマトリクスの生成を行う処理と、所定端末から問い合わせを受信し、当該問い合わせが含むキーワードに基づき、前記各説明文のうち前記キーワードを含むものを候補説明文として特定する処理と、前記マトリクスにおける前記候補説明文に関する記述に基づき決定木分析を実行し、分岐判定項目を質問文とした質問木を生成する処理と、を実行することを特徴とする。 Further, in the question generation method of the present invention, an information processing apparatus provided with a storage unit that holds each explanatory statement for each predetermined item has a presence or absence of each word in each explanatory statement based on the morphological analysis result of each explanatory statement. A process of generating a matrix in which the above is described, a process of receiving an inquiry from a predetermined terminal, and a process of specifying one of the above explanatory texts including the keyword as a candidate explanatory text based on the keywords included in the inquiry, and the said matrix. It is characterized in that the decision tree analysis is executed based on the description regarding the candidate explanatory text in the above, and the process of generating the question tree with the branch determination item as the question text is executed.

本発明によれば、質問木を効率的に自動生成可能となる。 According to the present invention, a question tree can be efficiently and automatically generated.

本実施形態の質問生成装置を含むネットワーク構成図である。It is a network configuration diagram including the question generator of this embodiment. 本実施形態における質問生成装置のハードウェア構成例を示す図である。It is a figure which shows the hardware configuration example of the question generator in this embodiment. 本実施形態のＱＡ情報のデータ構成例を示す図である。It is a figure which shows the data structure example of the QA information of this embodiment. 本実施形態のマトリクスのデータ構成例を示す図である。It is a figure which shows the data structure example of the matrix of this embodiment. 本実施形態の質問生成方法におけるフロー例１を示す図である。It is a figure which shows the flow example 1 in the question generation method of this embodiment. 本実施形態の質問生成方法を示すシーケンス例１を示す図である。It is a figure which shows the sequence example 1 which shows the question generation method of this embodiment. 本実施形態における画面例１を示す図である。It is a figure which shows the screen example 1 in this embodiment. 本実施形態における画面例２を示す図である。It is a figure which shows the screen example 2 in this embodiment. 本実施形態における画面例３を示す図である。It is a figure which shows the screen example 3 in this embodiment. 本実施形態の質問生成方法を示すシーケンス例２を示す図である。It is a figure which shows the sequence example 2 which shows the question generation method of this embodiment. 本実施形態における画面例４を示す図である。It is a figure which shows the screen example 4 in this embodiment. 本実施形態における画面例５を示す図である。It is a figure which shows the screen example 5 in this embodiment. 本実施形態における質問木の生成概念例１を示す図である。It is a figure which shows the generation concept example 1 of the question tree in this embodiment. 本実施形態における質問生成方法のフロー例２を示す図である。It is a figure which shows the flow example 2 of the question generation method in this embodiment. 本実施形態における質問木の生成概念例２を示す図である。It is a figure which shows the generation concept example 2 of the question tree in this embodiment. 本実施形態における質問生成方法のフロー例３を示す図である。It is a figure which shows the flow example 3 of the question generation method in this embodiment. 本実施形態における質問木の生成概念例３を示す図である。It is a figure which shows the generation concept example 3 of the question tree in this embodiment.

－－－ネットワーク構成－－－
以下に本発明の実施形態について図面を用いて詳細に説明する。図１は、本実施形態の質問生成装置１００を含むネットワーク構成図である。－－－ Network configuration －－－
Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. FIG. 1 is a network configuration diagram including the question generation device 100 of the present embodiment.

図１に示す質問生成装置１００は、質問木を効率的に自動生成するコンピュータ装置である。質問生成装置１００は、一例として、各種の金融サービスを顧客に提供する金融機関が運用するサーバ装置を想定できる。この質問生成装置１００は、チャットボット機能を有し、或いは既存のチャットボットシステムと連動し、顧客からの問い合わせに対して、適宜な応答を自動的に行う顧客対応業務用のシステムとも言える。 The question generation device 100 shown in FIG. 1 is a computer device that efficiently and automatically generates a question tree. As an example, the question generation device 100 can be assumed to be a server device operated by a financial institution that provides various financial services to customers. It can be said that the question generation device 100 has a chatbot function or is linked with an existing chatbot system to automatically respond to inquiries from customers as appropriate for customer-facing business.

従って本実施形態の質問生成装置１００は、こうしたチャットボット機能における自動応答処理に適用する質問木を、顧客からの問い合わせ機会ごとに生成し、当該チャットボット機能に提供するものとなる。 Therefore, the question generation device 100 of the present embodiment generates a question tree applied to the automatic response processing in such a chatbot function for each inquiry opportunity from the customer and provides the chatbot function.

なお、図１で例示するように、質問生成装置１００は、インターネットなどの適宜なネットワーク１０を介して、ユーザ端末２００と通信可能に接続されている。 As illustrated in FIG. 1, the question generation device 100 is communicably connected to the user terminal 200 via an appropriate network 10 such as the Internet.

ユーザ端末２００は、上述の顧客が操作する端末である。当該ユーザ端末２００を操作する顧客は、所定のＷＥＢサイトを閲覧しつつ、自身が行う問い合わせの内容を入力する。 The user terminal 200 is a terminal operated by the above-mentioned customer. A customer who operates the user terminal 200 inputs the content of an inquiry made by himself / herself while browsing a predetermined WEB site.

上述のＷＥＢサイトは、例えば、金融機関が運用する顧客対応業務用のサーバ（質問生成装置１００も含みうる）がインターネット上で公開するＷＥＢサイトである。
一方、ユーザ端末２００は、顧客が入力した問い合わせの情報を、ネットワーク１０を介して質問生成装置１００に送信することとなる。
－－－ハードウェア構成－－－ The above-mentioned WEB site is, for example, a WEB site that is open to the public on the Internet by a server for customer service business operated by a financial institution (which may also include a question generation device 100).
On the other hand, the user terminal 200 transmits the inquiry information input by the customer to the question generation device 100 via the network 10.
--- Hardware configuration ---

また、本実施形態の質問生成装置１００のハードウェア構成は図２に示すものとなる。すなわち本実施形態の質問生成装置１００は、記憶部１０１、メモリ１０３、演算部１０４、入力部１０５、出力部１０６、および、通信部１０７、を備える。
このうち、記憶部１０１は、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）やハードディスクドライブなど適宜な不揮発性記憶素子で構成される。
また、メモリ１０３は、ＲＡＭなど揮発性記憶素子で構成される。 Further, the hardware configuration of the question generation device 100 of the present embodiment is as shown in FIG. That is, the question generation device 100 of the present embodiment includes a storage unit 101, a memory 103, a calculation unit 104, an input unit 105, an output unit 106, and a communication unit 107.
Of these, the storage unit 101 is composed of an appropriate non-volatile storage element such as an SSD (Solid State Drive) or a hard disk drive.
Further, the memory 103 is composed of a volatile storage element such as a RAM.

また、演算部１０４は、記憶部１０１に保持されるプログラム１０２をメモリ１０３に読み出すなどして実行し装置自体の統括制御を行なうとともに各種判定、演算及び制御処理を行なうＣＰＵで構成される。
また、入力部１０５は、ユーザからのキー入力や音声入力を受け付けるキーボード、マウス等で構成される。
また、出力部１０６は、処理データの表示を行うディスプレイ、スピーカー等で構成される。 Further, the calculation unit 104 is composed of a CPU that executes the program 102 held in the storage unit 101 by reading it into the memory 103 to perform integrated control of the apparatus itself and also performs various determinations, calculations, and control processes.
Further, the input unit 105 is composed of a keyboard, a mouse, and the like that receive key input and voice input from the user.
Further, the output unit 106 is composed of a display, a speaker, and the like that display processing data.

また、通信部１０７は、ネットワーク１０と接続して、ユーザ端末２００等の他装置との通信処理を担うＮＦＣ（ＮｅｔｗｏｒｋＩｎｔｅｒｆａｃｅＣａｒｄ）で構成される。 Further, the communication unit 107 is composed of an NFC (Network Interface Card) that is connected to the network 10 and is responsible for communication processing with other devices such as the user terminal 200.

なお、記憶部１０１内には、本実施形態の質問生成装置として必要な機能を実装する為のプログラム１０２に加えて、ＱＡ情報１２５およびマトリクス１２６が少なくとも記憶されている。このＱＡ情報１２５およびマトリクス１２６の詳細については後述する。
－－－データ構造例－－－
続いて、本実施形態の質問生成装置１００が用いるテーブル類について説明する。図３に、本実施形態におけるＱＡ情報１２５の一例を示す。 In addition to the program 102 for implementing the function required as the question generation device of the present embodiment, at least the QA information 125 and the matrix 126 are stored in the storage unit 101. Details of the QA information 125 and the matrix 126 will be described later.
--- Data structure example ---
Subsequently, the tables used by the question generation device 100 of the present embodiment will be described. FIG. 3 shows an example of QA information 125 in this embodiment.

本実施形態のＱＡ情報１２５は、上述の金融機関において生成した、いわゆるＦＡＱなどの想定問答集である。金融機関において、顧客から問い合わせを受ける対象は、金融サービス（金融商品の概念含む）の不明点や不具合といった事項である。 The QA information 125 of the present embodiment is a collection of hypothetical questions and answers such as so-called FAQs generated by the above-mentioned financial institution. At financial institutions, the targets of inquiries from customers are matters such as unclear points and defects in financial services (including the concept of financial products).

よって、このＱＡ情報１２５においては、こうした事項に関する問い合わせに対して想定される１または複数の説明分を格納している。なお、以降は「問い合わせ」を「質問」、「説明文」を「回答」と称している。 Therefore, in this QA information 125, one or a plurality of explanations assumed for inquiries regarding such matters are stored. Hereinafter, "inquiry" is referred to as "question" and "explanatory text" is referred to as "answer".

そのデータ構造は、質問と回答の組合せを一意に特定するＩＤをキーとして、当該組合せにおける質問、当該質問に対する回答、当該回答に関する形態素解析結果、および、回答実績、といったデータから成るレコードの集合体である。 The data structure is a collection of records consisting of data such as a question in the combination, an answer to the question, a morphological analysis result related to the answer, and an answer record, using an ID that uniquely identifies the combination of the question and the answer as a key. Is.

このうち形態素解析結果は、一般的な形態素解析プログラムにより得られるもので、上述の回答（の文章）において意味を有する最小単位に該当する。こうした形態素解析結果は、質問生成装置１００が所定の形態素解析プログラムを実行した結果であってもよいし、或いは、金融機関における他の所定装置による結果であってもよい。 Of these, the morphological analysis result is obtained by a general morphological analysis program, and corresponds to the smallest unit that has meaning in the above answer (sentence). Such a morphological analysis result may be the result of the question generation device 100 executing a predetermined morphological analysis program, or may be the result of another predetermined device in a financial institution.

また、回答実績は、当該組合せにおける質問に対して当該回答を顧客に返した回数や頻度を示す値である。図３で示すＱＡ情報１２５の例では、一例として累積回数の値を回答実績としている。 In addition, the answer record is a value indicating the number and frequency of returning the answer to the customer for the question in the combination. In the example of QA information 125 shown in FIG. 3, the value of the cumulative number of times is taken as the answer record as an example.

したがって、この回答実績の値が所定の基準より大きい場合、或いは、相対的に他の組合せより大きい場合、当該組合せにおける質問に対して、当該回答が用いられやすい、ことを示している。換言すれば、顧客らが不明点等を感じる機会が多い事項（の質問）に関して、そうした顧客らの条件に合致しやすい回答、であるとも言える。 Therefore, when the value of this answer record is larger than a predetermined standard, or when it is relatively larger than another combination, it indicates that the answer is likely to be used for the question in the combination. In other words, it can be said that it is an answer that easily meets the conditions of such customers regarding matters (questions) that customers often feel unclear points.

続いて図４に、本実施形態のマトリクス１２６の一例を示す。本実施形態のマトリクス１２６は、上述のＱＡ情報１２５における各回答の形態素解析結果に基づき、当該各回答における各単語の有無を記述したテーブルである。 Subsequently, FIG. 4 shows an example of the matrix 126 of the present embodiment. The matrix 126 of the present embodiment is a table describing the presence / absence of each word in each answer based on the morphological analysis result of each answer in the above-mentioned QA information 125.

そのデータ構造は、上述のＱＡ情報１２５における各組合せのＩＤをキーとして、当該組合せにおける回答での所定単語の有無といったデータから成るレコードの集合体である。 The data structure is a collection of records composed of data such as the presence or absence of a predetermined word in the answer in the combination, using the ID of each combination in the above-mentioned QA information 125 as a key.

図４の例では、上述の単語として、単語１「住所変更」、単語２「本人」、単語３「来店」、単語４「法定代理人」、・・・、といったものを示している。また、こうした単語の有無については「１」で「有」、「０」で「無」を示している。 In the example of FIG. 4, as the above-mentioned words, the word 1 “address change”, the word 2 “principal”, the word 3 “visit”, the word 4 “legal agent”, and the like are shown. Regarding the presence or absence of such words, "1" indicates "yes" and "0" indicates "no".

したがって、例えば、ＩＤ「ＴＥＸＴ１」なる組合せにおける回答では、単語１「住所変更」は「１」、単語２「本人」は「１」、単語３「来店」は「１」、単語４「法定代理人」は「０」、となっている。また、例えばＩＤ「ＴＥＸＴ２」なる組合せにおける回答では、単語１「住所変更」は「１」、単語２「本人」は「１」、単語３「来店」は「０」、単語４「法定代理人」は「０」、となっている。また、例えばＩＤ「ＴＥＸＴ３」なる組合せにおける回答では、単語１「住所変更」は「１」、単語２「本人」は「０」、単語３「来店」は「０」、単語４「法定代理人」は「１」、となっている。また、例えばＩＤ「ＴＥＸＴ４」なる組合せにおける回答では、単語１「住所変更」は「１」、単語２「本
人」は「０」、単語３「来店」は「０」、単語４「法定代理人」は「０」、となっている。 Therefore, for example, in the answer in the combination of ID "TEXT1", the word 1 "address change" is "1", the word 2 "person" is "1", the word 3 "visit" is "1", and the word 4 "legal proxy". "People" is "0". In addition, for example, in the answer in the combination of ID "TEXT2", the word 1 "address change" is "1", the word 2 "person" is "1", the word 3 "visit" is "0", and the word 4 "legal agent". Is "0". In addition, for example, in the answer in the combination of ID "TEXT3", the word 1 "address change" is "1", the word 2 "person" is "0", the word 3 "visit" is "0", and the word 4 "legal agent". "Is" 1 ". In addition, for example, in the answer in the combination of ID "TEXT4", the word 1 "address change" is "1", the word 2 "person" is "0", the word 3 "visit" is "0", and the word 4 "legal agent". Is "0".

なお、マトリクス１２６には、複数のマトリクスが含まれうる。例えば、ＱＡ情報１２５における各回答のうち、所定期間における回答実績が所定基準以上の頻出回答であるものに関する形態素解析結果に基づき、当該頻出回答における各単語の有無を記述した頻出回答向けマトリクスが想定できる。また他にも、ＱＡ情報１２５における各回答のうち、上述の頻出回答以外の非頻出回答であるものに関する形態素解析結果に基づき、当該非頻出回答における各単語の有無を記述した非頻出回答向けマトリクスも想定できる。
－－－質問生成方法の手順について－－－ The matrix 126 may include a plurality of matrices. For example, a matrix for frequent answers that describes the presence or absence of each word in the frequent answers is assumed based on the morphological analysis result of each answer in the QA information 125 that has a frequent response record in a predetermined period of a predetermined standard or more. can. In addition, a matrix for non-frequent answers that describes the presence or absence of each word in the non-frequent answers based on the morphological analysis results of the non-frequent answers other than the above-mentioned frequent answers among the answers in the QA information 125. Can also be assumed.
－－－ About the procedure of question generation method －－－

以下、本実施形態における質問生成方法の実際手順について図に基づき説明する。以下で説明する質問生成方法に対応する各種動作は、質問生成装置１００がメモリ等に読み出して実行するプログラムによって実現される。そして、このプログラムは、以下に説明される各種の動作を行うためのコードから構成されている。
ここではまず、マトリクス１２６の生成手順について説明する。図５は本実施形態における質問生成方法のフロー例１を示す図である。 Hereinafter, the actual procedure of the question generation method in the present embodiment will be described with reference to the figure. Various operations corresponding to the question generation method described below are realized by a program read by the question generation device 100 into a memory or the like and executed. The program is composed of codes for performing various operations described below.
Here, first, the procedure for generating the matrix 126 will be described. FIG. 5 is a diagram showing a flow example 1 of the question generation method in the present embodiment.

この場合、質問生成装置１００は、ＱＡ情報１２５における各組合せにおける回答（説明文）欄の値、すなわち文章に対して形態素解析を実行する（ｓ１）。質問生成装置１００は、形態素解析プログラムを予め保持し、適宜に呼び出して利用出来るものとする。 In this case, the question generation device 100 executes morphological analysis on the value in the answer (explanatory text) column in each combination in the QA information 125, that is, the text (s1). The question generation device 100 holds a morphological analysis program in advance and can be appropriately called and used.

上述の形態素解析により、解析対象となる文章において意味を有する最小単位を特定できることとなる。例えば、「本人によるご来店の住所変更・・・」という回答に関して得た形態素解析結果は、「本人」、「来店」、「住所変更」、を少なくとも含む単語群となる。 By the above-mentioned morphological analysis, the smallest unit having meaning in the text to be analyzed can be specified. For example, the morphological analysis result obtained for the answer "change of address of visit by the person" is a word group including at least "person", "visit", and "change of address".

続いて、質問生成装置１００は、ｓ１で得た形態素解析結果である単語群に基づき、上述の各回答（説明文）における各単語の有無に応じた所定値を、所定の雛形マトリクスに設定し、マトリクスを生成する（ｓ２）。 Subsequently, the question generator 100 sets a predetermined value according to the presence or absence of each word in each of the above-mentioned answers (explanatory sentences) in a predetermined template matrix based on the word group which is the morphological analysis result obtained in s1. , Generate a matrix (s2).

上述の雛形マトリクスは、図４で例示したマトリクス１２６におけるテーブル構造に対応したマトリクスを想定できる。また、各単語の有無に応じた所定値は、同じく図４で例示したマトリクス１２６における、該当単語が「有」の場合の「１」、該当単語が「無」の場合の「０」、を想定できる。
また、質問生成装置１００は、ｓ２で生成したマトリクスを記憶部１０１のマトリクス１２６として格納し（ｓ３）、処理を終了する。 As the above-mentioned template matrix, a matrix corresponding to the table structure in the matrix 126 illustrated in FIG. 4 can be assumed. Further, the predetermined value according to the presence or absence of each word is "1" when the corresponding word is "Yes" and "0" when the corresponding word is "None" in the matrix 126 also exemplified in FIG. I can imagine.
Further, the question generation device 100 stores the matrix generated in s2 as the matrix 126 of the storage unit 101 (s3), and ends the process.

続いて図６は、本実施形態の質問生成方法を示すシーケンス例１を示す図である。ここで、或る顧客がユーザ端末２００を操作し、問い合わせ画面（図７Ａの画面６００）にて、「来店し住所変更したい」といった問い合わせ内容を入力し、当該問い合わせを質問生成装置１００に送信してきたとする。 Subsequently, FIG. 6 is a diagram showing a sequence example 1 showing the question generation method of the present embodiment. Here, a customer operates the user terminal 200, inputs an inquiry content such as "I want to visit the store and change the address" on the inquiry screen (screen 600 in FIG. 7A), and sends the inquiry to the question generation device 100. Suppose.

一方、これを受ける質問生成装置１００は、記憶部１０１におけるＱＡ情報１２５に対し、上述の問い合わせ内容が含むキーワード（「来店」、「住所変更」）による検索処理を実行する（ｓ１０）。 On the other hand, the question generation device 100 that receives this executes a search process for the QA information 125 in the storage unit 101 using the keywords (“visit”, “address change”) included in the above inquiry content (s10).

具体的には、質問生成装置１００は、ＱＡ情報１２５における各レコードの「質問」欄の値と、上述の問い合わせ内容が含むキーワード（「来店」、「住所変更」）との照合処理を実行し、該当するレコードを候補説明文に関するレコードとして特定する。 Specifically, the question generation device 100 executes a collation process between the value in the "question" column of each record in the QA information 125 and the keywords ("visit", "address change") included in the above inquiry content. , Identify the relevant record as a record for the candidate description.

続いて、質問生成装置１００は、マトリクス１２６における要素のうち、ｓ１０で特定したレコードに関するものを当該レコードのＩＤに基づき特定し、ここで特定した要素に関して決定木分析を実行する（ｓ１１）。なお、決定木分析の手法自体は、既存のものを適宜に採用すればよい。 Subsequently, the question generator 100 identifies the elements in the matrix 126 related to the record specified in s10 based on the ID of the record, and executes the decision tree analysis for the elements specified here (s11). As the decision tree analysis method itself, an existing method may be appropriately adopted.

これにより、ｓ１０の検索で特定された複数のレコードを、それぞれのレコードが含む単語の有無に基づいて分別するシナリオが生成できる。その結果、上述の単語の有無に応じた問いかけである分岐判定項目、を質問文とした質問木が生成される。なお、図６では、質問木を「決定木」と記している（以下同様）。 As a result, it is possible to generate a scenario in which a plurality of records identified by the search of s10 are sorted based on the presence or absence of words included in each record. As a result, a question tree is generated with the branch determination item, which is a question depending on the presence or absence of the above-mentioned word, as a question sentence. In FIG. 6, the question tree is referred to as a "decision tree" (the same applies hereinafter).

続いて、質問生成装置１００は、ｓ１１の決定木分析で得た質問木の段数が予め定めた基準を越えるか判定する（ｓ１２）。質問木における段数とは、最初の分岐判定項目から最終的に回答を導く最後の分岐判定項目までの、分岐判定項目の数である。 Subsequently, the question generation device 100 determines whether the number of stages of the question tree obtained by the decision tree analysis of s11 exceeds a predetermined standard (s12). The number of stages in the question tree is the number of branch determination items from the first branch determination item to the last branch determination item that finally leads to an answer.

この判定の結果、上述の段数が基準を超える場合、質問生成装置１００は、図８で別途示すシーケンス例に処理を遷移させる。この遷移は、現状の質問木では顧客に対して問いかけの回数が多いため、その回数を低減するとの意図に基づくものである。 As a result of this determination, when the number of stages described above exceeds the standard, the question generation device 100 shifts the process to the sequence example separately shown in FIG. This transition is based on the intention of reducing the number of questions asked to the customer in the current question tree.

一方、上述の判定の結果、段数が基準を超えない場合、質問生成装置１００は、ｓ１１で得た質問木を、例えばメモリ１０３における所定の記憶領域に格納する（ｓ１３）。 On the other hand, if the number of stages does not exceed the standard as a result of the above determination, the question generation device 100 stores the question tree obtained in s11 in a predetermined storage area in, for example, the memory 103 (s13).

また、質問生成装置１００は、当該質問木における最上位の段数における分岐判定項目に対応した質問（以下、問いかけ）を、上述のユーザ端末２００に出力する（ｓ１４）。 Further, the question generation device 100 outputs a question (hereinafter referred to as a question) corresponding to the branch determination item in the highest number of stages in the question tree to the above-mentioned user terminal 200 (s14).

ユーザ端末２００は、上述の問いかけを表示して（例：図７Ｂの画面６１０）、顧客からの回答を取得する。また、ユーザ端末２００は、ここで得た回答を質問生成装置１００に返す。 The user terminal 200 displays the above-mentioned question (example: screen 610 of FIG. 7B) and obtains an answer from the customer. Further, the user terminal 200 returns the answer obtained here to the question generation device 100.

他方、質問生成装置１００は、ユーザ端末２００から送信されてきた回答、すなわち最初の問いかけに対する回答を得て、上述の質問木に適用し、当該質問木における最終段すなわち回答に至るまで、各段の分岐判定項目に応じた問いかけを繰り返し実行する（ｓ１５～ｓ１６）。質問生成装置１００は、回答に至った場合、これをユーザ端末２００に出力し（図７Ｃの画面６２０）、処理を終了する。 On the other hand, the question generator 100 obtains the answer sent from the user terminal 200, that is, the answer to the first question, applies it to the above-mentioned question tree, and reaches the final stage, that is, the answer in the question tree. The question according to the branch determination item of is repeatedly executed (s15 to s16). When the question generation device 100 reaches an answer, it outputs this to the user terminal 200 (screen 620 in FIG. 7C) and ends the process.

なお、上述のｓ１２での判定の結果、質問木における段数が基準を超える場合、質問生成装置１００は、図８のシーケンス例で示すように、ユーザ端末２００に対し、追加キーワード（図中では「質問絞り込み」）の要求メッセージを通知する（ｓ２１）。 As a result of the above-mentioned determination in s12, when the number of stages in the question tree exceeds the standard, the question generation device 100 adds an additional keyword (in the figure, "" to the user terminal 200, as shown in the sequence example of FIG. Notify the request message of "question narrowing down" (s21).

一方、ユーザ端末２００は、上述の追加キーワードの要求メッセージを表示し、顧客から、「代理人」などといった追加キーワードの入力を受け付ける（図９Ａの画面８００）。また、質問生成装置１００は、ここで受け付けた追加キーワードを質問生成装置１００に返信する。 On the other hand, the user terminal 200 displays the above-mentioned request message for the additional keyword and accepts the input of the additional keyword such as "agent" from the customer (screen 800 in FIG. 9A). Further, the question generator 100 returns the additional keyword received here to the question generator 100.

質問生成装置１００は、ユーザ端末２００から追加キーワードを受信し、既にｓ１０の検索で用いたキーワード（例：「住所変更」）に当該追加キーワード（例：「代理人」）を加えたアンド検索を、ＱＡ情報１２５にて実行する（ｓ２２）。 The question generation device 100 receives an additional keyword from the user terminal 200, and performs an and search in which the additional keyword (example: "agent") is added to the keyword already used in the search of s10 (example: "address change"). , QA information 125 is executed (s22).

具体的には、質問生成装置１００は、ＱＡ情報１２５における各レコードの「質問」欄の値と、上述の問い合わせ内容が含むキーワード（「住所変更」および「代理人」）との
照合処理を実行し、該当するレコードを新たな候補説明文に関するレコードとして特定する。 Specifically, the question generation device 100 executes a collation process between the value in the "question" column of each record in the QA information 125 and the keywords ("address change" and "agent") included in the above inquiry content. Then, the corresponding record is specified as a record related to the new candidate description.

続いて、質問生成装置１００は、マトリクス１２６における要素のうち、ｓ２２で特定したレコードに関するものを当該レコードのＩＤに基づき特定し、ここで特定した要素に関して決定木分析を実行する（ｓ２３）。 Subsequently, the question generator 100 identifies the elements in the matrix 126 related to the record specified in s22 based on the ID of the record, and executes the decision tree analysis for the elements specified here (s23).

これにより、ｓ２２の検索で特定された新たな候補説明文に対応する複数のレコードを、それぞれのレコードが含む単語の有無に基づいて分別するシナリオが生成できる。その結果、上述の単語の有無に応じた問いかけである分岐判定項目、を質問文とした新たな質問木が生成される。
続いて、質問生成装置１００は、ｓ２３の決定木分析で得た新たな質問木の段数が予め定めた基準を越えるか判定する（ｓ２４）。 As a result, it is possible to generate a scenario in which a plurality of records corresponding to the new candidate explanations identified by the search of s22 are sorted based on the presence or absence of words included in each record. As a result, a new question tree is generated with the branch determination item, which is a question depending on the presence or absence of the above-mentioned word, as a question sentence.
Subsequently, the question generation device 100 determines whether the number of stages of the new question tree obtained by the decision tree analysis of s23 exceeds a predetermined standard (s24).

この判定の結果、上述の段数が基準を超える場合、質問生成装置１００は、ｓ２１に処理を遷移させる。この遷移は、現状の質問木では顧客に対して問いかけの回数が多いため、その回数を低減するとの意図に基づくものである。 As a result of this determination, when the above-mentioned number of stages exceeds the standard, the question generation device 100 shifts the process to s21. This transition is based on the intention of reducing the number of questions asked to the customer in the current question tree.

一方、上述の判定の結果、段数が基準を超えない場合、質問生成装置１００は、ｓ２３で得た質問木を、例えばメモリ１０３における所定の記憶領域に格納し（ｓ２５）、処理を終了する。 On the other hand, if the number of stages does not exceed the standard as a result of the above determination, the question generation device 100 stores the question tree obtained in s23 in a predetermined storage area in, for example, the memory 103 (s25), and ends the process.

ここで、上述のマトリクスと質問木との関係を踏まえた、質問木の生成概念例を図１０に示す。図１０で示す具体例は、ＱＡ情報１２５におけるレコードのうち「ＴＥＸＴ１」～「ＴＥＸＴ４」に関して「単語１」～「単語４」の各単語の有無を規定したマトリクス７００と、このマトリクス７００を決定木分析の対象とした場合の質問木７５０、を示している。 Here, FIG. 10 shows an example of the concept of generating a question tree based on the relationship between the above matrix and the question tree. Specific examples shown in FIG. 10 are a matrix 700 that defines the presence or absence of each word of "word 1" to "word 4" with respect to "TEXT1" to "TEXT4" among the records in the QA information 125, and a decision tree for this matrix 700. The question tree 750, which is the subject of analysis, is shown.

この場合、顧客からの初回の質問７５１、すなわちユーザ端末２００から受けた問い合わせの内容は、「住所変更」である。また、この「住所変更」なるキーワードを回答（説明文）に含むレコードをＱＡ情報１２５で特定すると、「ＴＥＸＴ１」～「ＴＥＸＴ４」が特定されたものとする。 In this case, the content of the first question 751 from the customer, that is, the inquiry received from the user terminal 200 is "address change". Further, when the record including the keyword "address change" in the answer (explanatory text) is specified by the QA information 125, it is assumed that "TEXT1" to "TEXT4" are specified.

すると、この「ＴＥＸＴ１」～「ＴＥＸＴ４」のうちいずれかを、当該顧客からの問い合わせに対する回答として特定する必要がある。よってこの「ＴＥＸＴ１」～「ＴＥＸＴ４」における「単語１」～「単語４」の各有無に基づく決定木分析により、「ＴＥＸＴ１」～「ＴＥＸＴ４」を切り分ける、質問木７５０が生成される。 Then, it is necessary to specify any one of the "TEXT1" to "TEXT4" as an answer to the inquiry from the customer. Therefore, the decision tree analysis based on the presence or absence of each of "word 1" to "word 4" in "TEXT1" to "TEXT4" generates a question tree 750 that separates "TEXT1" to "TEXT4".

図１０の例では、分岐判定項目７５２～７５７、すなわち顧客への問いかけとして、「本人」か否か、「本人」である場合に「来店」するか否か、を順次問いかけすることで、当該顧客に提示すべき回答７６０～７６３として「ＴＥＸＴ１」と「ＴＥＸＴ２」とを切り分けできる。 In the example of FIG. 10, the branch determination items 752 to 757, that is, whether or not the customer is the "principal" and whether or not the "principal" is "visiting the store" are sequentially asked. "TEXT1" and "TEXT2" can be separated as answers 760 to 763 to be presented to the customer.

また、「本人」ではない場合に、それは「法定代理人」か否か、を順次問いかけすることで、当該顧客に提示すべき回答７６０～７６３として「ＴＥＸＴ３」と「ＴＥＸＴ４」とを切り分けできる。 In addition, if the customer is not the "principal", by sequentially asking whether or not it is a "legal agent", "TEXT3" and "TEXT4" can be separated as answers 760 to 763 to be presented to the customer.

続いて、質問木生成を効率化する工夫について説明する。図１１は本実施形態における質問生成方法のフロー例２を示す図であり、図１２は本実施形態における質問木の生成概念例２を示す図である。 Next, we will explain how to improve the efficiency of question tree generation. FIG. 11 is a diagram showing a flow example 2 of the question generation method in the present embodiment, and FIG. 12 is a diagram showing a question tree generation concept example 2 in the present embodiment.

この場合、質問生成装置１００は、上述のｓ１０で得た候補説明文のうち、各候補説明文を横断して頻出する単語を含んでいるものを優先候補説明文として特定する（ｓ３０）。 In this case, the question generation device 100 specifies, among the candidate explanatory texts obtained in s10 described above, those including words that frequently appear across each candidate explanatory text as priority candidate explanatory texts (s30).

具体的には、質問生成装置１００は、例えば、マトリクス１２６における各候補説明文に関する各単語の有無の値を参照し、最多の候補説明文に存在する単語、２番目に多い候補説明文に存在する単語、および、３番目に多い候補説明文に存在する単語、を特定し、当該単語らを含んでいる候補説明文に関するＩＤをマトリクス１２６にて特定する。 Specifically, the question generator 100 refers to, for example, the value of the presence / absence of each word for each candidate explanation in the matrix 126, and is present in the word existing in the most candidate explanation and in the second most candidate explanation. The word to be used and the word existing in the third most candidate explanation sentence are specified, and the ID related to the candidate explanation sentence including the words is specified by the matrix 126.

なお、上述の最多の候補説明文に存在する単語、２番目に多い候補説明文に存在する単語、および、３番目に多い候補説明文に存在する単語、を特定するに際し、ＱＡ情報１２５における各候補説明文のレコードの「質問」欄の値を参照し、該当単語を特定するとしてもよい。 In addition, in specifying the word existing in the above-mentioned most candidate explanation sentence, the word existing in the second most candidate explanation sentence, and the word existing in the third most candidate explanation sentence, each in the QA information 125. The corresponding word may be specified by referring to the value in the "question" column of the record of the candidate description.

また、質問生成装置１００は、上述のｓ１１における質問木の生成に際し、マトリクス１２６における要素のうち、ｓ３０で特定した優先候補説明文のレコードに関するものを当該レコードのＩＤに基づき特定して頻出単語マトリクス（図１２の頻出単語マトリクス８１０参照）を生成し、この頻出単語マトリクスに関して決定木分析を実行する（ｓ３１）。 Further, when the question tree is generated in s11, the question generation device 100 specifies the record of the priority candidate explanatory text specified in s30 among the elements in the matrix 126 based on the ID of the record, and specifies the frequently-used word matrix. (See Frequent Word Matrix 810 in FIG. 12) is generated and a decision tree analysis is performed on this Frequent Word Matrix (s31).

これにより、ｓ３０で優先候補説明文として特定された複数のレコードを、それぞれのレコードが含む単語の有無に基づいて分別するシナリオが生成できる。その結果、上述の単語の有無に応じた問いかけである分岐判定項目、を質問文とした頻出単語向け質問木（図１２の頻出単語向け質問木８２０参照）が生成される。 As a result, it is possible to generate a scenario in which a plurality of records specified as priority candidate explanations in s30 are separated based on the presence or absence of words included in each record. As a result, a question tree for frequent words (see question tree 820 for frequent words in FIG. 12) is generated with the branch determination item, which is a question depending on the presence or absence of the above words, as a question sentence.

続いて、質問生成装置１００は、ｓ３１の決定木分析で得た頻出単語向け質問木における所定段の分岐判定項目で、候補説明文（ｓ１０で得ているもの）を区分しきれているか判定する（ｓ３２）。具体的には、頻出単語向け質問木において、複数の回答が１つの分岐判定項目（最終段のもの）に紐付けられている構成（図１２の頻出単語向け質問木８２０における構成８３０）が存在するか判定する。 Subsequently, the question generation device 100 determines whether or not the candidate explanatory text (the one obtained in s10) has been completely classified by the branch determination item in the predetermined stage in the question tree for frequently-used words obtained by the decision tree analysis of s31. (S32). Specifically, in the question tree for frequent words, there is a configuration in which a plurality of answers are associated with one branch determination item (the one in the final stage) (configuration 830 in the question tree 820 for frequent words in FIG. 12). Determine if you want to.

上述の判定の結果、候補説明文を区分しきれている場合（ｓ３２：ｙ）、質問生成装置１００は、当該頻出単語向け質問木を、例えばメモリ１０３における所定の記憶領域に格納し（ｓ３３）、処理を終了する。 As a result of the above determination, when the candidate explanatory texts have been completely divided (s32: y), the question generation device 100 stores the question tree for frequently-used words in a predetermined storage area in, for example, the memory 103 (s33). , End the process.

他方、上述の判定の結果、候補説明文を区分しきれていない場合（ｓ３２：ｎ）、質問生成装置１００は、上述の候補説明文のうち、優先候補説明文以外の非優先候補説明文を特定する（ｓ３４）。 On the other hand, as a result of the above-mentioned determination, when the candidate explanations are not completely classified (s32: n), the question generation device 100 uses the non-priority candidate explanations other than the priority candidate explanations among the above-mentioned candidate explanations. Identify (s34).

また、質問生成装置１００は、マトリクス１２６における要素のうち、ｓ３４で特定した非優先候補説明文のレコードに関するものを当該レコードのＩＤに基づき特定して非頻出単語マトリクス（図１２の非頻出単語マトリクス８３０参照）を生成し、この非頻出単語マトリクスに関して決定木分析を実行する（ｓ３５）。 Further, the question generator 100 specifies the elements in the matrix 126 related to the record of the non-priority candidate explanatory text specified in s34 based on the ID of the record, and specifies the non-frequent word matrix (the non-frequent word matrix in FIG. 12). 830) is generated and a decision tree analysis is performed on this infrequent word matrix (s35).

これにより、ｓ３４で非優先候補説明文として特定された複数のレコードを、それぞれのレコードが含む単語の有無に基づいて分別するシナリオが生成できる。その結果、上述の単語の有無に応じた問いかけである分岐判定項目、を質問文とした非頻出単語向け質問木（図１２の非頻出単語向け質問木８４０参照）が生成される。
また、質問生成装置１００は、当該非頻出単語向け質問木を、例えばメモリ１０３にお
ける所定の記憶領域に格納し（ｓ３６）、処理を終了する。 As a result, it is possible to generate a scenario in which a plurality of records specified as non-priority candidate explanations in s34 are separated based on the presence or absence of words included in each record. As a result, a question tree for non-frequent words (see question tree 840 for non-frequent words in FIG. 12) is generated with the branch determination item, which is a question depending on the presence or absence of the above-mentioned words, as a question sentence.
Further, the question generation device 100 stores the question tree for non-frequent words in a predetermined storage area in the memory 103, for example (s36), and ends the process.

また、質問木生成を効率化する別の工夫について説明する。図１３は本実施形態における質問生成方法のフロー例３を示す図であり、図１４は本実施形態における質問木の生成概念例３を示す図である。 In addition, another device to improve the efficiency of question tree generation will be explained. FIG. 13 is a diagram showing a flow example 3 of the question generation method in the present embodiment, and FIG. 14 is a diagram showing a question tree generation concept example 3 in the present embodiment.

この場合、質問生成装置１００は、ＱＡ情報１２５における各レコードのうち、例えば、回答実績の値が所定基準以上の頻出回答であるものを特定し、当該レコードにおける形態素解析結果欄の値を取得する（ｓ４０）。 In this case, the question generator 100 identifies, for example, frequently-used answers whose answer actual values are equal to or higher than a predetermined standard among the records in the QA information 125, and acquires the value in the morphological analysis result column in the record. (S40).

続いて、質問生成装置１００は、上述のｓ４０で得た形態素解析結果欄の値、すなわち単語群に基づき、当該頻出回答における各単語の有無を記述した頻出回答向けマトリクス（図１４の頻出回答向けマトリクス９１０参照）を生成する（ｓ４１）。 Subsequently, the question generator 100 describes a matrix for frequent answers (for frequent answers in FIG. 14) that describes the presence or absence of each word in the frequent answer based on the value in the morphological analysis result column obtained in s40 described above, that is, a word group. Matrix 910) is generated (s41).

続いて、質問生成装置１００は、ユーザ端末２００からの問い合わせを受信し、当該問い合わせが含むキーワードに基づき、ｓ４０で特定している頻出回答のうち当該キーワードを含むものを優先候補説明文として特定する（ｓ４２）。 Subsequently, the question generation device 100 receives the inquiry from the user terminal 200, and based on the keyword included in the inquiry, specifies the frequently-used answer specified in s40 that includes the keyword as the priority candidate explanatory text. (S42).

また、質問生成装置１００は、ｓ４１で生成している頻出回答向けマトリクスにおける、上述の優先候補説明文（ｓ４２で特定したもの）に関するものを当該優先候補説明文のＩＤに基づき特定し、ここで特定した要素に関して決定木分析を実行する（ｓ４３）。 Further, the question generation device 100 specifies the above-mentioned priority candidate explanation (specified in s42) in the matrix for frequent answers generated in s41 based on the ID of the priority candidate explanation, and here. A decision tree analysis is performed on the identified element (s43).

これにより、ｓ４２で優先候補説明文として特定された複数のレコードを、それぞれのレコードが含む単語の有無に基づいて分別するシナリオが生成できる。その結果、頻出回答向け質問木（図１４の頻出回答向け質問木９２０参照）が生成される。 As a result, it is possible to generate a scenario in which a plurality of records specified as priority candidate explanations in s42 are separated based on the presence or absence of words included in each record. As a result, a question tree for frequent answers (see question tree 920 for frequent answers in FIG. 14) is generated.

続いて、質問生成装置１００は、ｓ４３の決定木分析で得た頻出回答向け質問木における所定段の分岐判定項目で、優先候補説明文（ｓ４２で得ているもの）を区分しきれているか判定する（ｓ４４）。具体的には、頻出回答向け質問木において、複数の回答が１つの分岐判定項目（最終段のもの）に紐付けられている構成（図１４の頻出回答向け質問木９２０における構成９３０）が存在するか判定する。 Subsequently, the question generation device 100 determines whether or not the priority candidate explanatory text (the one obtained in s42) has been completely classified by the branch determination item in the predetermined stage in the question tree for frequent answers obtained by the decision tree analysis of s43. (S44). Specifically, in the question tree for frequent answers, there is a configuration in which a plurality of answers are linked to one branch determination item (the one in the final stage) (configuration 930 in the question tree 920 for frequent answers in FIG. 14). Determine if you want to.

上述の判定の結果、優先候補説明文を区分しきれている場合（ｓ４４：ｙ）、質問生成装置１００は、当該頻出回答向け質問木を、例えばメモリ１０３における所定の記憶領域に格納し（ｓ４５）、処理を終了する。 As a result of the above determination, when the priority candidate explanatory text has been completely divided (s44: y), the question generation device 100 stores the question tree for frequent answers in a predetermined storage area in, for example, the memory 103 (s45). ), End the process.

他方、上述の判定の結果、優先候補説明文を区分しきれていない場合（ｓ４４：ｎ）、質問生成装置１００は、ＱＡ情報１２５における各レコードのうち、上述の頻出回答以外の非頻出回答における形態素解析結果欄の値を取得する（ｓ４６）。 On the other hand, as a result of the above-mentioned determination, when the priority candidate explanatory text is not completely classified (s44: n), the question generation device 100 in each record in the QA information 125, in the non-frequent answer other than the above-mentioned frequent answer. The value in the morphological analysis result column is acquired (s46).

続いて、質問生成装置１００は、上述のｓ４６で得た形態素解析結果欄の値、すなわち単語群に基づき、当該非頻出回答における各単語の有無を記述した非頻出回答向けマトリクス（図１４の非頻出回答向けマトリクス９１５参照）を生成する（ｓ４７）。
また、質問生成装置１００は、この非頻出回答向けマトリクスに関して決定木分析を実行する（ｓ４８）。 Subsequently, the question generator 100 describes a matrix for non-frequent answers (non-frequent in FIG. 14) that describes the presence or absence of each word in the non-frequent answer based on the value in the morphological analysis result column obtained in s46 above, that is, the word group. (See Matrix 915 for Frequent Answers) is generated (s47).
In addition, the question generator 100 performs a decision tree analysis on this matrix for infrequent answers (s48).

これにより、非優先候補説明文である複数のレコードを、それぞれのレコードが含む単語の有無に基づいて分別するシナリオが生成できる。その結果、上述の単語の有無に応じた問いかけである分岐判定項目、を質問文とした非頻出回答向け質問木（図１４の非頻出回答向け質問木９４０参照）が生成される。
また、質問生成装置１００は、当該非頻出回答向け質問木を、例えばメモリ１０３における所定の記憶領域に格納し（ｓ４９）、処理を終了する。 As a result, it is possible to generate a scenario in which a plurality of records that are non-priority candidate explanations are separated based on the presence or absence of words contained in each record. As a result, a question tree for non-frequent answers (see question tree 940 for non-frequent answers in FIG. 14) is generated with the branch determination item, which is a question depending on the presence or absence of the above-mentioned word, as a question sentence.
Further, the question generation device 100 stores the question tree for non-frequent answers in a predetermined storage area in the memory 103, for example (s49), and ends the process.

以上、本発明を実施するための最良の形態などについて具体的に説明したが、本発明はこれに限定されるものではなく、その要旨を逸脱しない範囲で種々変更可能である。
こうした本実施形態によれば、質問木を効率的に自動生成可能となる。 Although the best mode for carrying out the present invention has been specifically described above, the present invention is not limited to this, and various modifications can be made without departing from the gist thereof.
According to this embodiment, the question tree can be efficiently and automatically generated.

本明細書の記載により、少なくとも次のことが明らかにされる。すなわち、本実施形態の質問生成装置において、前記演算部は、前記生成した質問木の段数が所定基準を越える場合、前記端末に追加キーワードの要求メッセージを通知し、当該端末から追加キーワードを取得する処理を更に実行し、前記候補説明文の特定に際し、前記キーワードおよび前記追加キーワードに基づき、前記各説明文のうち前記キーワードおよび前記追加キーワードを含むものを新たな候補説明文として特定し、前記質問木の生成に際し、前記マトリクスにおける前記新たな候補説明文に関する記述に基づき決定木分析を実行し、分岐判定項目を質問文とした新たな質問木を生成し、前記新たな質問木の段数が前記所定基準を超えなくなるまで、前記追加キーワードの取得、前記新たな候補説明文の特定、および、前記新たな質問木の生成、を繰り返し行うものである、としてもよい。 The description herein reveals at least the following: That is, in the question generation device of the present embodiment, when the number of stages of the generated question tree exceeds a predetermined standard, the calculation unit notifies the terminal of a request message for the additional keyword and acquires the additional keyword from the terminal. Further processing is performed, and when the candidate explanation is specified, the keyword and the additional keyword are included in each of the explanations based on the keyword and the additional keyword, and the candidate description is specified as a new candidate explanation, and the question is asked. When generating a tree, a decision tree analysis is performed based on the description of the new candidate explanation in the matrix, a new question tree with a branch judgment item as a question sentence is generated, and the number of stages of the new question tree is the above. The acquisition of the additional keyword, the identification of the new candidate explanatory text, and the generation of the new question tree may be repeated until the predetermined standard is not exceeded.

これによれば、質問木の段数が多すぎる、すなわち問い合わせを行ってきた顧客とのやりとりが多くなりすぎることになる場合、これを効果的に回避すること可能となる。ひいては、質問木をより効率的に自動生成可能となる。 According to this, when the number of steps of the question tree is too large, that is, the number of interactions with the customer who has made an inquiry becomes too large, it is possible to effectively avoid this. As a result, the question tree can be automatically generated more efficiently.

また、本実施形態の質問生成装置において、前記演算部は、前記候補説明文のうち、各候補説明文を横断して頻出する単語を含んでいるものを優先候補説明文として特定し、前記質問木の生成に際し、前記マトリクスにおける前記優先候補説明文に関する記述に基づき決定木分析を実行して、分岐判定項目を質問文とした頻出単語向け質問木を生成するものである、としてもよい。 Further, in the question generation device of the present embodiment, the calculation unit specifies, among the candidate explanations, a word that frequently appears across each candidate explanation as a priority candidate explanation, and asks the question. When generating a tree, a decision tree analysis may be performed based on the description of the priority candidate explanatory text in the matrix to generate a question tree for frequently-used words using a branch determination item as a question sentence.

これによれば、候補説明文の群の中で広く出現する単語を含む候補説明文を優先的に使用し、質問木を生成するため、共通性が推定される事項間でポピュラーな内容の問い合わせに対して効率的に説明文すなわち回答を導く可能性を高める。ひいては、質問木をより効率的に自動生成可能となる。 According to this, since candidate explanations including words that appear widely in the group of candidate explanations are preferentially used to generate a question tree, inquiries about popular contents among items that are presumed to have commonality. Increase the possibility of efficiently deriving an explanation, that is, an answer. As a result, the question tree can be automatically generated more efficiently.

また、本実施形態の質問生成装置において、前記演算部は、前記頻出単語向け質問木における所定段の分岐判定項目で前記候補説明文を区分しきれていない場合、前記候補説明文のうち、前記優先候補説明文以外の非優先候補説明文を特定し、前記マトリクスにおける前記非優先候補説明文に関する記述に基づき、前記所定段の分岐判定項目を起点にした決定木分析を実行して、非頻出単語向け質問木を生成するものである、としてもよい。 Further, in the question generation device of the present embodiment, when the calculation unit cannot completely classify the candidate explanatory text in the branch determination item of the predetermined stage in the question tree for frequently-used words, the candidate explanatory text is described as described above. A non-priority candidate description other than the priority candidate description is specified, and a decision tree analysis starting from the branch determination item of the predetermined stage is executed based on the description of the non-priority candidate description in the matrix, and the non-frequent occurrence occurs. It may be to generate a question tree for words.

これによれば、上述の頻出単語向け質問木での説明文すなわち回答の導出が難しい状況に対して的確に対応し、ひいては、質問木をより効率的に自動生成可能となる。 According to this, it is possible to accurately deal with the situation where it is difficult to derive the explanatory text, that is, the answer in the above-mentioned question tree for frequently-used words, and by extension, the question tree can be automatically generated more efficiently.

また、本実施形態の質問生成装置において、前記演算部は、前記各説明文のうち、所定期間において回答として出力された実績が所定基準以上の頻出回答であるものに関する形態素解析結果に基づき、前記頻出回答における各単語の有無を記述した頻出回答向けマトリクスの生成を行い、所定端末から問い合わせを受信し、当該問い合わせが含むキーワードに基づき、前記頻出回答のうち前記キーワードを含むものを優先候補説明文として特定し、前記頻出回答向けマトリクスにおける前記優先候補説明文に関する記述に基づき決定木分析を実行して、分岐判定項目を質問文とした頻出回答向け質問木を生成するものである、としてもよい。 Further, in the question generation device of the present embodiment, the calculation unit describes the above-mentioned explanatory texts based on the morphological analysis result regarding the frequently-appearing answers whose actual results output as answers in a predetermined period are more than a predetermined standard. A matrix for frequent answers that describes the presence or absence of each word in the frequent answers is generated, an inquiry is received from a predetermined terminal, and based on the keywords included in the inquiry, those containing the keywords are given priority candidate explanations. It is also possible to execute a decision tree analysis based on the description regarding the priority candidate explanatory text in the matrix for frequent answers to generate a question tree for frequent answers with the branch determination item as the question text. ..

これによれば、例えば最近の顧客が望む回答の傾向等を踏まえて、効率的に回答を導く質問木を生成しやすくなる。ひいては、質問木をより効率的に自動生成可能となる。 According to this, it becomes easy to generate a question tree that efficiently guides the answer, for example, based on the tendency of the answer desired by the customer in recent years. As a result, the question tree can be automatically generated more efficiently.

また、本実施形態の質問生成装置において、前記演算部は、前記頻出回答向け質問木における所定段の分岐判定項目で前記優先候補説明文を区分しきれていない場合、前記各説明文のうち、前記頻出回答以外の非頻出回答であるものに関する形態素解析結果に基づき、前記非頻出回答における各単語の有無を記述した非頻出回答向けマトリクスの生成を行い、前記非頻出回答向けマトリクスにおける前記非頻出回答に関する記述に基づき、前記所定段の分岐判定項目を起点にした決定木分析を実行して、非頻出回答向け質問木を生成するものである、としてもよい。 Further, in the question generation device of the present embodiment, when the calculation unit cannot completely classify the priority candidate explanatory text in the branch determination item of the predetermined stage in the question tree for frequent answers, among the above-mentioned explanatory texts, Based on the morphological analysis results for non-frequent answers other than the frequent answers, a matrix for non-frequent answers describing the presence or absence of each word in the non-frequent answers is generated, and the non-frequent answers in the non-frequent answers matrix are generated. Based on the description of the answer, the decision tree analysis starting from the branch determination item of the predetermined stage may be executed to generate the question tree for the infrequent answer.

これによれば、上述の頻出回答向け質問木での説明文すなわち回答の導出が難しい状況に対して的確に対応し、ひいては、質問木をより効率的に自動生成可能となる。 According to this, it is possible to accurately deal with the above-mentioned explanatory text in the question tree for frequent answers, that is, the situation where it is difficult to derive the answer, and by extension, the question tree can be automatically generated more efficiently.

また、本実施形態の質問生成方法において、前記情報処理装置が、前記生成した質問木の段数が所定基準を越える場合、前記端末に追加キーワードの要求メッセージを通知し、当該端末から追加キーワードを取得する処理を更に実行し、前記候補説明文の特定に際し、前記キーワードおよび前記追加キーワードに基づき、前記各説明文のうち前記キーワードおよび前記追加キーワードを含むものを新たな候補説明文として特定し、前記質問木の生成に際し、前記マトリクスにおける前記新たな候補説明文に関する記述に基づき決定木分析を実行し、分岐判定項目を質問文とした新たな質問木を生成し、前記新たな質問木の段数が前記所定基準を超えなくなるまで、前記追加キーワードの取得、前記新たな候補説明文の特定、および、前記新たな質問木の生成、を繰り返し行う、としてもよい。 Further, in the question generation method of the present embodiment, when the information processing device notifies the terminal of a request message for an additional keyword when the number of stages of the generated question tree exceeds a predetermined standard, the additional keyword is acquired from the terminal. Further, when specifying the candidate explanatory text, the keyword and the additional keyword are included in the respective explanatory texts based on the keyword and the additional keyword, and the candidate explanatory text is specified as a new candidate explanatory text. When generating the question tree, the decision tree analysis is executed based on the description about the new candidate explanation in the matrix, a new question tree with the branch judgment item as the question sentence is generated, and the number of stages of the new question tree is set. The acquisition of the additional keyword, the identification of the new candidate explanatory text, and the generation of the new question tree may be repeated until the predetermined criteria are not exceeded.

また、本実施形態の質問生成方法において、前記情報処理装置が、前記候補説明文のうち、各候補説明文を横断して頻出する単語を含んでいるものを優先候補説明文として特定し、前記質問木の生成に際し、前記マトリクスにおける前記優先候補説明文に関する記述に基づき決定木分析を実行して、分岐判定項目を質問文とした頻出単語向け質問木を生成する、としてもよい。 Further, in the question generation method of the present embodiment, the information processing apparatus specifies, among the candidate explanations, a word that frequently appears across each candidate explanation as a priority candidate explanation, and the above-mentioned When generating the question tree, the decision tree analysis may be executed based on the description of the priority candidate explanatory text in the matrix to generate the question tree for frequently-used words with the branch determination item as the question sentence.

また、本実施形態の質問生成方法において、前記情報処理装置が、前記頻出単語向け質問木における所定段の分岐判定項目で前記候補説明文を区分しきれていない場合、前記候補説明文のうち、前記優先候補説明文以外の非優先候補説明文を特定し、前記マトリクスにおける前記非優先候補説明文に関する記述に基づき、前記所定段の分岐判定項目を起点にした決定木分析を実行して、非頻出単語向け質問木を生成する、としてもよい。 Further, in the question generation method of the present embodiment, when the information processing apparatus cannot completely classify the candidate explanatory text in the branch determination item of the predetermined stage in the question tree for frequently-used words, the candidate explanatory text is among the candidate explanatory texts. A non-priority candidate description other than the priority candidate description is specified, and based on the description of the non-priority candidate description in the matrix, a decision tree analysis starting from the branch determination item of the predetermined stage is executed to perform non-priority candidate description. You may generate a question tree for frequently-used words.

また、本実施形態の質問生成方法において、前記情報処理装置が、前記各説明文のうち、所定期間において回答として出力された実績が所定基準以上の頻出回答であるものに関する形態素解析結果に基づき、前記頻出回答における各単語の有無を記述した頻出回答向けマトリクスの生成を行い、所定端末から問い合わせを受信し、当該問い合わせが含むキーワードに基づき、前記頻出回答のうち前記キーワードを含むものを優先候補説明文として特定し、前記頻出回答向けマトリクスにおける前記優先候補説明文に関する記述に基づき決定木分析を実行して、分岐判定項目を質問文とした頻出回答向け質問木を生成する、としてもよい。 Further, in the question generation method of the present embodiment, the information processing apparatus is based on the morphological analysis result of each of the above-mentioned explanatory texts, in which the actual result output as an answer in a predetermined period is a frequent answer of a predetermined standard or more. A matrix for frequent answers that describes the presence or absence of each word in the frequent answers is generated, an inquiry is received from a predetermined terminal, and based on the keywords included in the inquiry, those including the keywords are given as priority candidate explanations. It may be specified as a sentence, and a decision tree analysis may be executed based on the description regarding the priority candidate explanatory statement in the frequently-appearing answer matrix to generate a question tree for frequent-appearing answers using a branch determination item as a question sentence.

また、本実施形態の質問生成方法において、前記情報処理装置が、前記頻出回答向け質問木における所定段の分岐判定項目で前記優先候補説明文を区分しきれていない場合、前記各説明文のうち、前記頻出回答以外の非頻出回答であるものに関する形態素解析結果に基づき、前記非頻出回答における各単語の有無を記述した非頻出回答向けマトリクスの生
成を行い、前記非頻出回答向けマトリクスにおける前記非頻出回答に関する記述に基づき、前記所定段の分岐判定項目を起点にした決定木分析を実行して、非頻出回答向け質問木を生成する、としてもよい。 Further, in the question generation method of the present embodiment, when the information processing apparatus cannot completely classify the priority candidate explanatory text in the branch determination item of the predetermined stage in the question tree for frequent answers, among the above-mentioned explanatory texts. Based on the morphological analysis results for non-frequent answers other than the frequent answers, a matrix for non-frequent answers describing the presence or absence of each word in the non-frequent answers is generated, and the non-frequent answers in the non-frequent answers matrix. Based on the description of the frequent answer, the decision tree analysis starting from the branch determination item of the predetermined stage may be executed to generate the question tree for the non-frequent answer.

１０ネットワーク
１００質問生成装置
１０１記憶部
１０２プログラム
１０３メモリ
１０４演算部
１０５入力部
１０６出力部
１０７通信部
１２５情報
１２６情報
１２７情報
２００ユーザ端末 10 Network 100 Question generator 101 Storage unit 102 Program 103 Memory 104 Calculation unit 105 Input unit 106 Output unit 107 Communication unit 125 Information 126 Information 127 Information 200 User terminal

Claims

A storage unit that holds each explanation for each predetermined item,
Based on the morphological analysis result of each explanation, the process of generating a matrix describing the presence or absence of each word in each explanation, and the inquiry received from a predetermined terminal, each explanation is based on the keyword included in the inquiry. A process of specifying a sentence containing the keyword as a candidate explanation sentence, and a process of executing a decision tree analysis based on the description of the candidate explanation sentence in the matrix and generating a question tree using a branch judgment item as a question sentence. And the arithmetic unit that executes
A question generator characterized by being equipped with.

The arithmetic unit
When the number of steps of the generated question tree exceeds the predetermined standard,
The process of notifying the predetermined terminal of the request message of the additional keyword and further executing the process of acquiring the additional keyword from the predetermined terminal is executed.
In specifying the candidate explanation, based on the keyword and the additional keyword, the one containing the keyword and the additional keyword among the explanations is specified as a new candidate explanation.
When generating the question tree, a decision tree analysis is executed based on the description regarding the new candidate explanation in the matrix, and a new question tree with the branch determination item as the question is generated.
The acquisition of the additional keyword, the identification of the new candidate explanation, and the generation of the new question tree are repeated until the number of stages of the new question tree does not exceed the predetermined standard.
The question generator according to claim 1.

The arithmetic unit
Among the candidate explanations, those containing words that frequently appear across each candidate explanation are specified as priority candidate explanations.
When generating the question tree, a decision tree analysis is executed based on the description regarding the priority candidate explanatory text in the matrix, and a question tree for frequently-used words with the branch determination item as the question sentence is generated.
The question generator according to claim 1.

The arithmetic unit
When the candidate explanations are not completely classified by the branch determination items in the predetermined stage in the question tree for frequently-used words, the non-priority candidate explanations other than the priority candidate explanations are specified among the candidate explanations.
Based on the description of the non-priority candidate explanatory text in the matrix, a decision tree analysis starting from the branch determination item of the predetermined stage is executed to generate a question tree for non-frequent words.
The question generator according to claim 3, wherein the question generator is characterized by the above.

The arithmetic unit
Generation of a matrix for frequent answers that describes the presence or absence of each word in the frequent answers based on the morphological analysis results of each of the above explanations for which the actual results output as answers in a predetermined period are frequent answers exceeding a predetermined standard. And
Upon receiving an inquiry from a predetermined terminal, based on the keyword included in the inquiry, the frequently-used answers including the keyword are specified as priority candidate explanations.
A decision tree analysis is executed based on the description regarding the priority candidate explanatory text in the matrix for frequent answers, and a question tree for frequent answers with the branch determination item as the question text is generated.
The question generator according to claim 1.

The arithmetic unit
When the priority candidate explanation is not completely classified by the branch judgment item in the predetermined stage in the question tree for frequent answers.
Based on the morphological analysis results of the non-frequent answers other than the frequent answers among the above explanations, a matrix for the non-frequent answers describing the presence or absence of each word in the non-frequent answers was generated.
Based on the description of the non-frequent answers in the matrix for non-frequent answers, a decision tree analysis starting from the branch determination item of the predetermined stage is executed to generate a question tree for non-frequent answers.
The question generator according to claim 5.

An information processing device equipped with a storage unit that holds each explanation for each predetermined item
Based on the morphological analysis result of each explanation, a process of generating a matrix describing the presence or absence of each word in each explanation, and a process of generating a matrix.
A process of receiving an inquiry from a predetermined terminal and specifying, among the above-mentioned explanatory texts, those including the keyword as candidate explanatory texts based on the keywords included in the inquiry.
A process of executing a decision tree analysis based on the description of the candidate explanation in the matrix and generating a question tree using the branch determination item as a question.
A question generation method characterized by running.

The information processing device
When the number of steps of the generated question tree exceeds the predetermined standard,
The process of notifying the predetermined terminal of the request message of the additional keyword and further executing the process of acquiring the additional keyword from the predetermined terminal is executed.
In specifying the candidate explanation, based on the keyword and the additional keyword, the one containing the keyword and the additional keyword among the explanations is specified as a new candidate explanation.
When generating the question tree, a decision tree analysis is executed based on the description regarding the new candidate explanation in the matrix, and a new question tree with the branch determination item as the question is generated.
The acquisition of the additional keyword, the identification of the new candidate explanation, and the generation of the new question tree are repeated until the number of stages of the new question tree does not exceed the predetermined standard.
The question generation method according to claim 7, wherein the question is generated.

The information processing device
Among the candidate explanations, those containing words that frequently appear across each candidate explanation are specified as priority candidate explanations.
When generating the question tree, a decision tree analysis is executed based on the description regarding the priority candidate explanatory text in the matrix, and a question tree for frequently-used words with the branch determination item as the question sentence is generated.
The question generation method according to claim 7, wherein the question is generated.

The information processing device
When the candidate explanations are not completely classified by the branch determination items in the predetermined stage in the question tree for frequently-used words, the non-priority candidate explanations other than the priority candidate explanations are specified among the candidate explanations.
Based on the description of the non-priority candidate explanatory text in the matrix, a decision tree analysis starting from the branch determination item of the predetermined stage is executed to generate a question tree for non-frequent words.
The question generation method according to claim 9, wherein the question is generated.

The information processing device
Generation of a matrix for frequent answers that describes the presence or absence of each word in the frequent answers based on the morphological analysis results of each of the above explanations for which the actual results output as answers in a predetermined period are frequent answers exceeding a predetermined standard. And
Upon receiving an inquiry from a predetermined terminal, based on the keyword included in the inquiry, the frequently-used answers including the keyword are specified as priority candidate explanations.
A decision tree analysis is executed based on the description regarding the priority candidate explanatory text in the matrix for frequent answers, and a question tree for frequent answers with the branch determination item as the question text is generated.
The question generation method according to claim 7, wherein the question is generated.

The information processing device
When the priority candidate explanation is not completely classified by the branch judgment item in the predetermined stage in the question tree for frequent answers.
Based on the morphological analysis results of the non-frequent answers other than the frequent answers among the above explanations, a matrix for the non-frequent answers describing the presence or absence of each word in the non-frequent answers was generated.
Based on the description of the non-frequent answers in the matrix for non-frequent answers, a decision tree analysis starting from the branch determination item of the predetermined stage is executed to generate a question tree for non-frequent answers.
The question generation method according to claim 11, wherein the question is generated.