JP2021005391A

JP2021005391A - Text monitoring system, method for monitoring text, and program

Info

Publication number: JP2021005391A
Application number: JP2020149529A
Authority: JP
Inventors: 康高山本; Yasutaka Yamamoto; 貴士大西; Takashi Onishi; 享赤峯; Susumu Akamine
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2019-05-08
Filing date: 2020-09-07
Publication date: 2021-01-14
Anticipated expiration: 2035-03-18
Also published as: JP6954426B2

Abstract

To provide an analysis support system, a method for supporting analysis, and a program which can increase accuracy of detecting an event in a monitor using a text.SOLUTION: An analysis support system (monitoring system 1) includes: text acquiring means 10 for acquiring first texts; reception means for receiving designation of second texts; monitoring means 70 for monitoring to determine whether or not the number of ones of the acquired first texts that include designated second texts satisfies a predetermined condition; and display control means 80 for displaying a screen which can show that a predetermined condition is satisfied when the condition is satisfied. A relation between the second texts and the first texts including the second texts shows that the content of the second texts is true if the content of the first texts is true.SELECTED DRAWING: Figure 2

Description

本発明は、テキスト監視システム、テキスト監視方法、及び、記録媒体に関し、特に、テキストを用いて監視を行うテキスト監視システム、テキスト監視方法、及び、記録媒体に関する。 The present invention relates to a text monitoring system, a text monitoring method, and a recording medium, and more particularly to a text monitoring system, a text monitoring method, and a recording medium for monitoring using text.

収集したテキストから、製品に対する不具合報告や、クレーム等の事象の発生を監視する技術として、特定のキーワードを含むテキストの数を監視する方法が知られている。例えば、特許文献１には、収集した文書から、風評規則に従って、風評表現を抽出する技術が開示されている。特定のキーワードを含むテキストの監視では、予め、監視対象の事象のキーワードを定義する必要がある。 A method of monitoring the number of texts containing a specific keyword is known as a technique for monitoring the occurrence of an event such as a defect report or a complaint for a product from the collected texts. For example, Patent Document 1 discloses a technique for extracting a rumor expression from a collected document in accordance with a rumor rule. In monitoring text containing a specific keyword, it is necessary to define the keyword of the event to be monitored in advance.

キーワードの定義が不要な監視技術として、例えば、テキスト間のキーワードの共有度合いをもとに、テキストをクラスタに分類し、各クラスタに分類されたテキストの数を監視する方法が考えられる。 As a monitoring technique that does not require the definition of keywords, for example, a method of classifying texts into clusters based on the degree of sharing of keywords between texts and monitoring the number of texts classified in each cluster can be considered.

しかしながら、一般に、監視対象のテキストには、複数の観点が混在していることがある。このため、キーワードの共有度合いをもとにテキストの分類を行っても、観点の見落とし、或いは、異なる観点のテキストの同じクラスタへの分類等により、各クラスタの観点が不明確になることがある。したがって、キーワードの共有度合いをもとに生成されたクラスタを用いてテキストの監視をした場合、事象の検出精度が低いことがある。 However, in general, the text to be monitored may have a mixture of viewpoints. For this reason, even if texts are classified based on the degree of sharing of keywords, the viewpoints of each cluster may become unclear due to oversight of viewpoints or classification of texts from different viewpoints into the same cluster. .. Therefore, when text is monitored using a cluster generated based on the degree of keyword sharing, the event detection accuracy may be low.

図２５は、キーワードの共有度合いをもとにしたクラスタリング結果の例を示す図である。図２５のテキストは、自動車の不具合に係るテキストである。これらのテキストは、「不具合」、「発進」、「周辺」等のキーワードを共有しているため、同じクラスタに分類される。しかしながら、このクラスタに分類されるテキストの数が増加したときにアラートが通知されても、クラスタ内に含まれるテキストは同じ事象に係るテキストではないため、ユーザは、どのような事象が増加したかを正確に把握できない。 FIG. 25 is a diagram showing an example of clustering results based on the degree of sharing of keywords. The text in FIG. 25 is a text relating to a malfunction of an automobile. These texts are classified into the same cluster because they share keywords such as "fault", "start", and "periphery". However, even if an alert is notified when the number of texts classified in this cluster increases, the text contained in the cluster is not the text related to the same event, so the user can see what kind of event has increased. I can't figure out exactly.

なお、関連技術として、非特許文献１には、テキスト間の含意関係を抽出し、含意関係があるテキストを同じクラスタに分類する、含意クラスタリング技術が開示されている。また、特許文献２には、テキスト間の含意関係をもとに、含意関係を表す含意グラフを生成する技術が開示されている。特許文献３には、対話テキストの集合から発話を抽出し、含意関係がある発話を発話クラスタとして抽出する技術が開示されている。 As a related technique, Non-Patent Document 1 discloses an implication clustering technique for extracting implications between texts and classifying the texts having implications into the same cluster. Further, Patent Document 2 discloses a technique for generating an implication graph showing an implication relationship based on the implication relationship between texts. Patent Document 3 discloses a technique of extracting utterances from a set of dialogue texts and extracting utterances having an implication relationship as utterance clusters.

特開２００３−２７１６０９号公報Japanese Unexamined Patent Publication No. 2003-271609 特許第５４９４９９９号公報Japanese Patent No. 5494999 特開２０１３−１９０９９１号公報Japanese Unexamined Patent Publication No. 2013-19991

「NEC、大量の文書データを同じ意味で自動グループ化する技術を開発」、[online]、日本電気株式会社、[平成27年2月9日検索]、インターネット<URL:http://jpn.nec.com/press/201411/20141118_02.html>"NEC develops technology to automatically group a large amount of document data with the same meaning", [online], NEC Corporation, [Search on February 9, 2015], Internet <URL: http: // jpn. nec.com/press/201411/20141118_02.html>

上述のように、キーワードの共有度合いをもとに生成されたクラスタを用いて事象の監視を行った場合、事象の検出精度が低いという技術課題があった。 As described above, when event monitoring is performed using a cluster generated based on the degree of keyword sharing, there is a technical problem that the event detection accuracy is low.

本発明の目的は、上述の技術課題を解決し、テキストを用いた監視において、事象の検出精度を向上できる、テキスト監視システム、テキスト監視方法、及び、記録媒体を提供することである。 An object of the present invention is to provide a text monitoring system, a text monitoring method, and a recording medium capable of solving the above-mentioned technical problems and improving the detection accuracy of an event in monitoring using text.

本発明の一態様における分析支援システムは、第１のテキストを取得するテキスト取得手段と、第２のテキストの指定を受け付ける受付手段と、前記取得した第１のテキストのうち、前記指定を受け付けた第２のテキストを含意する第１のテキストの数が所定の条件を満たすか否かを監視する監視手段と、前記所定の条件を満たす場合に、前記所定の条件を満たすことが把握可能な画面を表示する表示制御手段と、を備え、前記第２のテキストと、前記第２のテキストを含意する前記第１のテキストとの関係は、前記第１のテキストの内容が真であるならば前記第２のテキストの内容が真である、ことを示す。 The analysis support system according to one aspect of the present invention has received the designation of the text acquisition means for acquiring the first text, the reception means for accepting the designation of the second text, and the acquired first text. A monitoring means for monitoring whether or not the number of the first texts including the second text satisfies a predetermined condition, and a screen capable of grasping that the predetermined condition is satisfied when the predetermined condition is satisfied. The relationship between the second text and the first text implying the second text is such that if the content of the first text is true, the relationship between the second text and the first text is provided. Indicates that the content of the second text is true.

本発明の一態様における分析支援方法は、コンピュータに具備されたテキスト取得手段が、第１のテキストを取得し、前記コンピュータに具備された受付手段が、第２のテキストの指定を受け付け、前記コンピュータに具備された監視手段が、前記取得した第１のテキストのうち、前記指定を受け付けた第２のテキストを含意する第１のテキストの数が所定の条件を満たすか否かを監視し、前記コンピュータに具備された表示制御手段が、前記所定の条件を満たす場合に、前記所定の条件を満たすことが把握可能な画面を表示する分析支援方法であって、前記第２のテキストと、前記第２のテキストを含意する前記第１のテキストとの関係は、前記第１のテキストの内容が真であるならば前記第２のテキストの内容が真である、ことを示す。 In the analysis support method according to one aspect of the present invention, the text acquisition means provided in the computer acquires the first text, the reception means provided in the computer receives the designation of the second text, and the computer. The monitoring means provided in the above monitors whether or not the number of the first texts including the second text that has received the designation among the acquired first texts satisfies a predetermined condition. An analysis support method for displaying a screen on which it is possible to grasp that the predetermined condition is satisfied when the display control means provided in the computer satisfies the predetermined condition, the second text and the second text. The relationship with the first text, which implies the second text, indicates that if the content of the first text is true, then the content of the second text is true.

本発明の一態様におけるプログラムは、コンピュータに、第１のテキストを取得し、第２のテキストの指定を受け付け、前記取得した第１のテキストのうち、前記指定を受け付けた第２のテキストを含意する第１のテキストの数が所定の条件を満たすか否かを監視し、前記所定の条件を満たす場合に、前記所定の条件を満たすことが把握可能な画面を表示する処理を実行させるプログラムであって、前記第２のテキストと、前記第２のテキストを含意する前記第１のテキストとの関係は、前記第１のテキストの内容が真であるならば前記第２のテキストの内容が真である、ことを示す。 The program according to one aspect of the present invention acquires the first text, accepts the designation of the second text, and includes the second text of the acquired first text that has accepted the designation. A program that monitors whether or not the number of first texts to be used satisfies a predetermined condition, and if the predetermined condition is satisfied, executes a process of displaying a screen that can be grasped that the predetermined condition is satisfied. The relationship between the second text and the first text implying the second text is that if the content of the first text is true, the content of the second text is true. Indicates that.

本発明の技術効果は、テキストを用いた監視において、事象の検出精度を向上できることである。 The technical effect of the present invention is that the accuracy of event detection can be improved in monitoring using text.

本発明の第１の実施の形態の基本的な構成を示すブロック図である。It is a block diagram which shows the basic structure of the 1st Embodiment of this invention. 本発明の第１の実施の形態における、監視システム１の構成を示すブロック図である。It is a block diagram which shows the structure of the monitoring system 1 in the 1st Embodiment of this invention. 本発明の第１の実施の形態における、コンピュータにより実現された監視システム１の構成を示すブロック図である。It is a block diagram which shows the structure of the monitoring system 1 realized by the computer in the 1st Embodiment of this invention. 本発明の第１の実施の形態における、監視システム１の動作を示すフローチャートである。It is a flowchart which shows the operation of the monitoring system 1 in the 1st Embodiment of this invention. 本発明の第１の実施の形態における、テキストデータの例を示す図である。It is a figure which shows the example of the text data in the 1st Embodiment of this invention. 本発明の第１の実施の形態における、含意関係の抽出結果の例を示す図である。It is a figure which shows the example of the extraction result of the implication relation in the 1st Embodiment of this invention. 本発明の第１の実施の形態における、代表テキスト情報の例を示す図である。It is a figure which shows the example of the representative text information in the 1st Embodiment of this invention. 本発明の第１の実施の形態における、各代表テキストを含意するテキストの数の例である。It is an example of the number of texts implying each representative text in the first embodiment of the present invention. 本発明の第１の実施の形態における、通知画面９０の例を示す図である。It is a figure which shows the example of the notification screen 90 in the 1st Embodiment of this invention. 本発明の第１の実施の形態における、監視期間毎の、各代表テキストを含意するテキストの数の他の例である。It is another example of the number of texts implying each representative text for each monitoring period in the first embodiment of the present invention. 本発明の第２の実施の形態における、監視システム１の構成を示すブロック図である。It is a block diagram which shows the structure of the monitoring system 1 in the 2nd Embodiment of this invention. 本発明の第２の実施の形態における、テキストデータの例を示す図である。It is a figure which shows the example of the text data in the 2nd Embodiment of this invention. 本発明の第２の実施の形態における、含意関係の抽出結果の例を示す図である。It is a figure which shows the example of the extraction result of the implication relation in the 2nd Embodiment of this invention. 本発明の第２の実施の形態における、代表テキスト情報の例を示す図である。It is a figure which shows the example of the representative text information in the 2nd Embodiment of this invention. 本発明の第２の実施の形態における、対象種別情報の例を示す図である。It is a figure which shows the example of the object type information in the 2nd Embodiment of this invention. 本発明の第２の実施の形態における、各種別の代表テキストを含意するテキストの数の合計の例である。It is an example of the total number of texts implying different representative texts in the second embodiment of the present invention. 本発明の第２の実施の形態における、通知画面１００の例を示す図である。It is a figure which shows the example of the notification screen 100 in the 2nd Embodiment of this invention. 本発明の第３の実施の形態における、監視システム１の動作を示すフローチャートである。It is a flowchart which shows the operation of the monitoring system 1 in the 3rd Embodiment of this invention. 本発明の第３の実施の形態における、テキストデータの例を示す図である。It is a figure which shows the example of the text data in the 3rd Embodiment of this invention. 本発明の第３の実施の形態における、含意関係の抽出結果の例を示す図である。It is a figure which shows the example of the extraction result of the implication relation in the 3rd Embodiment of this invention. 本発明の第３の実施の形態における、代表テキスト情報の例を示す図である。It is a figure which shows the example of the representative text information in the 3rd Embodiment of this invention. 本発明の第３の実施の形態における、各代表テキストを含意するテキストの数の例である。It is an example of the number of texts implying each representative text in the third embodiment of the present invention. 本発明の第３の実施の形態における、比較画面１１０の例を示す図である。It is a figure which shows the example of the comparison screen 110 in the 3rd Embodiment of this invention. 本発明の実施の形態における、代表テキストと要素テキストの関係の例を示す図である。It is a figure which shows the example of the relationship between the representative text and the element text in embodiment of this invention. キーワードの共有度合いをもとにしたクラスタリング結果の例を示す図である。It is a figure which shows the example of the clustering result based on the degree of sharing of a keyword.

はじめに、本発明の実施の形態で用いるテキストのクラスタリング手法である、含意クラスタリングについて説明する。含意クラスタリングでは、非特許文献１に記載されているように、テキスト間の意味の関係である、含意関係をもとにクラスタリングを行う。本発明の実施の形態では、含意関係を、特許文献２と同様に、次のように定義する。すなわち、第１のテキストの内容が真であるならば第２のテキストの内容が真である場合、第１のテキストが第２のテキストを含意（entailment）すると定義する。また、第１のテキストの内容から第２のテキストの内容が読み取れる場合、第１のテキストが第２のテキストを含意すると定義してもよい。含意クラスタリングを用いることにより、分析対象のテキストに含まれる観点をもれなく、かつ、クラスタ内のテキストが共通に含意し、クラスタの概要を表す代表テキストとともに抽出できる。 First, implication clustering, which is a text clustering method used in the embodiment of the present invention, will be described. In implication clustering, as described in Non-Patent Document 1, clustering is performed based on the implication relationship, which is the relationship of meaning between texts. In the embodiment of the present invention, the implication relationship is defined as follows, as in Patent Document 2. That is, if the content of the first text is true, then if the content of the second text is true, then the first text is defined as entailing the second text. Further, when the content of the second text can be read from the content of the first text, it may be defined that the first text implies the second text. By using implication clustering, it is possible to extract all the viewpoints included in the text to be analyzed, the texts in the cluster have common implications, and the representative text representing the outline of the cluster.

含意関係の理解を容易にするため、具体例を用いて説明する、
＜具体例１＞
第１のテキスト：オバマ大統領はホワイトハウスに住んでいる。
第２のテキスト：オバマ大統領はアメリカに住んでいる。 In order to facilitate understanding of implications, explanations will be given using specific examples.
<Specific example 1>
First text: President Obama lives in the White House.
Second text: President Obama lives in the United States.

この場合、第１のテキストの内容が真であるならば第２のテキストの内容が真であるので、第１のテキストが第２のテキストを含意するといえる。 In this case, if the content of the first text is true, then the content of the second text is true, so it can be said that the first text implies the second text.

＜具体例２＞
第１のテキスト：犬養毅首相は海軍将校らに暗殺された。
第２のテキスト：犬養毅首相は亡くなった
この場合、第１のテキストの内容が真であるならば第２のテキストの内容が真であるので、第１のテキストが第２のテキストを含意するといえる。 <Specific example 2>
First text: Prime Minister Tsuyoshi Inukai was assassinated by naval officers.
Second text: Prime Minister Tsuyoshi Inukai died In this case, if the content of the first text is true, then the content of the second text is true, so if the first text implies the second text I can say.

ここで、「代表テキスト」と「要素テキスト」を定義する。テキストの集合に対して含意クラスタリング処理を実行すると、代表テキストと要素テキストとが決定される。代表テキストと要素テキストとの関係は、要素テキストの内容が真であるならば代表テキストの内容が真である、という関係である。すなわち、代表テキストと要素テキストとの関係は、要素テキストは代表テキストを含意するという関係である。 Here, "representative text" and "element text" are defined. When the implication clustering process is executed on a set of texts, the representative text and the element texts are determined. The relationship between the representative text and the element text is that if the content of the element text is true, then the content of the representative text is true. That is, the relationship between the representative text and the element text is that the element text implies the representative text.

図２４は、本発明の実施の形態における、代表テキストと要素テキストの関係の例を示す図である。代表テキストと要素テキストの理解を容易にするため、図２４を用いて説明する。図２４は、Ｔ１からＴ１１までの１１個のテキストについて、含意クラスタリング処理を実行した様子を示す。図２４における円形のシンボルは一つのテキストを示す。図２４における矢印は、矢印の元のテキストが矢印の先のテキストを含意することを示す。図２４において、テキストＴ６、Ｔ７、Ｔ１１が、テキストＴ１を含意している。同様に、テキストＴ２、Ｔ３、Ｔ７、Ｔ１０が、テキストＴ５を含意しており、テキストＴ２、Ｔ４、Ｔ７、Ｔ８が、テキストＴ９を含意している。このとき、テキストＴ６、Ｔ７、Ｔ１１は、代表テキストＴ１の要素テキストである。同様に、テキストＴ２、Ｔ３、Ｔ７、Ｔ１０は、代表テキストＴ５の要素テキストである。同様に、テキストＴ２、Ｔ４、Ｔ７、Ｔ８は、代表テキストＴ９の要素テキストである。 FIG. 24 is a diagram showing an example of the relationship between the representative text and the element text in the embodiment of the present invention. In order to facilitate the understanding of the representative text and the element text, the description will be made with reference to FIG. FIG. 24 shows how the implication clustering process was executed for 11 texts from T1 to T11. The circular symbol in FIG. 24 indicates a piece of text. The arrow in FIG. 24 indicates that the original text of the arrow implies the text at the tip of the arrow. In FIG. 24, the texts T6, T7, T11 imply text T1. Similarly, texts T2, T3, T7, and T10 imply text T5, and texts T2, T4, T7, and T8 imply text T9. At this time, the texts T6, T7, and T11 are element texts of the representative text T1. Similarly, the texts T2, T3, T7, and T10 are element texts of the representative text T5. Similarly, the texts T2, T4, T7, and T8 are element texts of the representative text T9.

ここで、代表テキスト自身が要素テキストとして扱われてもよい。例えば、テキストＴ１、Ｔ６、Ｔ７、Ｔ１１が代表テキストＴ１の要素テキストでもよい。 Here, the representative text itself may be treated as an element text. For example, the texts T1, T6, T7, and T11 may be element texts of the representative text T1.

（第１の実施の形態）
本発明の第１の実施の形態について説明する。 (First Embodiment)
The first embodiment of the present invention will be described.

はじめに、本発明の第１の実施の形態の構成を説明する。 First, the configuration of the first embodiment of the present invention will be described.

図２は、本発明の第１の実施の形態における、監視システム１の構成を示すブロック図である。 FIG. 2 is a block diagram showing the configuration of the monitoring system 1 according to the first embodiment of the present invention.

図２を参照すると、本発明の第１の実施の形態における監視システム１は、テキスト取得部１０、テキスト記憶部２０、含意関係抽出部３０、代表テキスト抽出部４０、代表テキスト記憶部５０、判定部６０、監視部７０、及び、表示制御部８０を含む。監視システム１は、本発明のテキスト監視システムの一実施形態である。 Referring to FIG. 2, the monitoring system 1 according to the first embodiment of the present invention includes a text acquisition unit 10, a text storage unit 20, an implication relationship extraction unit 30, a representative text extraction unit 40, a representative text storage unit 50, and a determination. A unit 60, a monitoring unit 70, and a display control unit 80 are included. The monitoring system 1 is an embodiment of the text monitoring system of the present invention.

テキスト取得部１０は、監視対象のテキストを取得する。 The text acquisition unit 10 acquires the text to be monitored.

テキスト記憶部２０は、テキスト取得部１０が取得したテキストを示すテキストデータを記憶する。 The text storage unit 20 stores text data indicating the text acquired by the text acquisition unit 10.

図５は、本発明の第１の実施の形態における、テキストデータの例を示す図である。図５の例は、監視対象のテキストが、自動車の不具合報告における「現象」に係る、自然言語のテキストの場合の例である。テキストデータは、テキストの取得日時、及び、テキストを含む。なお、テキストの前の括弧内の符号は、テキストの識別子を示す。 FIG. 5 is a diagram showing an example of text data in the first embodiment of the present invention. The example of FIG. 5 is an example in which the text to be monitored is a natural language text related to the "phenomenon" in the defect report of the automobile. The text data includes the acquisition date and time of the text and the text. The code in parentheses before the text indicates the text identifier.

監視対象のテキストは、例えば、文書（不具合報告書等）から抽出される。この場合、テキストは、例えば、所定の形式に従って、複数のカテゴリ（不具合の現象、原因、対策等）毎に記載された文書中の、監視対象のカテゴリ（現象）に対する記載を取得することにより抽出される。また、テキストは、自由形式で記述された文書から、監視対象のカテゴリに係る記載部分を特定することにより抽出されてもよい。また、テキストは、例えば、コールセンタ等における会話を音声認識することにより生成した、コールログから抽出されてもよい。また、テキストは、口コミサイトや、ブログ、ＳＮＳ（ソーシャルネットワーキングサービス）から抽出されてもよい。 The text to be monitored is extracted from, for example, a document (defect report, etc.). In this case, the text is extracted, for example, by acquiring the description for the category (phenomenon) to be monitored in the document described for each of a plurality of categories (phenomenon, cause, countermeasure, etc. of the defect) according to a predetermined format. Will be done. In addition, the text may be extracted from the document described in free form by specifying the description part related to the category to be monitored. Further, the text may be extracted from a call log generated by voice recognition of a conversation in a call center or the like, for example. In addition, the text may be extracted from word-of-mouth sites, blogs, and SNS (social networking services).

含意関係抽出部３０は、監視対象のテキストの内、含意関係抽出対象のテキスト（以下、抽出対象テキストとも記載する）を用いて、含意関係を抽出する。ここで、抽出対象テキストとして、例えば、複数の所定の長さの監視期間（定期的な監視期間）の内の最初の監視期間等、特定の監視期間の全テキストや、特定の監視期間の一部（所定数や所定の部分期間）のテキストが用いられる。 The implication relationship extraction unit 30 extracts the implication relationship by using the text of the implication relationship extraction target (hereinafter, also referred to as the extraction target text) among the texts to be monitored. Here, as the text to be extracted, for example, all texts of a specific monitoring period such as the first monitoring period in a plurality of monitoring periods of a predetermined length (regular monitoring period), or one of the specific monitoring periods. The text of the part (predetermined number or predetermined subperiod) is used.

代表テキスト抽出部４０は、抽出された含意関係から代表テキストを抽出し、代表テキスト、及び、代表テキストを含意する要素テキストが設定されたクラスタを生成する。 The representative text extraction unit 40 extracts the representative text from the extracted implication relation, and generates a cluster in which the representative text and the element text including the representative text are set.

代表テキスト記憶部５０は、代表テキスト抽出部４０が生成したクラスタに係る情報を示す代表テキスト情報を記憶する。 The representative text storage unit 50 stores representative text information indicating information related to the cluster generated by the representative text extraction unit 40.

判定部６０は、監視期間毎に、新たに取得された（テキストデータに追加された）各テキストが、各代表テキストを含意するか（各クラスタに属するか）どうかを判定する。 The determination unit 60 determines, for each monitoring period, whether each newly acquired text (added to the text data) implies each representative text (belongs to each cluster).

監視部７０は、監視期間毎に、各代表テキストを含意するテキストの数を監視し、監視結果を、表示制御部８０を介して出力する。本発明の第１の実施の形態では、監視結果の出力として、代表テキストを含意するテキストの数が所定の通知条件を満たした場合に、通知を行う。ここで、通知条件として、例えば、テキストの数に係る下限の閾値（テキストの数が閾値以上であれば通知）が用いられる。 The monitoring unit 70 monitors the number of texts including each representative text for each monitoring period, and outputs the monitoring result via the display control unit 80. In the first embodiment of the present invention, as the output of the monitoring result, a notification is given when the number of texts including the representative text satisfies a predetermined notification condition. Here, as the notification condition, for example, a lower threshold value related to the number of texts (notification if the number of texts is equal to or greater than the threshold value) is used.

表示制御部８０は、監視結果（通知内容）を表示するための通知画面９０を生成し、ユーザ等に表示する。 The display control unit 80 generates a notification screen 90 for displaying the monitoring result (notification content) and displays it to the user or the like.

なお、監視システム１は、ＣＰＵ（Central Processing Unit）とプログラムを記憶した記憶媒体を含み、プログラムにもとづく制御によって動作するコンピュータであってもよい。 The monitoring system 1 may be a computer that includes a CPU (Central Processing Unit) and a storage medium that stores a program, and operates under control based on the program.

図３は、本発明の第１の実施の形態における、コンピュータにより実現された監視システム１の構成を示すブロック図である。 FIG. 3 is a block diagram showing a configuration of a monitoring system 1 realized by a computer according to the first embodiment of the present invention.

監視システム１は、ＣＰＵ２、ハードディスクやメモリ等の記憶デバイス３（記憶媒体）、他の装置等と通信を行う通信デバイス４、マウスやキーボード等の入力デバイス５、及び、ディスプレイ等の出力デバイス６を含む。 The monitoring system 1 includes a CPU 2, a storage device 3 (storage medium) such as a hard disk or memory, a communication device 4 that communicates with other devices, an input device 5 such as a mouse or keyboard, and an output device 6 such as a display. Including.

ＣＰＵ２は、テキスト取得部１０、含意関係抽出部３０、代表テキスト抽出部４０、判定部６０、監視部７０、及び、表示制御部８０の機能を実現するためのコンピュータプログラムを実行する。記憶デバイス３は、テキスト記憶部２０、及び、代表テキスト記憶部５０のデータを記憶する。出力デバイス６は、ユーザ等へ、通知画面９０を出力する。入力デバイス５は、ユーザ等から、例えば、通知条件等の指定を受け付ける。また、通信デバイス４が、他の装置等へ通知画面９０を出力してもよい。 The CPU 2 executes a computer program for realizing the functions of the text acquisition unit 10, the implication relationship extraction unit 30, the representative text extraction unit 40, the determination unit 60, the monitoring unit 70, and the display control unit 80. The storage device 3 stores the data of the text storage unit 20 and the representative text storage unit 50. The output device 6 outputs a notification screen 90 to the user or the like. The input device 5 receives, for example, designation of notification conditions and the like from the user and the like. Further, the communication device 4 may output the notification screen 90 to another device or the like.

また、図２に示された監視システム１の各構成要素は、独立した論理回路でもよい。 Further, each component of the monitoring system 1 shown in FIG. 2 may be an independent logic circuit.

また、図２に示された監視システム１の各構成要素は、有線または無線で接続された複数の物理的な装置に分散的に配置されていてもよい。 Further, each component of the monitoring system 1 shown in FIG. 2 may be distributedly arranged in a plurality of physical devices connected by wire or wirelessly.

次に、本発明の第１の実施の形態の動作を説明する。 Next, the operation of the first embodiment of the present invention will be described.

ここでは、テキスト取得部１０が、例えば、図示しない記憶部等に定期的にアクセスすることにより、不具合報告に係る文書から監視対象のテキストを抽出し、図５のようなテキストデータを、テキスト記憶部２０に保存していると仮定する。 Here, the text acquisition unit 10 extracts the text to be monitored from the document related to the defect report by periodically accessing, for example, a storage unit (not shown), and stores the text data as shown in FIG. 5 as text. It is assumed that it is stored in part 20.

また、テキストの監視は監視期間（１ヶ月）毎に行われると仮定する。さらに、含意関係抽出対象のテキスト（抽出対象テキスト）は、最初の監視期間のテキストの内の、先頭の所定数のテキストであると仮定する。 It is also assumed that text monitoring is performed every monitoring period (1 month). Further, it is assumed that the text to be extracted with implications (text to be extracted) is the first predetermined number of texts in the text of the first monitoring period.

図４は、本発明の第１の実施の形態における、監視システム１の動作を示すフローチャートである。 FIG. 4 is a flowchart showing the operation of the monitoring system 1 according to the first embodiment of the present invention.

はじめに、含意関係抽出部３０は、最初の監視期間において、テキスト記憶部２０に保存されている、抽出対象テキスト間の含意関係を抽出する（ステップＳ１０１）。 First, the implication relationship extraction unit 30 extracts the implication relationship between the texts to be extracted stored in the text storage unit 20 during the first monitoring period (step S101).

ここで、含意関係抽出部３０は、例えば、特許文献２と同様の判定処理を行うことにより、テキスト間の含意関係を抽出する。この場合、含意関係抽出部３０は、テキストに含まれる内容語を比較し、被覆率を算出することにより、含意関係の有無を判定する。なお、含意関係抽出部３０は、テキスト間の含意関係を抽出できれば、特許文献２と異なる判定処理により、テキスト間の含意関係を判定してもよい。 Here, the implication relationship extraction unit 30 extracts the implication relationship between texts by, for example, performing the same determination processing as in Patent Document 2. In this case, the implication relationship extraction unit 30 determines the presence or absence of the implication relationship by comparing the content words contained in the text and calculating the coverage ratio. If the implication relationship extraction unit 30 can extract the implication relationship between texts, the implication relationship between texts may be determined by a determination process different from that of Patent Document 2.

図６は、本発明の第１の実施の形態における、含意関係の抽出結果の例を示す図である。図６において、矢印の元のテキストは、先のテキストを含意することを示す。図６の例では、テキストＴ６、Ｔ７、Ｔ１１、…が、テキストＴ１「警告灯が点灯した」を含意している。同様に、テキストＴ２、Ｔ３、Ｔ７、Ｔ１０、…が、テキストＴ５「異音がする」を含意しており、テキストＴ２、Ｔ４、Ｔ７、Ｔ８、…が、テキストＴ９「エンストした」を含意している。 FIG. 6 is a diagram showing an example of the extraction result of the implication relationship in the first embodiment of the present invention. In FIG. 6, the original text of the arrow indicates that it implies the previous text. In the example of FIG. 6, the texts T6, T7, T11, ... Imply the text T1 "warning light turned on". Similarly, texts T2, T3, T7, T10, ... Imply text T5 "abnormal noise", and texts T2, T4, T7, T8, ... Imply text T9 "stall". ing.

例えば、含意関係抽出部３０は、図５における、取得日時が監視期間「２０１５／１」内の抽出対象テキストから、図６のような含意関係を抽出する。 For example, the implication relationship extraction unit 30 extracts the implication relationship as shown in FIG. 6 from the extraction target text whose acquisition date and time is within the monitoring period “2015/1” in FIG.

代表テキスト抽出部４０は、抽出された含意関係から代表テキストを抽出し、クラスタを生成する（ステップＳ１０２）。代表テキスト抽出部４０は、例えば、非特許文献１の技術と同様に、含意関係抽出部３０により抽出された含意関係をもとに、クラスタを生成する。ここで、抽出対象テキストの内、他のテキストにより含意されるテキストがクラスタの代表テキスト、当該代表テキストを含意するテキストが当該クラスタの要素テキストに設定される。テキストが複数の代表テキストを含意する場合、当該テキストは、複数のクラスタの要素テキストに設定される。なお、本発明の実施の形態では、あるクラスタの代表テキストに設定されたテキスト自身も、当該クラスタの代表テキストを含意する要素テキストとして設定される。代表テキスト抽出部４０は、各クラスタの代表テキストの識別子を当該クラスタの要素テキストの識別子と関連付けた代表テキスト情報を、代表テキスト記憶部５０に保存する。 The representative text extraction unit 40 extracts the representative text from the extracted implications and generates a cluster (step S102). The representative text extraction unit 40 generates clusters based on the implication relationships extracted by the implication relationship extraction unit 30, for example, as in the technique of Non-Patent Document 1. Here, among the texts to be extracted, the text implied by other texts is set as the representative text of the cluster, and the text implying the representative text is set as the element text of the cluster. If the text implies more than one representative text, the text is set to the element text of multiple clusters. In the embodiment of the present invention, the text itself set as the representative text of a certain cluster is also set as the element text implying the representative text of the cluster. The representative text extraction unit 40 stores the representative text information in which the identifier of the representative text of each cluster is associated with the identifier of the element text of the cluster in the representative text storage unit 50.

図７は、本発明の第１の実施の形態における、代表テキスト情報の例を示す図である。図７の例では、テキストＴ１「警告灯が点灯した」、Ｔ５「異音がする」、及び、Ｔ９「エンストした」が、それぞれ、クラスタＣ１、Ｃ２、及び、Ｃ３の代表テキストに設定されている。また、テキストＴ１とテキストＴ１を含意するテキストＴ６、Ｔ７、Ｔ１１…が、クラスタＣ１の要素テキストに設定されている。同様に、テキストＴ５とテキストＴ５を含意するテキストが、クラスタＣ２の要素テキストに設定され、テキストＴ９とテキストＴ９を含意するテキストが、クラスタＣ３の要素テキストに設定されている。 FIG. 7 is a diagram showing an example of representative text information in the first embodiment of the present invention. In the example of FIG. 7, the texts T1 "warning light turned on", T5 "abnormal noise", and T9 "stall" are set as the representative texts of clusters C1, C2, and C3, respectively. There is. Further, the texts T6, T7, T11 ... Implying the text T1 and the text T1 are set as the element texts of the cluster C1. Similarly, the text including the text T5 and the text T5 is set as the element text of the cluster C2, and the text including the text T9 and the text T9 is set as the element text of the cluster C3.

例えば、代表テキスト抽出部４０は、図６の含意関係をもとに、図７のような代表テキスト情報を生成する。 For example, the representative text extraction unit 40 generates the representative text information as shown in FIG. 7 based on the implication relationship of FIG.

なお、代表テキスト抽出部４０は、さらに、異なるクラスタ間の要素テキストの重複の度合いをもとに、当該異なるクラスタを一つのクラスタに統合してもよい。 The representative text extraction unit 40 may further integrate the different clusters into one cluster based on the degree of duplication of element texts between different clusters.

次に、判定部６０は、各監視期間において、テキスト記憶部２０に保存されている、当該監視期間に新たに取得された（テキストデータに追加された）各テキストが、各代表テキストを含意するかどうかを判定する（ステップＳ１０３）。 Next, in each monitoring period, the determination unit 60 means that each text newly acquired (added to the text data) stored in the text storage unit 20 during the monitoring period implies each representative text. Whether or not it is determined (step S103).

監視部７０は、各代表テキストについて、当該代表テキストを含意するテキストの数を集計する（ステップＳ１０４）。ここで、監視期間のテキストに、抽出対象テキストが含まれる場合、当該抽出対象テキストの数も、集計対象のテキストとして用いられる。また、監視部７０は、どの代表テキストにも含意しないテキストの数を、「その他」のテキストの数として集計する。 The monitoring unit 70 totals the number of texts including the representative text for each representative text (step S104). Here, when the text of the monitoring period includes the text to be extracted, the number of the texts to be extracted is also used as the text to be aggregated. In addition, the monitoring unit 70 totals the number of texts that are not implied in any representative text as the number of "other" texts.

監視部７０は、監視期間の、各代表テキストを含意するテキストの数が、所定の通知条件を満足するかどうかを判定する（ステップＳ１０５）。 The monitoring unit 70 determines whether or not the number of texts including each representative text in the monitoring period satisfies a predetermined notification condition (step S105).

ステップＳ１０５で、所定の通知条件を満たす代表テキストがある場合（ステップＳ１０５／Ｙ）、監視部７０は、表示制御部８０を介して通知を行う（ステップＳ１０６）。ここで、表示制御部８０は、通知画面９０を生成し、ユーザ等に表示する。 In step S105, when there is a representative text satisfying a predetermined notification condition (step S105 / Y), the monitoring unit 70 notifies via the display control unit 80 (step S106). Here, the display control unit 80 generates a notification screen 90 and displays it on the user or the like.

さらに、「その他」のテキストの数が所定の抽出閾値未満の間、ステップＳ１０３からの処理が、監視期間毎に繰り返される（ステップＳ１０７／Ｎ）。 Further, while the number of "other" texts is less than the predetermined extraction threshold, the process from step S103 is repeated for each monitoring period (step S107 / N).

図８は、本発明の第１の実施の形態における、各代表テキストを含意するテキストの数の例である。 FIG. 8 is an example of the number of texts implying each representative text in the first embodiment of the present invention.

例えば、監視部７０は、監視期間「２０１５／１」、「２０１５／２」、…の各々において、図８のように、各代表テキストＴ１、Ｔ５、Ｔ９を含意するテキストの数を集計する。 For example, the monitoring unit 70 totals the number of texts implying the representative texts T1, T5, and T9 as shown in FIG. 8 in each of the monitoring periods “2015/1”, “2015/2”, and so on.

ここで、例えば、通知条件が、テキストの数に係る下限の閾値「１００以上」の場合、監視期間「２０１５／５」における、代表テキストＴ５「異音がする」を含意するテキストの数が通知条件を満たす。したがって、監視部７０は、監視期間「２０１５／５」の事象「異音がする」に関して、通知を行う。 Here, for example, when the notification condition is the lower limit threshold value “100 or more” related to the number of texts, the number of texts including the representative text T5 “abnormal noise” in the monitoring period “2015/5” is notified. Satisfy the conditions. Therefore, the monitoring unit 70 notifies the event "abnormal noise" in the monitoring period "2015/5".

図９は、本発明の第１の実施の形態における、通知画面９０の例を示す図である。 FIG. 9 is a diagram showing an example of the notification screen 90 in the first embodiment of the present invention.

図９の例では、通知画面９０は、通知領域９１、代表テキスト表示領域９２、時系列表示領域９３、及び、テキスト表示領域９４を含む。 In the example of FIG. 9, the notification screen 90 includes a notification area 91, a representative text display area 92, a time series display area 93, and a text display area 94.

通知領域９１には、通知対象の監視期間（通知閾値の超過が検出された監視期間）や通知対象の代表テキスト（通知閾値を超過した代表テキスト）が表示される。 In the notification area 91, the monitoring period of the notification target (monitoring period in which the notification threshold is detected to be exceeded) and the representative text of the notification target (representative text exceeding the notification threshold) are displayed.

代表テキスト表示領域９２の「クラスタ」欄には、例えば、各クラスタの代表テキストが表示される。また、「件数」欄には、例えば、通知対象の監視期間における、各代表テキストを含意するテキストの数が表示される。 In the "cluster" column of the representative text display area 92, for example, the representative text of each cluster is displayed. Further, in the "number of cases" column, for example, the number of texts including each representative text in the monitoring period to be notified is displayed.

時系列表示領域９３には、例えば、監視期間毎の、各代表テキストを含意するテキストの数（時系列）を示すグラフが表示される。 In the time series display area 93, for example, a graph showing the number of texts (time series) including each representative text for each monitoring period is displayed.

テキスト表示領域９４の「詳細テキスト」欄には、例えば、通知対象の監視期間における、通知対象の代表テキストを含意するテキストが、取得日時の順番で表示される。 In the "detailed text" field of the text display area 94, for example, texts including the representative text of the notification target in the monitoring period of the notification target are displayed in the order of acquisition date and time.

表示制御部８０は、図９のような通知画面９０を、ユーザ等に表示する。 The display control unit 80 displays the notification screen 90 as shown in FIG. 9 on the user or the like.

ユーザ等は、図９の通知画面９０の通知領域９１を参照し、発生数が多い（または、少ない）事象を、概要レベルで把握できる。また、ユーザ等は、テキスト表示領域９４を参照し、当該通知対象の事象の詳細を把握できる。 The user or the like can refer to the notification area 91 of the notification screen 90 of FIG. 9 and grasp the events with a large number (or a small number of occurrences) at the summary level. In addition, the user or the like can refer to the text display area 94 and grasp the details of the event to be notified.

また、「その他」のテキストの数が、所定の抽出閾値以上の場合（ステップＳ１０７／Ｙ）、ステップＳ１０１からの処理が行われる。 Further, when the number of "other" texts is equal to or greater than the predetermined extraction threshold value (step S107 / Y), the processing from step S101 is performed.

図１０は、本発明の第１の実施の形態における、監視期間毎の、各代表テキストを含意するテキストの数の他の例である。 FIG. 10 is another example of the number of texts implying each representative text for each monitoring period in the first embodiment of the present invention.

ここで、例えば、抽出閾値が「１０」の場合、監視期間「２０１６／４」における、「その他」のテキストの数が抽出閾値以上である。したがって、含意関係抽出部３０は、例えば、次の監視期間「２０１６／５」の抽出対象テキストから、再び、含意関係を抽出する。そして、代表テキスト抽出部４０は、抽出された含意関係から、再び、代表テキストを抽出し、クラスタを生成する。ここで、代表テキスト抽出部４０が、代表テキストＴ１、Ｔ５、Ｔ９に加えて、新たな代表テキストＴ２０１「オイルが漏れている」を抽出したと仮定する。 Here, for example, when the extraction threshold value is “10”, the number of “other” texts in the monitoring period “2016/4” is equal to or greater than the extraction threshold value. Therefore, the implication relationship extraction unit 30 extracts the implication relationship again from, for example, the extraction target text of the next monitoring period “2016/5”. Then, the representative text extraction unit 40 extracts the representative text again from the extracted implications and generates a cluster. Here, it is assumed that the representative text extraction unit 40 extracts a new representative text T201 "oil is leaking" in addition to the representative texts T1, T5, and T9.

監視期間「２０１６／８」において、代表テキストＴ２０１「オイルが漏れている」を含意するテキストの数が通知条件を満たす。したがって、監視部７０は、監視期間「２０１６／８」の事象「オイルが漏れている」に関して通知を行う。 In the monitoring period "2016/8", the number of texts implying the representative text T201 "oil is leaking" satisfies the notification condition. Therefore, the monitoring unit 70 notifies the event "oil is leaking" in the monitoring period "2016/8".

以上により、本発明の第１の実施の形態の動作が完了する。 As described above, the operation of the first embodiment of the present invention is completed.

なお、本発明の第１の実施の形態では、クラスタリング対象のテキストが、自動車の不具合報告に係るテキストである場合を例に説明した。しかしながら、これに限らず、クラスタリング対象のテキストは、様々な現象や原因、対策、意見、評価、苦情、要望等、どのような内容に係るテキストでもよい。 In the first embodiment of the present invention, a case where the text to be clustered is a text related to a defect report of an automobile has been described as an example. However, the text to be clustered is not limited to this, and the text related to any content such as various phenomena, causes, countermeasures, opinions, evaluations, complaints, requests, etc. may be used.

また、本発明の第１の実施の形態では、含意関係抽出対象のテキスト（抽出対象テキスト）として、特定の監視期間のテキスト（全テキストや一部のテキスト）を用いた。そして、当該監視期間のテキストから抽出された含意関係をもとに、代表テキストが抽出され、当該代表テキストが、当該監視期間以降の監視期間の監視に用いられた。また、その他の代表テキストの数が所定の抽出閾値以上の場合に、新たな監視期間のテキストから含意関係が再抽出され、当該含意関係をもとに代表テキストが再抽出された。 Further, in the first embodiment of the present invention, the text (all texts or a part of the texts) of a specific monitoring period was used as the text to be extracted (extracted text). Then, a representative text was extracted based on the implication relationship extracted from the text of the monitoring period, and the representative text was used for monitoring the monitoring period after the monitoring period. Further, when the number of other representative texts was equal to or more than a predetermined extraction threshold value, the implication relation was re-extracted from the text of the new monitoring period, and the representative text was re-extracted based on the implication relation.

しかしながら、これに限らず、抽出対象テキストとして、各監視期間のテキストを用いてもよい。この場合、監視期間ごとに、当該監視期間のテキストから含意関係の再抽出、代表テキストの再抽出が行われ、当該代表テキストが、当該監視期間の監視に用いられる。 However, the present invention is not limited to this, and the text of each monitoring period may be used as the text to be extracted. In this case, for each monitoring period, the implication relationship is re-extracted from the text of the monitoring period and the representative text is re-extracted, and the representative text is used for monitoring the monitoring period.

また、これに限らず、抽出対象テキストとして、各監視期間までに取得したテキスト（当該監視期間以前のテキスト）を用いてもよい。この場合、監視期間ごとに、当該監視期間のテキストが抽出対象テキストに追加され、含意関係の再抽出、代表テキストの再抽出が行われる。 Further, the present invention is not limited to this, and the text acquired by each monitoring period (text before the monitoring period) may be used as the text to be extracted. In this case, for each monitoring period, the text of the monitoring period is added to the extraction target text, and the implication relationship is re-extracted and the representative text is re-extracted.

また、代表テキスト抽出部４０は、代表テキストの再抽出により新たな代表テキストが抽出された場合、当該新たな代表テキストを、表示制御部８０を介して通知してもよい。 Further, when a new representative text is extracted by re-extracting the representative text, the representative text extraction unit 40 may notify the new representative text via the display control unit 80.

また、代表テキスト抽出部４０は、抽出された代表テキストの内、監視対象の代表テキストの指定をユーザ等から受け付け、監視部７０は、当該指定された代表テキストについて、含意するテキストの数を監視してもよい。 Further, the representative text extraction unit 40 receives the designation of the representative text to be monitored from the user or the like among the extracted representative texts, and the monitoring unit 70 monitors the number of implied texts for the designated representative text. You may.

また、本発明の第１の実施の形態では、通知条件として、テキストの数に係る下限の閾値を用いた。しかしながら、これに限らず、通知条件として、上限の閾値（テキストの数が閾値以下であれば通知）、または、増加量もしくは減少量の下限の閾値（テキストの数の増加量もしくは減少量が閾値以上であれば通知）が用いられてもよい。また、通知条件として、テキストの数や増減量の閾値以外に、テキストの数や増減量に係る、所定の統計量や分布の条件が設定されていてもよい。 Further, in the first embodiment of the present invention, the lower limit threshold value related to the number of texts is used as the notification condition. However, not limited to this, as a notification condition, an upper threshold (notification if the number of texts is less than or equal to the threshold) or a lower threshold of the increase or decrease (the increase or decrease in the number of texts is the threshold). If the above is the case, notification) may be used. Further, as the notification condition, a predetermined statistic or distribution condition related to the number of texts or the amount of increase / decrease may be set in addition to the threshold value of the number of texts or the amount of increase / decrease.

次に、本発明の第１の実施の形態の基本的な構成を説明する。 Next, the basic configuration of the first embodiment of the present invention will be described.

図１は、本発明の第１の実施の形態の基本的な構成を示すブロック図である。図１を参照すると、本発明の監視システム１（テキスト監視システム）は、テキスト取得部１０、判定部６０、及び、監視部７０を含む。監視システム１は、テキスト間の含意関係に基づいて抽出された、他のテキストが含意するテキストである代表テキストを記憶する記憶部にアクセス可能に接続される。テキスト取得部１０は、監視対象のテキストを取得する。判定部６０は、取得したテキストが代表テキストを含意するかを判定する。監視部７０は、代表テキストを含意するテキストの数を監視し、監視結果を出力する。 FIG. 1 is a block diagram showing a basic configuration of the first embodiment of the present invention. Referring to FIG. 1, the monitoring system 1 (text monitoring system) of the present invention includes a text acquisition unit 10, a determination unit 60, and a monitoring unit 70. The monitoring system 1 is accessiblely connected to a storage unit that stores representative text, which is the text implied by other texts, extracted based on the implications between the texts. The text acquisition unit 10 acquires the text to be monitored. The determination unit 60 determines whether the acquired text implies a representative text. The monitoring unit 70 monitors the number of texts including the representative text and outputs the monitoring result.

次に、本発明の第１の実施の形態の効果を説明する。 Next, the effect of the first embodiment of the present invention will be described.

本発明の第１の実施の形態によれば、テキストを用いた監視において、事象の検出精度を向上できる。その理由は、判定部６０が、取得したテキストが代表テキストを含意するかを判定し、監視部７０は、代表テキストを含意するテキストの数を監視し、監視結果を出力するためである。これにより、ユーザは、明確な概要レベルの観点毎に、発生数が多い（または、少ない）事象を把握できる。 According to the first embodiment of the present invention, the accuracy of detecting an event can be improved in monitoring using text. The reason is that the determination unit 60 determines whether the acquired text implies the representative text, and the monitoring unit 70 monitors the number of texts including the representative text and outputs the monitoring result. This allows the user to grasp the events with a high (or low) number of occurrences for each clear summary level viewpoint.

また、本発明の第１の実施の形態によれば、テキストを用いた監視において、事前に監視対象を定義することなく監視を行うことができる。その理由は、代表テキスト抽出部４０が、監視対象のテキスト間の含意関係から代表テキストを抽出するためである。 Further, according to the first embodiment of the present invention, in monitoring using a text, monitoring can be performed without defining a monitoring target in advance. The reason is that the representative text extraction unit 40 extracts the representative text from the implications between the texts to be monitored.

（第２の実施の形態）
次に、本発明の第２の実施の形態について説明する。 (Second Embodiment)
Next, a second embodiment of the present invention will be described.

本発明の第２の実施の形態では、特定の種別の代表テキストを含意するテキストの数の合計を監視する点において、本発明の第１の実施の形態と異なる。 A second embodiment of the present invention differs from the first embodiment of the present invention in that it monitors the total number of texts implying a particular type of representative text.

はじめに、本発明の第２の実施の形態の構成を説明する。 First, the configuration of the second embodiment of the present invention will be described.

図１１は、本発明の第２の実施の形態における、監視システム１の構成を示すブロック図である。 FIG. 11 is a block diagram showing the configuration of the monitoring system 1 according to the second embodiment of the present invention.

図１１を参照すると、本発明の第２の実施の形態の監視システム１は、本発明の第１の実施の形態の監視システム１の構成に加えて、対象種別記憶部６５を含む。 Referring to FIG. 11, the monitoring system 1 of the second embodiment of the present invention includes the target type storage unit 65 in addition to the configuration of the monitoring system 1 of the first embodiment of the present invention.

対象種別記憶部６５は、監視対象の代表テキストの種別を示す、対象種別情報を記憶する。 The target type storage unit 65 stores target type information indicating the type of the representative text to be monitored.

図１５は、本発明の第２の実施の形態における、対象種別情報の例を示す図である。対象種別情報は、代表テキストの種別（「種別」欄）、通知条件（「通知条件」欄）、及び、当該種別に分類される代表テキスト（「クラスタ欄」）を含む。図１５の例では、代表テキストの種別として、「悪い評価」、及び、「良い評価」が設定されている。また、通知条件として、各種別の代表テキストに含意するテキストの数の合計に係る下限の閾値が設定されている。 FIG. 15 is a diagram showing an example of target type information in the second embodiment of the present invention. The target type information includes the type of the representative text (“type” column), the notification condition (“notification condition” column), and the representative text classified into the type (“cluster column”). In the example of FIG. 15, "bad evaluation" and "good evaluation" are set as the types of representative texts. Further, as a notification condition, a lower limit threshold value relating to the total number of texts implied in each type of representative text is set.

代表テキスト抽出部４０は、抽出した代表テキストを、対象種別情報で示される種別に分類する。ここで、代表テキスト抽出部４０は、所定の分類ルールに従って、抽出した代表テキストを種別に分類する。この場合、分類ルールには、例えば、種別毎に、キーワードや表現が定義され、当該キーワードや表現を含む代表テキストが、対応する種別に分類される。また、代表テキスト抽出部４０は、抽出した代表テキストをユーザ等に出力し、ユーザ等から、当該代表テキストの種別の指定を受け付けてもよい。 The representative text extraction unit 40 classifies the extracted representative text into the types indicated by the target type information. Here, the representative text extraction unit 40 classifies the extracted representative texts into types according to a predetermined classification rule. In this case, in the classification rule, for example, a keyword or expression is defined for each type, and the representative text including the keyword or expression is classified into the corresponding type. Further, the representative text extraction unit 40 may output the extracted representative text to the user or the like and accept the designation of the type of the representative text from the user or the like.

監視部７０は、対象種別情報で示される各種別について、当該種別の代表テキストを含意するテキストの数の合計が、当該種別に対して設定された通知条件を満たす場合に、通知を行う。 The monitoring unit 70 notifies each type indicated by the target type information when the total number of texts including the representative text of the type satisfies the notification condition set for the type.

表示制御部８０は、監視結果（通知内容）を表示するための通知画面１００を生成し、ユーザ等に表示する。 The display control unit 80 generates a notification screen 100 for displaying the monitoring result (notification content) and displays it to the user or the like.

次に、本発明の第２の実施の形態の動作を説明する。 Next, the operation of the second embodiment of the present invention will be described.

上述のステップＳ１０２で、代表テキスト抽出部４０は、抽出した代表テキストを、対象種別情報で示される種別に分類する。 In step S102 described above, the representative text extraction unit 40 classifies the extracted representative text into the types indicated by the target type information.

ステップＳ１０４で、監視部７０は、対象種別情報で示される各種別について、当該種別の代表テキストを含意するテキストの数の合計を集計する。 In step S104, the monitoring unit 70 totals the total number of texts including the representative text of the type for each type indicated by the target type information.

ステップＳ１０５で、監視部７０は、監視期間における、各種別のテキストの数の合計が、当該種別に対して設定された通知条件を満たすかどうかを判定する。通知条件を満たす場合、監視部７０は、ステップＳ１０６で、通知を行う。 In step S105, the monitoring unit 70 determines whether or not the total number of texts for each type in the monitoring period satisfies the notification condition set for the type. If the notification condition is satisfied, the monitoring unit 70 notifies in step S106.

次に、本発明の第２の実施の形態の動作の具体例を説明する。 Next, a specific example of the operation of the second embodiment of the present invention will be described.

図１２は、本発明の第２の実施の形態における、テキストデータの例を示す図である。図１２の例は、監視対象のテキストが、テレビ製品の「評価」に係るテキストである場合の例である。ここでは、テキスト取得部１０が、図１２のようなテキストデータを、テキスト記憶部２０に保存していると仮定する。また、対象種別情報には、図１５のような種別、及び、通知条件が設定されていると仮定する。 FIG. 12 is a diagram showing an example of text data in the second embodiment of the present invention. The example of FIG. 12 is an example in which the text to be monitored is the text related to the "evaluation" of the television product. Here, it is assumed that the text acquisition unit 10 stores the text data as shown in FIG. 12 in the text storage unit 20. Further, it is assumed that the target type information has the type as shown in FIG. 15 and the notification condition.

図１３は、本発明の第２の実施の形態における、含意関係の抽出結果の例を示す図である。 FIG. 13 is a diagram showing an example of the extraction result of the implication relationship in the second embodiment of the present invention.

含意関係抽出部３０は、例えば、図１２における、取得日時が監視期間「２０１５／１」内の抽出対象テキストから、図１３に示すように、含意関係を抽出する。 The implication relationship extraction unit 30 extracts the implication relationship from the extraction target text whose acquisition date and time is within the monitoring period “2015/1” in FIG. 12, for example, as shown in FIG.

図１４は、本発明の第２の実施の形態における、代表テキスト情報の例を示す図である。 FIG. 14 is a diagram showing an example of representative text information in the second embodiment of the present invention.

代表テキスト抽出部４０は、図１３の含意関係をもとに、図１４のような代表テキスト情報を生成する。そして、代表テキスト抽出部４０は、代表テキストＴ３０４「価格が高い」を種別「悪い評価」に、代表テキストＴ３０５「機能性が高い」、Ｔ３０６「デザインがよい」を種別「良い評価」に分類し、図１５のように、対象種別情報に設定する。 The representative text extraction unit 40 generates representative text information as shown in FIG. 14 based on the implication relationship of FIG. Then, the representative text extraction unit 40 classifies the representative text T304 "high price" into the type "bad evaluation", the representative text T305 "high functionality", and T306 "good design" into the type "good evaluation". , As shown in FIG. 15, the target type information is set.

図１６は、本発明の第２の実施の形態における、各種別の代表テキストを含意するテキストの数の合計の例である。 FIG. 16 is an example of the total number of texts implying different representative texts in the second embodiment of the present invention.

例えば、監視部７０は、監視期間「２０１５／１」、「２０１５／２」、…の各々において、図１６のように、種別「悪い評価」、「良い評価」の各々の代表テキストを含意するテキストの数の合計を集計する。 For example, the monitoring unit 70 includes representative texts of the types "bad evaluation" and "good evaluation" in each of the monitoring periods "2015/1", "2015/2", ..., As shown in FIG. Aggregate the total number of texts.

ここで、監視期間「２０１５／５」における、種別「良い評価」の代表テキストを含意するテキストの数の合計が通知条件を満たす。したがって、監視部７０は、監視期間「２０１５／５」の種別「良い評価」の事象に関して通知を行う。 Here, the total number of texts including the representative text of the type "good evaluation" in the monitoring period "2015/5" satisfies the notification condition. Therefore, the monitoring unit 70 notifies the event of the type "good evaluation" of the monitoring period "2015/5".

図１７は、本発明の第２の実施の形態における、通知画面１００の例を示す図である。 FIG. 17 is a diagram showing an example of the notification screen 100 in the second embodiment of the present invention.

図１７の例では、通知画面１００は、通知領域１０１、種別表示領域１０２、代表テキスト表示領域１０３、時系列表示領域１０４、及び、テキスト表示領域１０５を含む。 In the example of FIG. 17, the notification screen 100 includes a notification area 101, a type display area 102, a representative text display area 103, a time series display area 104, and a text display area 105.

通知領域１０１には、通知対象の監視期間や通知対象の種別が表示される。 In the notification area 101, the monitoring period of the notification target and the type of the notification target are displayed.

種別表示領域１０２の「種別」欄には、例えば、各種別が表示される。また、「件数」欄には、例えば、通知対象の監視期間における、各種別の代表テキストを含意するテキストの数の合計が表示される。 In the "type" column of the type display area 102, for example, each type is displayed. Further, in the "number of cases" column, for example, the total number of texts including the representative texts of each type in the monitoring period to be notified is displayed.

代表テキスト表示領域１０３の「クラスタ」欄には、例えば、通知対象の種別の代表テキストが表示される。また、「件数」欄には、例えば、通知対象の監視期間における、各代表テキストを含意するテキストの数が表示される。 In the "cluster" column of the representative text display area 103, for example, the representative text of the type to be notified is displayed. Further, in the "number of cases" column, for example, the number of texts including each representative text in the monitoring period to be notified is displayed.

時系列表示領域１０４には、例えば、監視期間毎の、各種別の代表テキストを含意するテキストの数の合計（時系列）を示すグラフが表示される。 In the time-series display area 104, for example, a graph showing the total number of texts (time-series) including representative texts for each type is displayed for each monitoring period.

テキスト表示領域１０５の「詳細テキスト」欄には、通知対象の監視期間における、通知対象の種別の代表テキストを含意するテキストが、例えば、取得日時の順番で表示される。 In the "detailed text" field of the text display area 105, texts including the representative text of the type of the notification target in the monitoring period of the notification target are displayed in the order of acquisition date and time, for example.

表示制御部８０は、図１７のような通知画面１００を、ユーザ等に表示する。 The display control unit 80 displays the notification screen 100 as shown in FIG. 17 on the user or the like.

ユーザ等は、図１７の通知画面１００の通知領域１０１を参照し、発生数が多い（または、少ない）事象の種別を把握できる。特定の種別が、製品に係る「良い評価」や「悪い評価」であれば、ユーザ等は、当該製品に対してどのような評価がされているかを容易に把握できる。また、ユーザ等は、テキスト表示領域１０５を参照し、「良い評価」や「悪い評価」の具体的な内容を確認できる。 The user or the like can refer to the notification area 101 of the notification screen 100 of FIG. 17 and grasp the types of events with a large number (or a small number of occurrences). If the specific type is "good evaluation" or "bad evaluation" related to the product, the user or the like can easily grasp what kind of evaluation is given to the product. In addition, the user or the like can refer to the text display area 105 and confirm the specific contents of the "good evaluation" and the "bad evaluation".

以上により、本発明の第２の実施の形態の動作が完了する。 As described above, the operation of the second embodiment of the present invention is completed.

次に、本発明の第２の実施の形態の効果を説明する。 Next, the effect of the second embodiment of the present invention will be described.

本発明の第２の実施の形態によれば、発生数が多い（または、少ない）事象の種別を容易に把握できる。その理由は、監視部７０が、所定の種別に属する代表テキストを含意するテキストの数の合計を監視するためである。 According to the second embodiment of the present invention, the types of events with a large number of occurrences (or a small number of occurrences) can be easily grasped. The reason is that the monitoring unit 70 monitors the total number of texts including the representative texts belonging to a predetermined type.

（第３の実施の形態）
次に、本発明の第３の実施の形態について説明する。 (Third Embodiment)
Next, a third embodiment of the present invention will be described.

本発明の第３の実施の形態では、監視結果として、異なる監視期間の間での、各代表テキストを含意するテキストの数の増減傾向を出力する点において、本発明の第１の実施の形態と異なる。 In the third embodiment of the present invention, as a monitoring result, an increasing / decreasing tendency of the number of texts including each representative text is output between different monitoring periods. Different from.

はじめに、本発明の第３の実施の形態の構成を説明する。 First, the configuration of the third embodiment of the present invention will be described.

本発明の第３の実施の形態の構成を示すブロック図は、本発明の第１の実施の形態（図２）と同様である。 The block diagram showing the configuration of the third embodiment of the present invention is the same as that of the first embodiment of the present invention (FIG. 2).

監視部７０は、監視結果として、異なる監視期間の間での、各代表テキストを含意するテキストの数の増減傾向（比較結果）を出力する。 As the monitoring result, the monitoring unit 70 outputs an increase / decrease tendency (comparison result) of the number of texts including each representative text between different monitoring periods.

表示制御部８０は、監視結果（増減傾向）を表示するための比較画面１１０を生成し、ユーザ等に表示（出力）する。 The display control unit 80 generates a comparison screen 110 for displaying the monitoring result (increase / decrease tendency) and displays (outputs) it to the user or the like.

次に、本発明の第３の実施の形態の動作を説明する。 Next, the operation of the third embodiment of the present invention will be described.

図１８は、本発明の第３の実施の形態における、監視システム１の動作を示すフローチャートである。 FIG. 18 is a flowchart showing the operation of the monitoring system 1 according to the third embodiment of the present invention.

はじめに、含意関係抽出部３０は、上述のステップＳ１０１と同様に、抽出対象テキスト間の含意関係を抽出する（ステップＳ２０１）。ここで、含意関係抽出部３０は、抽出対象テキストとして、複数の監視期間の全テキストを用いて、含意関係を抽出する。 First, the implication relationship extraction unit 30 extracts the implication relationship between the texts to be extracted in the same manner as in step S101 described above (step S201). Here, the implication relationship extraction unit 30 extracts the implication relationship by using all the texts of the plurality of monitoring periods as the extraction target texts.

代表テキスト抽出部４０は、上述のステップＳ１０２と同様に、抽出された含意関係から代表テキストを抽出し、クラスタを生成する（ステップＳ２０２）。 The representative text extraction unit 40 extracts the representative text from the extracted implications and generates a cluster in the same manner as in step S102 described above (step S202).

判定部６０は、上述のステップＳ１０３と同様に、各監視期間に取得された各テキストが、各代表テキストを含意するかどうかを判定する（ステップＳ２０３）。 Similar to step S103 described above, the determination unit 60 determines whether or not each text acquired during each monitoring period implies each representative text (step S203).

監視部７０は、上述のステップＳ１０４と同様に、各代表テキストについて、当該代表テキストを含意するテキストの数を集計する（ステップＳ２０４）。 Similar to step S104 described above, the monitoring unit 70 totals the number of texts including the representative text for each representative text (step S204).

判定部６０、及び、監視部７０は、全ての監視期間について、ステップＳ２０３、Ｓ２０４の処理を繰り返す（ステップＳ２０５）。 The determination unit 60 and the monitoring unit 70 repeat the processes of steps S203 and S204 for all monitoring periods (step S205).

監視部７０は、異なる監視期間の間で、各代表テキストを含意するテキストの数を比較する（ステップＳ２０６）。 The monitoring unit 70 compares the number of texts implying each representative text between different monitoring periods (step S206).

監視部７０は表示制御部８０を介して、各代表テキストを含意するテキストの数の増減傾向（比較結果）を出力する（ステップＳ２０７）。ここで、表示制御部８０は、比較画面１１０を生成し、ユーザ等に表示する。 The monitoring unit 70 outputs an increasing / decreasing tendency (comparison result) of the number of texts including each representative text via the display control unit 80 (step S207). Here, the display control unit 80 generates the comparison screen 110 and displays it to the user or the like.

次に、本発明の第３の実施の形態の動作の具体例を説明する。 Next, a specific example of the operation of the third embodiment of the present invention will be described.

図１９は、本発明の第３の実施の形態における、テキストデータの例を示す図である。ここでは、本発明の第２の実施の形態と同様に、監視対象のテキストが、テレビ製品の「評価」に係るテキストであると仮定する。また、監視部７０が、テレビ製品に係るキャンペーンの前の１ヶ月の監視期間（キャンペーン前）とキャンペーンの後の１ヶ月の監視期間（キャンペーン後）との間で、代表テキストを含意するテキストの数を比較すると仮定する。そして、図１９のようなテキストデータが、テキスト記憶部２０に保存されていると仮定する。また、含意関係抽出対象のテキスト（抽出対象テキスト）は、キャンペーン前とキャンペーン後の両方の監視期間の全テキストであると仮定する。 FIG. 19 is a diagram showing an example of text data in the third embodiment of the present invention. Here, it is assumed that the text to be monitored is the text related to the "evaluation" of the television product, as in the second embodiment of the present invention. In addition, the monitoring unit 70 has a text that implies a representative text between the one-month monitoring period (before the campaign) before the campaign related to the TV product and the one-month monitoring period (after the campaign) after the campaign. Suppose you want to compare numbers. Then, it is assumed that the text data as shown in FIG. 19 is stored in the text storage unit 20. In addition, it is assumed that the text to be extracted (extracted text) is the entire text of the monitoring period both before and after the campaign.

図２０は、本発明の第３の実施の形態における、含意関係の抽出結果の例を示す図である。 FIG. 20 is a diagram showing an example of the extraction result of the implication relationship in the third embodiment of the present invention.

含意関係抽出部３０は、例えば、図１９の全テキストから、図２０に示すように、含意関係を抽出する。 The implication relationship extraction unit 30 extracts implication relationships from, for example, all the texts in FIG. 19, as shown in FIG.

図２１は、本発明の第３の実施の形態における、代表テキスト情報の例を示す図である。図２１の例では、テキストＴ５０１「価格が高い」、Ｔ６０１「デザインがよい」、及び、Ｔ６０４「機能性が高い」が、それぞれ、クラスタＣ１、Ｃ２、及び、Ｃ３の代表テキストに設定されている。 FIG. 21 is a diagram showing an example of representative text information in the third embodiment of the present invention. In the example of FIG. 21, the texts T501 "high price", T601 "good design", and T604 "high functionality" are set as the representative texts of the clusters C1, C2, and C3, respectively. ..

代表テキスト抽出部４０は、図２０の含意関係をもとに、図２１のような代表テキスト情報を生成する。 The representative text extraction unit 40 generates representative text information as shown in FIG. 21 based on the implication relationship of FIG. 20.

図２２は、本発明の第３の実施の形態における、各代表テキストを含意するテキストの数の例である。 FIG. 22 is an example of the number of texts implying each representative text in the third embodiment of the present invention.

例えば、監視部７０は、キャンペーン前の監視期間「２０１５／１」とキャンペーン後の監視期間「２０１５／２」の各々において、図２２のように、各代表テキストＴ５０１、Ｔ６０１、Ｔ６０４を含意するテキストの数を集計する。 For example, the monitoring unit 70 includes texts including the representative texts T501, T601, and T604 in each of the pre-campaign monitoring period “2015/1” and the post-campaign monitoring period “2015/2”, as shown in FIG. Aggregate the number of.

また、監視部７０は、監視期間「２０１５／１」と「２０１５／２」との間で、各代表テキストＴ５０１、Ｔ６０１、Ｔ６０４を含意するテキストの数を比較する。ここで、代表テキストＴ５０１を含意するテキストの数は、キャンペーン前の監視期間「２０１５／１」には１００であったが、キャンペーン後の監視期間「２０１５／２」には０である。したがって、監視部７０は、代表テキストＴ５０１のクラスタの「削除」を検出する。また、代表テキストＴ６０１を含意するテキストの数は、キャンペーン前には０であったが、キャンペーン後には１００である。したがって、監視部７０は、代表テキストＴ５０１のクラスタの「追加」を検出する。さらに、代表テキストＴ６０４を含意するテキストの数は、キャンペーン前には５０であったが、キャンペーン後には２００である。したがって、代表テキストＴ６０４を含意するテキストの数の「増加」を検出する。なお、同様に、監視部７０は、代表テキストを含意するテキストの数の「減少」を検出してもよい。 In addition, the monitoring unit 70 compares the number of texts implying the representative texts T501, T601, and T604 between the monitoring periods “2015/1” and “2015/2”. Here, the number of texts including the representative text T501 was 100 in the monitoring period "2015/1" before the campaign, but 0 in the monitoring period "2015/2" after the campaign. Therefore, the monitoring unit 70 detects the "deletion" of the cluster of the representative text T501. The number of texts including the representative text T601 was 0 before the campaign, but 100 after the campaign. Therefore, the monitoring unit 70 detects the “addition” of the cluster of the representative text T501. Furthermore, the number of texts implying the representative text T604 was 50 before the campaign, but 200 after the campaign. Therefore, an "increase" in the number of texts implying the representative text T604 is detected. Similarly, the monitoring unit 70 may detect a “decrease” in the number of texts including the representative text.

図２３は、本発明の第３の実施の形態における、比較画面１１０の例を示す図である。 FIG. 23 is a diagram showing an example of the comparison screen 110 in the third embodiment of the present invention.

図２３の例では、比較画面１１０は、代表テキスト表示領域１１１、及び、時系列表示領域１１２を含む。 In the example of FIG. 23, the comparison screen 110 includes a representative text display area 111 and a time series display area 112.

代表テキスト表示領域１１１の「クラスタ」欄には、例えば、各代表テキストが表示される。また、「件数」欄には、例えば、監視期間毎の、各代表テキストを含意するテキストの数が表示される。「比較結果」欄には、各代表テキストを含意するテキストの数の比較結果（クラスタの「削除」や「追加」、テキストの数の「増加」や「減少」）が表示される。 For example, each representative text is displayed in the "cluster" column of the representative text display area 111. Further, in the "number of cases" column, for example, the number of texts including each representative text is displayed for each monitoring period. In the "Comparison result" column, the comparison result of the number of texts including each representative text ("Delete" or "Add" of the cluster, "Increase" or "Decrease" of the number of texts) is displayed.

例えば、表示制御部８０は、図２３のような比較画面１１０を、ユーザ等に表示する。 For example, the display control unit 80 displays the comparison screen 110 as shown in FIG. 23 on the user or the like.

ユーザ等は、図２３の比較画面１１０を参照し、異なる監視期間の間で追加もしくは削除されたクラスタ、または、発生数が増加もしくは減少した事象を把握できる。ここで、例えば、代表テキストＴ６０１「デザインがよい」やＴ６０４「機能性が高い」のように、良い評価に係る事象のクラスタの追加や、良い評価に係る事象が増加していた場合、キャンペーンにより、評価が改善されたことがわかる。一方、良い評価に係る事象のクラスタの削除や、良い評価に係る事象が減少していた場合、キャンペーンにより、評価が改悪されたことがわかる。同様に、例えば、代表テキストＴ５０１「価格が高い」のように、悪い評価に係る事象のクラスタの削除や、悪い評価に係る事象が減少していた場合も、キャンペーンにより、評価が改善されたことがわかる。また、悪い評価に係る事象のクラスタの追加や、悪い評価に係る事象が増加していた場合も、キャンペーンにより、評価が改悪されたことがわかる。 The user or the like can refer to the comparison screen 110 of FIG. 23 and grasp the clusters added or deleted during different monitoring periods, or the events in which the number of occurrences increased or decreased. Here, for example, when a cluster of events related to good evaluation is added or the number of events related to good evaluation is increasing, such as the representative texts T601 "good design" and T604 "high functionality", the campaign may be used. , It can be seen that the evaluation has improved. On the other hand, if the cluster of events related to good evaluation is deleted or the number of events related to good evaluation is decreasing, it can be seen that the evaluation has been deteriorated by the campaign. Similarly, when the cluster of events related to bad evaluation was deleted or the number of events related to bad evaluation decreased, for example, the representative text T501 "High price", the evaluation was improved by the campaign. I understand. In addition, when the number of events related to bad evaluation is increasing or the number of events related to bad evaluation is increasing, it can be seen that the evaluation has been deteriorated by the campaign.

以上により、本発明の第３の実施の形態の動作が完了する。 As described above, the operation of the third embodiment of the present invention is completed.

次に、本発明の第３の実施の形態の効果を説明する。 Next, the effect of the third embodiment of the present invention will be described.

本発明の第３の実施の形態によれば、異なる監視期間の間で、追加もしくは削除されたクラスタ、または、発生数が増加もしくは減少した事象を容易に把握できる。その理由は、監視部７０が、異なる監視期間の間での、各代表テキストを含意するテキストの数の増減傾向を出力するためである。 According to the third embodiment of the present invention, it is possible to easily grasp the clusters added or deleted, or the events in which the number of occurrences increased or decreased during different monitoring periods. The reason is that the monitoring unit 70 outputs the increasing / decreasing tendency of the number of texts including each representative text between different monitoring periods.

以上、実施形態を参照して本願発明を説明したが、本願発明は上記実施形態に限定されるものではない。本願発明の構成や詳細には、本願発明のスコープ内で当業者が理解し得る様々な変更をすることができる。 Although the present invention has been described above with reference to the embodiments, the present invention is not limited to the above embodiments. Various changes that can be understood by those skilled in the art can be made within the scope of the present invention in terms of the structure and details of the present invention.

以下、参考形態の例を付記する。 Hereinafter, an example of the reference form will be added.

（付記１）
テキスト間の含意関係に基づいて、含意関係があるテキストを同じグループに分類することによりクラスタリングされた情報源と、前記情報源に追加されたテキストと同一グループに属するクラスタを特定する特定手段と、前記特定したクラスタに属するテキストの数を計算して、所定のタイミングで提示する提示手段と、を備える、テキスト監視システム。 (Appendix 1)
Sources clustered by classifying texts with implications into the same group based on the implications between texts, and specific means of identifying clusters that belong to the same group as the text added to the source. A text monitoring system comprising a presentation means for calculating the number of texts belonging to the identified cluster and presenting them at a predetermined timing.

本発明は、大量文書データから事象を監視するシステムに適用できる。例えば、本発明は、製品やサービスの改善、マーケティング、営業活動の効率化のために、コールログや顧客の意見等を監視するシステムに適用できる。また、本発明は、製品の不具合や製品に対する評価や要望を監視するシステム、学術文献等の内容を監視するシステムにも適用できる。 The present invention can be applied to a system for monitoring an event from a large amount of document data. For example, the present invention can be applied to a system for monitoring call logs, customer opinions, etc. in order to improve products and services, improve marketing, and improve the efficiency of sales activities. The present invention can also be applied to a system for monitoring product defects, evaluations and requests for products, and a system for monitoring the contents of academic literature and the like.

１監視システム
２ＣＰＵ
３記憶デバイス
４通信デバイス
５入力デバイス
６出力デバイス
１０テキスト取得部
２０テキスト記憶部
３０含意関係抽出部
４０代表テキスト抽出部
５０代表テキスト記憶部
６０判定部
７０監視部
８０表示制御部
９０通知画面
９１通知領域
９２代表テキスト表示領域
９３時系列表示領域
９４テキスト表示領域
１００通知画面
１０１通知領域
１０２種別表示領域
１０３代表テキスト表示領域
１０４時系列表示領域
１０５テキスト表示領域
１１０比較画面
１１１代表テキスト表示領域
１１２時系列表示領域 1 Monitoring system 2 CPU
3 Storage device 4 Communication device 5 Input device 6 Output device 10 Text acquisition unit 20 Text storage unit 30 Implication relationship extraction unit 40 Representative text extraction unit 50 Representative text storage unit 60 Judgment unit 70 Monitoring unit 80 Display control unit 90 Notification screen 91 Notification Area 92 Representative text display area 93 Time series display area 94 Text display area 100 Notification screen 101 Notification area 102 Type display area 103 Representative text display area 104 Time series display area 105 Text display area 110 Comparison screen 111 Representative text display area 112 Time series Indicated Area

Claims

Text acquisition means to acquire the first text,
A reception means that accepts the designation of the second text,
A monitoring means for monitoring whether or not the number of the first texts including the second text that has received the designation among the acquired first texts satisfies a predetermined condition, and
A display control means for displaying a screen that can be grasped to satisfy the predetermined condition when the predetermined condition is satisfied
With
The relationship between the second text and the first text implying the second text is that if the content of the first text is true, then the content of the second text is true. An analysis support system that shows that.

The analysis support system according to claim 1, wherein the monitoring means monitors whether or not the number of first texts including the second text that has received the designation exceeds a threshold value.

The analysis according to claim 1, wherein the monitoring means monitors whether or not the number of first texts including the second text that has received the designation satisfies a predetermined condition in a plurality of predetermined periods. Support system.

The monitoring means monitors whether or not the total number of the first texts including the second text belonging to a predetermined type satisfies the predetermined condition in the plurality of predetermined periods. The analysis support system according to claim 3.

The predetermined condition is that the number of the first texts is equal to or more than or equal to a predetermined threshold value, or an increase or decrease in the number of the first texts during different periods among the plurality of periods. The analysis support system according to claim 3 or 4, wherein the amount is equal to or greater than a predetermined threshold value.

The analysis support system according to claim 1, wherein the monitoring means monitors an increasing / decreasing tendency of the number of the first texts including the second texts that have received the designation.

The text acquisition means provided in the computer acquires the first text and
The reception means provided in the computer accepts the designation of the second text,
The monitoring means provided in the computer monitors whether or not the number of the first texts including the second text that has received the designation among the acquired first texts satisfies a predetermined condition. ,
An analysis support method for displaying a screen on which it is possible to grasp that the predetermined condition is satisfied when the display control means provided in the computer satisfies the predetermined condition.
The relationship between the second text and the first text implying the second text is that if the content of the first text is true, then the content of the second text is true. An analysis support method that shows that.

On the computer
Get the first text,
Accepts the specification of the second text,
Among the acquired first texts, it is monitored whether or not the number of the first texts including the second texts that have accepted the designation satisfies a predetermined condition.
A program that executes a process of displaying a screen that can be grasped that the predetermined conditions are satisfied when the predetermined conditions are satisfied.
The relationship between the second text and the first text implying the second text is that if the content of the first text is true, then the content of the second text is true. A program that shows that.