JP2006146621A

JP2006146621A - Information management device and method, and information management program

Info

Publication number: JP2006146621A
Application number: JP2004336619A
Authority: JP
Inventors: Kiyomi Yatabe; 清美矢田部; Shinichi Doi; 伸一土井; Shinichi Ando; 真一安藤
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2004-11-19
Filing date: 2004-11-19
Publication date: 2006-06-08

Abstract

<P>PROBLEM TO BE SOLVED: To easily recognize the state of knowledge of each user. <P>SOLUTION: An information management device includes input/output data storage means 31, 32 for storing input/output data inputted and outputted by users and input/output condition data indicative of the input/output conditions of the input/output data; a feature value extraction means 22 for extracting feature values based on the predetermined standard of the input/output data; and an individual knowledge data creation means 23 for creating, based on the feature value of the input/output data and the input/output condition data, individual knowledge data indicative of the feature of the input/output data for each of the users who inputted or outputted the input/output data. <P>COPYRIGHT: (C)2006,JPO&NCIPI

Description

本発明は、情報管理装置にかかり、特に、ユーザによる入出力データを、特徴付けてユーザごとに管理する情報管理装置に関する。また、情報管理方法及び情報管理用プログラムに関する。 The present invention relates to an information management apparatus, and more particularly to an information management apparatus that characterizes input / output data by a user and manages it for each user. The present invention also relates to an information management method and an information management program.

近年、大量の情報が供給される中で、個人が所望の情報を簡単に検索したり配信されて受け取ったりすることや（情報検索や情報配信）、所望の知識をもった専門家を検索すること（専門家検索）、などの要求が高まっている。 In recent years, with a large amount of information being supplied, individuals can easily search for or receive desired information (information search and information distribution), and search for experts with the desired knowledge. (Expert search), etc. are increasing.

一方で、個人の情報の入出力が電子文書や電子メール、電子画像などの電子情報としてコンピュータ上で行われることが多くなり、個人の入出力情報をその個人の関心事や専門知識として保存することが可能になってきている。 On the other hand, personal information is often input / output on a computer as electronic information such as electronic documents, e-mails, and electronic images, and personal input / output information is stored as personal interests and expertise. It is becoming possible.

ここで、入出力情報とは、例えば、コピーや閲覧した入力情報と、作成や編集した出力情報を指す。こうした個人の現在までの入出力情報を利用して検索や配信に活用するという従来技術が提案されている。それらの従来技術には以下のものがある。例えば、（１）各個人の入出力情報を蓄積して、後に情報検索、専門家検索に活用するナレッジマネージメントサービスや、（２）個人の情報の入出力を保存して、その個人にあった情報提供を可能とする情報配信パーソナライゼーション、リコメンデーションサービス（例えば特許文献１）である。 Here, the input / output information refers to, for example, input information that has been copied or browsed and output information that has been created or edited. Conventional techniques have been proposed in which such input / output information of an individual is used for search and distribution. These prior arts include the following. For example, (1) Knowledge management service that accumulates input / output information of each individual and later uses it for information search and expert search, and (2) Stores input / output of personal information, and was in that individual Information distribution personalization and recommendation service (for example, Patent Document 1) enabling information provision.

特開２００１−１７５６７６号公報JP 2001-175676 A

しかし、上記従来例の第１の問題点として、古いデータはデータベースに埋もれてしまい、検索することが困難となってしまう、という問題が生じる。すなわち、通常、個人の関心事や専門知識は時期に応じて移り変わり、個人は時間の経過とともに複数の関心事や専門知識をもつものであるが、従来の方法では、過去から現在までの期間全体ではそれほど顕著ではないが、過去のある時期だけに取り扱ったテーマのように、過去のある期間に集中的に入出力した情報から抽出される個人の関心事や専門知識が全体に埋もれてしまい、抽出することが困難となってしまう。 However, as a first problem of the above-described conventional example, there arises a problem that old data is buried in a database and it is difficult to search. In other words, individual interests and expertise usually change over time, and individuals have multiple interests and expertise over time, but conventional methods use the entire period from the past to the present. Although it is not so noticeable, personal interests and expertise extracted from information intensively input and output during a certain past period, such as themes handled only in a certain past period, are buried in the whole, It becomes difficult to extract.

上記状況を、図２１を参照しながら説明する。図２１の実線（横線）は、ある個人Ｐが時間ｔ（横軸）上の特定の期間に、特定の関心事や専門知識にかかわる活動をしていたことを表している。例えば、図２１内の最上段に位置する三本の実線（横線）は、個人Ｐが時間ｔ１〜ｔ２と、ｔ３〜ｔ５と、ｔ７〜ｔ９との期間に関心事や専門知識ａにかかわる活動をしていたことを表している。同様に図２１内のその他の下の二本の実線（横線）は、個人Ｐが時間ｔ２〜ｔ４の期間に関心事や専門知識ｂにかかわる活動をし、時間ｔ６〜ｔ８の期間に関心事や専門知識ｃにかかわる活動をしていたことを表している。 The above situation will be described with reference to FIG. A solid line (horizontal line) in FIG. 21 indicates that a certain individual P was engaged in an activity related to a specific interest or expertise in a specific period on time t (horizontal axis). For example, the three solid lines (horizontal lines) located at the top in FIG. 21 indicate that the individual P is involved in the interests and expertise a during the period from time t1 to t2, t3 to t5, and t7 to t9. It means that he was doing. Similarly, two lower solid lines (horizontal lines) in FIG. 21 indicate that the individual P is engaged in activities related to interests and expertise b in the period from time t2 to t4, and is interested in the period from time t6 to t8. And activities related to expertise c.

そして、上述した従来例では、入出力情報全体から一つの個人の関心事や専門知識を抽出しているため、個人の関心事や専門知識ａのみが抽出でき、過去のある時期に集中的に取り扱った関心事や専門知識ｂやｃが抽出できない、という問題が生じる。また、同様に、従来例では、時間的に古い入出力情報からは個人の関心事や専門知識の抽出を困難にしているため、過去のあるときに集中的に入出力した情報でも、現在の時点から遠く離れている情報は、抽出されにくい。その結果、全体を通じて多く取り扱ったａや最近集中的に取り扱ったｃは抽出できるが、ｃよりも古く過去のある時期に集中的に取り扱った関心事や専門知識ｂが抽出できない、という問題が生じる。 In the conventional example described above, since one individual's interests and expertise are extracted from the entire input / output information, only the individual interests and expertise a can be extracted and concentrated on a certain period in the past. There arises a problem that the handled interests and expertise b and c cannot be extracted. Similarly, in the conventional example, it is difficult to extract personal interests and expertise from input / output information that is old in time. Information that is far away from the time is difficult to extract. As a result, although a that has been dealt with a lot throughout and c that has recently been handled intensively can be extracted, there is a problem that interests and expertise b that are older than c and that have been handled intensively at a certain time in the past cannot be extracted. .

また、従来例の第２の問題点として、個人の関心事や専門知識を正確に抽出することが困難となる、という問題が生じる。それは、個人が見ただけなのかそれとも深く理解したのかを区別せず、個人の情報の入出力全体からその個人の関心事や専門知識を抽出しているためである。さらに詳述すると、情報をコピー・閲覧しただけなのか、それとも作成・編集するほど自分の高い関心事や専門知識として扱ったのか、を区別せず、それら混在した両方から特徴を抽出しているためである。 Further, as a second problem of the conventional example, there arises a problem that it becomes difficult to accurately extract personal interests and expertise. This is because the individual's interests and expertise are extracted from the entire input / output of personal information without distinguishing whether the individual has just seen or deeply understood. To elaborate further, we do not distinguish between whether the information was just copied / viewed or whether it was treated as a high level of interest or expertise so that it could be created / edited, and features were extracted from both of them. Because.

このため、本発明は、個人の入出力状況に応じたデータ管理を行うことで、各ユーザの知識状態を容易に認識可能とし、これにより、あらゆる情報検索の容易化を図る、ことをその目的とする。具体的には、上記の第１の問題点に対応して、時間情報を用いて個人の期間ごとの関心事や専門知識を抽出することで、過去のある時期だけに取り扱ったテーマのように、過去のある期間に集中的に入出力した情報から抽出される個人の関心事や専門知識を検索することができるようにすることを目的とする。また、上記の第２の問題点に対応して、個人が見ただけなのかそれとも深く理解したのか、個人の関心事や専門知識を正確に抽出することを目的とする。 Therefore, the object of the present invention is to make it possible to easily recognize the knowledge state of each user by performing data management in accordance with the personal input / output status, thereby facilitating all information retrieval. And Specifically, in response to the first problem described above, by extracting the interests and specialized knowledge of each individual period using time information, it looks like a theme handled only at a certain time in the past An object is to make it possible to search for personal interests and expertise extracted from information intensively input and output during a past period. Another object of the present invention is to accurately extract the interests and expertise of an individual, whether they have only seen it or understood it deeply in response to the second problem.

そこで、本発明の一形態である情報管理装置は、
ユーザにて入出力される入出力データと、この入出力データの入出力状況を表す入出力状況データと、を記憶する入出力データ記憶手段と、
入出力データの予め定められた基準に基づく特徴値を抽出する特徴値抽出手段と、
入出力データの特徴値と入出力状況データとに基づいて、入出力データを入出力したユーザごとにおける入出力データの特徴を表す個人知識データを生成する個人知識データ生成手段と、
を備えたことを特徴としている。 Therefore, an information management apparatus according to one aspect of the present invention is
Input / output data storage means for storing input / output data input / output by the user and input / output status data representing the input / output status of the input / output data;
Feature value extraction means for extracting feature values based on predetermined criteria of input / output data;
Personal knowledge data generating means for generating personal knowledge data representing the characteristics of the input / output data for each user who has input / output the input / output data based on the feature values of the input / output data and the input / output status data;
It is characterized by having.

これにより、ユーザごとの入出力状況に応じて入出力データの特徴を表す個人知識データが生成されるため、各ユーザの知識状態を容易に認識可能となり、的確な情報の検索を実現できる。 As a result, personal knowledge data representing the characteristics of the input / output data is generated according to the input / output status of each user, so that the knowledge state of each user can be easily recognized, and accurate information search can be realized.

また、特徴値抽出手段は、入出力データに含まれる各用語の出現頻度、出現位置、強調情報のうち、少なくとも１つに基づいて特徴値を抽出する、ことを特徴としている。これにより、ユーザごとの個人知識データが入出力された用語の頻度等に応じて生成されるため、より明確に各ユーザの知識状態を認識することができる。 The feature value extracting means is characterized in that the feature value is extracted based on at least one of the appearance frequency, appearance position, and emphasis information of each term included in the input / output data. Thereby, since personal knowledge data for each user is generated according to the frequency of input / output terms, the knowledge state of each user can be recognized more clearly.

また、入出力状況データは、入出力データが入出力された時間を表すデータである、ことを特徴としている。そして、このとき、個人知識データ生成手段は、予め定められた期間内に入出力が行われた複数の入出力データに基づいて個人知識データを生成する、ことを特徴としている。これにより、入出力データの入出力時期ごとに個人知識データを生成することができ、時期ごとの各ユーザの知識状態を認識することができる。従って、情報の新旧を意識せずに、検索が可能となる。 In addition, the input / output status data is data representing the time when the input / output data was input / output. At this time, the personal knowledge data generating means generates personal knowledge data based on a plurality of input / output data that are input / output within a predetermined period. Thereby, personal knowledge data can be generated for each input / output time of the input / output data, and the knowledge state of each user for each time can be recognized. Therefore, it is possible to search without being conscious of new and old information.

また、入出力状況データは、入出力データの入出力の区別を表す入出力区別データである、ことを特徴としている。このとき、個人知識データ生成手段は、入出力区別データに応じて入出力データに重み付けを行って個人知識データを生成する、ことを特徴としている。さらに、このとき、個人知識データ生成手段は、入出力データのうち出力データに対して入力データよりも高い重み付けを行う、ことを特徴としている。また、入出力状況データは、入出力データにおける他のデータの参照状況を表す参照状況データである、ことを特徴としている。このとき、個人知識データ生成手段は、参照状況データに応じて入出力データに重み付けを行って個人知識データを生成する、ことを特徴としている。特に、重み付けは、入出力データのうち所定のデータの参照先となるデータに対して重み付けを行う、ことを特徴としている。これにより、入出力データの入出力の区別、あるいは、参照状態に応じて、ユーザごとの個人知識データの特徴が生成されるため、より個人の知識状態を反映して生成することができ、より的確な情報検索が可能となる。 The input / output status data is input / output distinction data representing input / output distinction of the input / output data. At this time, the personal knowledge data generating means generates the personal knowledge data by weighting the input / output data according to the input / output distinction data. Further, at this time, the personal knowledge data generating means is characterized in that the output data of the input / output data is weighted higher than the input data. Further, the input / output status data is reference status data representing a reference status of other data in the input / output data. At this time, the personal knowledge data generating means generates the personal knowledge data by weighting the input / output data according to the reference situation data. In particular, the weighting is characterized in that weighting is performed on data serving as a reference destination of predetermined data among input / output data. As a result, the characteristics of the personal knowledge data for each user are generated according to the input / output distinction of the input / output data or the reference state, so that it can be generated more reflecting the individual knowledge state. Accurate information search is possible.

また、複数の入出力データの特徴値間の類似度を算出する類似度算出手段を備え、個人知識データ生成手段は、類似度が所定値以上の複数の入出力データに対して個人知識データの生成を行う、ことを特徴としている。これにより、互いに特徴が類似する入出力データに対する個人知識データを生成することができ、同類のデータ群にまとめて個人知識データを生成でき、より個人の特徴を反映することができる。 In addition, a similarity calculation unit that calculates a similarity between feature values of a plurality of input / output data is provided, and the personal knowledge data generation unit is configured to store personal knowledge data for a plurality of input / output data having a similarity equal to or greater than a predetermined value. It is characterized by generating. As a result, personal knowledge data for input / output data having similar characteristics can be generated, and personal knowledge data can be generated in a group of similar data, and more personal characteristics can be reflected.

さらに、上記構成に加え、個人知識データ生成手段にて生成された個人知識データを蓄積する情報蓄積手段を備えた、ことを特徴としている。そして、他のユーザから入出力データに対応するデータを含む検索要求データを受け付けて、当該検索要求データに基づいて情報蓄積手段に記憶された個人知識データを検索する検索手段を備えた、ことを特徴としている。このとき、検索手段は、検索した個人知識データに予め定められた基準にて類似する他の個人知識データを検索する、ことを特徴としている。また、上記構成に加え、検索手段にて検索した個人知識データを出力する検索情報出力手段を備えた、ことを特徴としている。また、検索手段にて検索した個人知識データに対応する入出力データを、入出力データ記憶手段から読み出して出力する検索情報出力手段を備えた、ことを特徴としている。さらに、検索手段にて検索した個人知識データに対応するユーザを特定するユーザ特定情報を出力する検索情報出力手段を備えた、ことを特徴としている。これにより、個人ごとの入出力データの特徴に基づいて情報の検索を行うことができ、より的確な情報の検索を行うことができる。 Further, in addition to the above configuration, the information storage means for storing the personal knowledge data generated by the personal knowledge data generating means is provided. And a search means for receiving search request data including data corresponding to the input / output data from another user and searching the personal knowledge data stored in the information storage means based on the search request data. It is a feature. At this time, the search means is characterized in that it searches for other personal knowledge data similar to the searched personal knowledge data based on a predetermined criterion. Further, in addition to the above configuration, a search information output means for outputting personal knowledge data searched by the search means is provided. Further, the present invention is characterized in that it includes search information output means for reading out and outputting input / output data corresponding to the personal knowledge data searched by the search means from the input / output data storage means. Further, the present invention is characterized in that search information output means for outputting user specifying information for specifying a user corresponding to the personal knowledge data searched by the search means is provided. Accordingly, information can be searched based on the characteristics of input / output data for each individual, and more accurate information can be searched.

また、本発明の他の形態である情報管理方法は、
ユーザにて入出力された入出力データと、この入出力データの入出力状況を表す入出力状況データと、を記憶する入出力データ記憶手段に接続されたコンピュータが実行する情報管理方法であって、
入出力データの特徴値を予め定められた基準に基づいて抽出する特徴値抽出工程と、
入出力データの特徴値と、入出力データ記憶手段に記憶された入出力状況データと、に基づいて、入出力データを入出力したユーザごとにおける入出力データの特徴を表す個人知識データを生成する個人知識データ生成工程と、
を有することを特徴としている。 In addition, an information management method according to another aspect of the present invention includes:
An information management method executed by a computer connected to input / output data storage means for storing input / output data input / output by a user and input / output status data representing the input / output status of the input / output data. ,
A feature value extraction step of extracting feature values of the input / output data based on a predetermined criterion;
Based on the characteristic value of the input / output data and the input / output status data stored in the input / output data storage means, personal knowledge data representing the characteristics of the input / output data for each user who has input / output the input / output data is generated. Personal knowledge data generation process,
It is characterized by having.

そして、特徴値抽出工程は、入出力データに含まれる各用語の出現頻度、出現位置、強調情報のうち、少なくとも１つに基づいて特徴値を抽出する、ことを特徴としている。 The feature value extraction step is characterized in that feature values are extracted based on at least one of the appearance frequency, appearance position, and emphasis information of each term included in the input / output data.

また、入出力状況データは、入出力データが入出力された時間を表すデータであり、
個人知識データ生成工程は、予め定められた期間内に入出力が行われた複数の入出力データに基づいて個人知識データを生成する、ことを特徴としている。 The input / output status data is data representing the time when the input / output data was input / output.
The personal knowledge data generating step is characterized in that personal knowledge data is generated based on a plurality of input / output data input / output within a predetermined period.

また、入出力状況データは、入出力データの入出力の区別を表す入出力区別データであり、
個人知識データ生成工程は、入出力区別データに応じて入出力データに重み付けを行って個人知識データを生成する、ことを特徴としている。 The input / output status data is input / output distinction data indicating the input / output distinction of the input / output data.
The personal knowledge data generation step is characterized in that the personal knowledge data is generated by weighting the input / output data according to the input / output distinction data.

さらに、個人知識データ生成工程は、複数の入出力データの特徴値間の類似度を算出すると共に、当該類似度が所定値以上の複数の入出力データに対して個人知識データの生成を行う、ことを特徴としている。 Furthermore, the personal knowledge data generation step calculates similarity between feature values of a plurality of input / output data, and generates personal knowledge data for a plurality of input / output data having the similarity equal to or higher than a predetermined value. It is characterized by that.

また、本発明の他の形態である情報管理用プログラムは、
ユーザにて入出力された入出力データと、この入出力データの入出力状況を表す入出力状況データと、を記憶する入出力データ記憶手段に接続されたコンピュータに、
入出力データの特徴値を予め定められた基準に基づいて抽出する特徴値抽出手段と、
入出力データの特徴値と、入出力データ記憶手段に記憶された入出力状況データと、に基づいて、入出力データを入出力したユーザごとにおける入出力データの特徴を表す個人知識データを生成する個人知識データ生成手段と、
を実現するための情報管理用プログラム、という構成を採っている。 An information management program according to another aspect of the present invention is
To a computer connected to input / output data storage means for storing input / output data input / output by the user and input / output status data representing the input / output status of the input / output data,
Feature value extraction means for extracting feature values of the input / output data based on a predetermined criterion;
Based on the characteristic value of the input / output data and the input / output status data stored in the input / output data storage means, personal knowledge data representing the characteristics of the input / output data for each user who has input / output the input / output data is generated. Personal knowledge data generation means;
It adopts the configuration of an information management program for realizing the above.

そして、コンピュータに、入出力データに含まれる各用語の出現頻度、出現位置、強調情報のうち、少なくとも１つに基づいて特徴値を抽出する特徴値抽出手段、を実現するための情報管理用プログラムでもある。 An information management program for realizing, on the computer, feature value extraction means for extracting a feature value based on at least one of the appearance frequency, appearance position, and emphasis information of each term included in the input / output data But there is.

また、コンピュータに、
入出力状況データが入出力データの入出力時間を表すデータである場合に、予め定められた期間内に入出力が行われた複数の入出力データに基づいて個人知識データを生成する個人知識データ生成手段、を実現するための情報管理用プログラムでもある。 Also on the computer,
Personal knowledge data for generating personal knowledge data based on a plurality of input / output data input / output within a predetermined period when the input / output status data is data representing the input / output time of the input / output data. It is also an information management program for realizing the generation means.

また、コンピュータに、
入出力状況データが入出力データの入出力の区別を表す入出力区別データである場合に、入出力区別データに応じて入出力データに重み付けを行って個人知識データを生成する個人知識データ生成手段、を実現するための情報管理用プログラムでもある。 Also on the computer,
Personal knowledge data generating means for generating personal knowledge data by weighting the input / output data according to the input / output distinction data when the input / output status data is input / output distinction data representing the input / output distinction of the input / output data It is also an information management program for realizing.

さらに、コンピュータに、
複数の入出力データの特徴値間の類似度を算出する類似度算出手段と、
類似度が所定値以上の複数の入出力データに対して個人知識データの生成を行う個人知識データ生成手段と、を実現するための情報管理用プログラムでもある。 In addition,
Similarity calculation means for calculating the similarity between feature values of a plurality of input / output data;
It is also an information management program for realizing personal knowledge data generating means for generating personal knowledge data for a plurality of input / output data having a similarity equal to or greater than a predetermined value.

上記構成の情報管理方法及び情報管理用プログラムであっても、上述した情報管理装置と同様に作用するため、上記目的を達成することができる。 Even the information management method and the information management program having the above-described configuration operate in the same manner as the information management apparatus described above, and thus the above-described object can be achieved.

本発明は、上述したように構成され機能するので、これによると、ユーザによる入出力データの履歴を、ユーザごとの入出力状況に応じて入出力データの特徴を表す個人知識データとして生成しておくことができるため、各ユーザの知識状態を容易に認識することが可能となり、かかる個人知識データを管理することで、容易かつ的確な情報検索を実現できる、という従来にない優れた効果を有する。 Since the present invention is configured and functions as described above, according to this, the history of input / output data by the user is generated as personal knowledge data representing the characteristics of the input / output data according to the input / output situation for each user. Therefore, it is possible to easily recognize the knowledge status of each user, and it is possible to easily and accurately search information by managing such personal knowledge data. .

本発明は、ユーザによって入出力された個人の関心事や専門知識である入出力データを、後に容易に検索可能なよう蓄積管理する情報管理装置である。このため、本発明は、入出力データの特徴を表す個人知識データを入出力状況に応じて生成し、これを管理することに特徴を有する。 The present invention is an information management apparatus for accumulating and managing input / output data, which are personal interests and expertise input / output by a user, so that it can be easily retrieved later. For this reason, the present invention is characterized in that personal knowledge data representing the characteristics of input / output data is generated according to the input / output situation and managed.

以下、実施例１では、入出力状況の一例として入出力時期を挙げ、入出力が行われた期間ごとに入出力データを区切って管理する情報管理装置を説明する。また、実施例２では、入出力状況の他の例として入出力の区別を挙げ、入出力の相違や他の参照の有無に応じた特徴を生成して管理する情報管理装置を説明する。また、実施例３では、上記期間と入出力区別とを組み合わせた場合を説明する。 In the following, in the first embodiment, an input / output time is given as an example of an input / output state, and an information management apparatus that manages by dividing input / output data for each period during which input / output is performed will be described. In the second embodiment, an input / output distinction is given as another example of the input / output situation, and an information management apparatus that generates and manages features according to input / output differences and the presence or absence of other references will be described. In the third embodiment, a case where the above period and input / output distinction are combined will be described.

本発明の第１の実施例を、図１乃至図１２を参照して説明する。図１は、本発明の全体構成を示すブロック図である。図２は、情報管理装置の構成を示す機能ブロック図である。図３乃至図５は、情報管理装置における情報処理の様子を示す説明図である。図６乃至図１２は、情報管理装置の動作を示すフローチャートである。 A first embodiment of the present invention will be described with reference to FIGS. FIG. 1 is a block diagram showing the overall configuration of the present invention. FIG. 2 is a functional block diagram showing the configuration of the information management apparatus. 3 to 5 are explanatory diagrams showing the state of information processing in the information management apparatus. 6 to 12 are flowcharts showing the operation of the information management apparatus.

［構成］
本発明は、ユーザにて入出力されるデータを管理する情報管理装置２であって、サーバコンピュータなどの情報処理装置にて構成されている。そして、この情報管理装置２に対してデータの登録や閲覧など、データの入出力を行うユーザＡ，Ｂ，Ｃが操作するコンピュータであるユーザ端末１１，１２，１３に、ネットワークＮを介して接続されている。また、この情報管理装置２に対してデータの検索要求を行うユーザＤの操作するコンピュータであるユーザ端末１４にもネットワークＮを介して接続されている。 [Constitution]
The present invention is an information management apparatus 2 that manages data input and output by a user, and is configured by an information processing apparatus such as a server computer. Then, it is connected via a network N to user terminals 11, 12, and 13 which are computers operated by users A, B, and C that input and output data such as data registration and browsing with respect to the information management apparatus 2. Has been. Further, a user terminal 14 which is a computer operated by a user D who makes a data search request to the information management apparatus 2 is also connected via the network N.

なお、情報管理装置２は、必ずしも一台のコンピュータにて構成されていることに限定されず、複数台のコンピュータにて構成されていてもよい。また、データの入出力や検索は、上記ユーザ端末１１，１２，１３，１４によって行われることに限定されず、情報管理装置２自体をユーザが操作することによって行われてもよい。また、「ユーザ」とは、個人であることに限定されず、複数の主体からなる組織やグループを指してもよい。以下、情報管理装置２の具体的な構成について説明する。 Note that the information management apparatus 2 is not necessarily configured by a single computer, and may be configured by a plurality of computers. Data input / output and search are not limited to being performed by the user terminals 11, 12, 13, and 14, but may be performed by a user operating the information management apparatus 2 itself. Further, the “user” is not limited to being an individual, and may refer to an organization or group composed of a plurality of subjects. Hereinafter, a specific configuration of the information management apparatus 2 will be described.

情報管理装置２は、情報の演算処理を行う演算部２０と、情報を記憶する記憶部３０とを備えており、演算部２０には、情報管理用プログラムが組み込まれることで、図２に示すように、データ入力処理部２１と、特徴値抽出処理部２２と、個人知識生成処理部２３と、が構築されており、ユーザ端末１１，１２，１３による入出力データを、その特徴に基づいて管理する機能を有する。さらに、演算部２０には、上記プログラムが組み込まれることで、検索要求受付処理部２４と、検索処理部２５と、検索結果出力処理部２６と、が構築されており、ユーザ端末１４からのデータの検索要求に応じて検索処理を行う機能を有する。また、これに応じて、記憶部３０には、入出力データ記憶部３１と、時間データ記憶部３２と、期間設定データ記憶部３３と、個人知識データ記憶部３４と、が形成されている。以下、各処理部２１〜２６、及び、各記憶部３１〜３４について詳述する。 The information management device 2 includes a calculation unit 20 that performs calculation processing of information and a storage unit 30 that stores information. The calculation unit 20 incorporates an information management program, which is shown in FIG. Thus, the data input processing unit 21, the feature value extraction processing unit 22, and the personal knowledge generation processing unit 23 are constructed, and input / output data by the user terminals 11, 12, and 13 are based on the features. Has the function to manage. Further, the calculation unit 20 includes the above-described program, so that a search request reception processing unit 24, a search processing unit 25, and a search result output processing unit 26 are constructed. Has a function of performing a search process in response to the search request. In response to this, the storage unit 30 includes an input / output data storage unit 31, a time data storage unit 32, a period setting data storage unit 33, and a personal knowledge data storage unit 34. Hereinafter, the processing units 21 to 26 and the storage units 31 to 34 will be described in detail.

上記データ入出力処理部２１は、ユーザがユーザ端末１１，１２，１３を介して情報管理装置２に入出力した入出力データと、当該ユーザが入出力した時間を表す時間データと、を受け付けて、記憶部３０内の入出力データ記憶部３１と時間データ記憶部３２とに（入出力データ記憶手段）それぞれ記憶する機能を有する。ここで、時間データとは、例えば、ユーザが入出力データにアクセスした時間、あるいは、入出力データを更新した時間を表す。また、入出力データは、あらかじめ情報管理装置２の記憶部に格納済みのデータである。従って、かかる場合には、時間データとは、ユーザが入出力データを記憶部３０に保存した時間を指す。なお、ユーザにて入出力されるデータとは、他のコンピュータの記憶部に対して入出力されるデータであってもよい。その場合には、かかる入出力状態を情報管理装置２が監視して、入出力データと時間データとを記憶するよう作動する。また、入出力データは、ユーザの関心事や専門知識を表すデータであり、後述するように、個人知識として抽出・蓄積・共有される。 The data input / output processing unit 21 receives input / output data input / output by the user to / from the information management apparatus 2 via the user terminals 11, 12, and 13 and time data representing the time input / output by the user. The input / output data storage unit 31 and the time data storage unit 32 (input / output data storage unit) in the storage unit 30 have a function of storing each. Here, the time data represents, for example, the time when the user accesses the input / output data or the time when the input / output data is updated. The input / output data is data that has been stored in the storage unit of the information management apparatus 2 in advance. Therefore, in such a case, the time data refers to the time when the user saved the input / output data in the storage unit 30. The data input / output by the user may be data input / output to / from a storage unit of another computer. In that case, the information management device 2 monitors such an input / output state and operates to store the input / output data and the time data. The input / output data is data representing the interests and expertise of the user, and is extracted, stored, and shared as personal knowledge, as will be described later.

そして、上述したようにして入出力データと時間データとが記憶されたときの様子を、図３、図４を参照して説明する。例えば、ユーザＡが、プログラミング言語「Ｃ＋＋」に関心を持ち知識を身に付けるために文書を読んだりした場合には、ユーザＡがユーザ端末１１を介して入出力したデータを、随時、記憶部３０の入出力データ記憶部３１に保存する。また、同時に、ユーザＡが入出力データを入出力した時間データ（時間情報）を記憶部３０の時間データ記憶部３２に保存する。すると、図３に示すように、ユーザＡによる入出力データは、当該入出力データのＩＤと、入出力時間とが関連付けられて記憶されることとなる。そして、入出力データとしては、後述するように、図４に示す「Ｃ＋＋」や、「ライブラリ」、「クラス」、「変数」などがある。 The state when the input / output data and the time data are stored as described above will be described with reference to FIGS. For example, when the user A is interested in the programming language “C ++” and reads a document to acquire knowledge, the data input / output by the user A via the user terminal 11 is stored as needed. The data is stored in the 30 input / output data storage unit 31. At the same time, the time data (time information) when the user A inputs / outputs the input / output data is stored in the time data storage unit 32 of the storage unit 30. Then, as shown in FIG. 3, the input / output data by the user A is stored in association with the ID of the input / output data and the input / output time. The input / output data includes “C ++”, “library”, “class”, “variable” and the like shown in FIG. 4 as described later.

次に、特徴値抽出処理部２２（特徴値抽出手段）について説明する。特徴値抽出処理部２２は、入出力データの特徴値を算出するよう作動する。このとき、特徴値の算出ルール（基準）は予め設定されており、かかる処理部２２に組み込まれている。以下、特徴値抽出の一例を説明する。ここでは、入出力データがテキスト情報であるものとし、かかるテキスト情報に含まれる各用語の出現頻度に基づいて特徴値を抽出するものとする。 Next, the feature value extraction processing unit 22 (feature value extraction means) will be described. The feature value extraction processing unit 22 operates to calculate the feature value of the input / output data. At this time, the feature value calculation rule (reference) is set in advance and incorporated in the processing unit 22. Hereinafter, an example of feature value extraction will be described. Here, the input / output data is assumed to be text information, and the feature value is extracted based on the appearance frequency of each term included in the text information.

特徴値抽出処理部２２は、記憶部３０の入出力データ記憶部３１に保存されているユーザＡが入出力したデータを読み出し、そのデータの解析を行う。このとき、解析単位は任意でよく、１ファイルといった情報単位でなくてもよい。そして、解析方法は、例えば、ある期間の入出力データＡｊ（ｊ>＝１）がテキスト情報であった場合、公知の技術であるＴＦ／ＩＤＦ手法を用いて重み付けを行い、特徴単語（用語）を抽出することにより行う。さらに具体的には、各入出力データＡｊから形態素解析処理によって単語を抽出し、抽出された単語の中から、名詞、動詞、形容詞などの内容語の出現頻度をカウントし、内容語の各入出力データでの出現頻度を用いて内容語のＴＦ／ＩＤＦ値を計算する。そして、ＴＦ／ＩＤＦ値の上位Ｎ位の特徴単語とＴＦ／ＩＤＦ値を、その入出力データの特徴量とする。なお、ＴＦ／ＩＤＦ手法については、文献「Sparck
Jones, Karen. (1972). A statistical interpretation of term specificity and its
application in retrieval. Journal of Documentation 28: 11-21.」他に記述があり、周知であるため詳細な説明は省略する。 The feature value extraction processing unit 22 reads data input / output by the user A stored in the input / output data storage unit 31 of the storage unit 30 and analyzes the data. At this time, the analysis unit may be arbitrary and may not be an information unit such as one file. For example, when the input / output data Aj (j> = 1) for a certain period is text information, the analysis method performs weighting using a TF / IDF technique, which is a well-known technique, and features words (terms). This is done by extracting More specifically, words are extracted from each input / output data Aj by morphological analysis processing, and the frequency of appearance of content words such as nouns, verbs, and adjectives is counted from the extracted words, and each content word is input. The TF / IDF value of the content word is calculated using the appearance frequency in the output data. Then, the top N feature words of the TF / IDF value and the TF / IDF value are used as the feature amount of the input / output data. For the TF / IDF method, refer to the document “Sparck”.
Jones, Karen. (1972). A statistical interpretation of term specificity and its
Journal of Documentation 28: 11-21 "and so on, and is well known, so detailed explanation is omitted.

なお、上記では、入出力データがテキスト情報である場合を説明したが、入力データはこれに限定されず、例えば、テキスト情報の他、注釈の付与された画像情報や音声情報であってもよい。また、特徴値の抽出方法は上記方法に限定されず、入出力データに含まれる各用語の出現位置、フォント情報や韻律情報といった強調情報に基づいて特徴値を抽出してもよい。そして、これらを組み合わせて特徴値を抽出してもよい。 In the above description, the case where the input / output data is text information has been described. However, the input data is not limited thereto, and may be, for example, text information, annotated image information or audio information. . The feature value extraction method is not limited to the above method, and the feature value may be extracted based on the appearance position of each term included in the input / output data, emphasis information such as font information and prosodic information. And you may extract a feature value combining these.

ここで、特徴値抽出処理２２の結果の一例を、図４乃至図５（ａ）に示す。図４では、入出力データＩＤがＡ３である入出力データを形態素解析し、名詞、動詞、形容詞といった内容語を抽出し、その他の入出力情報での内容語の頻度と比較してＴＦ／ＩＤＦ値を計算した結果、上位４位の特徴単語「C＋＋」、「ライブラリ」、「クラス」、「変数」と、そのＴＦ／ＩＤＦ値を、特徴値として抽出した例である。そして、図５（ａ）には、各時間における入出力データＡ１〜Ａｊを特徴値で表示した例を示している。 Here, an example of the result of the feature value extraction process 22 is shown in FIGS. In FIG. 4, morphological analysis is performed on input / output data whose input / output data ID is A3, content words such as nouns, verbs, and adjectives are extracted, and compared with the frequency of content words in other input / output information, TF / IDF In this example, the top four feature words “C ++”, “library”, “class”, “variable” and their TF / IDF values are extracted as feature values. FIG. 5A shows an example in which input / output data A1 to Aj at each time are displayed as feature values.

次に、個人知識生成処理部２３（個人知識データ生成手段）について説明する。個人知識生成処理部２３は、上記入出力データの特徴値から、入出力状況データである時間データに基づいて、入出力データを入出力したユーザごとにおける入出力データの特徴を表す個人知識データを生成するよう作動する。そして、本実施例では、特に、入出力の時間の近接性が高い複数の入出力データの特徴を計算することにより、ユーザ個人の入出力データの特徴を表す個人知識データを作成する。 Next, the personal knowledge generation processing unit 23 (personal knowledge data generation means) will be described. The personal knowledge generation processing unit 23 obtains personal knowledge data representing the characteristics of the input / output data for each user who has input / output the input / output data, based on the time data which is the input / output status data, from the feature values of the input / output data. Operates to generate. In this embodiment, personal knowledge data representing the characteristics of individual user input / output data is created by calculating the characteristics of a plurality of input / output data that are particularly close in input / output time.

さらに、個人知識生成処理部２３について具体的に説明すると、まず、期間設定データ記憶部３３に記憶された期間設定データを読み出す。この期間設定データとは、例えば、時間的に近接する複数の入出力データを一期間内のデータとして設定するデータであったり、所定時間内の入出力データを１つの期間として設定するデータであるなど、所定の期間を設定する基準となるデータである。そして、この期間設定データに基づいて期間を設定するが、本実施例では、図５（ａ）に示すＡ１〜Ａ３の期間を特定の期間Ａｔ１として設定する。そして、かかる期間内の入出力データの特徴値から、さらに特徴を表す個人知識データを生成する。例えば、各入出力データの上位Ｎ位の特徴単語のＴＦ／ＩＤＦ値をそれぞれ特徴単語ベクトルとし、その和を生成する。そして、生成されたデータを、図５（ｂ）に示すように個人知識データとする。その後は、別の期間における個人知識データも作成し、さらには、ユーザＢ，Ｃによる入出力データに対しても個人知識データの作成を行う。また、個人知識生成処理部２３は、上述したように生成した個人知識データを、個人知識データ記憶部３４（情報蓄積手段）に保存する。 Further, the personal knowledge generation processing unit 23 will be specifically described. First, the period setting data stored in the period setting data storage unit 33 is read. The period setting data is, for example, data that sets a plurality of input / output data that are close in time as data within one period, or data that sets input / output data within a predetermined time as one period. For example, it is data serving as a reference for setting a predetermined period. The period is set based on the period setting data. In this embodiment, the period A1 to A3 shown in FIG. 5A is set as the specific period At1. Then, personal knowledge data representing further features is generated from the feature values of the input / output data within the period. For example, the TF / IDF values of the top N feature words of each input / output data are used as feature word vectors, and the sum is generated. And let the produced | generated data be personal knowledge data, as shown in FIG.5 (b). Thereafter, personal knowledge data for another period is also created, and personal knowledge data is also created for input / output data by the users B and C. The personal knowledge generation processing unit 23 stores the personal knowledge data generated as described above in the personal knowledge data storage unit 34 (information storage unit).

ここで、個人知識生成処理部２３による個人識別データの生成方法は、上述したものに限定されない。例えば、入出力データの特徴値である特徴単語とそのＴＦ／ＩＤＦ値を一定時間内で合計し、平均をとることにより、個人識別データを生成してもよい。また、期間の設定も上述したものに限定されず、さらには、期間は一定の時間でなく動的であってもよい。 Here, the method of generating personal identification data by the personal knowledge generation processing unit 23 is not limited to the above-described method. For example, the personal identification data may be generated by adding the feature words that are the feature values of the input / output data and their TF / IDF values within a predetermined time and taking the average. Further, the setting of the period is not limited to that described above, and the period may be dynamic rather than a fixed time.

次に、検索要求受付部２４について説明する。検索要求受付部２４は、情報管理装置２に管理されているデータの検索要求を、ユーザ端末１４から受け付け、検索処理部２５に通知するよう作動する。このとき、検索要求を行うユーザ端末１４は、ユーザＤが操作するものとする。そして、ユーザＤが、「Ｃ＋＋やそのライブラリ」という検索要求データを送信して、検索要求を行ったとする。 Next, the search request receiving unit 24 will be described. The search request receiving unit 24 operates to receive a search request for data managed by the information management device 2 from the user terminal 14 and notify the search processing unit 25 of the request. At this time, it is assumed that the user D operates the user terminal 14 that makes a search request. Then, it is assumed that the user D transmits search request data “C ++ or its library” and makes a search request.

検索処理部２５（検索手段）は、上記検索要求受付部２４から検索要求データの通知を受けると、検索要求データの内容を分析し、蓄積された個人知識データの少なくとも一つを検索する。具体的な検索処理としては、例えば、形態素解析処理にかけ、抽出された内容語である「Ｃ＋＋」と「ライブラリ」に「１．０」の値を付与する。同時に、検索対象に含まれていて、検索要求に含まれていない単語には「０」の値を付与する。その結果、例えば、(Ｃ＋＋，ライブラリ，クラス，変数）＝（1.0, 1.0, 0,0）といった検索要求の特徴単語ベクトルを作成する。そして、この検索要求データの単語特徴ベクトルと、１つ以上の個人知識データの類似度を各々計算する。類似度の計算方法は任意でよいが、例えば、空間ベクトル法を用いて計算すると、上述した個人知識データＡｔ１と検索要求データとの間の類似度は「0.91」になる。これを全てのユーザＡ，Ｂ、Ｃの各期間ごとの個人知識データを対象に検索要求との間の類似度を計算し、この結果から検索要求データと最も類似度の高い個人知識データを特定する。 Upon receiving notification of the search request data from the search request receiving unit 24, the search processing unit 25 (search means) analyzes the content of the search request data and searches for at least one of the stored personal knowledge data. As a specific search process, for example, a value of “1.0” is assigned to “C ++” and “library” which are extracted content words through a morphological analysis process. At the same time, a value of “0” is assigned to words that are included in the search target and are not included in the search request. As a result, for example, a feature word vector for a search request such as (C ++, library, class, variable) = (1.0, 1.0, 0, 0) is created. Then, the similarity between the word feature vector of the search request data and one or more personal knowledge data is calculated. Although the calculation method of the similarity may be arbitrary, for example, when the calculation is performed using the space vector method, the similarity between the above-described personal knowledge data At1 and the search request data becomes “0.91”. The degree of similarity between the search requests is calculated based on the personal knowledge data for each period of all users A, B, and C, and the personal knowledge data having the highest similarity to the search request data is identified from the result. To do.

そして、上記特定された個人知識データ自体を個人知識データ記憶部３４から抽出してもよく、さらに、これに類似する他の個人知識データを特定して抽出してもよい。例えば、上記類似度を参照して、２番目に高い個人知識データをも抽出してもよい。また、特定された個人知識データに対応する入出力データを、入出力データ記憶部３１から抽出してもよい。また、さらには、特定された個人知識データに対応するユーザを特定するユーザ特定情報を抽出してもよい。すなわち、上述した例では、ユーザＡを特定する情報（例えば、ユーザＡの個人名）が抽出される。なお、このような検索は、検索要求時に、ユーザ端末１４から人物検索要求がなされたときに行ってもよい。 Then, the specified personal knowledge data itself may be extracted from the personal knowledge data storage unit 34, and other personal knowledge data similar to this may be specified and extracted. For example, the second highest personal knowledge data may be extracted by referring to the similarity. In addition, input / output data corresponding to the identified personal knowledge data may be extracted from the input / output data storage unit 31. Furthermore, user specifying information for specifying a user corresponding to the specified personal knowledge data may be extracted. That is, in the above-described example, information for identifying the user A (for example, the personal name of the user A) is extracted. Such a search may be performed when a person search request is made from the user terminal 14 at the time of the search request.

そして、これら抽出した情報は検索結果出力処理部２６（検索結果出力手段）に渡され、当該検索結果出力処理部２６から検索要求を行ったユーザ端末１４に送信される。 The extracted information is passed to the search result output processing unit 26 (search result output unit), and is transmitted from the search result output processing unit 26 to the user terminal 14 that has made the search request.

［動作］
次に、図６乃至図１１を参照して、上述した情報管理装置２の動作を説明する。図６は、ユーザＡ，Ｂ，Ｃのユーザ端末１１，１２，１３による入出力データに対する情報管理装置による情報蓄積処理を示すフローチャートであり、図７乃至図８は、その一部の詳細な動作を示すフローチャートである。図９は、ユーザＤのユーザ端末１４による検索要求に対する情報管理装置による検索処理を示すフローチャートであり、図１０乃至図１１は、その一部の詳細な動作を示すフローチャートである。なお、特に、ユーザＡがプログラミング言語Ｃ＋＋に関心を持ち、知識を身に付けるために文書を読んだり、メモを作成することにより、かかる知識のデータを入出力している状態と、ユーザＤが「Ｃ＋＋やそのライブラリ」について知りたいと思い、他者の個人知識を検索する場合を考える。すなわち、上述した図３乃至図５も参照して説明する。 [Operation]
Next, the operation of the information management apparatus 2 described above will be described with reference to FIGS. FIG. 6 is a flowchart showing information storage processing by the information management apparatus for input / output data by the user terminals 11, 12, and 13 of the users A, B, and C, and FIGS. It is a flowchart which shows. FIG. 9 is a flowchart showing search processing by the information management apparatus in response to a search request from the user terminal 14 of the user D, and FIGS. 10 to 11 are flowcharts showing detailed operations of a part thereof. In particular, the user A is interested in the programming language C ++, reads a document to acquire knowledge, creates a memo, and the user D inputs and outputs such knowledge data. Suppose you want to know about "C ++ and its library" and search for the personal knowledge of others. That is, the description will be given with reference to FIGS. 3 to 5 described above.

まず、情報管理装置２のデータ入出力処理部２１が、ユーザＡのユーザ端末１１にて入出力された入出力データを、入出力データ記憶部３１に保存する（ステップＳ１）。このとき、同時に、情報管理装置２のデータ入出力処理部２１は、ユーザＡが入出力データを入出力した時間データを時間データ記憶部３２に保存する（ステップＳ２）。すると、図３に示すように、ユーザＡによるユーザ端末１１からの入出力データのＩＤと、それら入出力データにアクセスした時間データが記憶される。 First, the data input / output processing unit 21 of the information management device 2 stores the input / output data input / output at the user terminal 11 of the user A in the input / output data storage unit 31 (step S1). At the same time, the data input / output processing unit 21 of the information management apparatus 2 stores the time data when the user A inputs / outputs the input / output data in the time data storage unit 32 (step S2). Then, as shown in FIG. 3, the ID of the input / output data from the user terminal 11 by the user A and the time data for accessing the input / output data are stored.

続いて、情報管理装置２の特徴値抽出処理部２２が、記憶部３０に保存されている入出力データを読み出し、その特徴値を抽出する（ステップＳ３、特徴値抽出工程）。かかる特徴値の抽出処理を、図７を参照して詳述する。特徴値抽出処理部２２は、入出力データがテキスト情報であった場合には、所定の期間の入出力データＡｊ（ｊ＞＝１）に対して、公知の技術であるＴＦ／ＩＤＦ手法を用いて重み付けを行う。具体的には、図４に示すように、入出力データＡ３に形態素解析処理を行い、名詞、動詞、形容詞といった内容語を抽出する（ステップＳ１１）。そして、抽出された単語の中から、名詞、動詞、形容詞などの内容語の出現頻度をカウントし（ステップＳ１２）、内容語の各入出力データでの出現頻度を用いて当該内容語のＴＦ／ＩＤＦ値を計算する（ステップＳ１３）。続いて、ＴＦ／ＩＤＦ値の上位４位を特徴単語「Ｃ＋＋」、「ライブラリ」、「クラス」、「変数」を抽出し（ステップＳ１４）、そのＴＦ／ＩＤＦ値を抽出して、特徴単語とそのＴＦ／ＩＤＦ値を特徴値とする（ステップＳ１５）。 Subsequently, the feature value extraction processing unit 22 of the information management device 2 reads the input / output data stored in the storage unit 30 and extracts the feature value (step S3, feature value extraction step). The feature value extraction process will be described in detail with reference to FIG. When the input / output data is text information, the feature value extraction processing unit 22 uses a known technique of TF / IDF for the input / output data Aj (j> = 1) for a predetermined period. Weight. Specifically, as shown in FIG. 4, morphological analysis processing is performed on the input / output data A3 to extract content words such as nouns, verbs, and adjectives (step S11). Then, the frequency of appearance of content words such as nouns, verbs, and adjectives is counted from the extracted words (step S12), and using the frequency of appearance of each content word in each input / output data, TF / An IDF value is calculated (step S13). Subsequently, feature words “C ++”, “library”, “class”, and “variable” are extracted from the top four TF / IDF values (step S14), the TF / IDF values are extracted, and the feature words and The TF / IDF value is set as a feature value (step S15).

続いて、個人知識生成処理部２３が、上記特徴値抽出処理部２２にて得られた入出力データの特徴値と、記憶部３０の時間データ記憶部３２に保存された時間データを用いて、時間の近接性が高い入出力データ間における代表的な特徴を、その期間の個人の関心事や専門知識を表す個人知識データとして生成する（ステップＳ４、個人知識データ生成工程）。この動作を、図８を参照してさらに詳述する。 Subsequently, the personal knowledge generation processing unit 23 uses the feature value of the input / output data obtained by the feature value extraction processing unit 22 and the time data stored in the time data storage unit 32 of the storage unit 30. Representative characteristics between input / output data with high temporal proximity are generated as personal knowledge data representing the interests and expertise of the individual during that period (step S4, personal knowledge data generation step). This operation will be further described in detail with reference to FIG.

個人知識生成処理部２３は、まず、期間設定データ記憶部３３から期間設定データを読み出し（ステップＳ２１）、これに基づいて個人知識データを生成する入出力データの入出力が行われた期間を設定する（ステップＳ２２）。この場合には、図５（ａ）に示すように、期間ｔ１が設定され、入出力データＡ１〜Ａ３までの特徴が個人知識データを生成する対象となる。そして、各入出力データのＴＦ／ＩＤＦ値上位４位の特徴単語のＴＦ／ＩＤＦ値を特徴単語ベクトルとし、それらの和を作成し、個人知識データとする（ステップＳ２３）。具体的には、期間ｔ１内の単語特徴ベクトルの和をＡｔ１とすると、At1=A1+A2+A3=(0.96,0.84,0)+(0,0.92,0,064)+(0.93,0.86,0.74,0.55)=(1.89,2.62,0.74,1.19)となる。そして、上記同様に、個人知識データを、各期間ｔｋ（ｋ＞＝１）ごとにそれぞれ作成する。 The personal knowledge generation processing unit 23 first reads the period setting data from the period setting data storage unit 33 (step S21), and sets a period during which input / output data for generating personal knowledge data is input / output based on the period setting data. (Step S22). In this case, as shown in FIG. 5A, a period t1 is set, and the features of the input / output data A1 to A3 are the targets for generating personal knowledge data. Then, the TF / IDF values of the top four feature words of each input / output data are used as feature word vectors, and their sum is created and used as personal knowledge data (step S23). Specifically, when the sum of the word feature vectors in the period t1 is At1, At1 = A1 + A2 + A3 = (0.96,0.84,0) + (0,0.92,0,064) + (0.93,0.86,0.74, 0.55) = (1.89, 2.62, 0.74, 1.19). Similarly to the above, personal knowledge data is created for each period tk (k> = 1).

その後、生成された個人知識データを、個人知識データ記憶部３４に保存する（ステップＳ５）。そして、ユーザＡの個人知識データだけでなく、ユーザＢ，Ｃ等についても、同様に期間ごとの個人知識データを作成し、個人知識データ記憶部３４に保存する。 Thereafter, the generated personal knowledge data is stored in the personal knowledge data storage unit 34 (step S5). Then, not only the personal knowledge data of the user A but also the personal knowledge data for each of the users B, C, etc. are created and stored in the personal knowledge data storage unit 34 in the same manner.

続いて、ユーザＤのユーザ端末１４から検索要求がなされた場合における情報管理装置２による検索処理動作を、図９を参照して説明する。まず、ユーザ端末１４から検索要求がなされると、検索要求受付処理部２４が検索要求を受け付け（ステップＳ４１）、検索処理部２５に通知する。このとき、ユーザＤが「Ｃ＋＋やそのライブラリ」という検索要求を送信した場合を考える。 Next, a search processing operation by the information management apparatus 2 when a search request is made from the user terminal 14 of the user D will be described with reference to FIG. First, when a search request is made from the user terminal 14, the search request reception processing unit 24 receives the search request (step S41) and notifies the search processing unit 25 of it. At this time, consider a case where user D transmits a search request “C ++ or its library”.

続いて、検索処理部２５は、検索要求の内容を分析する（ステップＳ４２）。例えば、検索要求データを形態素解析処理にかけ、抽出された内容語「Ｃ＋＋」と「ライブラリ」に「1.0」の値を付与する。同時に、検索対象に含まれていて、検索要求に含まれていない単語には「０」の値を付与する。その結果、例えば、(C++,ライブラリ,クラス,変数）=（1.0,
1.0, 0,0）といった検索要求の特徴単語ベクトルが作成される。 Subsequently, the search processing unit 25 analyzes the content of the search request (step S42). For example, the search request data is subjected to a morphological analysis process, and a value of “1.0” is assigned to the extracted content words “C ++” and “library”. At the same time, a value of “0” is assigned to words that are included in the search target and are not included in the search request. As a result, for example, (C ++, library, class, variable) = (1.0,
A feature word vector for a search request such as 1.0, 0, 0) is created.

そして、これに基づいて、個人知識データ記憶部中から検索を行う（ステップＳ４３）。具体的には、図１０に示すように、検索要求データの単語特徴ベクトルと個人知識データとの類似度をそれぞれ計算する。例えば、空間ベクトル法を用いて計算すると、図５（ａ）に示す個人知識データＡｔ１と検索要求の間の類似度は「0.91」になる。これをユーザＡ、ユーザＢ、ユーザＣなどの各期間ごとの個人知識データを対象に、検索要求データとの間の類似度を計算する。この結果から検索要求と最も類似度の高い個人知識データを特定する（ステップＳ５２）。例えば、個人知識データＡｔ１が最も類似度の高い個人知識データであると判定された場合、個人知識データＡｔ１を検索結果として抽出する（ステップＳ４４）。なお、かかるステップＳ４４の詳細な動作を図１１（ａ）にも示す（ステップＳ６１）。そして、抽出した個人知識データＡｔ１をユーザＤのユーザ端末１４に対して出力する（ステップＳ４５）。なお、検索結果として出力するデータは、個人知識データＡｔ１ばかりでなく、検索要求データに対して類似度が２番目、３番目に高い個人知識データなどを抽出して出力してもよい。さらには、個人知識データに限らず、これの元となる入出力データを入出力データ記憶部３１から抽出して（図１１（ｂ）のステップＳ７１参照）、ユーザＤのユーザ端末１４に出力してもよい。 Based on this, the personal knowledge data storage unit is searched (step S43). Specifically, as shown in FIG. 10, the similarity between the word feature vector of the search request data and the personal knowledge data is calculated. For example, when the calculation is performed using the space vector method, the similarity between the personal knowledge data At1 shown in FIG. 5A and the search request is “0.91”. The degree of similarity with the search request data is calculated for the personal knowledge data for each period such as user A, user B, and user C. From this result, the personal knowledge data having the highest similarity with the search request is specified (step S52). For example, when it is determined that the personal knowledge data At1 is the personal data having the highest similarity, the personal knowledge data At1 is extracted as a search result (step S44). The detailed operation of step S44 is also shown in FIG. 11A (step S61). And the extracted personal knowledge data At1 is output with respect to the user terminal 14 of the user D (step S45). Note that the data to be output as the search result may be extracted and output not only the personal knowledge data At1 but also personal knowledge data having the second and third highest similarity to the search request data. Furthermore, not only the personal knowledge data but also the input / output data that is the basis of the data is extracted from the input / output data storage unit 31 (see step S71 in FIG. 11B) and output to the user terminal 14 of the user D. May be.

なお、検索要求を行ったユーザＤからの要求が、個人知識データのみではなく、「Ｃ＋＋やそのライブラリ」について「詳しい人物」を検索する要求である場合には、上述したように、個人知識データＡｔ１を特定した後に、当該個人知識データのもととなる入出力データを入出力したユーザＡを特定し（図１１（ｃ）のステップＳ８１参照）、その「個人名であるＡ」を検索結果としてユーザＤのユーザ端末１４に出力する。 When the request from the user D who made the search request is not only the personal knowledge data but also a request to search for “detailed person” for “C ++ and its library”, as described above, the personal knowledge data After specifying At1, the user A who inputs / outputs the input / output data that is the basis of the personal knowledge data is specified (see step S81 in FIG. 11C), and the search result is “A that is an individual name”. To the user terminal 14 of the user D.

このようにすることにより、上記情報管理装置２によると、時間情報を用いて個人の期間ごとの関心事や専門知識を抽出することで、過去のある期間に集中的に入出力した情報を検索することができることから、過去のある時期だけに取り扱ったテーマのように、過去のある期間に集中的に入出力した情報から抽出される個人の関心事や専門知識を持っていた専門家である他の個人やその情報を検索することができる。また、同様に、現在一致する関心事や専門知識を持っている他の個人が過去にはどのような関心事や専門知識を持っていたのか、といったことも検索できることである。 In this way, according to the information management device 2 described above, the information input / output intensively in a certain past period is retrieved by extracting the interests and expertise of each individual period using the time information. It is an expert who has personal interests and expertise extracted from information intensively input and output during a certain past period, such as themes handled only in a certain past period. You can search for other individuals and their information. Similarly, it is also possible to search what interests and expertise other individuals who have current matching interests and expertise have in the past.

さらに、時間情報を用いて個人の期間ごとの関心事や専門知識を抽出すること、個人の過去から現在までの関心事や専門知識の変遷を検索できるようにしたため、検索者である個人が所望の情報を検索する際に、自分の過去から現在までの関心事や専門知識の変遷により近い人物の持っている情報から所望の情報を検索することができ、情報の再利用化を図ることができる。 Furthermore, it is now possible to extract interests and expertise for each individual period using time information, and to search for changes in interests and expertise from the past to the present of the individual, so that the individual who is the searcher wants When searching for information, it is possible to search for desired information from information held by a person closer to the interests of the past and the present and the transition of expertise, so that information can be reused it can.

［変形例］
次に、上記構成の情報管理装置２の変形例を、図１２を参照して説明する。この変形では、基本的な構成は上述した情報管理装置２と同様であるが、情報蓄積処理時における個人知識データの作成方法が異なる。すなわち、個人知識生成処理部２３が以下のような機能を有し、図６のステップＳ４の動作が図１２のフローチャートに示すようになる。 [Modification]
Next, a modified example of the information management apparatus 2 configured as described above will be described with reference to FIG. In this modification, the basic configuration is the same as that of the information management device 2 described above, but the method of creating personal knowledge data at the time of information storage processing is different. That is, the personal knowledge generation processing unit 23 has the following functions, and the operation in step S4 in FIG. 6 is as shown in the flowchart in FIG.

［構成］
まず、この変形例における個人知識生成処理部２３は、複数の入出力データの特徴値間の類似度を算出する類似度算出機能（類似度算出手段）を有している。そして、この算出された類似度に基づいて、当該類似度が所定値以上の複数の入出力データの特徴値を用いて、個人知識データの生成を行う、という機能を有する。そして、本実施例では、特に、上述したように、設定された期間内の入出力データの特徴値に基づいて個人知識データの生成が行われるため、お互いに類似度が大きく、かつ、入出力の時間の近接性が高い入出力データの特徴値から、個人知識データが生成されることとなる。 [Constitution]
First, the personal knowledge generation processing unit 23 in this modification has a similarity calculation function (similarity calculation means) for calculating the similarity between feature values of a plurality of input / output data. And based on this calculated similarity, it has the function of producing | generating personal knowledge data using the feature value of several input / output data whose said similarity is more than predetermined value. In this embodiment, since the personal knowledge data is generated based on the feature values of the input / output data within the set period, as described above, the degree of similarity is high and the input / output The personal knowledge data is generated from the feature values of the input / output data having high time proximity.

［動作］
具体的な処理動作を、図１２のフローチャートを参照して説明する。まず、各入出力データの特徴値が算出されると、特徴値間の類似度を算出する（ステップＳ３１）。このとき、類似度の計算方法は任意でよいが、例えば、特徴単語の集合の重なりによって、類似度を判定する。また、公知技術であるVector Space Model（ベクトル空間法）等を用いて、情報の特徴をそのテキスト情報の中に出現する単語データをもとに単語行列ベクトル（特徴単語ベクトル）で表し、各特徴単語ベクトル間の距離や内積等によって、二情報間の類似度の大きさを求めることが可能である。ベクトル空間法は文献「Salton,
G. (1989) Automatic Text Processing. Addison-Wesley Publishing Company.」の十章等に詳細に述べられている。また、別の公知の技術であるLatent
Semantic Space（潜在的意味空間法）等を用いて、作成した特徴単語ベクトルを、特異値分解により低階数近似し、ベクトルの次元を小さくして計算することも可能である。潜在的意味空間法については、文献「Deerwester,
S. et. al.(1990): Indexing by latent semantic analysis. Journal of the American
Society for Information Science, 41(7), 391-407.」他に記述があるため詳細な説明は省略する。 [Operation]
A specific processing operation will be described with reference to the flowchart of FIG. First, when the feature value of each input / output data is calculated, the similarity between the feature values is calculated (step S31). At this time, the method for calculating the degree of similarity may be arbitrary, but the degree of similarity is determined based on, for example, overlapping of sets of feature words. In addition, using the well-known Vector Space Model (vector space method) etc., the feature of the information is expressed as a word matrix vector (feature word vector) based on the word data that appears in the text information. The degree of similarity between two pieces of information can be obtained from the distance between word vectors, the inner product, and the like. The vector space method is described in the document "Salton,
G. (1989) Automatic Text Processing. Addison-Wesley Publishing Company. Another known technology, Latent
Using the Semantic Space (latent semantic space method) or the like, it is also possible to approximate the created feature word vector by low-order approximation by singular value decomposition and reduce the vector dimension. The latent semantic space method is described in the document “Deerwester,
S. et. Al. (1990): Indexing by latent semantic analysis.Journal of the American
"Society for Information Science, 41 (7), 391-407."

そして、算出した類似度を用いて、お互いに類似度が高い入出力データの特徴値を抽出し（ステップＳ３２）、かつ、時間の近接性が高い入出力データ間の特徴値に対して、個人知識データを生成する（ステップＳ３５）。すなわち、上述同様に、期間設定データを読み出して、所定の期間を設定し（ステップＳ３４）、かかる期間内の存在する入出力データのうち、類似度が高い特徴値のみの特徴を表す個人知識データを生成する。 Then, using the calculated similarity, the feature values of the input / output data having a high similarity to each other are extracted (step S32), and the feature values between the input / output data having a high temporal proximity are obtained for the individual. Knowledge data is generated (step S35). That is, as described above, the period setting data is read out, a predetermined period is set (step S34), and the personal knowledge data representing the feature of only the feature value having high similarity among the input / output data existing in the period. Is generated.

次に、本発明の第２の実施例を、図１３乃至図１７を参照して説明する。図１３は、情報管理装置の構成を示す機能ブロック図である。図１４乃至図１５は、情報管理装置における情報処理の様子を示す説明図である。図１６乃至図１７は、情報管理装置の動作を示すフローチャートである。 Next, a second embodiment of the present invention will be described with reference to FIGS. FIG. 13 is a functional block diagram showing the configuration of the information management apparatus. FIG. 14 to FIG. 15 are explanatory diagrams showing the state of information processing in the information management apparatus. 16 to 17 are flowcharts showing the operation of the information management apparatus.

［構成］
本実施例における情報処理装置１０２は、基本的には、上述した実施例１におけるものと同様の構成を採っている。従って、情報管理装置１０２は、１台又は複数台のサーバコンピュータなどの情報処理装置にて構成されており、データの登録や閲覧などデータの入出力を行うユーザＡ，Ｂ，Ｃが操作するコンピュータであるユーザ端末１１，１２，１３に、ネットワークＮを介して接続されている。また、この情報管理装置１０２に対してデータの検索要求を行うユーザＤの操作するコンピュータであるユーザ端末１４にもネットワークＮを介して接続されている。 [Constitution]
The information processing apparatus 102 in the present embodiment basically has the same configuration as that in the first embodiment described above. Therefore, the information management apparatus 102 is configured by an information processing apparatus such as one or a plurality of server computers, and is a computer operated by users A, B, and C that input and output data such as data registration and browsing. Are connected via a network N to user terminals 11, 12, and 13. Further, a user terminal 14 which is a computer operated by a user D who makes a data search request to the information management apparatus 102 is also connected via the network N.

そして、情報管理装置１０２は、情報の演算処理を行う演算部１２０と、情報を記憶する記憶部１３０とを備えており、演算部１２０には、情報管理用プログラムが組み込まれることで、図１３に示すように、データ入力処理部１２１と、特徴値抽出処理部１２２と、個人知識生成処理部１２３と、が構築されており、ユーザ端末１１，１２，１３による入出力データを、その特徴に基づいて管理する機能を有する。本実施例では、特に、上記実施例１の場合と比較して、入出力データの個人知識データの生成方法が異なるため、上記各処理部１２１，１２２，１２３の機能が異なる。また、これに伴い、記憶部１３０には、入出力データ記憶部１３１と、入出力区分データ記憶部１３２と、重み付け設定データ記憶部１３３と、個人知識データ記憶部１３４と、が形成されている。これらについては、以下に詳述する。 The information management apparatus 102 includes a calculation unit 120 that performs calculation processing of information, and a storage unit 130 that stores information. The information management program is incorporated in the calculation unit 120, so that FIG. As shown in FIG. 4, a data input processing unit 121, a feature value extraction processing unit 122, and a personal knowledge generation processing unit 123 are constructed, and input / output data by the user terminals 11, 12, and 13 are used as characteristics thereof. Based on the management function. In the present embodiment, in particular, since the method of generating personal knowledge data of input / output data is different from that in the first embodiment, the functions of the processing units 121, 122, and 123 are different. Accordingly, the storage unit 130 includes an input / output data storage unit 131, an input / output classification data storage unit 132, a weighting setting data storage unit 133, and a personal knowledge data storage unit 134. . These will be described in detail below.

また、演算部１２０には、上記プログラムが組み込まれることで、検索要求受付処理部１２４と、検索処理部１２５と、検索結果出力処理部１２６と、が構築されており、ユーザ端末１４からのデータの検索要求に応じて検索処理を行う機能を有しているが、これら各処理部１２４，１２５，１２６の機能は、上記実施例１の場合とほぼ同一であるので、その説明は省略する。 In addition, the calculation unit 120 incorporates the above-described program, so that a search request reception processing unit 124, a search processing unit 125, and a search result output processing unit 126 are constructed. However, since the functions of the processing units 124, 125, and 126 are almost the same as those in the first embodiment, the description thereof is omitted.

上記データ入出力処理部１２１は、ユーザがユーザ端末１１，１２，１３を介して情報管理装置１０２に入出力した入出力データと、この入出力データの入出力状況を表すデータである入出力の区別を表す入出力区別データと、を受け付けて、記憶部１３０内の入出力データ記憶部１３１と入出力区別データ記憶部１３２とにそれぞれ記憶する機能を有する。また、データ入出力処理部１２１は、さらに、入出力データにおける他のデータの参照状況を表す参照状況データをも受け付けて、入出力区分データ記憶部１３２に記憶する機能を有する。 The data input / output processing unit 121 inputs / outputs data that is input / output to / from the information management apparatus 102 by the user via the user terminals 11, 12, and 13 and data indicating the input / output status of the input / output data. It has a function of receiving input / output distinction data representing distinction and storing them in the input / output data storage unit 131 and the input / output distinction data storage unit 132 in the storage unit 130, respectively. The data input / output processing unit 121 further has a function of receiving reference status data representing the reference status of other data in the input / output data and storing it in the input / output segmented data storage unit 132.

ここで、上述した各データについて詳述する。入出力データは、あらかじめ情報管理装置１０２に格納済みの、例えば電子メールの送信メールや受信メールなどの入出力の区別が可能なユーザ個人の入出力データである。また、入出力区別データは、上記入出力データが入力データか、出力データか、ということを示すデータであり、例えば、送信メールには出力データであることを示す区別データ「１」を付与し、受信メールには入力データであることを示す区別データ「０」を付与して記憶しておく。そのときの記憶例を図１４に示す。この図は、ユーザＡの入出力データのＩＤと入出力区別データと参照関係を記録した一例である。まず、入出力データにＩＤを付与してデータの内容と共に記憶し、さらにこの入出力データユーザＡが出力データであれば「１」を、入力データであれば「０」といった区別データを付与し、図示するように記憶する。 Here, each data mentioned above is explained in full detail. The input / output data is personal input / output data stored in the information management apparatus 102 in advance and capable of distinguishing input / output such as e-mail transmission mail and reception mail. The input / output distinction data is data indicating whether the input / output data is input data or output data. For example, the sent mail is provided with distinction data “1” indicating output data. The received mail is stored with the distinction data “0” indicating the input data. A storage example at that time is shown in FIG. This figure is an example in which the input / output data ID of user A, the input / output distinction data, and the reference relationship are recorded. First, an ID is assigned to the input / output data and stored together with the contents of the data. Further, if the input / output data user A is output data, “1” is assigned, and if the input data is input data, “0” is assigned. Store as shown.

また、参照状況データは、入出力データ間の参照関係を示すデータである。参照関係は、例えば、出力データ内に明記された参考文献や引用符で囲まれた記述等の説明からその説明の内容が書かれた入力データを特定することで得ることができることを表している。そして、例えば、図１４に示すように、出力データＡ３を作成する際に当該Ａ３が他の入力データＡ２を参照して作成している場合には（図１４内の矢印参照）、その参照関係を「Ａ３−＞Ａ２」のようにＡ３のデータに対して付与し、記憶する。 The reference status data is data indicating a reference relationship between input and output data. The reference relationship indicates that it can be obtained, for example, by specifying the input data in which the content of the description is written from the description such as the reference document specified in the output data or the description enclosed in quotation marks. . Then, for example, as shown in FIG. 14, when the output data A3 is created by referring to the other input data A2 (see the arrow in FIG. 14), the reference relationship is created. Is assigned to the data of A3 as “A3-> A2” and stored.

また、特徴値抽出処理部１２２は、実施例１にて説明したものと同様に作用し、例えば、各入出力データＡｊから形態素解析処理によって単語を抽出し、抽出された単語の中から、名詞、動詞、形容詞などの内容語の出現頻度をカウントし、内容語の各入出力データでの出現頻度を用いて内容語のＴＦ／ＩＤＦ値を計算する。これにより、実施例１にて説明した図４に示すように、上位Ｎ位の特徴単語とＴＦ／ＩＤＦ値を、特徴値として抽出する。 The feature value extraction processing unit 122 operates in the same manner as described in the first embodiment. For example, the feature value extraction processing unit 122 extracts words from each input / output data Aj by morpheme analysis processing, and extracts nouns from the extracted words. The frequency of appearance of content words such as verbs and adjectives is counted, and the TF / IDF value of the content word is calculated using the frequency of appearance of the content word in each input / output data. As a result, as shown in FIG. 4 described in the first embodiment, the top N feature words and TF / IDF values are extracted as feature values.

次に、個人知識生成処理部１２３（個人知識データ生成手段）について説明する。個人知識生成処理部１２３は、上記入出力データの特徴値に対して、入出力区別データに応じて重み付けを行って、個人知識データを生成するよう作動する。このとき、重み付けは、例えば、出力データに対しては入力データよりも高い重み付けを行う。また、さらに、上記参照状況データに応じても、入出力データに重み付けを行う。例えば、参照されている入力データに対して所定の重み付けを付加する。 Next, the personal knowledge generation processing unit 123 (personal knowledge data generation means) will be described. The personal knowledge generation processing unit 123 operates to generate personal knowledge data by weighting the feature values of the input / output data according to the input / output distinction data. At this time, for example, the weighting for the output data is higher than that for the input data. Further, the input / output data is also weighted according to the reference status data. For example, a predetermined weight is added to the input data being referred to.

上記個人知識生成処理部１２３による個人知識データ作成の具体的な処理を、図１４乃至図１５を参照して説明する。まず、図１５（ａ）には入出力データの一例を示しているが、図１４に示すように、Ａ１，Ａ２は入力データであり、Ａ３は出力データである。また、Ａ３はＡ２を参照している。かかる場合に、入出力データの特徴値のうち、出力データＡ３の特徴値に任意の重み、例えば、ｗ＝０．８（０＜＝ｗ＜＝１）を加算し、入力データＡ１，Ａ２の特徴値に重み０．２（１−ｗ）を加算する。また、Ａ３の参照先となっている入力データＡ２には、さらに、例えば０．５の重みを加算する。従って、Ａ２には、（１−ｗ）＋０．５＝０．７の重みが加算されたこととなる。そして、上記重みを付加した各入出力データの特徴値を特徴単語ベクトルとし、各入出力データにおける単語特徴ベクトルの和Ａｗを取る。すると、Ａｗ=(0.94,1.50,0.59,0.89)となり、これを、ユーザＡによる入出力データの代表的な特徴を表す個人知識データとして生成する。かかる結果は、ユーザＡが出力データとして取り扱った情報の特徴をより強く反映したものとなる。 Specific processing of personal knowledge data creation by the personal knowledge generation processing unit 123 will be described with reference to FIGS. First, FIG. 15A shows an example of input / output data. As shown in FIG. 14, A1 and A2 are input data, and A3 is output data. A3 refers to A2. In such a case, an arbitrary weight, for example, w = 0.8 (0 <= w <= 1) is added to the feature value of the output data A3 among the feature values of the input / output data, and the input data A1, A2 A weight of 0.2 (1-w) is added to the feature value. Further, for example, a weight of 0.5 is added to the input data A2 that is the reference destination of A3. Therefore, a weight of (1−w) + 0.5 = 0.7 is added to A2. Then, the feature value of each input / output data to which the weight is added is used as a feature word vector, and the sum Aw of the word feature vectors in each input / output data is taken. Then, Aw = (0.94, 1.50, 0.59, 0.89), which is generated as personal knowledge data representing typical characteristics of input / output data by the user A. Such a result more strongly reflects the characteristics of the information handled as output data by the user A.

そして、上述したように生成した個人知識データを、個人知識データ記憶部１３４に記憶しておく。また、他のユーザの入出力データに対しても、同様に個人知識データの生成及び保存を行う。 The personal knowledge data generated as described above is stored in the personal knowledge data storage unit 134. Similarly, personal knowledge data is generated and stored for input / output data of other users.

ここで、上述した重み付けの値は一例であって、上記値に限定されるものではない。また、上述した個人知識データを算出する手法では、参照関係に基づいて入出力データの特徴値に重みを付加する例を説明したが、必ずしもこれに限定されず、かかる参照関係に基づいた重みは付加されなくてもよい。逆に、参照関係に基づく重み付けのみが付加され、入出力区別に基づく重みが付加されなくてもよい。 Here, the above-described weighting value is an example, and is not limited to the above value. In the above-described method for calculating the personal knowledge data, the example in which the weight is added to the feature value of the input / output data based on the reference relationship has been described. However, the weight based on the reference relationship is not necessarily limited thereto. It may not be added. Conversely, only the weight based on the reference relationship is added, and the weight based on the input / output distinction may not be added.

そして、上述したように生成され記憶された個人知識データに対して、他のユーザから検索要求があった場合には、情報管理装置１０２は検索要求受付部１２４と検索処理部１２５と検索結果出力処理部１２６との作用により、検索処理を実行する。但し、これら各処理部１２４，１２５，１２６の機能は上記実施例１のものとほぼ同一であるので、その説明は省略する。 When there is a search request from another user for the personal knowledge data generated and stored as described above, the information management apparatus 102 outputs the search request receiving unit 124, the search processing unit 125, and the search result output. A search process is executed by the action of the processing unit 126. However, the function of each of the processing units 124, 125, 126 is almost the same as that of the first embodiment, and the description thereof is omitted.

［動作］
次に、図１６乃至図１７を参照して、上述した情報管理装置１０２の動作を説明する。図１６は、ユーザＡ，Ｂ，Ｃのユーザ端末１１，１２，１３による入出力データに対する情報管理装置による情報蓄積処理を示すフローチャートであり、図１７は、その一部の詳細な動作を示すフローチャートである。なお、上記実施例１と同様に、ユーザＡがプログラミング言語Ｃ＋＋に関心を持ち、知識を身に付けるために文書を読んだり、メモを作成している場合における情報蓄積処理の動作を説明する。 [Operation]
Next, the operation of the information management apparatus 102 described above will be described with reference to FIGS. FIG. 16 is a flowchart showing information accumulation processing by the information management apparatus for input / output data by the user terminals 11, 12, and 13 of the users A, B, and C, and FIG. 17 is a flowchart showing a part of detailed operation thereof. It is. Similar to the first embodiment, the operation of the information accumulation process when the user A is interested in the programming language C ++ and reads a document or creates a memo to acquire knowledge will be described.

まず、情報管理装置１０２のデータ入出力処理部１２１が、ユーザＡのユーザ端末１１にて入出力された入出力データを、入出力データ記憶部１３１に保存する（ステップＳ１０１）。このとき、同時に、情報管理装置１０２のデータ入出力処理部１２１は、ユーザＡが入出力したデータの入手・出力の区別である入出力区別データを、入出力区別データ記憶部１３２に保存する（ステップＳ１０２）。また、このとき、入出力データの参照関係も入出力区別データ記憶部１３２に保存する。このようにして記憶した例を図１４に示す。上述したように、かかる記憶例では、ユーザＡが入出力したデータに入出力データＩＤを付与し、そのデータ内容とＩＤを保存する。また、ユーザＡがアクセスまたは保存したデータが出力データであれば「１」を、入力データであれば「０」といった区別データを付与して保存する。さらには、ある出力データＡ３を作成する際に入力データＡ２を参照して作成したという参照関係を「Ａ３−＞Ａ２」のように、Ａ３に付与して保存する。なお、入出力データの内容とは、例えば、電子メールの送信メール、受信メールなどの入出力の区別が可能なデータである。 First, the data input / output processing unit 121 of the information management apparatus 102 stores the input / output data input / output on the user terminal 11 of the user A in the input / output data storage unit 131 (step S101). At the same time, the data input / output processing unit 121 of the information management apparatus 102 stores the input / output distinction data, which is the distinction between acquisition and output of data input / output by the user A, in the input / output distinction data storage unit 132 ( Step S102). At this time, the input / output data reference relationship is also stored in the input / output distinction data storage unit 132. An example stored in this way is shown in FIG. As described above, in this storage example, the input / output data ID is assigned to the data input / output by the user A, and the data content and ID are stored. Further, if the data accessed or stored by the user A is output data, “1” is assigned, and if the data is input data, “0” is given and stored. Further, a reference relationship that is created by referring to the input data A2 when creating certain output data A3 is assigned to A3 and saved as "A3-> A2". The contents of input / output data are, for example, data that can be distinguished from input / output such as e-mail transmission mail and reception mail.

続いて、情報管理装置１０２の特徴値抽出処理部１２２が、記憶した入出力データを読み出し、実施例１の場合と同様に、その特徴値を抽出する（ステップＳ１０３）。 Subsequently, the feature value extraction processing unit 122 of the information management apparatus 102 reads the stored input / output data, and extracts the feature value as in the case of the first embodiment (step S103).

続いて、個人知識生成処理部１２３が、上記特徴値抽出処理部１２２にて得られた入出力データの特徴値と、入出力区別データ記憶部１３２に保存された入出力区別データを用いて、入出力状況に対応した個人の関心事や専門知識をあらわす代表的な特徴を、個人知識データとして生成する（ステップＳ１０４）。この動作を、図１７を参照してさらに詳述する。 Subsequently, the personal knowledge generation processing unit 123 uses the feature value of the input / output data obtained by the feature value extraction processing unit 122 and the input / output distinction data stored in the input / output distinction data storage unit 132. Representative characteristics representing personal interests and expertise corresponding to the input / output situation are generated as personal knowledge data (step S104). This operation will be further described in detail with reference to FIG.

個人知識生成処理部１２３は、まず、重み付け設定データ記憶部１３３から重み付け設定データを読み出すと共に（ステップＳ１１１）、入出力区別データ記憶部１３２から、入出力区別データと参照状況データとを読み出す（ステップＳ１１２）。そして、これらに基づいて、各入出力データの特徴値に重み付けを行う（ステップＳ１１３）。本実施例の場合には、出力データに対しては重み付けが高くなるよう設定されており、また、参照先となるデータに対しても重みが高くなるよう設定されている。具体的には、図１５（ａ）に示すように重み付けが加算され、各入出力データＡ１，Ａ２，Ａ３の単語特徴ベクトルの和を計算することで、図１５（ｂ）のＡｗに示す個人知識データが生成される（ステップＳ１１４）。 The personal knowledge generation processing unit 123 first reads the weight setting data from the weight setting data storage unit 133 (step S111), and also reads the input / output distinction data and the reference situation data from the input / output distinction data storage unit 132 (step S111). S112). Based on these, the feature value of each input / output data is weighted (step S113). In this embodiment, the output data is set to have a high weight, and the reference destination data is also set to have a high weight. Specifically, weights are added as shown in FIG. 15A, and the sum of word feature vectors of each input / output data A1, A2, A3 is calculated, so that the individual shown as Aw in FIG. Knowledge data is generated (step S114).

その後、生成された個人知識データを、個人知識データ記憶部１３４に保存する（ステップＳ１０５）。そして、ユーザＡの個人知識データだけでなく、ユーザＢ，Ｃ等についても、同様に個人知識データを作成し、個人知識データ記憶部１３４に保存する。 Thereafter, the generated personal knowledge data is stored in the personal knowledge data storage unit 134 (step S105). Then, not only the personal knowledge data of the user A but also the personal knowledge data is created for the users B, C, etc., and stored in the personal knowledge data storage unit 134.

なお、後に、ユーザＤのユーザ端末１４から検索要求がなされた場合における情報管理装置１０２による検索処理動作は、上記実施例１と同様なので、その説明は省略する。 Note that the search processing operation by the information management apparatus 102 when a search request is made later from the user terminal 14 of the user D is the same as that in the first embodiment, and the description thereof is omitted.

このようにすることにより、入出力区別データを用いて、個人知識のうち出力データをもとに作成された知識に重みづけをして個人知識データを作成していることから、ユーザが出力データとして取り扱った情報の特徴をより強く反映したものとなるため、後の検索時に、個人の関心事や専門知識を正確に抽出することができる。これは、個人が閲覧やコピーをするといった入力データは個人が単に見ただけといったことが考えられ、個人の関心事や専門知識との関連の度合いは低いかもしれないのに対して、個人が作成や編集をするといった出力データは、個人が選択や創作をするステップを経なければ実現せず、個人の関心事や専門知識との関連の度合いが強いと考えられることによる。 In this way, since the personal knowledge data is created by weighting the knowledge created based on the output data of the personal knowledge using the input / output distinction data, the user outputs the output data Since it reflects the characteristics of the information handled as a more strongly, it is possible to accurately extract personal interests and expertise during subsequent searches. This is because the input data that the individual browses and copies is simply viewed by the individual, and the degree of association with the individual's interests and expertise may be low, whereas the individual This is because output data such as creation and editing is not realized unless the individual goes through the selection and creation steps, and it is considered that the degree of association with the individual's interests and expertise is strong.

また、出力データとその参照関係にある入力データに重みづけをして個人知識を作成することで、出力データに重みづけをして個人知識を作成する場合と同様、個人の関心事や専門知識を正確に保存することが可能になる。すなわち、出力データと同様に、その出力データを作成するときに参考にした情報である入力データは、個人が単に見ただけかもしれない入力データ全体に比べて、個人がその情報を利用して出力データを作成しているので、個人の関心事や専門知識との関連の度合いが強いと考えられるためである。 In addition, by creating a personal knowledge by weighting the input data that has a reference relationship with the output data, as with the case of creating personal knowledge by weighting the output data, individual interests and expertise Can be stored accurately. That is, as with the output data, the input data, which is the information that was referenced when creating the output data, is used by the individual in comparison to the entire input data that the individual may have just viewed. This is because the output data is created, so it is considered that there is a strong degree of association with individual interests and expertise.

次に、本発明の第３の実施例を、図１８乃至図２０を参照して説明する。図１８は、情報管理装置の構成を示す機能ブロック図である。図１９乃至図２０は、情報管理装置の動作を示すフローチャートである。 Next, a third embodiment of the present invention will be described with reference to FIGS. FIG. 18 is a functional block diagram illustrating the configuration of the information management apparatus. 19 to 20 are flowcharts showing the operation of the information management apparatus.

［構成］
本実施例における情報処理装置２０２は、上述した実施例１と実施例２において説明したそれぞれの情報処理装置２，１０２が有する各処理部を組み合わせた構成を採っている。そして、情報管理装置２０２は、情報の演算処理を行う演算部２２０と、情報を記憶する記憶部２３０とを備えており、演算部２２０には、情報管理用プログラムが組み込まれることで、図１８に示すように、データ入力処理部２２１と、特徴値抽出処理部２２２と、個人知識生成処理部２２３と、が構築されており、ユーザ端末１１，１２，１３による入出力データを、その特徴に基づいて管理する機能を有する。本実施例では、特に、入出力データの個人知識データの生成方法が異なるため、上記データ入出力処理部２２１と個人知識生成処理部２２３の機能が異なる。また、これに伴い、記憶部２３０には、入出力データ記憶部２３１と、時間データ・入出力区別データ記憶部２３２と、期間設定データ・重み付け設定データ記憶部２３３と、個人知識データ記憶部２３４と、が形成されている。これらについては、以下に詳述する。 [Constitution]
The information processing apparatus 202 according to the present embodiment employs a configuration in which the processing units included in the information processing apparatuses 2 and 102 described in the first and second embodiments are combined. The information management apparatus 202 includes a calculation unit 220 that performs calculation processing of information and a storage unit 230 that stores information, and the information management program is incorporated in the calculation unit 220, so that FIG. As shown in FIG. 4, a data input processing unit 221, a feature value extraction processing unit 222, and a personal knowledge generation processing unit 223 are constructed, and input / output data by the user terminals 11, 12, and 13 are used as the characteristics. Based on the management function. In this embodiment, in particular, since the method of generating personal knowledge data of input / output data is different, the functions of the data input / output processing unit 221 and the personal knowledge generation processing unit 223 are different. Accordingly, the storage unit 230 includes an input / output data storage unit 231, a time data / input / output distinction data storage unit 232, a period setting data / weighting setting data storage unit 233, and a personal knowledge data storage unit 234. And are formed. These will be described in detail below.

また、演算部２２０には、上記プログラムが組み込まれることで、検索要求受付処理部２２４と、検索処理部２２５と、検索結果出力処理部２２６と、が構築されており、ユーザ端末１４からのデータの検索要求に応じて検索処理を行う機能を有しているが、これら各処理部２２４，２２５，２２６の機能は、上記実施例１の場合とほぼ同一であるので、その説明は省略する。 In addition, the calculation unit 220 includes a search request reception processing unit 224, a search processing unit 225, and a search result output processing unit 226 by incorporating the above program. However, since the functions of the processing units 224, 225, and 226 are almost the same as those in the first embodiment, the description thereof is omitted.

そして、上記データ入出力処理部２２１は、入出力データを受け付けて入出力データ記憶部２３１に記憶すると共に、入出力データの入出力時間を表す時間データと、入出力データの入出力の区別を表す入出力区別データや入出力データ間の参照関係を表す参照状況データと、をそれぞれ受け付けて、時間データ・入出力区別データ記憶部２３２に記憶する、という機能を有している。 The data input / output processing unit 221 receives the input / output data and stores the input / output data in the input / output data storage unit 231, and distinguishes the time data indicating the input / output time of the input / output data from the input / output of the input / output data. It has a function of accepting input / output distinction data and reference status data representing a reference relationship between the input / output data and storing them in the time data / input / output distinction data storage unit 232.

また、個人知識生成処理部２２３は、時間データと入出力区別データとに基づいて、入出力の時間の近接性が高い所定の期間内における複数の入出力データの特徴を算出する際に、出力データに重みづけを付加するなど入出力状況を考慮して重み付けを加算し、個人知識データを生成する。 In addition, the personal knowledge generation processing unit 223 outputs, when calculating the characteristics of a plurality of input / output data within a predetermined period in which the proximity of input / output time is high, based on the time data and the input / output distinction data. The personal knowledge data is generated by adding weights in consideration of the input / output situation such as adding weights to the data.

［動作］
次に、図１９乃至図２０を参照して、上述した情報管理装置２０２の動作を説明する。図１９は、ユーザＡ，Ｂ，Ｃのユーザ端末１１，１２，１３による入出力データに対する情報管理装置による情報蓄積処理を示すフローチャートであり、図２０は、その一部の詳細な動作を示すフローチャートである。なお、上記実施例１，２と同様に、ユーザＡがプログラミング言語Ｃ＋＋に関心を持ち、知識を身に付けるために文書を読んだり、メモを作成している場合における情報蓄積処理の動作を説明する。 [Operation]
Next, the operation of the information management apparatus 202 described above will be described with reference to FIGS. FIG. 19 is a flowchart showing information storage processing by the information management apparatus for input / output data by the user terminals 11, 12, and 13 of the users A, B, and C, and FIG. 20 is a flowchart showing a part of the detailed operation. It is. As in the first and second embodiments, the operation of the information accumulation process when the user A is interested in the programming language C ++ and reads a document or creates a memo to acquire knowledge is described. To do.

まず、情報管理装置２０２のデータ入出力処理部２２１が、ユーザＡのユーザ端末１１にて入出力された入出力データを、入出力データ記憶部２３１に保存する（ステップＳ２０１）。このとき、同時に、情報管理装置２０２のデータ入出力処理部２２１は、入出力データの入出力時間を表す時間データと、ユーザＡが入出力したデータの入力・出力の区別である入出力区別データとを、時間データ・入出力区別データ記憶部２３２に保存する（ステップＳ２０２）。また、このとき、入出力データの参照関係も時間データ・入出力区別データ記憶部２３２に保存する。 First, the data input / output processing unit 221 of the information management apparatus 202 stores the input / output data input / output at the user terminal 11 of the user A in the input / output data storage unit 231 (step S201). At the same time, the data input / output processing unit 221 of the information management apparatus 202 performs the input / output distinction data which is the distinction between the time data indicating the input / output time of the input / output data and the input / output of the data input / output by the user A. Are stored in the time data / input / output distinction data storage unit 232 (step S202). At this time, the reference relationship of the input / output data is also stored in the time data / input / output distinction data storage unit 232.

続いて、情報管理装置２０２の特徴値抽出処理部２２２が、記憶されている入出力データを読み出し、実施例１の場合と同様に、その特徴値を抽出する（ステップＳ２０３）。 Subsequently, the feature value extraction processing unit 222 of the information management apparatus 202 reads the stored input / output data, and extracts the feature value as in the case of the first embodiment (step S203).

続いて、個人知識生成処理部２２３が、上記特徴値抽出処理部２２２にて得られた入出力データの特徴値と、時間データ・入出力区別データ記憶部２３２に保存された時間データ及び入出力区別データを用いて、所定期間における入出力データにおける入出力状況に対応した個人の関心事や専門知識をあらわす代表的な特徴を、個人知識データとして生成する（ステップＳ２０４）。この動作を、図２０を参照してさらに詳述する。 Subsequently, the personal knowledge generation processing unit 223 performs the feature value of the input / output data obtained by the feature value extraction processing unit 222 and the time data and input / output stored in the time data / input / output distinction data storage unit 232. Using the distinction data, representative features representing personal interests and expertise corresponding to the input / output situation in the input / output data in a predetermined period are generated as personal knowledge data (step S204). This operation will be further described in detail with reference to FIG.

個人知識生成処理部２２３は、まず、期間設定データ・重み付け設定記憶部２３３から期間設定データを読み出し（ステップＳ２１１）、これに基づいて個人知識データを生成する入出力データの入出力が行われた期間を設定する（ステップＳ２１２）。続いて、期間設定データ・重み付け設定データ記憶部２３３から重み付け設定データを読み出し（ステップＳ２１３）、これに基づいて、実施例２において説明したように、上記設定された期間内の各入出力データの特徴値に重み付けを行う（ステップＳ２１４）。そして、この設定された期間内の入出力データであって、重み付けが加算された特徴値の単語特徴ベクトルの和を計算することにより、個人知識データを生成する（ステップＳ２１５）。なお、上記個人知識データの生成方法は一例であって、上記手法に限定されるものではない。 First, the personal knowledge generation processing unit 223 reads the period setting data from the period setting data / weighting setting storage unit 233 (step S211), and input / output of input / output data for generating personal knowledge data is performed based on the period setting data. A period is set (step S212). Subsequently, the weight setting data is read from the period setting data / weight setting data storage unit 233 (step S213), and based on this, as described in the second embodiment, each input / output data within the set period is read. The feature value is weighted (step S214). Then, the personal knowledge data is generated by calculating the sum of the word feature vectors of the feature values that are the input / output data within the set period and to which the weight is added (step S215). The method for generating the personal knowledge data is an example, and is not limited to the above method.

その後、生成された個人知識データを、個人知識データ記憶部２３４に保存する（ステップＳ２０５）。そして、ユーザＡの個人知識データだけでなく、ユーザＢ，Ｃ等についても、同様に個人知識データを作成し、個人知識データ記憶部２３４に保存する。 Thereafter, the generated personal knowledge data is stored in the personal knowledge data storage unit 234 (step S205). Then, not only the personal knowledge data of the user A but also the personal knowledge data is created for the users B, C, etc., and stored in the personal knowledge data storage unit 234.

なお、後に、ユーザＤのユーザ端末１４から検索要求がなされた場合における情報管理装置２０２による検索処理動作は、上記実施例１と同様なので、その説明は省略する。 Note that the search processing operation by the information management apparatus 202 when a search request is made later from the user terminal 14 of the user D is the same as that in the first embodiment, and a description thereof will be omitted.

このようにすることにより、入出力データが入出力された期間と、その入出力区別が考慮された個人知識データが作成されるため、入出力データを取り扱ったユーザの特徴をより反映させることができ、後のデータの利用がさらに容易となる。特に、過去のある期間に集中的に入出力した情報を検索することができ、また、個人の関心事や専門知識をより高精度に抽出することができる。 By doing this, personal knowledge data is created in consideration of the period during which the input / output data is input / output and the input / output distinction, so that the characteristics of the user who handled the input / output data can be more reflected. This makes it easier to use the data later. In particular, information input / output intensively during a certain past period can be searched, and personal interests and expertise can be extracted with higher accuracy.

本発明は、ユーザによる入出力データを蓄積して管理すると共に、その検索を可能とするデータベースとして利用可能であるため、産業上の利用可能性を有する。 The present invention has industrial applicability because it can be used as a database for storing and managing input / output data by a user and enabling the search.

本発明の全体構成を示すブロック図である。It is a block diagram which shows the whole structure of this invention. 実施例１における情報管理装置の構成を示す機能ブロック図である。1 is a functional block diagram illustrating a configuration of an information management device in Embodiment 1. FIG. 入出力データと時間データとを記憶したときの一例を示す説明図である。It is explanatory drawing which shows an example when input / output data and time data are memorize | stored. 入出力データの特徴値を算出した時の一例を示す説明図である。It is explanatory drawing which shows an example when the feature value of input / output data is calculated. 個人知識データを生成するときの一例を示す説明図であり、図５（ａ）は時間的に連続する複数の入出力データの一例を示し、図５（ｂ）は所定期間の入出力データを用いて個人知識データを生成したときの一例を示す。FIG. 5A is an explanatory diagram showing an example when generating personal knowledge data, FIG. 5A shows an example of a plurality of input / output data that are temporally continuous, and FIG. 5B shows input / output data for a predetermined period. An example when personal knowledge data is generated by using the data is shown. 実施例１における情報管理装置による情報蓄積処理の動作を示すフローチャートである。3 is a flowchart illustrating an operation of information accumulation processing by the information management apparatus according to the first exemplary embodiment. 図６のステップＳ３の詳細な動作を示すフローチャートである。It is a flowchart which shows the detailed operation | movement of step S3 of FIG. 図６のステップＳ４の詳細な動作を示すフローチャートである。It is a flowchart which shows the detailed operation | movement of step S4 of FIG. 実施例１における情報管理装置による検索処理の動作を示すフローチャートである。6 is a flowchart illustrating an operation of a search process performed by the information management apparatus according to the first embodiment. 図９のステップＳ４３の詳細な動作を示すフローチャートである。It is a flowchart which shows the detailed operation | movement of step S43 of FIG. 図１１（ａ），（ｂ），（ｃ）は、それぞれ図９のステップＳ４４の詳細な動作を示すフローチャートである。FIGS. 11A, 11B, and 11C are flowcharts showing the detailed operation of step S44 of FIG. 9, respectively. 図６のステップＳ４の詳細な他の動作例を示すフローチャートである。It is a flowchart which shows the other detailed operation example of step S4 of FIG. 実施例２における情報管理装置の構成を示す機能ブロック図である。FIG. 10 is a functional block diagram illustrating a configuration of an information management device according to a second embodiment. 入出力データと入出力区別データとを記憶したときの一例を示す説明図である。It is explanatory drawing which shows an example when input / output data and input / output distinction data are memorize | stored. 個人知識データを生成するときの一例を示す説明図であり、図１５（ａ）は入出力区別に基づいて入出力データに重み付けを付加した一例を示し、図１５（ｂ）は個人知識データを生成したときの一例を示す。It is explanatory drawing which shows an example when producing | generating personal knowledge data, FIG.15 (a) shows an example which added the weighting to input / output data based on the input / output distinction, FIG.15 (b) shows personal knowledge data. An example is shown when it is generated. 実施例２における情報管理装置による情報蓄積処理の動作を示すフローチャートである。10 is a flowchart illustrating an operation of information accumulation processing by the information management apparatus according to the second embodiment. 図１６のステップＳ１０４の詳細な動作を示すフローチャートである。It is a flowchart which shows the detailed operation | movement of step S104 of FIG. 実施例３における情報管理装置の構成を示す機能ブロック図である。FIG. 10 is a functional block diagram illustrating a configuration of an information management device according to a third embodiment. 実施例３における情報管理装置による情報蓄積処理の動作を示すフローチャートである。14 is a flowchart illustrating an operation of information accumulation processing by the information management apparatus according to the third embodiment. 図１９のステップＳ２０４の詳細な動作を示すフローチャートである。It is a flowchart which shows the detailed operation | movement of step S204 of FIG. 従来例における不都合を説明するための説明図である。It is explanatory drawing for demonstrating the inconvenience in a prior art example.

Explanation of symbols

２情報管理装置
１１，１２，１３，１４ユーザ端末
２１，１２１データ入出力処理部
２２，１２２特徴値抽出処理部（特徴値抽出手段）
２３，１２３個人知識生成処理部（個人知識データ生成手段）
２４，１２４検索要求受付処理部（検索手段）
２５，１２５検索処理部（検索手段）
２６，１２６検索結果出力処理部（検索結果出力手段）
３１，１３１入出力データ記憶部（入出力データ記憶手段）
３２時間データ記憶部（入出力データ記憶手段）
３３期間設定データ記憶部
３４，１３４個人知識データ記憶部（情報蓄積手段）
１３２入出力区別データ記憶手段（入出力データ記憶手段）
１３３重み付け設定データ記憶部
2 Information management device 11, 12, 13, 14 User terminal 21, 121 Data input / output processing unit 22, 122 Feature value extraction processing unit (feature value extraction means)
23,123 Personal knowledge generation processing unit (personal knowledge data generation means)
24,124 Search request reception processing unit (search means)
25, 125 Search processing unit (search means)
26, 126 Search result output processing unit (search result output means)
31, 131 Input / output data storage unit (input / output data storage means)
32 hour data storage (input / output data storage means)
33 Period setting data storage unit 34, 134 Personal knowledge data storage unit (information storage means)
132 Input / output distinction data storage means (input / output data storage means)
133 Weight setting data storage unit

Claims

Input / output data storage means for storing input / output data input / output by the user and input / output status data representing the input / output status of the input / output data;
Feature value extraction means for extracting feature values based on predetermined criteria of the input / output data;
Personal knowledge data generating means for generating personal knowledge data representing the characteristics of the input / output data for each of the users who input / output the input / output data based on the feature values of the input / output data and the input / output status data; ,
An information management device comprising:

The information management apparatus according to claim 1, wherein the feature value extraction unit extracts a feature value based on an appearance frequency of each term included in the input / output data.

3. The information management apparatus according to claim 1, wherein the feature value extraction unit extracts a feature value based on an appearance position of each term included in the input / output data.

4. The information management apparatus according to claim 1, wherein the feature value extracting unit extracts a feature value based on emphasis information of each term included in the input / output data.

5. The information management apparatus according to claim 1, wherein the input / output status data is data representing a time when the input / output data is input / output.

6. The information management according to claim 5, wherein the personal knowledge data generating means generates the personal knowledge data based on a plurality of the input / output data input / output within a predetermined period. apparatus.

7. The information management apparatus according to claim 1, wherein the input / output status data is input / output distinction data representing input / output distinction of the input / output data.

The information management apparatus according to claim 7, wherein the personal knowledge data generating unit generates the personal knowledge data by weighting the input / output data according to the input / output distinction data.

9. The information management apparatus according to claim 8, wherein the personal knowledge data generating means weights the output data of the input / output data higher than the input data.

10. The input / output status data is reference status data representing a reference status of other data in the input / output data. The information management device described.

11. The information management apparatus according to claim 10, wherein the personal knowledge data generating means generates the personal knowledge data by weighting the input / output data in accordance with the reference situation data.

The information management apparatus according to claim 11, wherein the personal knowledge data generation unit weights data that is a reference destination of predetermined data among the input / output data.

A similarity calculating means for calculating a similarity between feature values of the plurality of input / output data;
6. The personal knowledge data generating means generates the personal knowledge data for a plurality of the input / output data having a similarity equal to or greater than a predetermined value. , 6, 7, 8, 9, 10, 11 or 12.

An information storage means for storing the personal knowledge data generated by the personal knowledge data generation means is provided, wherein the personal knowledge data is provided. 1, 2, 3, 4, 5, 6, 7, 8, 9, The information management device according to 10, 11, 12, or 13.

Retrieval means for accepting search request data including data corresponding to the input / output data from another user and searching the personal knowledge data stored in the information storage means based on the search request data The information management apparatus according to claim 14.

16. The information management apparatus according to claim 15, wherein the search means searches for other personal knowledge data similar to the searched personal knowledge data based on a predetermined criterion.

The information management apparatus according to claim 15 or 16, further comprising search result output means for outputting the personal knowledge data searched by the search means.

The search result output means for reading out and outputting the input / output data corresponding to the personal knowledge data searched by the search means from the input / output data storage means, or 17. The information management device according to 17.

19. Information according to claim 15, 16, 17 or 18, further comprising search result output means for outputting user specifying information for specifying the user corresponding to the personal knowledge data searched by the search means. Management device.

An information management method executed by a computer connected to input / output data storage means for storing input / output data input / output by a user and input / output status data representing the input / output status of the input / output data. ,
A feature value extracting step of extracting the feature value of the input / output data based on a predetermined criterion;
Based on the characteristic value of the input / output data and the input / output status data stored in the input / output data storage unit, the characteristic of the input / output data for each user who has input / output the input / output data is represented. Personal knowledge data generation process for generating personal knowledge data;
An information management method characterized by comprising:

21. The information according to claim 20, wherein the feature value extracting step extracts a feature value based on at least one of an appearance frequency, an appearance position, and emphasis information of each term included in the input / output data. Management method.

The input / output status data is data representing the time when the input / output data was input / output,
22. The personal knowledge data generation step generates the personal knowledge data based on a plurality of the input / output data input / output within a predetermined period. Information management method.

The input / output status data is input / output distinction data representing input / output distinction of the input / output data,
23. The information management method according to claim 20, wherein the personal knowledge data generating step generates the personal knowledge data by weighting the input / output data according to the input / output distinction data. .

The personal knowledge data generation step calculates similarity between feature values of the plurality of input / output data, and generates the personal knowledge data for the plurality of input / output data having the similarity equal to or greater than a predetermined value. 24. The information management method according to claim 20, 21, 22, or 23.

To a computer connected to input / output data storage means for storing input / output data input / output by the user and input / output status data representing the input / output status of the input / output data,
Feature value extracting means for extracting feature values of the input / output data based on a predetermined criterion;
Based on the characteristic value of the input / output data and the input / output status data stored in the input / output data storage unit, the characteristic of the input / output data for each user who has input / output the input / output data is represented. Personal knowledge data generating means for generating personal knowledge data;
Information management program for realizing

In the computer,
26. For information management according to claim 25, for realizing the feature value extracting means for extracting a feature value based on at least one of appearance frequency, appearance position, and emphasis information of each term included in the input / output data. program.

In the computer,
When the input / output status data is data representing the input / output time of the input / output data, the personal knowledge data is generated based on a plurality of the input / output data input / output within a predetermined period. 27. The information management program according to claim 25 or 26 for realizing the personal knowledge data generating means.

In the computer,
When the input / output status data is input / output distinction data representing the input / output distinction of the input / output data, the input / output distinction data is weighted to generate the personal knowledge data. 28. The information management program according to claim 25, 26 or 27 for realizing the personal knowledge data generating means.

In the computer,
Similarity calculating means for calculating the similarity between feature values of the plurality of input / output data;
29. Information according to claim 25, 26, 27, or 28, for realizing the personal knowledge data generating means for generating the personal knowledge data for a plurality of the input / output data having a similarity equal to or greater than a predetermined value. Administrative program.