JP2011175362A

JP2011175362A - Information processing apparatus, importance level calculation method, and program

Info

Publication number: JP2011175362A
Application number: JP2010037469A
Authority: JP
Inventors: Mitsuhiro Miyazaki; 充弘宮嵜
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2010-02-23
Filing date: 2010-02-23
Publication date: 2011-09-08
Also published as: CN102163211A; US20110208750A1; US8234311B2

Abstract

<P>PROBLEM TO BE SOLVED: To flexibly evaluate importance levels about various combinations between attributes from attributes of contents. <P>SOLUTION: The information processing apparatus includes: a memory holding an attribute table storing an attribute value imparted to each content about the plurality of contents; and an importance level calculation unit calculating the importance levels of other one or more attributes to a prescribed attribute of the content by use of the attribute values stored in the attribute table. The importance level calculation unit calculates the importance level by use of a determination table wherein the prescribed attribute is set as a determination attribute and wherein the one or more attributes are set as condition attributes. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、情報処理装置、重要度算出方法及びプログラムに関する。 The present invention relates to an information processing apparatus, an importance calculation method, and a program.

近年、情報通信技術の発展に伴い、音楽、映像、電子書籍、ニュース記事、商品情報又はイベント情報などの様々なコンテンツがネットワークを介してユーザに提供されている。このような膨大なコンテンツから個々のユーザが自己に見合った情報を探し出すことは容易でない。そのため、ユーザがコンテンツを探し出すことを支援するキーワード検索又はジャンル検索などの検索サービス、及びシステムがユーザにふさわしいコンテンツを推薦する推薦サービスなどが提供されている。例えば、下記特許文献１は、多くのコンテンツ推薦サービスに取り入れられている協調フィルタリングと呼ばれる手法について記述している。また、下記特許文献２は、ベクトル空間法によるマッチングを利用して推薦すべきＴＶ番組を決定する手法について記述している。 In recent years, with the development of information communication technology, various contents such as music, video, electronic books, news articles, product information or event information are provided to users via a network. It is not easy for individual users to search for information suitable for themselves from such a vast amount of content. Therefore, there are provided search services such as keyword search or genre search that assist the user in finding content, and recommendation services in which the system recommends content suitable for the user. For example, Patent Document 1 below describes a technique called collaborative filtering that is incorporated in many content recommendation services. Patent Document 2 below describes a method for determining a TV program to be recommended using matching based on a vector space method.

上述したようなコンテンツ検索サービス、コンテンツ推薦サービス及びその他のコンテンツ関連サービスにおいてサービスの有効性を左右する鍵となるのが、コンテンツの属性の取り扱いである。一般的に、コンテンツの属性には、コンテンツに人為的に付与される属性、コンテンツデータを解析することにより生成される属性、コンテンツに対するユーザアクションに基づいて算出される属性など、様々な種類のものがある。今日では、ユーザに提供されるコンテンツが増加しただけではなく、サービスが取り扱うべきコンテンツの属性の種類も多くなっている。そこで、コンテンツの属性の中から、コンテンツの分類、検索又は推薦等のために重要な属性を効率的に抽出することのできる技術が注目されている。例えば、下記特許文献３は、カテゴリ分類されるデータについての複数の属性の中から重要な属性を抽出するための技術の一例を記述している。 In the content search service, content recommendation service, and other content-related services as described above, the key to the effectiveness of the service is the handling of content attributes. In general, there are various types of content attributes, such as attributes that are artificially given to content, attributes that are generated by analyzing content data, and attributes that are calculated based on user actions on content. There is. Today, not only the content provided to users has increased, but the types of content attributes that the service should handle are also increasing. Therefore, a technique that can efficiently extract important attributes for content classification, search, recommendation, and the like from content attributes has attracted attention. For example, Patent Document 3 below describes an example of a technique for extracting an important attribute from a plurality of attributes for data classified into categories.

特開２００７−３２３３１５号公報JP 2007-323315 A 特開２００７−２００３３９号公報Japanese Patent Laid-Open No. 2007-200339 特開平９−３２５９６９号公報JP 9-325969 A

しかしながら、上記特許文献３に記載された技術では、個々の属性ごとに重要性が判定されるため、相互に関連する属性の組合せについての重要性を認識することができない。例えば、音楽コンテンツの属性の例として「ジャンル」及び「年代」が挙げられる。「ジャンル」についての属性値は、「ロック」、「ポップ」又は「クラシック」などである。また、「年代」についての属性値は、「１９７０年代」、「１９８０年代」又は「１９９０年代」などである。このような場合に、同じ「ジャンル」（＝例えば「ロック」）であっても、どの「年代」の「ロック」であるかがユーザにとって異なる意味を持つケースは少なくない。即ち、一般的に、多数のコンテンツの属性の中から属性の様々な組合せについての重要度を柔軟に評価することができれば、コンテンツ推薦の際の推薦理由の呈示、コンテンツ検索のためのジャンルの付与など、その重要度を様々な用途に活用できるものと期待される。 However, in the technique described in Patent Document 3, since importance is determined for each attribute, it is not possible to recognize the importance of a combination of attributes that are related to each other. For example, “genre” and “age” are examples of attributes of music content. The attribute value for “genre” is “rock”, “pop”, “classic”, or the like. The attribute value for “age” is “1970s”, “1980s”, “1990s”, or the like. In such a case, even if the “genre” (= “rock”, for example) is the same, the “rock” of which “age” has a different meaning for the user. That is, in general, if the importance of various combinations of attributes among a large number of content attributes can be flexibly evaluated, the reason for recommendation at the time of content recommendation and the addition of a genre for content search are provided. It is expected that the importance can be used for various purposes.

そこで、本発明は、コンテンツの属性の中から属性の様々な組合せについての重要度を柔軟に評価することのできる、新規かつ改良された情報処理装置、重要度算出方法及びプログラムを提供しようとするものである。 Therefore, the present invention aims to provide a new and improved information processing apparatus, importance calculation method, and program capable of flexibly evaluating the importance of various combinations of attributes among the attributes of content. Is.

本発明のある実施形態によれば、複数のコンテンツについて、各コンテンツに付与される属性値を記憶する属性テーブルを保持する記憶部と、上記属性テーブルに記憶されている属性値を用いて、コンテンツの所定の属性に対する他の１つ以上の属性の重要度を算出する重要度算出部と、を備え、上記重要度算出部は、上記重要度を、上記所定の属性を決定属性とし、上記１つ以上の属性を条件属性とする決定表を用いて算出する、情報処理装置が提供される。 According to an embodiment of the present invention, for a plurality of contents, a content is stored using a storage unit that stores an attribute table that stores attribute values assigned to each content, and attribute values stored in the attribute table. An importance calculating unit that calculates the importance of one or more other attributes with respect to the predetermined attribute, wherein the importance calculating unit uses the predetermined attribute as the determination attribute, There is provided an information processing apparatus that calculates using a decision table having two or more attributes as condition attributes.

かかる構成によれば、上記１つ以上の属性を条件属性のセットとして、様々な組合せについて、決定表を用いて上記所定の属性に対する重要度が算出され得る。 According to such a configuration, the importance for the predetermined attribute can be calculated for various combinations using the one or more attributes as a set of condition attributes using the determination table.

また、上記重要度算出部は、上記重要度を、上記決定表について上記決定属性の正領域を形成するコンテンツの数に基づいて算出してもよい。 In addition, the importance calculation unit may calculate the importance based on the number of contents forming a positive region of the determination attribute for the determination table.

また、上記情報処理装置は、上記重要度算出部により算出される上記重要度に応じて、コンテンツに関するユーザに呈示すべき情報を生成するために使用される１つ以上の重要属性を抽出する抽出部、をさらに備えてもよい。 In addition, the information processing apparatus extracts one or more important attributes used to generate information about the content to be presented to the user according to the importance calculated by the importance calculation unit. May be further provided.

また、上記情報処理装置は、上記属性テーブルに記憶されている属性値を用いてユーザに推薦すべきコンテンツを選択する推薦部であって、上記抽出部により抽出される上記１つ以上の重要属性に基づいて推薦の理由を生成する推薦部、をさらに備えてもよい。 The information processing apparatus is a recommendation unit that selects content to be recommended to a user using an attribute value stored in the attribute table, and the one or more important attributes extracted by the extraction unit A recommendation unit that generates a reason for recommendation based on the information may be further provided.

また、上記推薦部は、コンテンツの選択のために算出したコンテンツごとのスコアを上記属性テーブルにさらに記憶させ、上記抽出部は、上記スコアを決定属性として上記重要度算出部により算出される上記重要度に応じて、上記１つ以上の重要属性を抽出してもよい。 Further, the recommendation unit further stores a score for each content calculated for content selection in the attribute table, and the extraction unit calculates the importance calculated by the importance calculation unit using the score as a determination attribute. Depending on the degree, the one or more important attributes may be extracted.

また、上記属性テーブルは、各コンテンツに対するユーザのフィードバックに基づいて付与されるフィードバック属性の属性値をさらに記憶し、上記推薦部は、上記フィードバック属性の属性値を用いて、ユーザに推薦すべきコンテンツを選択し、上記抽出部は、上記フィードバック属性を決定属性として上記重要度算出部により算出される上記重要度に応じて、上記１つ以上の重要属性を抽出してもよい。 Further, the attribute table further stores attribute values of feedback attributes given based on user feedback for each content, and the recommendation unit uses the attribute values of the feedback attributes to recommend content to the user The extraction unit may extract the one or more important attributes according to the importance calculated by the importance calculation unit using the feedback attribute as a determination attribute.

また、上記属性テーブルは、各コンテンツに対するユーザアクションの状況に基づいて付与されるコンテキスト属性の属性値をさらに記憶し、上記推薦部は、上記コンテキスト属性の属性値を用いて、ユーザに推薦すべきコンテンツを選択し、上記抽出部は、上記コンテキスト属性を決定属性として上記重要度算出部により算出される上記重要度に応じて、上記１つ以上の重要属性を抽出してもよい。 The attribute table further stores an attribute value of a context attribute given based on a user action status for each content, and the recommendation unit should recommend to the user using the attribute value of the context attribute The content may be selected, and the extraction unit may extract the one or more important attributes according to the importance calculated by the importance calculation unit using the context attribute as a determination attribute.

また、上記属性テーブルは、各コンテンツに付与される基本属性の属性値に加えて、当該基本属性の属性値を解析することにより得られる拡張属性の属性値を記憶してもよい。 The attribute table may store an attribute value of an extended attribute obtained by analyzing the attribute value of the basic attribute in addition to the attribute value of the basic attribute given to each content.

また、上記抽出部は、上記所定の属性を決定属性とし、上記拡張属性に含まれる１つ以上の属性を条件属性として上記重要度算出部により算出される第１の重要度と、上記拡張属性に含まれる属性を決定属性とし、上記基本属性に含まれる１つ以上の属性を条件属性として上記重要度算出部により算出される第２の重要度とに応じて、上記１つ以上の重要属性を抽出してもよい。 In addition, the extraction unit includes the first importance calculated by the importance calculation unit using the predetermined attribute as a determination attribute and one or more attributes included in the extended attribute as a condition attribute, and the extended attribute. The one or more important attributes according to the second importance calculated by the importance calculating unit using the attribute included in the basic attribute as a decision attribute and one or more attributes included in the basic attribute as a condition attribute May be extracted.

また、上記情報処理装置は、ＰＬＳＡ（Probabilistic Latent Semantic Analysis）又はＬＤＡ（Latent Dirichlet Allocation）による確率的分類法に従って、上記基本属性の属性値に基づいて上記拡張属性の属性値を算出する解析部、をさらに備えてもよい。 Further, the information processing apparatus includes an analysis unit that calculates an attribute value of the extended attribute based on an attribute value of the basic attribute according to a probabilistic classification method based on PLSA (Probabilistic Latent Semantic Analysis) or LDA (Latent Dirichlet Allocation), May be further provided.

また、上記情報処理装置は、上記属性テーブルに記憶されている属性値を用いて再生すべきコンテンツのリストを生成するコンテンツリスト生成部であって、上記抽出部により抽出される上記１つ以上の重要属性に基づいてコンテンツリストのタイトルを生成するコンテンツリスト生成部、をさらに備え、上記抽出部は、上記コンテンツリスト生成部による上記コンテンツリストの生成のために使用された属性を決定属性として上記重要度算出部により算出される上記重要度に応じて、上記１つ以上の重要属性を抽出してもよい。 The information processing apparatus may be a content list generation unit that generates a list of contents to be played back using the attribute values stored in the attribute table, and the one or more extracted by the extraction unit A content list generation unit that generates a title of the content list based on the important attribute, and the extraction unit uses the attribute used for generation of the content list by the content list generation unit as the determination attribute. The one or more important attributes may be extracted according to the importance calculated by the degree calculation unit.

また、上記情報処理装置は、ユーザの指定に応じて再生すべきコンテンツのリストを生成するコンテンツリスト生成部であって、上記抽出部により抽出される上記１つ以上の重要属性に基づいてコンテンツリストのタイトルを生成するコンテンツリスト生成部、をさらに備え、上記抽出部は、ユーザによる指定の有無に応じて属性値が決定される属性を決定属性として上記重要度算出部により算出される上記重要度に応じて、上記１つ以上の重要属性を抽出してもよい。 The information processing apparatus may be a content list generation unit that generates a list of contents to be played according to a user's specification, and the content list is based on the one or more important attributes extracted by the extraction unit A content list generation unit that generates a title of the image, and the extraction unit calculates the importance calculated by the importance calculation unit using an attribute whose attribute value is determined according to whether or not specified by the user as a determination attribute Depending on, the one or more important attributes may be extracted.

また、上記記憶部は、各コンテンツに対するユーザアクションの履歴を表す履歴データ、をさらに保持し、上記重要度算出部は、上記履歴データに含まれるコンテンツについての属性値を用いて、ユーザごとに上記１つ以上の属性についての上記重要度を算出してもよい。 Further, the storage unit further holds history data representing a history of user actions for each content, and the importance calculation unit uses the attribute value for the content included in the history data, for each user. The importance degree for one or more attributes may be calculated.

また、上記重要度算出部は、Ｒｏｕｇｈ集合理論に従って上記決定表における決定属性の正領域を導出してもよい。 In addition, the importance calculation unit may derive a positive region of the decision attribute in the decision table according to a Rough set theory.

また、本発明の別の実施形態によれば、複数のコンテンツについて、各コンテンツに付与される属性値を記憶する属性テーブルを記憶媒体を用いて保持している情報処理装置において、上記属性テーブルに記憶されている属性値を用いて、コンテンツの所定の属性に対する他の１つ以上の属性の重要度を算出するステップ、を含む重要度算出方法であって、上記重要度は、上記所定の属性を決定属性とし、上記１つ以上の属性を条件属性とする決定表を用いて算出される、重要度算出方法が提供される。 According to another embodiment of the present invention, in an information processing apparatus that uses a storage medium to store an attribute table that stores an attribute value assigned to each content for a plurality of contents, Calculating an importance level of one or more other attributes with respect to a predetermined attribute of the content using a stored attribute value, wherein the importance level is calculated based on the predetermined attribute value. Is a decision attribute, and an importance calculation method is provided that is calculated using a decision table having the one or more attributes as condition attributes.

また、本発明の別の実施形態によれば、複数のコンテンツについて、各コンテンツに付与される属性値を記憶する属性テーブルを記憶媒体を用いて保持している情報処理装置を制御するコンピュータを、上記属性テーブルに記憶されている属性値を用いて、コンテンツの所定の属性に対する他の１つ以上の属性の重要度を算出する重要度算出部、として機能させるためのプログラムであって、上記重要度算出部は、上記重要度を、上記所定の属性を決定属性とし、上記１つ以上の属性を条件属性とする決定表を用いて算出する、プログラムが提供される。 According to another embodiment of the present invention, for a plurality of contents, a computer that controls an information processing apparatus that holds an attribute table that stores attribute values assigned to each content using a storage medium, A program for functioning as an importance calculation unit that calculates importance of one or more other attributes with respect to a predetermined attribute of content using attribute values stored in the attribute table, The degree calculation unit is provided with a program for calculating the importance using a determination table in which the predetermined attribute is the determination attribute and the one or more attributes are condition attributes.

以上説明したように、本発明に係る情報処理装置、重要度算出方法及びプログラムによれば、コンテンツの属性の中から属性の様々な組合せについての重要度を柔軟に評価することができる。 As described above, according to the information processing apparatus, the importance calculation method, and the program according to the present invention, the importance of various combinations of attributes can be flexibly evaluated from the attributes of the content.

一実施形態に係る情報処理装置の構成を示すブロック図である。It is a block diagram which shows the structure of the information processing apparatus which concerns on one Embodiment. 一実施形態に係る属性テーブルの概要を説明するための説明図である。It is explanatory drawing for demonstrating the outline | summary of the attribute table which concerns on one Embodiment. 属性テーブルにより記憶される基本属性のうちのメタデータの一例を説明するための説明図である。It is explanatory drawing for demonstrating an example of the metadata of the basic attributes memorize | stored by the attribute table. 属性テーブルにより記憶される基本属性のうちのコンテキストデータの一例を説明するための説明図である。It is explanatory drawing for demonstrating an example of the context data among the basic attributes memorize | stored by the attribute table. 属性テーブルにより記憶される拡張属性の概要を説明するための説明図である。It is explanatory drawing for demonstrating the outline | summary of the extended attribute memorize | stored by an attribute table. 図５に示した拡張属性の具体的な一例を説明するための説明図である。It is explanatory drawing for demonstrating a specific example of the extended attribute shown in FIG. 属性テーブルにより記憶される評価属性の一例を説明するための説明図である。It is explanatory drawing for demonstrating an example of the evaluation attribute memorize | stored by the attribute table. 決定表に関連する諸概念について説明するための説明図である。It is explanatory drawing for demonstrating the various concepts relevant to a decision table. 決定表に基づく重要度の算出について説明するための説明図である。It is explanatory drawing for demonstrating calculation of the importance based on a determination table. 決定表に基づいて算出される重要度の一例について説明するための説明図である。It is explanatory drawing for demonstrating an example of the importance calculated based on a determination table. 決定表に基づいて算出される重要度の他の例について説明するための説明図である。It is explanatory drawing for demonstrating the other example of the importance calculated based on a determination table. 重要属性の段階的抽出について説明するための説明図である。It is explanatory drawing for demonstrating the stepwise extraction of an important attribute. 推薦理由が呈示される画面の一例について説明するための説明図である。It is explanatory drawing for demonstrating an example of the screen as which a reason for recommendation is shown. 推薦理由が呈示される画面の他の例について説明するための説明図である。It is explanatory drawing for demonstrating the other example of the screen as which the reason for recommendation is shown. 一実施形態に係る情報処理装置による事前処理の流れの一例を示すフローチャートである。It is a flowchart which shows an example of the flow of the pre-processing by the information processing apparatus which concerns on one Embodiment. 一実施形態に係る情報処理装置による推薦処理の流れの一例を示すフローチャートである。It is a flowchart which shows an example of the flow of the recommendation process by the information processing apparatus which concerns on one Embodiment. 一変形例に係る情報処理装置の構成を示すブロック図である。It is a block diagram which shows the structure of the information processing apparatus which concerns on one modification. 他の変形例に係る個人化について説明するための説明図である。It is explanatory drawing for demonstrating the personalization which concerns on another modification. ハードウェア構成の一例を示すブロック図である。It is a block diagram which shows an example of a hardware configuration.

以下に添付図面を参照しながら、本発明の好適な実施の形態について詳細に説明する。なお、本明細書及び図面において、実質的に同一の機能構成を有する構成要素については、同一の符号を付すことにより重複説明を省略する。 Exemplary embodiments of the present invention will be described below in detail with reference to the accompanying drawings. In addition, in this specification and drawing, about the component which has the substantially same function structure, duplication description is abbreviate | omitted by attaching | subjecting the same code | symbol.

また、以下の順序にしたがって当該「発明を実施するための形態」を説明する。
１．用語の説明
２．一実施形態に係る情報処理装置の構成例
２−１．属性テーブル
２−２．記憶部
２−３．解析部
２−４．ユーザインタフェース制御部
２−５．推薦部
２−６．重要度算出部
２−７．抽出部
２−８．画面例
３．一実施形態に係る処理の流れ
３−１．事前処理
３−２．推薦処理
４．変形例
４−１．プレイリストの提供
４−２．個人化
５．ハードウェア構成例
６．まとめ Further, the “DETAILED DESCRIPTION OF THE INVENTION” will be described in the following order.
1. Explanation of terms 2. Configuration example of information processing apparatus according to one embodiment 2-1. Attribute table 2-2. Storage unit 2-3. Analysis unit 2-4. User interface controller 2-5. Recommendation section 2-6. Importance calculation unit 2-7. Extraction unit 2-8. Screen example 3. Flow of processing according to one embodiment 3-1. Pre-processing 3-2. Recommendation process Modified example 4-1. Provision of playlist 4-2. Personalization 5. 5. Hardware configuration example Summary

＜１．用語の説明＞
まず、本明細書において使用する主な用語の説明を以下に記述する。
・確率的分類法：コンテンツ又はテキスト等の集合の要素を部分集合に分類するための手法の１つ。確率的分類法においては、１つのコンテンツ又はテキスト等が、複数の部分集合に確率を伴って帰属し得る。
・潜在トピック：確率的分類法における個々の部分集合に対応し、各コンテンツ又はテキストの生起に対して潜在的に寄与する概念。個々の部分集合の分野又は話題等を表現するものと考えることができる。
・ＰＬＳＡ（Probabilistic Latent Semantic Analysis）：確率的分類法の１つ。潜在トピックによる確率的生成モデルを提供し、テキスト分類の分野において広く使用されている。
・ＬＤＡ（Latent Dirichlet Allocation）：ＰＬＳＡを発展させた確率的分類法の１つ。潜在トピックによる確率的生成モデルを提供し、テキスト分類の分野において広く使用されている。
・トピック集合：ある潜在トピックに帰属確率を伴って分類されているコンテンツ又はテキスト等の集合
・Ｒｏｕｇｈ集合理論：識別不能性（indiscernibility）による不確実性を伴う決定表（decision table）を解析するための基礎となる理論。
・Ｃｒｉｓｐ−Ｒｏｕｇｈ集合理論：Ｒｏｕｇｈ集合理論の一種。対象の集合をあるレベルで特定できる範囲で、属性を選択することによって対象の程よい近似を求める手法。
・Ｆｕｚｚｙ−Ｒｏｕｇｈ集合理論：Ｃｒｉｓｐ−Ｒｏｕｇｈ集合理論を発展させたＲｏｕｇｈ集合理論の一種。対象属性を名義属性から数値属性に拡張することにより、連続値による属性の取り扱いと対象の記述とを可能とした手法。
・推薦エンジン：ユーザの嗜好（Preference）又はユーザのコンテンツに対するアクション等に基づいてコンテンツを推薦するシステムモジュール。ＣＦ又はＣＢＦなどの様々な推薦アルゴリズムに基づく推薦エンジンが既に実用化されている。
・ＣＦ（Collaborative Filtering）：協調フィルタリング。推薦アルゴリズムの一種。複数のユーザの嗜好データを蓄積しておき、あるユーザと嗜好の類似する他のユーザに関するデータに基づいて推薦等を行う手法。
・ＣＢＦ（Content Based Filtering）：コンテンツベースフィルタリング。推薦アルゴリズムの一種。コンテンツの属性データの類似度に基づいてコンテンツの推薦等を行う手法。
・推薦理由：推薦エンジンがコンテンツを推薦する際にユーザに呈示する推薦の根拠に関する説明。 <1. Explanation of terms>
First, explanations of main terms used in this specification will be described below.
Probabilistic classification: A method for classifying elements of a set such as content or text into subsets. In the probabilistic classification method, one content or text can belong to a plurality of subsets with probability.
Latent topic: a concept that corresponds to an individual subset in a probabilistic taxonomy and potentially contributes to the occurrence of each content or text. It can be thought of as representing the field or topic of an individual subset.
PLSA (Probabilistic Latent Semantic Analysis): A probabilistic classification method. Provides a probabilistic generation model with latent topics and is widely used in the field of text classification.
LDA (Latent Dirichlet Allocation): One of the probabilistic classification methods developed from PLSA. Provides a probabilistic generation model with latent topics and is widely used in the field of text classification.
-Topic set: A set of content or text classified as belonging to a potential topic with attribution probability-Rough set theory: To analyze a decision table with uncertainty due to indiscernibility The theory underlying
Crisp-Rough set theory: A kind of Rough set theory. A technique for finding a reasonable approximation of an object by selecting attributes within a range that can identify the set of objects at a certain level.
Fuzzy-Rough set theory: A type of Rough set theory developed from Crisp-Rough set theory. A technique that enables the handling of attributes by continuous values and the description of the target by extending the target attribute from the nominal attribute to the numeric attribute.
Recommendation engine: A system module that recommends content based on the user's preferences or actions on the user's content. Recommendation engines based on various recommendation algorithms such as CF or CBF have already been put into practical use.
CF (Collaborative Filtering): collaborative filtering. A kind of recommendation algorithm. A method of accumulating preference data of a plurality of users and making recommendations based on data relating to other users who have similar preferences to a certain user.
CBF (Content Based Filtering): Content-based filtering. A kind of recommendation algorithm. A method for recommending content based on the similarity of content attribute data.
・ Reason for recommendation: Explanation regarding the basis of recommendation presented to the user when the recommendation engine recommends content.

＜２．一実施形態に係る情報処理装置の構成例＞
本発明の一実施形態に係る情報処理装置１００は、典型的には、複数のコンテンツについての属性値を記憶する属性テーブルを保持し、コンテンツの所定の属性への寄与の程度を表す重要度を、他の１つ以上の属性について算出する装置である。本実施形態では、情報処理装置１００は、さらに推薦機能を有する。そして、情報処理装置１００は、ユーザに推薦すべきコンテンツを選択すると共に、算出した上記重要度に応じて推薦理由を生成して、推薦すべきコンテンツに関する情報と推薦理由とをユーザに呈示する。 <2. Configuration Example of Information Processing Device According to One Embodiment>
The information processing apparatus 100 according to an embodiment of the present invention typically holds an attribute table that stores attribute values for a plurality of contents, and has an importance level indicating the degree of contribution of the contents to a predetermined attribute. A device that calculates one or more other attributes. In the present embodiment, the information processing apparatus 100 further has a recommendation function. Then, the information processing apparatus 100 selects content to be recommended to the user, generates a reason for recommendation according to the calculated importance, and presents the information about the content to be recommended and the reason for recommendation to the user.

図１は、本発明の一実施形態に係る情報処理装置１００の構成を示すブロック図である。図１を参照すると、情報処理装置１００は、記憶部１１０、解析部１２０、ユーザインタフェース（ＵＩ）制御部１３０、推薦部１４０、重要度算出部１５０及び抽出部１６０を備える。情報処理装置１００は、例えば、記憶部１１０に記憶されるコンテンツをＵＩ制御部１３０を介して再生可能なコンテンツプレーヤであってもよい。その代わりに、情報処理装置１００は、例えば、ＵＩ制御部１３０を介して端末装置にネットワーク経由でコンテンツデータを提供するコンテンツサーバであってもよい。より一般的には、情報処理装置１００は、例えば、高性能コンピュータ、ＰＣ（Personal Computer）、デジタル家電機器、ゲーム機器、ＡＶプレーヤ又はスマートフォンなどの任意の種類の装置であってよい。 FIG. 1 is a block diagram showing a configuration of an information processing apparatus 100 according to an embodiment of the present invention. Referring to FIG. 1, the information processing apparatus 100 includes a storage unit 110, an analysis unit 120, a user interface (UI) control unit 130, a recommendation unit 140, an importance calculation unit 150, and an extraction unit 160. The information processing apparatus 100 may be, for example, a content player that can reproduce content stored in the storage unit 110 via the UI control unit 130. Instead, the information processing apparatus 100 may be, for example, a content server that provides content data to the terminal device via the UI control unit 130 via a network. More generally, the information processing apparatus 100 may be any type of apparatus such as a high-performance computer, a PC (Personal Computer), a digital home appliance, a game machine, an AV player, or a smartphone.

［２−１．属性テーブル］
まず、本実施形態に係る情報処理装置１００の記憶部１１０により保持される属性テーブルについて説明する。属性テーブルは、複数のコンテンツについて、各コンテンツに付与される１つ以上の属性値を記憶するテーブルである。 [2-1. Attribute table]
First, an attribute table held by the storage unit 110 of the information processing apparatus 100 according to the present embodiment will be described. The attribute table is a table that stores one or more attribute values assigned to each content for a plurality of contents.

図２は、本実施形態に係る属性テーブルの概要を説明するための説明図である。図２を参照すると、属性テーブルは、大きく分けて基本属性、拡張属性及び評価属性の３種類の属性のカテゴリを有する。基本属性は、さらにメタデータ及びコンテキストデータの２種類のカテゴリに分類される。これら基本属性（メタデータ）、基本属性（コンテキストデータ）、拡張属性及び評価属性は、それぞれ１つ以上の属性項目（個々の属性）を含む。そして、これら属性項目ごとに、各コンテンツについて属性値が付与される。以下、図２に示した属性のカテゴリごとに、より具体的な属性の例を説明する。 FIG. 2 is an explanatory diagram for explaining an overview of the attribute table according to the present embodiment. Referring to FIG. 2, the attribute table is roughly divided into three types of attribute categories: basic attributes, extended attributes, and evaluation attributes. Basic attributes are further classified into two categories, metadata and context data. Each of these basic attributes (metadata), basic attributes (context data), extended attributes, and evaluation attributes includes one or more attribute items (individual attributes). An attribute value is assigned to each content item for each content item. Hereinafter, more specific examples of attributes will be described for each attribute category shown in FIG.

（１）基本属性：メタデータ
図３は、属性テーブルにより記憶される基本属性のうちのメタデータの一例を説明するための説明図である。図３を参照すると、メタデータは、「ジャンル」、「年代」、「ムード」、「キーワード」及び「アーティスト」の５種類の属性を含む。これら属性は、図３の例では、数値属性として表現されている。例えば、「ジャンル」については、「Ｇ１：Ｒｏｃｋ」、「Ｇ２：Ｐｏｐ」及びその他のジャンルに各コンテンツが属するか否かを表す数値が属性値として与えられている。例えば、コンテンツＣ１の「ジャンル」についての属性値は、（Ｇ１，Ｇ２，…）＝（１．０，０．０，…）である。また、コンテンツＣ２の「ジャンル」についての属性値は、（Ｇ１，Ｇ２，…）＝（０．０，１．０，…）である。同様に、「年代」については、「Ｅ１：’７０ｓ（１９７０年代）」、「Ｅ２：’８０ｓ（１９８０年代）」及びその他の年代に各コンテンツが属するか否かを表す数値が属性値として与えられている。コンテンツＣ１の「年代」についての属性値は、（Ｅ１，Ｅ２，…）＝（１．０，０．０，…）である。また、コンテンツＣ２の「年代」についての属性値は、（Ｅ１，Ｅ２，…）＝（０．０，１．０，…）である。なお、１つのコンテンツが重みを伴って複数の「ジャンル」又は「年代」等に属してもよい。 (1) Basic Attributes: Metadata FIG. 3 is an explanatory diagram for explaining an example of metadata among basic attributes stored by the attribute table. Referring to FIG. 3, the metadata includes five types of attributes of “genre”, “age”, “mood”, “keyword”, and “artist”. These attributes are expressed as numerical attributes in the example of FIG. For example, for “genre”, a numerical value indicating whether each content belongs to “G1: Rock”, “G2: Pop”, and other genres is given as an attribute value. For example, the attribute value for “genre” of the content C1 is (G1, G2,...) = (1.0, 0.0,...). Further, the attribute value for the “genre” of the content C2 is (G1, G2,...) = (0.0, 1.0,...). Similarly, for “age”, “E1: '70s (1970s)”, “E2:' 80s (1980s)”, and other numerical values indicating whether each content belongs to the other age are given as attribute values. It has been. The attribute value for the “age” of the content C1 is (E1, E2,...) = (1.0, 0.0,...). Further, the attribute value for the “age” of the content C2 is (E1, E2,...) = (0.0, 1.0,...). One content may belong to a plurality of “genres” or “age” with weights.

「ムード」は、例えば、特開２００７−２０７２１８号公報に記載された技術を用いて音楽コンテンツの音声信号の特徴量を解析することにより付与される属性である。例えば、各コンテンツが「Ｍ１：明るさ」又は「Ｍ２：楽しさ」などの印象を有するか否かを表す数値が、「ムード」についての属性値として各コンテンツに与えられる。「キーワード」は、例えば、各コンテンツと関連付けてサービス提供者又はユーザから供給されるレビュー文をテキスト解析することにより付与される属性である。例えば、レビュー文に含まれる名詞や形容詞などの単語ごとの出現頻度が、「キーワード」についての属性値として各コンテンツに与えられる。「アーティスト」は、例えば、音楽コンテンツと関連する人物名ごとに付与される属性である。例えば、作曲者、作詞者、歌手又は共演者などの人物と関連を有するか否かを表す数値が、「アーティスト」についての属性値として各コンテンツに与えられる。 The “mood” is an attribute given by analyzing the feature amount of the audio signal of the music content using, for example, the technique described in Japanese Patent Application Laid-Open No. 2007-207218. For example, a numerical value indicating whether or not each content has an impression such as “M1: brightness” or “M2: fun” is given to each content as an attribute value for “mood”. The “keyword” is an attribute given by, for example, text analysis of a review sentence supplied from a service provider or a user in association with each content. For example, the appearance frequency for each word such as a noun or adjective included in the review sentence is given to each content as an attribute value for “keyword”. “Artist” is an attribute assigned to each person name associated with the music content, for example. For example, a numerical value indicating whether or not there is a relationship with a person such as a composer, a lyricist, a singer or a co-star is given to each content as an attribute value for “artist”.

なお、図３に示した属性は、数値属性の形式ではなく名義属性の形式によっても表現され得る。例えば、名義属性の形式によれば、コンテンツＣ１について「ジャンル」＝「Ｇ１：Ｒｏｃｋ」、「年代」＝「Ｅ１：’７０ｓ」、コンテンツＣ２について「ジャンル」＝「Ｇ２：Ｐｏｐ」、「年代」＝「Ｅ２：’８０ｓ」と表現することもできる（「属性」＝「属性値」）。 Note that the attributes shown in FIG. 3 can be expressed not in the form of numerical attributes but also in the form of nominal attributes. For example, according to the format of the nominal attribute, “genre” = “G1: Rock”, “age” = “E1: '70s” for the content C1, and “genre” = “G2: Pop”, “age” for the content C2. = “E2: '80s” (“attribute” = “attribute value”).

これらメタデータは、典型的には、ユーザによる閲覧、視聴若しくは購買等（以下、閲覧等という）のアクション、又は後に説明する推薦処理等とは独立して、各コンテンツに予め付与することのできるデータである。 Typically, these metadata can be given in advance to each content independently of actions such as browsing, viewing, purchasing, etc. (hereinafter referred to as browsing) by the user, or recommendation processing described later. It is data.

（２）基本属性：コンテキスト
図４は、属性テーブルにより記憶される基本属性のうちのコンテキストデータの一例を説明するための説明図である。コンテキストデータは、典型的には、各コンテンツに対するユーザアクションの状況に基づいて付与されるデータである。図４を参照すると、コンテキストデータは、「時間帯」及び「場所」の２種類の属性を含む。例えば、「時間帯」は、「Ｔ１：１０〜１２（時）」、「Ｔ２：１２〜１４（時）」及びその他の時間帯において、各コンテンツが閲覧等された回数を表す。例えば、コンテンツＣ１の「時間帯」についての属性値は、（Ｔ１，Ｔ２，…）＝（２，１０，…）である。また、コンテンツＣ２の「時間帯」についての属性値は、（Ｔ１，Ｔ２，…）＝（３，１，…）である。同様に、「場所」は、「Ｐ１：Ｔｏｋｙｏ」、「Ｐ２：Ｏｓａｋａ」及びその他の場所において、各コンテンツが閲覧等された回数を表す。コンテンツＣ１の「場所」についての属性値は、（Ｐ１，Ｐ２，…）＝（８，４，…）である。また、コンテンツＣ２の「場所」についての属性値は、（Ｐ１，Ｐ２，…）＝（０，４，…）である。なお、コンテキストデータは、図４に例示した属性以外に、例えば、曜日、日付、国（場所）、ユーザの性別又は年齢層など、ユーザアクションと関連付けて取得可能な任意の属性を含んでもよい。また、図４に例示した「Ｔｏｋｙｏ」又は「Ｏｓａｋａ」などの地理的位置の代わりに、「家庭」、「オフィス」又は「車の中」などの「場所」についての属性値がコンテキストデータに含まれてもよい。 (2) Basic Attributes: Context FIG. 4 is an explanatory diagram for explaining an example of context data among basic attributes stored by the attribute table. The context data is typically data that is given based on the state of user action for each content. Referring to FIG. 4, the context data includes two types of attributes “time zone” and “location”. For example, “time zone” represents the number of times each content is browsed in “T1: 10-12 (hours)”, “T2: 12-14 (hours)”, and other time zones. For example, the attribute value for the “time zone” of the content C1 is (T1, T2,...) = (2, 10,...). Also, the attribute value for the “time zone” of the content C2 is (T1, T2,...) = (3, 1,...). Similarly, “location” represents the number of times each content is browsed in “P1: Tokyo”, “P2: Osaka” and other locations. The attribute value for the “location” of the content C1 is (P1, P2,...) = (8, 4,...). Further, the attribute value for “location” of the content C2 is (P1, P2,...) = (0, 4,...). In addition to the attributes illustrated in FIG. 4, the context data may include any attribute that can be acquired in association with the user action, such as a day of the week, a date, a country (location), a user gender, or an age group. Further, instead of the geographical location such as “Tokyo” or “Osaka” illustrated in FIG. 4, attribute values for “location” such as “home”, “office”, or “in the car” are included in the context data. May be.

（３）拡張属性
図５は、属性テーブルにより記憶される拡張属性の概要を説明するための説明図である。拡張属性は、各コンテンツに付与される基本属性の属性値を解析することにより得られる属性である。本実施形態において、属性テーブルは、レビュー文層及び人物層という２つの層に分けられる拡張属性を有する。このうち、レビュー文層は、基本属性のうち「キーワード」の属性値を確率的分類法に従って解析することにより得られる、潜在トピックＸ１、…、Ｘｍごとの各コンテンツの帰属確率を表す。一方、人物層は、基本属性のうち「アーティスト」の属性値を確率的分類法に従って解析することにより得られる、潜在トピックＹ１、…、Ｙｍごとの各コンテンツの帰属確率を表す。 (3) Extended Attributes FIG. 5 is an explanatory diagram for explaining an outline of extended attributes stored by the attribute table. The extended attribute is an attribute obtained by analyzing the attribute value of the basic attribute given to each content. In the present embodiment, the attribute table has extended attributes that are divided into two layers, a review sentence layer and a person layer. Among these, the review sentence layer represents the attribution probability of each content for each of the latent topics X1,..., Xm obtained by analyzing the attribute value of “keyword” among the basic attributes according to the probabilistic classification method. On the other hand, the person layer represents the attribution probability of each content for each of the latent topics Y1,..., Ym, obtained by analyzing the attribute value of “artist” among the basic attributes according to the probabilistic classification method.

図６は、図５に示した拡張属性の具体的な例をさらに説明するための説明図である。図６の上部には、コンテンツＣ１、Ｃ２及びＣ３のそれぞれについて、キーワード（Ｋ１，Ｋ２，Ｋ３，Ｋ４，Ｋ５，…）の属性値（各キーワードのレビュー文における頻度）が示されている。キーワードＫ１、Ｋ２、Ｋ３、Ｋ４、Ｋ５は、それぞれ、「ｓｃａｎｄａｌ」、「ａｌｂｕｍ」、「ｐｏｐｕｌａｒｉｔｙ」、「ｒｅｃｏｒｄ」、「ａｂｉｌｉｔｙ」である。そして、コンテンツＣ１について、キーワード（Ｋ１，Ｋ２，Ｋ３，Ｋ４，Ｋ５，…）＝（１，０，２，０，１，…）である。また、コンテンツＣ２について、キーワード（Ｋ１，Ｋ２，Ｋ３，Ｋ４，Ｋ５，…）＝（０，１，０，２，１，…）である。コンテンツＣ３について、キーワード（Ｋ１，Ｋ２，Ｋ３，Ｋ４，Ｋ５，…）＝（３，０，１，１，０，…）である。ＰＬＳＡ又はＬＤＡによりモデル化される確率的分類法によれば、これらキーワードは、各コンテンツが潜在的に帰属する潜在トピックの寄与によって、各コンテンツのレビュー文に現れる。逆に、キーワード（Ｋ１，Ｋ２，Ｋ３，Ｋ４，Ｋ５，…）の属性値をＰＬＳＡ又はＬＤＡによる確率的分類法に従って解析すれば、各コンテンツの潜在トピックごとの帰属確率を算出することができる。 FIG. 6 is an explanatory diagram for further explaining a specific example of the extended attribute shown in FIG. The upper part of FIG. 6 shows the attribute values (frequency of each keyword in the review sentence) of the keywords (K1, K2, K3, K4, K5,...) For each of the contents C1, C2, and C3. The keywords K1, K2, K3, K4, and K5 are “scandal”, “album”, “popularity”, “record”, and “ability”, respectively. For the content C1, the keywords (K1, K2, K3, K4, K5,...) = (1, 0, 2, 0, 1,...). For the content C2, the keywords (K1, K2, K3, K4, K5,...) = (0, 1, 0, 2, 1,...). For the content C3, the keyword (K1, K2, K3, K4, K5,...) = (3, 0, 1, 1, 0,...). According to the probabilistic taxonomy modeled by PLSA or LDA, these keywords appear in the review text of each content due to the contribution of potential topics to which each content potentially belongs. Conversely, if the attribute values of the keywords (K1, K2, K3, K4, K5,...) Are analyzed according to the probabilistic classification method by PLSA or LDA, the attribution probability for each potential topic of each content can be calculated.

より具体的には、各コンテンツのレビュー文ｄにおけるキーワードｗの生起確率をｐ（ｗ｜ｄ）とすると、生起確率ｐ（ｗ｜ｄ）は式（１）により表される。 More specifically, when the occurrence probability of the keyword w in the review sentence d of each content is p (w | d), the occurrence probability p (w | d) is expressed by Expression (1).

式（１）において、ｘ_ｉは潜在トピック、ｐ（ｗ｜ｘ_ｉ）は潜在トピックｘ_ｉについての単語ｗの生起確率、ｐ（ｘ_ｉ｜ｄ）は各コンテンツの（レビュー文ｄの）トピック分布である。なお、潜在トピックｘ_ｉの数は、解析の対象とするデータ空間の次元等に応じて予め適切な値（例えば１６など）に設定される。 In Expression (1), x _i is a latent topic, p (w | x _i ) is the probability of occurrence of the word w for the latent topic x _i , and p (x _i | d) is the topic (of the review sentence d) of each content Distribution. Note that the number of latent topics x _i is set to an appropriate value (for example, 16) in advance according to the dimension of the data space to be analyzed.

図６の下部には、キーワード（Ｋ１，Ｋ２，Ｋ３，Ｋ４，Ｋ５，…）の属性値をＰＬＳＡ又はＬＤＡによる確率的分類法に従って解析することにより得られる潜在トピック（Ｘ１，Ｘ２，…，Ｘｎ）ごとの帰属確率の一例が示されている。例えば、コンテンツＣ１について、帰属確率（Ｘ１，Ｘ２，…，Ｘｎ）＝（０．４，０．１，…，０．３）である。コンテンツＣ２について、帰属確率（Ｘ１，Ｘ２，…，Ｘｎ）＝（０．１，０．２，…，０．１）である。コンテンツＣ３について、帰属確率（Ｘ１，Ｘ２，…，Ｘｎ）＝（０．６，０．１，…，０．１）である。本実施形態では、これら各コンテンツの潜在トピックごとの帰属確率が、拡張属性の１つの層の属性値として、属性テーブルにより記憶される。なお、確率的分類法に従って拡張属性を算出する際には、基本属性の属性値を１つの属性項目（例えば個々のキーワード）の範囲内で正規化した上で（最大値を１とした上で）確率的分類法を適用するのが好適である。 In the lower part of FIG. 6, latent topics (X1, X2,..., Xn) obtained by analyzing attribute values of keywords (K1, K2, K3, K4, K5,...) According to a probabilistic classification method by PLSA or LDA. An example of the probability of belonging to each) is shown. For example, for the content C1, the attribution probability (X1, X2,..., Xn) = (0.4, 0.1,..., 0.3). For the content C2, the attribution probability (X1, X2,..., Xn) = (0.1, 0.2,..., 0.1). For the content C3, the attribution probability (X1, X2,..., Xn) = (0.6, 0.1,..., 0.1). In this embodiment, the attribution probability for each potential topic of each content is stored in the attribute table as an attribute value of one layer of extended attributes. When calculating the extended attribute according to the probabilistic classification method, the attribute value of the basic attribute is normalized within the range of one attribute item (for example, an individual keyword) (with the maximum value set to 1). It is preferred to apply a probabilistic classification method.

（４）評価属性
図７は、属性テーブルにより記憶される評価属性の一例を説明するための説明図である。評価属性は、典型的には、各コンテンツについての推薦エンジン又はユーザによる評価を表す属性である。図７を参照すると、評価属性は、アルゴリズムスコア及びユーザフィードバック（ＦＢ）の２種類のカテゴリに分類される。 (4) Evaluation Attributes FIG. 7 is an explanatory diagram for explaining an example of evaluation attributes stored by the attribute table. The evaluation attribute is typically an attribute representing evaluation by a recommendation engine or a user for each content. Referring to FIG. 7, evaluation attributes are classified into two categories: algorithm score and user feedback (FB).

このうち、アルゴリズムスコアは、１つ以上の推薦アルゴリズムにより算出されるコンテンツごとのスコアＳ１、Ｓ２、…を含む。例えば、スコアＳ１はＣＦ（協調フィルタリング）により算出されるスコア、スコアＳ２はＣＢＦ（コンテンツベースフィルタリング）により算出されるスコアであってよい。この場合、例えば、コンテンツＣ１についてのスコアＳ１は、推薦の対象とするユーザのユーザ嗜好と、コンテンツＣ１を閲覧等した他のユーザのユーザ嗜好との間の類似度などに相当する。また、コンテンツＣ１についてのスコアＳ２は、推薦の対象とするユーザが閲覧等したコンテンツのメタデータと、コンテンツＣ１のメタデータとの間の類似度などに相当する。 Among these, the algorithm score includes scores S1, S2,... For each content calculated by one or more recommendation algorithms. For example, the score S1 may be a score calculated by CF (collaborative filtering), and the score S2 may be a score calculated by CBF (content-based filtering). In this case, for example, the score S1 for the content C1 corresponds to the degree of similarity between the user preference of the user to be recommended and the user preference of another user who has browsed the content C1. The score S2 for the content C1 corresponds to the degree of similarity between the metadata of the content browsed by the user to be recommended and the metadata of the content C1.

一方、ユーザＦＢは、各コンテンツに対するユーザからのフィードバックに基づいて付与される属性値を含む。ユーザＦＢの属性値は、例えば、「Ｙ：好き」又は「Ｎ：嫌い」の二値データ、又は多段階（例えば５段階）評価における点数などを表す。その代わりに、ユーザフィードバックは、例えば、ユーザによる閲覧等のアクションの回数を表してもよい。 On the other hand, the user FB includes attribute values given based on feedback from the user with respect to each content. The attribute value of the user FB represents, for example, binary data of “Y: likes” or “N: dislikes” or a score in multi-level (for example, 5-level) evaluation. Instead, the user feedback may represent the number of actions such as browsing by the user, for example.

次に、図１に示した情報処理装置１００の各部の動作について、順に説明する。 Next, the operation of each unit of the information processing apparatus 100 illustrated in FIG. 1 will be described in order.

［２−２．記憶部］
記憶部１１０は、ハードディスク又は半導体メモリなどの記憶媒体を用いて構成され、図２〜図８を用いて説明した属性テーブルを保持する。属性テーブルには、上述した基本属性、拡張属性及び評価属性の各々の属性項目の属性値が格納される。そして、記憶部１１０は、情報処理装置１００の各部との間で、これら属性値を入出力する。さらに、記憶部１１０は、コンテンツデータそのものを記憶してもよい。例えば、情報処理装置１００が音楽プレーヤである場合には、音楽コンテンツのオーディオデータが記憶部１１０により記憶されてもよい。 [2-2. Storage unit]
The storage unit 110 is configured using a storage medium such as a hard disk or a semiconductor memory, and holds the attribute table described with reference to FIGS. The attribute table stores attribute values of the attribute items of the basic attribute, the extended attribute, and the evaluation attribute described above. The storage unit 110 inputs and outputs these attribute values with each unit of the information processing apparatus 100. Furthermore, the storage unit 110 may store the content data itself. For example, when the information processing apparatus 100 is a music player, audio data of music content may be stored in the storage unit 110.

［２−３．解析部］
解析部１２０は、ＰＬＳＡ又はＬＤＡによる確率的分類法に従って、属性テーブルの基本属性の属性値に基づいて、拡張属性の属性値を算出する。例えば、解析部１２０は、基本属性の属性値のうち、キーワードに関する属性値を確率的分類法に従って解析することにより、拡張属性のレビュー文層の属性値を算出する。より具体的には、拡張属性のレビュー文層の属性値は、例えば、潜在トピックごとのキーワードの生起確率との積和によりキーワードに関する属性値（出現頻度）を導く、各コンテンツのトピック分布であってよい（式（１）参照）。また、例えば、解析部１２０は、基本属性の属性値のうち、アーティストに関する属性値を確率的分類法に従って解析することにより、拡張属性の人物層の属性値を算出する。そして、解析部１２０は、算出した拡張属性の属性値を属性テーブルに格納する。解析部１２０による確率的分類法に従った解析は、例えば、コンテンツのデータベース（例えば記憶部１１０）に一定の数のコンテンツが蓄積された時、又は１ヶ月ごと若しくは１年ごとなどのように定期的に実行され得る。 [2-3. Analysis Department]
The analysis unit 120 calculates the attribute value of the extended attribute based on the attribute value of the basic attribute of the attribute table according to the probabilistic classification method by PLSA or LDA. For example, the analysis unit 120 calculates the attribute value of the review sentence layer of the extended attribute by analyzing the attribute value related to the keyword among the attribute values of the basic attribute according to the probabilistic classification method. More specifically, the attribute value of the review sentence layer of the extended attribute is, for example, the topic distribution of each content that derives the attribute value (appearance frequency) related to the keyword by the product sum with the occurrence probability of the keyword for each latent topic. (See equation (1)). For example, the analysis unit 120 calculates the attribute value of the extended attribute person layer by analyzing the attribute value related to the artist among the attribute values of the basic attribute according to the probabilistic classification method. Then, the analysis unit 120 stores the calculated attribute value of the extended attribute in the attribute table. The analysis according to the probabilistic classification method by the analysis unit 120 is performed periodically, for example, when a certain number of contents are accumulated in the content database (for example, the storage unit 110), or every month or every year. Can be implemented automatically.

なお、ＰＬＳＡについては、Thomas Hofmannによる“Probabilistic latent semantic indexing”（1999, Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval）において詳しく説明されている。また、ＬＤＡについては、David M. Blei, Andrew Y. Ng, Michael I. Jordanによる“Latent Dirichlet Allocation”（2003, Journal of Machine Learning Research, Volume 3）において詳しく説明されている。 The PLSA is described in detail in “Probabilistic latent semantic indexing” by Thomas Hofmann (1999, Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval). LDA is described in detail in “Latent Dirichlet Allocation” (2003, Journal of Machine Learning Research, Volume 3) by David M. Blei, Andrew Y. Ng, Michael I. Jordan.

［２−４．ユーザインタフェース制御部］
ＵＩ制御部１３０は、情報処理装置１００とユーザとの間のユーザインタフェースを制御する。かかるユーザインタフェースは、典型的には、表示装置により表示される画面インタフェースと、マウス、キーボード、タッチパネル又はキーパッドなどの入力インタフェースとを含む。さらに、音声出力回路又は映像出力回路などのコンテンツ再生用のインタフェースが、ユーザインタフェースに含まれてもよい。これらユーザインタフェースは、例えば、情報処理装置１００上に実装されてもよく、その代わりに情報処理装置１００とネットワークを介して接続される端末装置上に実装されてもよい。 [2-4. User interface control unit]
The UI control unit 130 controls a user interface between the information processing apparatus 100 and the user. Such a user interface typically includes a screen interface displayed by a display device and an input interface such as a mouse, a keyboard, a touch panel, or a keypad. Further, an interface for content reproduction such as an audio output circuit or a video output circuit may be included in the user interface. For example, these user interfaces may be mounted on the information processing apparatus 100, or instead, may be mounted on a terminal device connected to the information processing apparatus 100 via a network.

より具体的には、ＵＩ制御部１３０は、例えば、あるコンテンツについてのユーザによるアクションに応じて、当該コンテンツについてのコンテキストデータを更新する。即ち、例えば、ユーザが午前１１時に東京においてコンテンツＣ１を閲覧したとする。その場合、ＵＩ制御部１３０は、コンテンツＣ１についての時間帯「Ｔ１：１０〜１２（時）」及び場所「Ｐ１：Ｔｏｋｙｏ」の属性値を更新する。ＵＩ制御部１３０は、例えば、端末装置のＩＰアドレスに基づいて、又はＧＰＳ（Global Positioning System）を用いて、ユーザによるコンテンツの閲覧場所を認識することができる。 More specifically, the UI control unit 130 updates the context data for the content according to, for example, an action by the user for the certain content. That is, for example, assume that the user browses the content C1 in Tokyo at 11:00 am. In that case, the UI control unit 130 updates the attribute values of the time zone “T1: 10 to 12 (hours)” and the location “P1: Tokyo” for the content C1. For example, the UI control unit 130 can recognize the viewing location of the content by the user based on the IP address of the terminal device or using the GPS (Global Positioning System).

また、ＵＩ制御部１３０は、例えば、あるコンテンツについての図７に例示したユーザフィードバックの値を入力インタフェースを介して取得し、取得した値を属性テーブルに格納する。また、ＵＩ制御部１３０は、例えば、後に説明する推薦部１４０により生成される推薦画面を表示装置に表示させる。 For example, the UI control unit 130 acquires the user feedback value illustrated in FIG. 7 for a certain content via the input interface, and stores the acquired value in the attribute table. In addition, the UI control unit 130 causes the display device to display a recommendation screen generated by the recommendation unit 140 described later, for example.

さらに、ＵＩ制御部１３０は、例えば、情報処理装置１００又は端末装置がユーザの状態を検知することのできるセンサを備える場合には、当該センサにより取得されるユーザの状態をコンテキストデータの属性値として属性テーブルに格納してもよい。例えば、スマイルセンサ機能を有する撮像モジュールによりユーザの笑顔の状態を取得すること、又は生体センサによりユーザの脈拍又は呼吸数等の状態を取得することなどが考えられる。 Furthermore, for example, when the information processing apparatus 100 or the terminal device includes a sensor that can detect the user state, the UI control unit 130 uses the user state acquired by the sensor as an attribute value of the context data. It may be stored in the attribute table. For example, it is conceivable to acquire the state of the user's smile using an imaging module having a smile sensor function, or to acquire the state of the user's pulse or respiration rate using a biological sensor.

［２−５．推薦部］
推薦部１４０は、記憶部１１０が保持している属性テーブルに記憶されている属性値を用いて、ユーザに推薦すべきコンテンツを選択する推薦エンジンとして動作する。また、推薦部１４０は、後に説明する抽出部１６０により抽出される１つ以上の重要属性に基づいて、推薦の理由を生成する。そして、推薦部１４０は、推薦すべきコンテンツに関する情報と推薦理由とをユーザに呈示する推薦画面を生成し、生成した推薦画面をＵＩ制御部１３０を介して表示装置に表示させる。 [2-5. Recommendation section]
The recommendation unit 140 operates as a recommendation engine that selects content to be recommended to the user using the attribute values stored in the attribute table held by the storage unit 110. The recommendation unit 140 generates a reason for recommendation based on one or more important attributes extracted by the extraction unit 160 described later. And the recommendation part 140 produces | generates the recommendation screen which shows the information regarding the content which should be recommended, and a recommendation reason to a user, and displays the produced | generated recommendation screen on a display apparatus via UI control part 130. FIG.

推薦部１４０によるコンテンツの選択は、例えば、上述したＣＦ又はＣＢＦなど、任意の公知の推薦アルゴリズムに従って行われてよい。例えば、推薦部１４０が推薦アルゴリズムとしてＣＦを用いる場合には、推薦部１４０は、ユーザの嗜好と類似する嗜好を有する他のユーザを特定する。次に、推薦部１４０は、当該他のユーザにより閲覧等された一群のコンテンツを抽出する。このとき、推薦部１４０は、例えば、嗜好の類似度又は当該他のユーザによる各コンテンツの評価などに応じて、抽出した一群のコンテンツの各々についてのスコア（各コンテンツのユーザへの適合度、又は推薦についての確信度など）を算出する。ここで算出されるコンテンツごとのスコアは、属性テーブルに格納される。そして、推薦部１４０は、例えば、スコアの相対的に高いコンテンツを、ユーザに推薦すべきコンテンツとして選択する。 The selection of content by the recommendation unit 140 may be performed according to any known recommendation algorithm such as the above-described CF or CBF. For example, when the recommendation unit 140 uses CF as a recommendation algorithm, the recommendation unit 140 specifies another user having a preference similar to the user's preference. Next, the recommendation unit 140 extracts a group of contents browsed by the other user. At this time, the recommendation unit 140, for example, according to the similarity of the preference or the evaluation of each content by the other user, the score for each of the extracted group of content (the degree of suitability of each content to the user, or Calculate certainty about recommendation). The score for each content calculated here is stored in the attribute table. And the recommendation part 140 selects the content with a relatively high score as content which should be recommended to a user, for example.

また、例えば、推薦部１４０が推薦アルゴリズムとしてＣＢＦを用いる場合には、推薦部１４０は、ユーザが閲覧等したコンテンツとの間でメタデータが類似する一群のコンテンツを抽出する。このとき、推薦部１４０は、例えば、メタデータの類似度に応じて、抽出した一群のコンテンツの各々についてのスコアを算出する。ここで算出されるコンテンツごとのスコアは、属性テーブルに格納される。そして、推薦部１４０は、例えば、スコアの相対的に高いコンテンツを、ユーザに推薦すべきコンテンツとして選択する。 Further, for example, when the recommendation unit 140 uses CBF as a recommendation algorithm, the recommendation unit 140 extracts a group of contents whose metadata is similar to content viewed by the user. At this time, the recommendation unit 140 calculates a score for each of the extracted group of contents, for example, according to the similarity of the metadata. The score for each content calculated here is stored in the attribute table. And the recommendation part 140 selects the content with a relatively high score as content which should be recommended to a user, for example.

また、例えば、推薦部１４０は、属性テーブルに記憶されているユーザＦＢについての属性値を用いて、ユーザに推薦すべきコンテンツを選択してもよい。例えば、複数のコンテンツのうち、ユーザＦＢについての属性値が良好な値を示すコンテンツが、推薦すべきコンテンツとして優先的に選択され得る。また、例えば、推薦部１４０は、属性テーブルに記憶されているコンテキストデータについての属性値を用いて、ユーザに推薦すべきコンテンツを選択してもよい。例えば、複数のコンテンツのうち、推薦処理を実行している時間帯及び推薦の対象とするユーザが位置する場所においてより多く閲覧等された実績のあるコンテンツが、推薦すべきコンテンツとして優先的に選択され得る。 For example, the recommendation unit 140 may select content to be recommended to the user using the attribute value for the user FB stored in the attribute table. For example, among the plurality of contents, a content having a good attribute value for the user FB can be preferentially selected as the content to be recommended. Further, for example, the recommendation unit 140 may select content to be recommended to the user by using the attribute value for the context data stored in the attribute table. For example, among a plurality of contents, content that has been browsed more frequently in the time zone where the recommendation process is being executed and the location of the user to be recommended is preferentially selected as the content to be recommended Can be done.

推薦部１４０は、推薦すべきコンテンツを選択すると、コンテンツの選択に使用した所定の属性に対する重要度を、当該所定の属性以外の１つ以上の属性について、重要度算出部１５０に算出させる。また、推薦部１４０は、ユーザに呈示すべき推薦理由を生成するために使用される１つ以上の重要属性を、抽出部１６０に抽出させる。そして、推薦部１４０は、抽出部１６０により抽出された１つ以上の重要属性を用いて推薦理由を生成し、推薦すべきコンテンツに関する情報と推薦理由とを表示する推薦画面を、ＵＩ制御部１３０へ出力する。 When selecting the content to be recommended, the recommendation unit 140 causes the importance level calculation unit 150 to calculate the importance level for the predetermined attribute used for selecting the content for one or more attributes other than the predetermined attribute. In addition, the recommendation unit 140 causes the extraction unit 160 to extract one or more important attributes used to generate a recommendation reason to be presented to the user. Then, the recommendation unit 140 generates a recommendation reason using one or more important attributes extracted by the extraction unit 160, and displays a recommendation screen that displays information about the content to be recommended and the recommendation reason, and the UI control unit 130. Output to.

［２−６．重要度算出部］
重要度算出部１５０は、属性テーブルに記憶されている属性値を用いて、コンテンツの所定の属性に対する重要度を、コンテンツの当該所定の属性以外の１つ以上の属性について算出する。特に、本実施形態において、重要度算出部１５０は、上記１つ以上の属性についての重要度を、上記所定の属性を決定属性とし、上記１つ以上の属性を条件属性とする決定表を用いて算出する。例えば、以下に説明するように、重要度算出部１５０は、上述した決定表について決定属性の正領域を形成するコンテンツの数に基づいて、重要度を算出することができる。決定表における決定属性の正領域は、Ｒｏｕｇｈ集合理論に従って導出され得る。 [2-6. Importance calculator]
The importance level calculation unit 150 uses the attribute values stored in the attribute table to calculate the importance level for the predetermined attribute of the content for one or more attributes other than the predetermined attribute of the content. In particular, in the present embodiment, the importance level calculation unit 150 uses a determination table in which the importance level for the one or more attributes is set as the determination attribute and the one or more attributes are condition attributes. To calculate. For example, as will be described below, the importance level calculation unit 150 can calculate the importance level based on the number of contents forming the positive region of the determination attribute for the determination table described above. The positive region of the decision attribute in the decision table can be derived according to Rough set theory.

（１）決定表に関連する諸概念
図８は、Ｒｏｕｇｈ集合理論が適用される決定表に関連する諸概念について説明するための説明図である。図８の上部には、決定表の基本的な構成が示されている。図８に例示される決定表は、二次元の表であって、縦軸に対象全体集合Ｕ、横軸に属性全体集合Ａをとる。対象全体集合Ｕは、コンテンツの集合である。一方、属性全体集合Ａは、属性項目の集合である。属性全体集合Ａは、条件属性Ｃ及び決定属性Ｄの和集合である。条件属性Ｃ及び決定属性Ｄは、共に、１つ以上の属性を含む。属性値集合Ｖは、属性全体集合Ａに含まれる各属性が取り得る名義属性の属性値の集合である。例えば、図３に例示した属性「ジャンル」及び「年代」が属性全体集合Ａの元（member）である場合には、Ｖ＝｛｛Ｒｏｃｋ，Ｊａｚｚ，…｝，｛‘７０ｓ，’８０ｓ，…｝｝などとなる。属性値関数ρは、対象全体集合Ｕの１つの元（コンテンツ）と属性全体集合Ａの１つ元（属性）とが与えられた場合に、それに応じて属性値を特定する関数である（ρ：Ｕ×Ａ→Ｖ）。 (1) Various Concepts Related to Decision Table FIG. 8 is an explanatory diagram for explaining various concepts related to the decision table to which the Rough set theory is applied. The basic configuration of the decision table is shown in the upper part of FIG. The decision table illustrated in FIG. 8 is a two-dimensional table, in which the vertical axis represents the target total set U and the horizontal axis represents the attribute total set A. The entire target set U is a set of contents. On the other hand, the entire attribute set A is a set of attribute items. The entire attribute set A is a union of the condition attribute C and the decision attribute D. Both the condition attribute C and the decision attribute D include one or more attributes. The attribute value set V is a set of attribute values of nominal attributes that each attribute included in the attribute total set A can take. For example, when the attributes “genre” and “age” illustrated in FIG. 3 are members of the entire attribute set A, V = {{Rock, Jazz,...}, {'70s,' 80s,. }} Etc. The attribute value function ρ is a function that specifies an attribute value in response to one element (content) of the entire object set U and one element (attribute) of the entire attribute set A (ρ). : U × A → V).

図８の左下には、概念集合Ｘが示されている。概念集合Ｘは、特定の属性の属性値を１つの概念とみなした場合の、当該概念に属するコンテンツの集合である。例えば、属性Ａが“Ｙ”又は“Ｎ”の二値の属性値をとり得る場合に、属性Ａ＝“Ｙ”という概念についての概念集合Ｘは、属性Ａ＝“Ｙ”であるコンテンツを要素として含む。 A concept set X is shown in the lower left of FIG. The concept set X is a set of contents belonging to the concept when the attribute value of a specific attribute is regarded as one concept. For example, when the attribute A can take a binary attribute value “Y” or “N”, the concept set X for the concept of the attribute A = “Y” Include as.

図８の右下には、属性部分集合Ｐ（Ｐ⊆Ｃ）及びＱ（Ｑ⊆Ｄ）が示されている。属性部分集合Ｐは、条件属性Ｃのうち、重要度の算出に用いる条件属性の集合である。一方、属性部分集合Ｑは、決定属性Ｃのうち、重要度の算出に用いる決定属性の集合である。 In the lower right of FIG. 8, attribute subsets P (P⊆C) and Q (Q⊆D) are shown. The attribute subset P is a set of condition attributes used for calculating the importance of the condition attributes C. On the other hand, the attribute subset Q is a set of decision attributes used for calculation of importance among the decision attributes C.

（２）ＣｒｉｓｐＲｏｕｇｈ集合に関する定義式
表１は、上述した決定表に関連する諸概念を用いて記述される、ＣｒｉｓｐＲｏｕｇｈ集合に関する定義式を表す。 (2) Definition Formula for Crisp Rough Set Table 1 shows a definition formula for the Crisp Rough set described using various concepts related to the decision table described above.

図９を参照しながら、表１の各項目について説明する。図９の上部には、コンテンツＣ１〜Ｃ６を対象全体集合Ｕに含む決定表の一例が示されている。図９の決定表の条件属性の属性部分集合Ｐは、基本属性のうちのメタデータ「ジャンル」、「年代」、「キーワード（“世界”）」及び「ムード（“明るさ”）」を含む。ここでは、説明の簡明さの観点から、「ジャンル」、「年代」及び「ムード（“明るさ”）」は名義属性の形式で表現されるものとする。「決定属性の属性部分集合Ｑは、評価属性のうちのユーザＦＢ「好み」を含む。 Each item in Table 1 will be described with reference to FIG. In the upper part of FIG. 9, an example of a decision table including the contents C1 to C6 in the target entire set U is shown. The attribute subset P of the condition attributes in the decision table of FIG. 9 includes metadata “genre”, “age”, “keyword (“ world ”)”, and “mood (“ brightness ”)” among the basic attributes. . Here, from the viewpoint of simplicity of explanation, it is assumed that “genre”, “age”, and “mood (“ brightness ”)” are expressed in the form of nominal attributes. “The attribute subset Q of the decision attributes includes the user FB“ preference ”among the evaluation attributes.

表１において、Ｐ−同値関係の式は、条件属性の属性部分集合Ｐにおいて、対象全体集合Ｕに属するコンテンツｘとコンテンツｙとが同値関係であるための条件を表す。例えば、図９の決定表において、コンテンツＣ２の条件属性の属性値とコンテンツＣ６の条件属性の属性値とは等しく、コンテンツＣ２とコンテンツＣ６とは同値関係にある。 In Table 1, the expression of P-equivalence relationship represents a condition for content x and content y belonging to the target entire set U to have an equivalence relationship in the attribute subset P of the condition attributes. For example, in the determination table of FIG. 9, the attribute value of the condition attribute of the content C2 is equal to the attribute value of the condition attribute of the content C6, and the content C2 and the content C6 are in the same value relationship.

Ｐ−同値類は、条件属性の属性部分集合Ｐにおいて、あるコンテンツｘと同値関係にあるコンテンツの集合を表す。図９の決定表において、コンテンツＣ２及びコンテンツＣ６は、１つのＰ−同値類を形成する。 The P-equivalence class represents a set of contents having an equivalence relation with a certain content x in the attribute subset P of the condition attributes. In the decision table of FIG. 9, the content C2 and the content C6 form one P-equivalence class.

Ｐ−同値類によるＵ分割は、各々のＰ−同値類を部分集合として対象全体集合Ｕが分割され得ることを表す。図９の決定表において、５つの部分集合｛Ｃ１｝、｛Ｃ２，Ｃ６｝、｛Ｃ３｝、｛Ｃ４｝及び｛Ｃ５｝により、対象全体集合Ｕが分割され得る。 The U division by P-equivalence classes indicates that the target entire set U can be divided with each P-equivalence class as a subset. In the decision table of FIG. 9, the target entire set U can be divided by five subsets {C1}, {C2, C6}, {C3}, {C4}, and {C5}.

概念ＸのＰ−上近似は、条件属性の属性部分集合Ｐについての属性値に基づいて、概念集合Ｘに含まれる可能性があると判断されるコンテンツの集合を表す。図９の決定表において、決定属性Ｑの「好み」＝“Ｙ”の概念集合Ｘに含まれる可能性があるコンテンツは、コンテンツＣ１、Ｃ２，Ｃ４及びＣ６である。よって、「好み」＝“Ｙ”という概念の上近似は、集合｛Ｃ１，Ｃ２，Ｃ４，Ｃ６｝である。なお、コンテンツＣ２が当該上近似に含まれるのは、コンテンツＣ２と同値関係にあるコンテンツＣ６の「好み」が“Ｙ”であるためである。 The P-top approximation of the concept X represents a set of contents that are determined to be included in the concept set X based on the attribute values for the attribute subset P of the condition attribute. In the decision table of FIG. 9, contents C1, C2, C4, and C6 are contents that may be included in the concept set X of “preference” = “Y” of the decision attribute Q. Therefore, the upper approximation of the concept of “preference” = “Y” is the set {C1, C2, C4, C6}. The reason why the content C2 is included in the upper approximation is that the “favorite” of the content C6 that is equivalent to the content C2 is “Y”.

概念ＸのＰ−下近似は、条件属性の属性部分集合Ｐについての属性値に基づいて、概念集合Ｘに必然的に含まれると判断されるコンテンツの集合を表す。図９の決定表において、決定属性Ｑの「好み」＝“Ｙ”の概念集合Ｘに必然的に含まれると判断されるコンテンツは、コンテンツＣ１及びＣ４である。よって、「好み」＝“Ｙ”という概念の下近似は、集合｛Ｃ１，Ｃ４｝である。なお、コンテンツＣ６が当該下近似に含まれないのは、コンテンツＣ６と同値関係にあるコンテンツＣ２の「好み」が“Ｎ”であるためである。 The P-lower approximation of the concept X represents a set of contents that are determined to be necessarily included in the concept set X based on the attribute values for the attribute subset P of the condition attribute. In the decision table of FIG. 9, the contents that are inevitably determined to be included in the concept set X of “preference” = “Y” of the decision attribute Q are the contents C1 and C4. Therefore, the lower approximation of the concept of “preference” = “Y” is the set {C1, C4}. The reason why the content C6 is not included in the lower approximation is that the “favorite” of the content C2 having the same relationship with the content C6 is “N”.

Ｑ−同値類によるＵ分割は、各々のＱ−同値類を部分集合として対象全体集合Ｕが分割され得ることを表す。図９の決定表において、それぞれ決定属性Ｑの同値類である２つの部分集合｛Ｃ１，Ｃ４，Ｃ６｝及び｛Ｃ２，Ｃ３，Ｃ５｝により、対象全体集合Ｕが分割され得る。 The U division by the Q-equivalence class indicates that the target entire set U can be divided by using each Q-equivalence class as a subset. In the decision table of FIG. 9, the target entire set U can be divided by two subsets {C1, C4, C6} and {C2, C3, C5} that are equivalence classes of the decision attribute Q.

Ｐ，Ｑにおける正領域は、決定属性Ｑの同値類についてのＰ−下近似の、全ての同値類にわたっての和集合を表す。図９の決定表において、「好み」＝“Ｙ”という概念の下近似は、集合｛Ｃ１，Ｃ４｝である。「好み」＝“Ｎ”という概念の下近似は、集合｛Ｃ３，Ｃ５｝である。従って、図９の決定表における条件属性Ｐ，決定属性Ｑにおける正領域は、集合｛Ｃ１，Ｃ４｝，｛Ｃ３，Ｃ５｝｝である。 The positive region in P and Q represents the union over all equivalence classes of the P-lower approximation for the equivalence class of the decision attribute Q. In the decision table of FIG. 9, the lower approximation of the concept of “preference” = “Y” is the set {C1, C4}. The lower approximation of the concept of “preference” = “N” is the set {C3, C5}. Accordingly, the primary regions in the condition attribute P and the decision attribute Q in the decision table of FIG. 9 are the set {C1, C4}, {C3, C5}}.

（３）ＦｕｚｚｙＲｏｕｇｈ集合に関する定義式
表２は、上述した決定表に関連する諸概念についての、ＦｕｚｚｙＲｏｕｇｈ集合に関する定義式を表す。ＣｒｉｓｐＲｏｕｇｈ集合の代わりにＦｕｚｚｙＲｏｕｇｈ集合を用いることにより、名義属性に関する近似集合だけでなく、数値属性に関する近似集合をも扱うことが可能となる。例えば、表２において、ファジィ同値類μ_Ｆｉ（ｘ）（対象ｘにおける属性Ｆ_ｉのメンバシップ値）は、０．０〜１．０の連続値で表現される。 (3) Definition Formula for Fuzzy Rough Set Table 2 shows a definition formula for the Fuzzy Rough set for the concepts related to the decision table described above. By using the Fuzzy Rough set instead of the Crisp Rough set, it is possible to handle not only the approximate set related to the nominal attribute but also the approximate set related to the numerical attribute. For example, in Table 2, (membership value of the attribute _{F i} in the subject x) fuzzy equivalence classes _mu Fi (x) is represented by consecutive values of 0.0 to 1.0.

なお、ＦｕｚｚｙＲｏｕｇｈ集合理論については、Richard Jensen and Qiang Shenによる“Fuzzy-Rough Data Reduction with Ant Colony Optimization”（Fuzzy Sets and Systems, vol. 149, no. 1, pp. 5-20, 2005）において詳しく説明されているため、ここではその説明を省略する。 The Fuzzy Rough set theory is detailed in “Fuzzy-Rough Data Reduction with Ant Colony Optimization” by Richard Jensen and Qiang Shen (Fuzzy Sets and Systems, vol. 149, no. 1, pp. 5-20, 2005). Since it is described, its description is omitted here.

（４）重要度算出式
重要度算出部１５０による、ＣｒｉｓｐＲｏｕｇｈ集合についての重要度算出式を式（２）に示す。γ_Ｐ（Ｑ）は、決定属性（集合）Ｑに対する条件属性（集合）Ｐの重要度である。 (4) Importance Calculation Formula The importance calculation formula for the Crisp Rough set by the importance calculation unit 150 is shown in Formula (2). γ _P (Q) is the importance of the condition attribute (set) P with respect to the decision attribute (set) Q.

式（２）において、ｃａｒｄ（Ｋ）は、集合Ｋの濃度（cardinality）、即ち集合Ｋに含まれる元の数を表す。即ち、右辺の分母は、対象全体集合Ｕの濃度を表す。右辺の分子は、条件属性（集合）Ｐ、決定属性（集合）Ｑについての正領域の濃度を表す。 In Equation (2), card (K) represents the cardinality of the set K, that is, the original number included in the set K. That is, the denominator on the right side represents the density of the entire target set U. The numerator on the right side represents the density of the positive region for the condition attribute (set) P and the decision attribute (set) Q.

図９の例では、対象全体集合Ｕの濃度は、コンテンツＣ１〜Ｃ６の数、即ち６である。条件属性（集合）Ｐ、決定属性（集合）Ｑについての正領域の濃度は、「好み」＝“Ｙ”の下近似の濃度（＝２）及び「好み」＝“Ｎ”の下近似の濃度（＝２）の和に相当し、２＋２＝４である。従って、式（２）によれば、決定属性（「好み」）に対する条件属性の組合せ「ジャンル」、「年代」、「キーワード（“世界”）」及び「ムード（“明るさ”）」の重要度は、４／６＝０．６７と算出される。 In the example of FIG. 9, the density of the entire target set U is the number of contents C1 to C6, that is, six. The density of the positive region for the condition attribute (set) P and the decision attribute (set) Q is the density of the lower approximation (= 2) of “Preference” = “Y” and the density of the lower approximation of “Preference” = “N”. It corresponds to the sum of (= 2), and 2 + 2 = 4. Therefore, according to the formula (2), the combination of the condition attribute “genre”, “age”, “keyword (“ world ”)” and “mood (“ brightness ”)” with respect to the decision attribute (“preference”) is important. The degree is calculated as 4/6 = 0.67.

一般的に、正領域の濃度は、選択された条件属性の属性値が決定属性の属性値の決定により大きく寄与する場合、即ち、条件属性の属性値が与えられることで決定属性の属性値をより高い確度で予測できる場合に、高い値となる。逆に、条件属性の属性値が与えられても、決定属性の属性値がある程度の確度（あるいは必然性）をもって定まらない場合には、正領域の濃度は低くなる。これが意味するところは、対象全体集合Ｕの濃度を用いて式（２）のように正規化された正領域の濃度が、任意に選択される条件属性の組合せ（条件属性の属性部分集合Ｐ）の、決定属性（決定属性の属性部分集合Ｑ）の属性値に対する情報伝達性（informativity）の指標（あるいは近似の質の指標）として扱い得るということである。従って、例えば、属性テーブルに含まれる評価属性を決定属性とし、その他の任意の属性の組合せを条件属性として各組合せについて重要度を算出することにより、コンテンツの推薦の基礎となるスコア又はユーザＦＢがどのような属性の組合せによって有意に決定され又は予測され得るかを、数値的に評価することが可能となる。 Generally, the density of the positive region is determined when the attribute value of the selected condition attribute greatly contributes to the determination of the attribute value of the determination attribute, that is, the attribute value of the determination attribute is given by the attribute value of the condition attribute. A high value is obtained when prediction can be made with higher accuracy. On the other hand, if the attribute value of the decision attribute is not determined with a certain degree of accuracy (or necessity) even if the attribute value of the condition attribute is given, the density of the normal region is low. This means that a combination of condition attributes (condition attribute attribute subset P) in which the density of the normal region normalized by using the density of the entire object set U is arbitrarily selected as shown in Equation (2). It can be treated as an index (or an approximate quality index) of informativity for the attribute value of the determination attribute (attribute subset Q of the determination attribute). Therefore, for example, by calculating the importance for each combination using the evaluation attribute included in the attribute table as the determination attribute and the combination of any other attribute as the condition attribute, the score or the user FB as the basis for recommending the content can be obtained. It is possible to numerically evaluate what combination of attributes can be determined or predicted significantly.

重要度算出部１５０による、ＦｕｚｚｙＲｏｕｇｈ集合についての重要度算出式は、式（３）の通りである。式（３）に従って算出される重要度もまた、任意に選択される条件属性の組合せ（条件属性の属性部分集合Ｐ）の、決定属性（決定属性の属性部分集合Ｑ）の属性値に対する情報伝達性の指標として扱い得る。 The importance calculation formula for the Fuzzy Rough set by the importance calculation unit 150 is as shown in Expression (3). The importance calculated in accordance with the expression (3) is also transmitted to the attribute value of the decision attribute (the attribute subset Q of the decision attribute) of the arbitrarily selected combination of the condition attributes (the attribute subset P of the condition attribute) It can be treated as an index of sex.

重要度算出部１５０は、様々な条件属性の組合せ（以下、条件属性セットという）について、式（２）又は式（３）に従って、決定属性Ｑに対する重要度を計算する。 The importance calculation unit 150 calculates the importance for the decision attribute Q according to the formula (2) or the formula (3) for various combinations of condition attributes (hereinafter referred to as condition attribute sets).

図１０は、図９の上部に例示した決定表に基づいて算出される、様々な条件属性セットについての重要度の一例について説明するための説明図である。図１０を参照すると、「ジャンル」、「年代」、「キーワード（“世界”）」及び「ムード（“明るさ”）」という４つの元を含む条件属性Ｃから選択され得る１５通りの条件属性セットＡＳ１〜ＡＳ１５が示されている（そのうち一部は省略されている）。例えば、条件属性セットＡＳ１は、「ジャンル」のみを含む。条件属性セットＡＳ２は、「年代」のみを含む。また、条件属性セットＡＳ５は、「ジャンル」と「年代」とを含む。条件属性セットＡＳ１５は、「ジャンル」、「年代」、「キーワード（“世界”）」及び「ムード（“明るさ”）」の全てを含む。 FIG. 10 is an explanatory diagram for explaining an example of importance levels for various condition attribute sets calculated based on the determination table illustrated in the upper part of FIG. 9. Referring to FIG. 10, 15 condition attributes that can be selected from condition attributes C including four elements of “genre”, “age”, “keyword (“ world ”)”, and “mood (“ brightness ”)”. Sets AS1 to AS15 are shown (some of which are omitted). For example, the condition attribute set AS1 includes only “genre”. The condition attribute set AS2 includes only “age”. The condition attribute set AS5 includes “genre” and “age”. The condition attribute set AS15 includes all of “genre”, “age”, “keyword (“ world ”)”, and “mood (“ brightness ”)”.

重要度算出部１５０は、例えば、１つの属性（図１０の例では「好み」）を決定属性とし、各条件属性セットＡＳ１〜ＡＳ１５を条件属性の属性部分集合として、各条件属性セットについての重要度を式（２）又は式（３）に従って算出する。その結果、例えば、条件属性セットＡＳ７の重要度が最も高い値を示したものとする。図１０の例では、条件属性セットＡＳ７の重要度は０．５５である。この場合、条件属性セットＡＳ７に含まれる「ジャンル」及び「ムード（“明るさ”）」の組合せが、決定属性「好み」の属性値を決定し又は予測するために最も有意な属性であるということができる。 The importance calculation unit 150 uses, for example, one attribute (“preference” in the example of FIG. 10) as a decision attribute, and sets each condition attribute set AS1 to AS15 as an attribute subset of condition attributes. The degree is calculated according to formula (2) or formula (3). As a result, for example, it is assumed that the condition attribute set AS7 shows the highest importance value. In the example of FIG. 10, the importance of the condition attribute set AS7 is 0.55. In this case, the combination of “genre” and “mood (“ brightness ”)” included in the condition attribute set AS7 is the most significant attribute for determining or predicting the attribute value of the determination attribute “preference”. be able to.

重要度算出部１５０は、このような条件属性セットごとの重要度の算出結果を、抽出部１６０へ出力する。 The importance calculation unit 150 outputs the calculation result of the importance for each condition attribute set to the extraction unit 160.

［２−７．抽出部］
抽出部１６０は、重要度算出部１５０により算出される上述した重要度に応じて、コンテンツに関するユーザに呈示すべき情報を生成するために使用される１つ以上の重要属性を抽出する。コンテンツに関するユーザに呈示すべき情報とは、例えば、推薦すべきコンテンツに関する情報と共に呈示される推薦理由などであってよい。その代わりに、コンテンツに関するユーザに呈示すべき情報とは、例えば、コンテンツリスト（例えば音楽コンテンツを一覧化したプレイリストなど）のタイトルであってもよい。抽出部１６０による重要属性の抽出処理には、主に（１）直接抽出と（２）段階的抽出の２通りのパターンが存在する。 [2-7. Extraction unit]
The extraction unit 160 extracts one or more important attributes used to generate information about the content to be presented to the user according to the above-described importance calculated by the importance calculation unit 150. The information to be presented to the user regarding the content may be, for example, the reason for recommendation presented together with the information regarding the content to be recommended. Instead, the information about the content to be presented to the user may be, for example, the title of a content list (for example, a playlist that lists music content). The important attribute extraction processing by the extraction unit 160 mainly includes two patterns: (1) direct extraction and (2) stepwise extraction.

（１）直接抽出
直接抽出の場合には、抽出部１６０は、例えば、コンテンツの推薦の基礎とされる属性を決定属性とし、推薦理由の生成のために用い得る１つ以上の属性を条件属性として重要度算出部１５０により算出される重要度を、重要属性の抽出のために使用する。 (1) Direct extraction In the case of direct extraction, the extraction unit 160 uses, for example, an attribute that is a basis for content recommendation as a decision attribute, and uses one or more attributes that can be used for generating a recommendation reason as a conditional attribute. The importance calculated by the importance calculation unit 150 is used for extracting important attributes.

例えば、推薦部１４０がユーザＦＢについての属性値を用いて推薦すべきコンテンツを選択する場合に、その推薦理由をコンテンツのメタデータを用いて生成することを想定する。その場合、例えば、図１０に例示したように、重要度算出部１５０は、コンテンツのメタデータ（「ジャンル」、「年代」等）の属性の様々な組合せ（条件属性セット）について、ユーザＦＢ（「好み」）を決定属性とした場合の重要度を算出する。そして、抽出部１６０は、例えば、算出された重要度が最も高い条件属性セットに含まれる１つ以上の属性を、重要属性として抽出する。即ち、図１０の例では、「ジャンル」及び「ムード（“明るさ”）」が重要属性として抽出され得る。 For example, it is assumed that when the recommendation unit 140 selects content to be recommended using an attribute value for the user FB, the reason for recommendation is generated using content metadata. In this case, for example, as illustrated in FIG. 10, the importance calculation unit 150 performs user FB (condition attribute set) on various combinations (condition attribute sets) of content metadata (“genre”, “age”, etc.). The degree of importance is calculated when “preference”) is used as a decision attribute. Then, the extraction unit 160 extracts, for example, one or more attributes included in the condition attribute set having the highest calculated importance as the important attributes. That is, in the example of FIG. 10, “genre” and “mood (“ brightness ”)” can be extracted as important attributes.

図１１は、決定表に基づいて算出される重要度の他の例について説明するための説明図である。図１１の例に関連し、例えば、推薦処理を実行している時間帯（例えばＴ１）においてより多く閲覧等された実績のあるコンテンツが、推薦すべきコンテンツとして選択され得る状況を想定する。その場合、例えば、重要度算出部１５０は、コンテンツのメタデータ（「ジャンル」、「年代」等）の属性の様々な組合せ（条件属性セット）について、コンテキストデータ（例えば「時間帯：Ｔ１」）を決定属性とした場合の重要度を算出する。そして、抽出部１６０は、例えば、算出された重要度が最も高い条件属性セットに含まれる１つ以上の属性を、重要属性として抽出する。図１１の例では、「ジャンル」及び「年代」を含む条件属性セットＡＳ５の重要度が最も高いため（０．６５）、「ジャンル」及び「年代」が重要属性として抽出され得る。 FIG. 11 is an explanatory diagram for describing another example of the importance calculated based on the determination table. In relation to the example of FIG. 11, for example, a situation is assumed in which content with a history of browsing or the like in a time zone (for example, T1) in which recommendation processing is executed can be selected as content to be recommended. In this case, for example, the importance calculation unit 150 uses context data (for example, “time zone: T1”) for various combinations (condition attribute sets) of attributes of content metadata (“genre”, “age”, etc.). The degree of importance is calculated when the is a decision attribute. Then, the extraction unit 160 extracts, for example, one or more attributes included in the condition attribute set having the highest calculated importance as the important attributes. In the example of FIG. 11, since the condition attribute set AS5 including “genre” and “age” has the highest importance (0.65), “genre” and “age” can be extracted as important attributes.

なお、抽出部１６０は、算出された重要度が最も高い条件属性セットに含まれる属性を重要属性として抽出する代わりに、予め設定される閾値を超える重要度を示す１つ以上の条件属性セットに含まれる属性を重要属性として抽出してもよい。また、抽出部１６０は、算出された重要度が高い順にＮ個（Ｎは予め設定される）の属性を重要属性として抽出してもよい。 Note that the extraction unit 160 does not extract the attribute included in the calculated condition attribute set having the highest importance level as an important attribute, but extracts one or more condition attribute sets that indicate an importance level exceeding a preset threshold. The included attributes may be extracted as important attributes. In addition, the extraction unit 160 may extract N (N is preset) attributes as important attributes in descending order of the calculated importance.

（２）段階的抽出
段階的抽出の場合には、抽出部１６０は、例えば、コンテンツの推薦の基礎とされる属性を決定属性とし、拡張属性に含まれる１つ以上の属性を条件属性として重要度算出部１５０により算出される第１の重要度と、拡張属性に含まれる属性を決定属性、推薦理由の生成のために用い得る１つ以上の属性を条件属性として重要度算出部１５０により算出される第２の重要度とを、重要属性の抽出のために使用する。 (2) Stepwise extraction In the case of stepwise extraction, the extraction unit 160 uses, for example, an attribute that is a basis for content recommendation as a decision attribute, and one or more attributes included in an extended attribute as important as a condition attribute. Calculated by the importance calculation unit 150 using the first importance calculated by the degree calculation unit 150 and the attribute included in the extended attribute as a decision attribute and one or more attributes that can be used for generating a recommendation reason as a condition attribute The second importance degree to be used is used for extraction of important attributes.

図１２は、抽出部１６０による重要属性の段階的抽出について説明するための説明図である。図１２の上部では、コンテンツの推薦の基礎とされる推薦アルゴリズムのスコアを決定属性Ｑ１とし、拡張属性のレビュー文層に含まれる潜在トピックＸ１〜Ｘｎの任意の組合せＰ１を条件属性として、重要度算出部１５０により第１の重要度γ_Ｐ１（Ｑ１）（Ｐ１⊆Ｃ１）が算出されている。ここで、例えば、Ｐ１＝｛Ｘ１，Ｘ２｝のとき第１の重要度γ_Ｐ１（Ｑ１）が最大であったものとする。 FIG. 12 is an explanatory diagram for explaining the stepwise extraction of important attributes by the extraction unit 160. In the upper part of FIG. 12, the score of the recommendation algorithm that is the basis of content recommendation is the determination attribute Q1, and any combination P1 of the latent topics X1 to Xn included in the review sentence layer of the extended attribute is the condition attribute. The calculation unit 150 calculates the first importance γ _P1 (Q1) ( _P1算出 C1). Here, for example, it is assumed that the first importance γ _P1 (Q1) is the maximum when P1 = {X1, X2}.

図１２の下部では、Ｐ１＝｛Ｘ１，Ｘ２｝を決定属性Ｑ２とし、推薦理由の生成のために用い得るキーワードＫ１、Ｋ２、Ｋ３、…の任意の組合せＰ２を条件属性として、重要度算出部１５０により第２の重要度γ_Ｐ２（Ｑ２）が算出されている。これら、第１の重要度及び第２の重要度の算出結果から、抽出部１６０は、例えば、第２の重要度γ_Ｐ２（Ｑ２）が最も高い条件属性セットＰ２（Ｑ２＝Ｐ１＝｛Ｘ１，Ｘ２｝）に含まれるキーワードを、重要属性として抽出する。 In the lower part of FIG. 12, P1 = {X1, X2} is a decision attribute Q2, and an arbitrary combination P2 of keywords K1, K2, K3,. 150, the second importance γ _P2 (Q2) is calculated. From the calculation results of the first importance level and the second importance level, the extraction unit 160, for example, sets the condition attribute set P2 (Q2 = P1 = {X1, Q2) having the highest second importance level γ _P2 (Q2). X2}) are extracted as important attributes.

なお、段階的抽出の手法は、かかる例に限定されない。図１２を用いて説明した手法は、第１の重要度を用いて推薦アルゴリズムのスコアに最も寄与する拡張属性（レビュー文層の潜在トピック）のセットを特定した後、特定した拡張属性のセットに寄与する基本属性（キーワード）のセットを第２の重要度を用いて抽出する手法であった。その代わりに、抽出部１６０は、例えば、第１の重要度と第２の重要度とを共通する拡張属性のセットについて乗算し、その乗算結果がより大きくなる基本属性のセットに含まれる属性を重要属性として抽出してもよい。 Note that the stepwise extraction method is not limited to such an example. The method described with reference to FIG. 12 uses the first importance to identify a set of extended attributes (latent topics in the review sentence layer) that most contributes to the score of the recommendation algorithm, and then to the specified set of extended attributes. This is a method of extracting a set of contributing basic attributes (keywords) using the second importance. Instead, for example, the extraction unit 160 multiplies the first importance level and the second importance level with respect to the common extended attribute set, and adds the attributes included in the basic attribute set whose multiplication result is larger. It may be extracted as an important attribute.

なお、直接抽出か段階的抽出かによらず、条件属性と決定属性との組は、上述した例に限定されない。抽出部１６０は、このように抽出した重要属性を、推薦部１４０へ出力する。 Regardless of direct extraction or stepwise extraction, the combination of the condition attribute and the determination attribute is not limited to the above-described example. The extraction unit 160 outputs the important attributes extracted in this way to the recommendation unit 140.

［２−８．画面例］
図１３及び図１４は、推薦理由が呈示される画面の例をそれぞれ示している。図１３を参照すると、推薦部１４０により生成され、ＵＩ制御部１３０を介して表示装置により表示される一例としての推薦画面１３２ａが示されている。推薦画面１３２ａには、ユーザに推薦される音楽コンテンツのジャケット画像１３４ａ、当該コンテンツの説明文１３６ａ及び推薦理由１３８ａが含まれる。推薦理由１３８ａには、抽出部１６０により抽出された重要属性を表す文字列が列挙されている。このような推薦理由１３８ａを表示することにより、推薦されたコンテンツについての推薦の理由をユーザに納得させることができる。また、ユーザは、推薦されたコンテンツについてのアクション（視聴、購買、又は無視等）をより容易に決定することができる。 [2-8. Screen example]
13 and 14 show examples of screens on which the reasons for recommendation are presented. Referring to FIG. 13, an example recommendation screen 132 a generated by the recommendation unit 140 and displayed on the display device via the UI control unit 130 is shown. The recommendation screen 132a includes a jacket image 134a of music content recommended by the user, a description sentence 136a of the content, and a recommendation reason 138a. In the recommendation reason 138a, character strings representing the important attributes extracted by the extraction unit 160 are listed. By displaying such a recommendation reason 138a, it is possible to convince the user of the reason for recommendation for the recommended content. Also, the user can more easily determine an action (viewing, purchasing, ignoring, etc.) for the recommended content.

図１４を参照すると、推薦部１４０により生成され、ＵＩ制御部１３０を介して表示装置により表示される他の例としての推薦画面１３２ｂが示されている。推薦画面１３２ｂには、ユーザに推薦される音楽コンテンツのジャケット画像１３４ｂ、当該コンテンツの説明文１３６ｂ及び推薦理由１３８ｂが含まれる。推薦理由１３８ｂには、抽出部１６０により抽出された重要属性を用いて生成された推薦理由の文章が示されている。このような推薦理由１３８ｂによっても、推薦されたコンテンツについての推薦の理由をユーザに納得させ、かつ、推薦されたコンテンツについてのユーザのアクションの決定をより容易にすることができる。 Referring to FIG. 14, a recommendation screen 132 b as another example generated by the recommendation unit 140 and displayed on the display device via the UI control unit 130 is shown. The recommendation screen 132b includes a jacket image 134b of music content recommended by the user, a description 136b of the content, and a reason for recommendation 138b. In the recommendation reason 138b, a sentence of the reason for recommendation generated by using the important attribute extracted by the extraction unit 160 is shown. Such a recommendation reason 138b can also convince the user of the reason for recommendation for the recommended content, and can more easily determine the user's action for the recommended content.

＜３．一実施形態に係る処理の流れ＞
次に、図１５及び図１６を用いて、本実施形態に係る情報処理装置１００による処理の流れを説明する。 <3. Flow of processing according to one embodiment>
Next, the flow of processing by the information processing apparatus 100 according to the present embodiment will be described with reference to FIGS. 15 and 16.

［３−１．事前処理］
図１５は、本実施形態に係る情報処理装置１００による事前処理の流れの一例を示すフローチャートである。事前処理は、コンテンツの推薦処理よりも以前に行われ得る処理である。事前処理は、例えば、一定の数のコンテンツが蓄積された時、又は所定の期間ごとに定期的に実行され得る。 [3-1. Pre-processing]
FIG. 15 is a flowchart illustrating an example of the flow of pre-processing by the information processing apparatus 100 according to the present embodiment. The pre-processing is processing that can be performed before the content recommendation processing. The pre-processing can be executed, for example, when a certain number of contents are accumulated or periodically every predetermined period.

図１５を参照すると、まず、解析部１２０により、記憶部１１０の属性テーブルに記憶されているコンテンツの基本属性が取得される（ステップＳ１０２）。次に、解析部１２０は、ＰＬＳＡ又はＬＤＡによる確率的分類法に従って、基本属性の属性値に基づいて、拡張属性の属性値を算出する（ステップＳ１０４）。解析部１２０は、ここで算出した拡張属性の属性値を属性テーブルに格納する。 Referring to FIG. 15, first, the basic attribute of content stored in the attribute table of the storage unit 110 is acquired by the analysis unit 120 (step S <b> 102). Next, the analysis unit 120 calculates the attribute value of the extended attribute based on the attribute value of the basic attribute according to the probabilistic classification method by PLSA or LDA (step S104). The analysis unit 120 stores the attribute value of the extended attribute calculated here in the attribute table.

次に、重要度算出部１５０により、推薦部１４０による推薦処理の中で重要属性の段階的抽出が行われるか否かが判定される（ステップＳ１０６）。ここで、重要属性の段階的抽出が行われない場合には、ステップＳ１０８はスキップされる。重要属性の段階的抽出が行われる場合には、重要度算出部１５０は、解析部１２０により算出された拡張属性を決定属性とし、基本属性を条件属性とした場合の重要度（段階的抽出における第２の重要度）を算出する（ステップＳ１０８）。重要度算出部１５０は、例えば、ここで算出した重要度を、後の推薦処理のために記憶部１１０に記憶させる。 Next, the importance level calculation unit 150 determines whether or not stepwise extraction of important attributes is performed in the recommendation process by the recommendation unit 140 (step S106). Here, if the important attributes are not extracted stepwise, step S108 is skipped. When the important attributes are stepwise extracted, the importance degree calculation unit 150 uses the extended attribute calculated by the analysis unit 120 as the decision attribute and the basic attribute as the condition attribute (in the stepwise extraction). The second importance) is calculated (step S108). For example, the importance calculation unit 150 stores the importance calculated here in the storage unit 110 for later recommendation processing.

このような事前処理のほかに、例えば、ユーザによるコンテンツの閲覧等のアクションが行われた場合には、ＵＩ制御部１３０によるコンテキストデータの更新、ユーザＦＢの登録などの処理が行われ得る。また、それに応じて、拡張属性の再計算などが行われてもよい。 In addition to such pre-processing, for example, when an action such as browsing of content is performed by the user, processing such as updating of context data and registration of the user FB by the UI control unit 130 may be performed. Further, recalculation of extended attributes may be performed accordingly.

［３−２．推薦処理］
図１６は、本実施形態に係る情報処理装置１００による推薦処理の流れの一例を示すフローチャートである。推薦処理は、例えば、ユーザ又は端末装置からの要求に応じて行われ得る処理である。 [3-2. Recommendation process]
FIG. 16 is a flowchart illustrating an example of a flow of recommendation processing by the information processing apparatus 100 according to the present embodiment. The recommendation process is a process that can be performed in response to a request from a user or a terminal device, for example.

図１６を参照すると、まず、推薦部１４０により、記憶部１１０が保持している属性テーブルに記憶されている属性値を用いて、ユーザに推薦すべきコンテンツが選択される（ステップＳ２０２）。次に、推薦部１４０は、推薦アルゴリズムにより出力されるスコアを、評価属性の属性値として属性テーブルに格納する（ステップＳ２０４）。次に、重要度算出部１５０により、重要属性の段階的抽出が行われるか否かが判定される（ステップＳ２０６）。ここで、重要属性の段階的抽出が行われる場合には、処理はステップＳ２０８へ進む。一方、重要属性の段階的抽出が行われない場合には、処理はステップＳ２１２へ進む。 Referring to FIG. 16, first, the recommendation unit 140 selects content to be recommended to the user using the attribute values stored in the attribute table held by the storage unit 110 (step S202). Next, the recommendation unit 140 stores the score output by the recommendation algorithm in the attribute table as the attribute value of the evaluation attribute (step S204). Next, it is determined by the importance level calculation unit 150 whether or not the stepwise extraction of important attributes is performed (step S206). Here, when the important attribute is extracted stepwise, the process proceeds to step S208. On the other hand, if the important attributes are not extracted stepwise, the process proceeds to step S212.

ステップＳ２０８では、重要度算出部１５０は、例えば、推薦部１４０により出力されたスコア（又はユーザＦＢなどの他の属性）を決定属性、拡張属性を条件属性とした場合の重要度（段階的抽出における第１の重要度）を算出する（ステップＳ２０８）。次に、抽出部１６０は、第１の重要度及び第２の重要度に応じて、重要属性を抽出する（ステップＳ２１０）。 In step S208, the importance calculation unit 150, for example, the importance (stepwise extraction) when the score (or other attribute such as the user FB) output by the recommendation unit 140 is the determination attribute and the extended attribute is the condition attribute. (First importance level) is calculated (step S208). Next, the extraction unit 160 extracts important attributes according to the first importance level and the second importance level (step S210).

一方、ステップＳ２１２では、重要度算出部１５０は、例えば、推薦部１４０により出力されたスコア（又はユーザＦＢなどの他の属性）を決定属性、基本属性を条件属性とした場合の重要度（直接抽出における重要度）を算出する（ステップＳ２１２）。次に、抽出部１６０は、重要度算出部１５０により算出された重要度に応じて、重要属性を抽出する（ステップＳ２１４）。 On the other hand, in step S212, the importance calculation unit 150 uses, for example, the importance (directly) when the score (or another attribute such as the user FB) output by the recommendation unit 140 is the determination attribute and the basic attribute is the condition attribute. The importance in the extraction is calculated (step S212). Next, the extraction unit 160 extracts an important attribute according to the importance calculated by the importance calculation unit 150 (step S214).

次に、推薦部１４０は、抽出部１６０により抽出された重要属性に基づいて、ユーザに呈示すべき推薦理由を含む推薦画面を生成する（ステップＳ２１６）。次に、ＵＩ制御部１３０は、推薦部１４０により生成された推薦画面を表示装置に表示させる（ステップＳ２１８）。その後、推薦すべきコンテンツを変更する場合には、処理はステップＳ２０２へ戻り、上述した処理が繰り返される。一方、推薦すべき新たなコンテンツが存在しない場合には、推薦処理は終了する。 Next, the recommendation unit 140 generates a recommendation screen including a recommendation reason to be presented to the user based on the important attributes extracted by the extraction unit 160 (step S216). Next, the UI control unit 130 causes the display device to display the recommendation screen generated by the recommendation unit 140 (step S218). Thereafter, when the content to be recommended is changed, the processing returns to step S202, and the above-described processing is repeated. On the other hand, when there is no new content to be recommended, the recommendation process ends.

なお、図１６に示したフローチャートでは、１つのコンテンツについて重要属性の直接抽出と段階的抽出のいずれか一方が行われる例を示したが、１つのコンテンツについて重要属性の直接抽出と段階的抽出の双方が行われてもよい。例えば、情報処理装置１００は、属性値集合の元の少ない基本属性（「ジャンル」、「年代」、「ムード」など）については直接抽出を行い、属性値集合の元の多い基本属性（「キーワード」、「アーティスト」など）については拡張属性を利用した段階的抽出を行ってもよい。拡張属性の属性空間の次元は基本属性の属性空間の次元に比べて一般的に低い（典型的には、潜在トピックの数に相当する）ため、拡張属性を条件属性として中間的に利用することで、推薦処理の最中の重要度算出処理の計算コストを抑えることができる。また、確率的分類法の利点として、潜在的なキーワード間又はアーティスト間の関連性（例えば同義関係など）を重要度の算出結果に反映させることができる。 Note that the flowchart shown in FIG. 16 shows an example in which either one of the important attribute direct extraction or stepwise extraction is performed for one content. However, the important attribute direct extraction or stepwise extraction is performed for one content. Both may be performed. For example, the information processing apparatus 100 directly extracts basic attributes (“genre”, “age”, “mood”, etc.) with a small original attribute value set, and basic attributes (“keyword” with a large attribute value set) ”,“ Artist ”, etc.) may be extracted step by step using extended attributes. The attribute space dimension of the extended attribute is generally lower than the attribute space dimension of the basic attribute (typically equivalent to the number of potential topics), so the extended attribute should be used as a conditional attribute in the middle Thus, the calculation cost of the importance calculation process during the recommendation process can be suppressed. Further, as an advantage of the probabilistic classification method, it is possible to reflect the relationship between potential keywords or artists (for example, synonymous relationships) in the calculation result of the importance.

＜４．変形例＞
［４−１．プレイリストの提供］
図１７は、本実施形態の一変形例に係る情報処理装置２００の構成を示すブロック図である。図１７を参照すると、情報処理装置２００は、記憶部１１０、解析部１２０、ユーザインタフェース（ＵＩ）制御部１３０、コンテンツリスト生成部２４０、重要度算出部２５０及び抽出部２６０を備える。情報処理装置２００においては、図１に示した推薦部１４０の代わりに、コンテンツリスト生成部２４０が設けられている。 <4. Modification>
[4-1. Provision of playlist]
FIG. 17 is a block diagram illustrating a configuration of an information processing apparatus 200 according to a modification of the present embodiment. Referring to FIG. 17, the information processing apparatus 200 includes a storage unit 110, an analysis unit 120, a user interface (UI) control unit 130, a content list generation unit 240, an importance calculation unit 250, and an extraction unit 260. In the information processing apparatus 200, a content list generation unit 240 is provided instead of the recommendation unit 140 illustrated in FIG.

コンテンツリスト生成部２４０は、ユーザの指定に応じて、又は属性テーブルに記憶されている属性値を用いて、ＵＩ制御部１３０が再生すべきコンテンツのリストを生成する。また、コンテンツリスト生成部２４０は、抽出部２６０により抽出される１つ以上の重要属性に基づいて、コンテンツリストのタイトルを生成する。そして、コンテンツリスト生成部２４０は、タイトルを付したコンテンツリストをＵＩ制御部１３０を介してユーザに呈示する。コンテンツリスト生成部２４０により生成されるコンテンツリストは、例えば、再生すべき音楽コンテンツ又は映像コンテンツなどを一覧化したプレイリストであってよい。 The content list generation unit 240 generates a list of contents to be played back by the UI control unit 130 in accordance with a user designation or using attribute values stored in the attribute table. In addition, the content list generation unit 240 generates a title of the content list based on one or more important attributes extracted by the extraction unit 260. Then, the content list generation unit 240 presents the content list with the title to the user via the UI control unit 130. The content list generated by the content list generation unit 240 may be, for example, a playlist that lists music content or video content to be reproduced.

コンテンツリスト生成部２４０により生成されるコンテンツリストには、例えば、ユーザにより指定される一群のコンテンツが含まれる。コンテンツリストに含めるべきコンテンツをユーザが指定した場合には、コンテンツリスト生成部２４０は、例えば、属性テーブルの評価属性として、ユーザによる指定の有無に応じた属性値（例えば、指定ありなら“１”、指定なしなら“０”）を有するスコアを格納する。一方、コンテンツリスト生成部２４０は、上述したいずれかの推薦アルゴリズムを用いてコンテンツリストに含めるべきコンテンツを自ら選択した場合には、推薦アルゴリズムが出力するスコアを属性テーブルに格納する。 The content list generated by the content list generation unit 240 includes, for example, a group of contents specified by the user. When the user specifies content to be included in the content list, the content list generation unit 240, for example, as an evaluation attribute of the attribute table, an attribute value according to the presence / absence of designation by the user (for example, “1” if designated) If there is no designation, a score having “0”) is stored. On the other hand, the content list generation unit 240 stores the score output by the recommendation algorithm in the attribute table when one of the above-described recommendation algorithms is used to select content to be included in the content list.

さらに、例えば、コンテンツリスト生成部２４０は、属性テーブルに記憶されているユーザＦＢについての属性値を用いて、コンテンツリストに含めるべきコンテンツを選択してもよい。また、例えば、コンテンツリスト生成部２４０は、属性テーブルに記憶されているコンテキストデータについての属性値を用いて、コンテンツリストに含めるべきコンテンツを選択してもよい。例えば、特定の時間帯においてより多く再生された実績のあるコンテンツが、コンテンツリストに含めるべきコンテンツとして優先的に選択され得る。 Further, for example, the content list generation unit 240 may select content to be included in the content list using the attribute value for the user FB stored in the attribute table. In addition, for example, the content list generation unit 240 may select content to be included in the content list using the attribute value for the context data stored in the attribute table. For example, content that has been played more frequently in a specific time zone can be preferentially selected as content to be included in the content list.

コンテンツリスト生成部２４０は、コンテンツリストを生成すると、コンテンツリストの生成のために使用した所定の属性に対する重要度を、当該所定の属性以外の１つ以上の属性について、重要度算出部２５０に算出させる。また、コンテンツリスト生成部２４０は、コンテンツリストのタイトルを生成するために使用される１つ以上の重要属性を、抽出部２６０に抽出させる。そして、コンテンツリスト生成部２４０は、抽出部２６０により抽出された１つ以上の重要属性を用いてコンテンツリストのタイトルを生成し、当該タイトルを付したコンテンツリストをＵＩ制御部１３０を介してユーザに呈示する。 When the content list is generated, the content list generation unit 240 calculates the importance for the predetermined attribute used for generating the content list to the importance calculation unit 250 for one or more attributes other than the predetermined attribute. Let In addition, the content list generation unit 240 causes the extraction unit 260 to extract one or more important attributes used for generating the title of the content list. Then, the content list generation unit 240 generates a title of the content list using one or more important attributes extracted by the extraction unit 260, and sends the content list with the title to the user via the UI control unit 130. Present.

重要度算出部２５０は、属性テーブルに記憶されている属性値を用いて、コンテンツの所定の属性に対する重要度を、コンテンツの当該所定の属性以外の１つ以上の属性について算出する。例えば、重要度算出部２５０は、ユーザによる指定の有無に応じて属性値が決定された評価属性を決定属性とする決定表を用いて重要度を算出してもよい。また、例えば、重要度算出部２５０は、コンテンツリスト生成部２４０によるコンテンツリストの生成のために使用された属性（例えば、推薦アルゴリズムのスコアなど）を決定属性とする決定表を用いて重要度を算出してもよい。重要度算出部２５０は、上述した重要度算出部１５０と同様に複数の条件属性セットごとに算出した重要度を、抽出部２６０へ出力する。 The importance level calculation unit 250 uses the attribute values stored in the attribute table to calculate the importance level for the predetermined attribute of the content for one or more attributes other than the predetermined attribute of the content. For example, the importance level calculation unit 250 may calculate the importance level using a determination table in which an evaluation attribute whose attribute value is determined according to whether or not a user designates is used as a determination attribute. Further, for example, the importance level calculation unit 250 uses the decision table having the attribute (for example, the recommendation algorithm score) used for generating the content list by the content list generation unit 240 as the determination attribute. It may be calculated. Similar to the importance calculation unit 150 described above, the importance calculation unit 250 outputs the importance calculated for each of the plurality of condition attribute sets to the extraction unit 260.

抽出部２６０は、重要度算出部２５０により算出される上述した重要度に応じて、コンテンツリストのタイトルを生成するために使用される１つ以上の重要属性を抽出する。上述した抽出部１６０と同様、抽出部２６０による重要属性の抽出処理もまた、直接抽出と段階的抽出の２通りのパターンによって行われ得る。 The extraction unit 260 extracts one or more important attributes used to generate the title of the content list according to the above-described importance calculated by the importance calculation unit 250. Similar to the extraction unit 160 described above, the important attribute extraction processing by the extraction unit 260 can also be performed by two patterns of direct extraction and stepwise extraction.

このような変形例によれば、例えば、ユーザの指定に応じてコンテンツリストが生成される場合にも、直接的にはシステムが知り得ないユーザの意図又は感情などに関連する重要属性の組合せが抽出され、その重要属性を用いてコンテンツリストのタイトルを動的に生成することができる。また、例えば、推薦アルゴリズムを用いてシステムがコンテンツリストを生成する場合にも、その推薦の結果にふさわしいタイトルを動的に生成することができる。なお、コンテンツリストのタイトルは、例えば、「Ｒｏｃｋ，Ｓｃａｎｄａｌ」のような属性文字列の単純な結合であってもよく、その代わりに、よりタイトルらしく加工されたものであってもよい。 According to such a modification, for example, even when a content list is generated in accordance with a user's specification, there are combinations of important attributes related to the user's intention or emotion that the system cannot directly know. The title of the content list can be dynamically generated using the extracted important attributes. Further, for example, when the system generates a content list using a recommendation algorithm, a title suitable for the result of the recommendation can be dynamically generated. The title of the content list may be, for example, a simple combination of attribute character strings such as “Rock, Scandal”, or may be processed more like a title instead.

［４−２．個人化］
図１８は、本実施形態の他の変形例について説明するための説明図である。図１８の上部には、図１に示した記憶部１１０により保持される属性テーブルが示されている。また、属性テーブルの下に、ユーザＵ１の各コンテンツに対するユーザアクションの履歴を表す履歴データが示されている。かかる履歴データは、例えば、記憶部１１０によりユーザごとに（又は装置ごとに）保持される。そして、例えば、重要度算出部１５０又は２５０は、履歴データに含まれるコンテンツについての属性テーブルのサブセットを取得し、取得したサブセットに含まれる属性値のみに基づいて、重要度を算出する。図１８の例では、ユーザＵ１の履歴データにはコンテンツＣ１、Ｃ４及びＣ２に対するユーザアクションが記述されており、コンテンツＣ１、Ｃ２及びＣ４についての属性テーブルのサブセットが取得されている。このような場合には、抽出部１６０又は２６０から出力される重要属性は、全てのユーザではなく、特定のユーザ（図１８の例ではユーザＵ１）にとって重要な属性となる。このような履歴データに基づく属性テーブルのフィルタリングにより、ユーザ個人の特性に応じた重要度を算出することが可能となる。その結果、情報処理装置１００又は２００により呈示される推薦理由又はプレイリストのタイトル等の個人化（Personalization）を図ることが可能となり、ユーザの満足度を向上させることができる。 [4-2. Personalization]
FIG. 18 is an explanatory diagram for explaining another modified example of the present embodiment. In the upper part of FIG. 18, an attribute table held by the storage unit 110 shown in FIG. 1 is shown. Also, below the attribute table, history data representing the history of user actions for each content of the user U1 is shown. Such history data is held by the storage unit 110 for each user (or for each device), for example. For example, the importance level calculation unit 150 or 250 acquires a subset of the attribute table for the content included in the history data, and calculates the importance level based only on the attribute value included in the acquired subset. In the example of FIG. 18, user actions for the contents C1, C4, and C2 are described in the history data of the user U1, and a subset of the attribute table for the contents C1, C2, and C4 is acquired. In such a case, the important attribute output from the extraction unit 160 or 260 is an important attribute for a specific user (user U1 in the example of FIG. 18), not all users. By filtering the attribute table based on such history data, it is possible to calculate the importance according to the characteristics of the individual user. As a result, it is possible to achieve personalization of the reason for recommendation presented by the information processing apparatus 100 or 200 or the title of the playlist, and the user satisfaction can be improved.

＜５．ハードウェア構成例＞
上述した情報処理装置１００又は２００による一連の処理は、典型的には、ソフトウェアを用いて実現される。ソフトウェアを構成するプログラムは、例えば図１９に示した構成を有する汎用コンピュータを用いて実行される。 <5. Hardware configuration example>
A series of processing by the information processing apparatus 100 or 200 described above is typically realized using software. The program constituting the software is executed using, for example, a general-purpose computer having the configuration shown in FIG.

図１９において、ＣＰＵ（Central Processing Unit）９０２は、汎用コンピュータの動作全般を制御する。ＲＯＭ（Read Only Memory）９０４には、一連の処理の一部又は全部を記述したプログラム又はデータが格納される。ＲＡＭ（Random Access Memory）９０６には、処理の実行時にＣＰＵ９０２により用いられるプログラムやデータなどが一時的に記憶される。 In FIG. 19, a CPU (Central Processing Unit) 902 controls the overall operation of the general-purpose computer. A ROM (Read Only Memory) 904 stores a program or data describing a part or all of a series of processes. A RAM (Random Access Memory) 906 temporarily stores programs and data used by the CPU 902 when processing is executed.

ＣＰＵ９０２、ＲＯＭ９０４、及びＲＡＭ９０６は、バス９１０を介して相互に接続される。バス９１０にはさらに、入出力インタフェース９１２が接続される。 The CPU 902, ROM 904, and RAM 906 are connected to each other via a bus 910. An input / output interface 912 is further connected to the bus 910.

入出力インタフェース９１２は、ＣＰＵ９０２、ＲＯＭ９０４、及びＲＡＭ９０６と、入力装置９２０、出力装置９２２、記憶装置９２４、通信装置９２６、及びドライブ９３０とを接続するためのインタフェースである。 The input / output interface 912 is an interface for connecting the CPU 902, ROM 904, and RAM 906 to the input device 920, output device 922, storage device 924, communication device 926, and drive 930.

入力装置９２０は、例えばマウス、キーボード又はタッチパネルなどの入力手段を介して、ユーザからの指示や情報入力を受け付ける。出力装置９２２は、例えばＣＲＴ（Cathode Ray Tube）、液晶ディスプレイ、ＯＬＥＤ（Organic Light Emitting Diode）などの表示装置、又はスピーカなどの音声出力装置を介してユーザに情報を出力する。 The input device 920 receives an instruction and information input from the user via an input unit such as a mouse, a keyboard, or a touch panel. The output device 922 outputs information to the user via a display device such as a CRT (Cathode Ray Tube), a liquid crystal display, an OLED (Organic Light Emitting Diode), or an audio output device such as a speaker.

記憶装置９２４は、例えばハードディスク又はフラッシュメモリなどにより構成され、プログラムやプログラムデータなどを記憶する。通信装置９２６は、ＬＡＮ又はインターネットなどのネットワークを介する通信処理を行う。ドライブ９３０は、必要に応じて汎用コンピュータに設けられ、例えばドライブ９３０にはリムーバブルメディア９３２が装着される。 The storage device 924 is configured by, for example, a hard disk or a flash memory, and stores programs, program data, and the like. The communication device 926 performs communication processing via a network such as a LAN or the Internet. The drive 930 is provided in a general-purpose computer as necessary. For example, a removable medium 932 is attached to the drive 930.

＜６．まとめ＞
ここまで、図１〜図１９を用いて、本発明の一実施形態及びその変形例について説明した。本実施形態によれば、複数のコンテンツについて各コンテンツに付与される属性値を用いて、コンテンツの所定の属性への寄与の程度を表す重要度が、コンテンツの上記所定の属性以外の１つ以上の属性について算出される。その際、重要度は、上記所定の属性を決定属性とし、上記１つ以上の属性を条件属性とする決定表を用いて算出される。それにより、条件属性の様々な組合せについて重要度を柔軟に評価することが可能となる。 <6. Summary>
Up to this point, an embodiment of the present invention and its modifications have been described with reference to FIGS. According to the present embodiment, one or more importance levels other than the predetermined attribute of the content are represented by using the attribute value given to each content with respect to a plurality of contents, and representing the degree of contribution to the predetermined attribute of the content. Are calculated for the attributes. At this time, the importance is calculated using a decision table in which the predetermined attribute is a decision attribute and the one or more attributes are condition attributes. As a result, the importance can be flexibly evaluated for various combinations of condition attributes.

また、本実施形態によれば、重要度は、Ｒｏｕｇｈ集合理論に従い、上記決定表について決定属性の正領域を形成するコンテンツの数に基づいて算出される。一般的に、決定表の正領域の濃度は、条件属性の組合せの、決定属性の属性値に対する情報伝達性の指標として扱い得る。従って、本実施形態に係る手法により算出される重要度は、条件属性の組合せごとの情報伝達性に応じた有意な指標である。 Further, according to the present embodiment, the importance is calculated based on the number of contents forming the positive region of the decision attribute for the decision table according to the Rough set theory. In general, the density of the positive region of the decision table can be treated as an index of information transferability with respect to the attribute value of the decision attribute of the combination of condition attributes. Therefore, the importance calculated by the method according to the present embodiment is a significant index corresponding to information transmissibility for each combination of condition attributes.

また、本実施形態によれば、算出された重要度に応じて抽出される重要属性を用いて、コンテンツに関するユーザに呈示すべき情報、例えば推薦理由又はコンテンツリストのタイトルなどが生成される。それにより、相互に関連する属性の組合せが意味を持つ場合にも、その属性の組合せを的確に推薦理由又はコンテンツリストのタイトルなどに反映することができる。また、ユーザによるフィードバック又はユーザアクションの際の状況（即ち、コンテキスト）に応じて、適応的に推薦理由又はコンテンツリストのタイトルなどを変化させることができる。 Further, according to the present embodiment, information to be presented to the user regarding the content, for example, the reason for recommendation or the title of the content list, is generated using the important attribute extracted according to the calculated importance. Thereby, even when a combination of mutually related attributes is meaningful, the combination of the attributes can be accurately reflected in a recommendation reason or a title of a content list. In addition, the reason for recommendation or the title of the content list can be adaptively changed in accordance with the situation (that is, context) at the time of user feedback or user action.

また、本実施形態によれば、ＰＬＳＡ又はＬＤＡによる確率的分類法に従って、基本属性の属性値に基づいて拡張属性の属性値が算出される。そして、拡張属性の属性値を中間的に利用することにより、重要属性を段階的に抽出することができる。その結果、重要度の算出結果の精度が基本属性（例えばキーワードや人物名など）の表記揺れを原因として低下するリスクが排除され得る。また、基本属性間の同義語関係などの潜在的な関連性を適切に踏まえた上で、推薦理由等を生成することができる。 Further, according to the present embodiment, the attribute value of the extended attribute is calculated based on the attribute value of the basic attribute according to the probabilistic classification method by PLSA or LDA. And an important attribute can be extracted in steps by using the attribute value of an extended attribute in the middle. As a result, it is possible to eliminate the risk that the accuracy of the calculation result of the degree of importance is reduced due to the shake of the notation of the basic attribute (for example, keyword or person name). In addition, it is possible to generate a recommendation reason or the like after properly considering a potential relationship such as a synonym relationship between basic attributes.

なお、本明細書において説明した決定表を用いた重要度算出方法によれば、条件属性及び決定属性の設定に応じて、様々な種類の重要度を算出することができる。例えば、上述したスマイルセンサ又は生体センサにより取得されるユーザの状態を表すコンテキストデータを決定属性とすることで、ユーザの心理状態又は体調に応じて異なる重要度を算出することも可能である。 Note that according to the importance calculation method using the determination table described in this specification, various types of importance can be calculated according to the setting of the condition attribute and the determination attribute. For example, by using the context data representing the user's state acquired by the above-described smile sensor or biometric sensor as a determination attribute, it is possible to calculate a different degree of importance depending on the user's psychological state or physical condition.

以上、添付図面を参照しながら本発明の好適な実施形態について詳細に説明したが、本発明はかかる例に限定されない。本発明の属する技術の分野における通常の知識を有する者であれば、特許請求の範囲に記載された技術的思想の範疇内において、各種の変更例または修正例に想到し得ることは明らかであり、これらについても、当然に本発明の技術的範囲に属するものと了解される。 The preferred embodiments of the present invention have been described in detail above with reference to the accompanying drawings, but the present invention is not limited to such examples. It is obvious that a person having ordinary knowledge in the technical field to which the present invention pertains can come up with various changes or modifications within the scope of the technical idea described in the claims. Of course, it is understood that these also belong to the technical scope of the present invention.

１００，２００情報処理装置
１１０記憶部
１２０解析部
１３０ユーザインタフェース制御部
１４０推薦部
１５０，２５０重要度算出部
１６０，２６０抽出部
２４０コンテンツリスト生成部
100, 200 Information processing device 110 Storage unit 120 Analysis unit 130 User interface control unit 140 Recommendation unit 150, 250 Importance calculation unit 160, 260 Extraction unit 240 Content list generation unit

Claims

A storage unit that holds an attribute table that stores attribute values assigned to each content for a plurality of contents;
An importance calculation unit that calculates the importance of one or more other attributes with respect to a predetermined attribute of the content using the attribute value stored in the attribute table;
With
The importance calculating unit calculates the importance using a determination table in which the predetermined attribute is a determination attribute and the one or more attributes are condition attributes;
Information processing device.

The information processing apparatus according to claim 1, wherein the importance calculation unit calculates the importance based on a number of contents forming a positive region of the determination attribute for the determination table.

The information processing apparatus includes:
An extraction unit that extracts one or more important attributes used to generate information about the content to be presented to the user according to the importance calculated by the importance calculation unit;
The information processing apparatus according to claim 2, further comprising:

The information processing apparatus includes:
A recommendation unit that selects content to be recommended to a user using an attribute value stored in the attribute table, and generates a reason for recommendation based on the one or more important attributes extracted by the extraction unit Recommendation section,
The information processing apparatus according to claim 3, further comprising:

The recommendation unit further stores a score for each content calculated for content selection in the attribute table,
The extraction unit extracts the one or more important attributes according to the importance calculated by the importance calculation unit using the score as a determination attribute.
The information processing apparatus according to claim 4.

The attribute table further stores attribute values of feedback attributes given based on user feedback for each content,
The recommendation unit selects content to be recommended to the user using the attribute value of the feedback attribute,
The extraction unit extracts the one or more important attributes according to the importance calculated by the importance calculation unit using the feedback attribute as a determination attribute;
The information processing apparatus according to claim 4.

The attribute table further stores an attribute value of a context attribute that is given based on a user action status for each content,
The recommendation unit selects content to be recommended to the user using the attribute value of the context attribute,
The extraction unit extracts the one or more important attributes according to the importance calculated by the importance calculation unit using the context attribute as a determination attribute;
The information processing apparatus according to claim 4.

The information processing according to claim 3, wherein the attribute table stores attribute values of extended attributes obtained by analyzing attribute values of the basic attributes in addition to attribute values of the basic attributes assigned to each content. apparatus.

The extraction unit includes the first attribute calculated by the importance calculation unit using the predetermined attribute as a determination attribute and one or more attributes included in the extended attribute as a condition attribute, and the extended attribute. The one or more important attributes are extracted according to the second importance calculated by the importance calculation unit using the attribute to be determined as the determination attribute and one or more attributes included in the basic attribute as the condition attributes The information processing apparatus according to claim 8.

The information processing apparatus includes:
In accordance with a probabilistic classification method based on PLSA (Probabilistic Latent Semantic Analysis) or LDA (Latent Dirichlet Allocation), an analysis unit that calculates the attribute value of the extended attribute based on the attribute value of the basic attribute;
The information processing apparatus according to claim 9, further comprising:

The information processing apparatus includes:
A content list generation unit that generates a list of contents to be played back using attribute values stored in the attribute table, wherein the content list is generated based on the one or more important attributes extracted by the extraction unit. A content list generator for generating titles;
Further comprising
The extraction unit includes the one or more important attributes according to the importance calculated by the importance calculation unit using the attribute used for generating the content list by the content list generation unit as a determination attribute. Extract,
The information processing apparatus according to claim 3.

The information processing apparatus includes:
A content list generation unit that generates a list of contents to be played according to a user's specification, wherein the content list generation unit generates a title of the content list based on the one or more important attributes extracted by the extraction unit Part,
Further comprising
The extraction unit extracts the one or more important attributes according to the importance calculated by the importance calculation unit using an attribute whose attribute value is determined according to whether or not specified by the user as a determination attribute. ,
The information processing apparatus according to claim 3.

The storage unit further holds history data representing a history of user actions for each content,
The importance level calculation unit calculates the importance level for the one or more attributes for each user using an attribute value for content included in the history data.
The information processing apparatus according to claim 1.

The information processing apparatus according to claim 1, wherein the importance calculation unit derives a positive region of a decision attribute in the decision table according to a Rough set theory.

In an information processing apparatus that uses a storage medium to store an attribute table that stores attribute values assigned to each content for a plurality of contents,
Calculating the importance of one or more other attributes with respect to a predetermined attribute of the content using the attribute values stored in the attribute table;
An importance calculation method including
The importance is calculated using a determination table in which the predetermined attribute is a determination attribute and the one or more attributes are condition attributes.
Importance calculation method.

For a plurality of contents, a computer that controls an information processing apparatus that holds an attribute table that stores attribute values assigned to each content using a storage medium,
An importance calculation unit that calculates the importance of one or more other attributes with respect to a predetermined attribute of the content using the attribute value stored in the attribute table;
Is a program for functioning as
The importance calculating unit calculates the importance using a determination table in which the predetermined attribute is a determination attribute and the one or more attributes are condition attributes;
program.