JP2019036087A

JP2019036087A - Generation device, method for generation, generation program, learning data, and model

Info

Publication number: JP2019036087A
Application number: JP2017156462A
Authority: JP
Inventors: 直晃山下; Naoaki Yamashita; 修平西村; Shuhei Nishimura; 智大田中; Tomohiro Tanaka
Original assignee: Yahoo Japan Corp
Current assignee: Yahoo Japan Corp
Priority date: 2017-08-14
Filing date: 2017-08-14
Publication date: 2019-03-07
Anticipated expiration: 2037-08-14
Also published as: JP6985059B2

Abstract

【課題】対象情報の各クラスに分類される割合を適切に推定可能にする生成装置、生成方法、生成プログラム、学習データ、及びモデルを提供する。【解決手段】生成装置１００は、制御部１３０において取得部１３１と、生成部１３２とを有する。取得部１３１は、分類対象となる対象情報と、複数のユーザの各々により対象情報の各クラスに分類された割合を示す割合情報とを取得する。生成部１３２は、取得部１３１により取得された対象情報と割合情報とに基づいて、一の対象情報が入力された場合に、一の対象情報が各クラスに分類される割合を推定するモデルを生成する。【選択図】図４A generation device, a generation method, a generation program, learning data, and a model capable of appropriately estimating a ratio of each class of target information to be classified. A generation apparatus includes an acquisition unit and a generation unit in a control unit. The acquisition unit 131 acquires target information to be classified and ratio information indicating a ratio classified into each class of target information by each of a plurality of users. Based on the target information and the ratio information acquired by the acquisition unit 131, the generation unit 132 calculates a model that estimates the ratio at which one target information is classified into each class when the one target information is input. Generate. [Selection] Figure 4

Description

本発明は、生成装置、生成方法、生成プログラム、学習データ、及びモデルに関する。 The present invention relates to a generation device, a generation method, a generation program, learning data, and a model.

近年、インターネットを用いて不特定多数の人（ユーザ）に仕事（タスク）を依頼するクラウドソーシングが知られている。例えば、このようなクラウドソーシングにおいて、受注者の人的資源を効率的に活用する技術が提供されている。 In recent years, crowdsourcing has been known in which work (tasks) is requested to an unspecified number of people (users) using the Internet. For example, in such crowdsourcing, a technique for efficiently utilizing the orderer's human resources is provided.

特開２０１４−１５３７５６号公報JP 2014-153756 A

しかしながら、上記の従来技術では、対象情報の各クラスに分類される割合を適切に推定可能にすることが難しい場合がある。例えば、クラウドソーシングにおいて、単純に画像等の対象情報を所望のクラスに分類するタスクをユーザに依頼するだけでは、新たに分類を必要とする対象情報が生じた場合等に、その対象情報の各クラスに分類される割合を適切に推定可能にすることが難しい。例えば、タスクを依頼したユーザから得られた回答結果のコンセンサス（合意）を推定することとすると、コンセンサスが得られない場合は、データとして使用することもできない。また、例えば、コンセンサスが得られない場合のみの情報を用いる場合、対象情報から回答の傾向のような有用な量を推定することが難しい。 However, with the above-described conventional technology, it may be difficult to appropriately estimate the ratio of classification into each class of target information. For example, in crowdsourcing, simply requesting a user to perform a task of classifying target information such as an image into a desired class, when target information that requires new classification occurs, etc. It is difficult to properly estimate the proportion of classes. For example, if the consensus (agreement) of the answer results obtained from the user who requested the task is estimated, if the consensus cannot be obtained, it cannot be used as data. For example, when using information only when consensus cannot be obtained, it is difficult to estimate a useful amount such as a tendency of an answer from target information.

本願は、上記に鑑みてなされたものであって、対象情報の各クラスに分類される割合を適切に推定可能にする生成装置、生成方法、生成プログラム、学習データ、及びモデルを提供することを目的とする。 The present application has been made in view of the above, and provides a generation device, a generation method, a generation program, learning data, and a model that can appropriately estimate the ratio classified into each class of target information. Objective.

本願に係る生成装置は、分類対象となる対象情報と、複数のユーザの各々により前記対象情報の各クラスに分類された割合を示す割合情報とを取得する取得部と、前記取得部により取得された前記対象情報と前記割合情報とに基づいて、一の対象情報が入力された場合に、前記一の対象情報が前記各クラスに分類される割合を推定するモデルを生成する生成部と、を備えたことを特徴とする。 The generation device according to the present application is acquired by the acquisition unit that acquires target information to be classified, and ratio information indicating a ratio classified into each class of the target information by each of a plurality of users, and the acquisition unit. A generation unit that generates a model for estimating a ratio of the one target information classified into each class when one target information is input based on the target information and the ratio information; It is characterized by having.

実施形態の一態様によれば、対象情報の各クラスに分類される割合を適切に推定可能にすることができるという効果を奏する。 According to one aspect of the embodiment, there is an effect that it is possible to appropriately estimate the ratio classified into each class of the target information.

図１は、実施形態に係る生成処理の一例を示す図である。FIG. 1 is a diagram illustrating an example of a generation process according to the embodiment. 図２は、実施形態に係る推定処理の一例を示す図である。FIG. 2 is a diagram illustrating an example of an estimation process according to the embodiment. 図３は、実施形態に係る生成システムの構成例を示す図である。FIG. 3 is a diagram illustrating a configuration example of a generation system according to the embodiment. 図４は、実施形態に係る生成装置の構成例を示す図である。FIG. 4 is a diagram illustrating a configuration example of the generation apparatus according to the embodiment. 図５は、実施形態に係る学習データ記憶部の一例を示す図である。FIG. 5 is a diagram illustrating an example of a learning data storage unit according to the embodiment. 図６は、実施形態に係るモデル情報記憶部の一例を示す図である。FIG. 6 is a diagram illustrating an example of the model information storage unit according to the embodiment. 図７は、実施形態に係るユーザ情報記憶部の一例を示す図である。FIG. 7 is a diagram illustrating an example of a user information storage unit according to the embodiment. 図８は、実施形態に係る推定情報記憶部の一例を示す図である。FIG. 8 is a diagram illustrating an example of the estimated information storage unit according to the embodiment. 図９は、実施形態に係る生成処理の一例を示すフローチャートである。FIG. 9 is a flowchart illustrating an example of the generation process according to the embodiment. 図１０は、実施形態に係る推定処理の一例を示すフローチャートである。FIG. 10 is a flowchart illustrating an example of the estimation process according to the embodiment. 図１１は、生成装置の機能を実現するコンピュータの一例を示すハードウェア構成図である。FIG. 11 is a hardware configuration diagram illustrating an example of a computer that realizes the function of the generation device.

以下に、本願に係る生成装置、生成方法、生成プログラム、学習データ、及びモデルを実施するための形態（以下、「実施形態」と呼ぶ）について図面を参照しつつ詳細に説明する。なお、この実施形態により本願に係る生成装置、生成方法、生成プログラム、学習データ、及びモデルが限定されるものではない。また、以下の各実施形態において同一の部位には同一の符号を付し、重複する説明は省略される。 In the following, a generation apparatus, a generation method, a generation program, learning data, and a form for implementing a model (hereinafter referred to as “embodiment”) according to the present application will be described in detail with reference to the drawings. Note that the generation device, generation method, generation program, learning data, and model according to the present application are not limited by this embodiment. In the following embodiments, the same portions are denoted by the same reference numerals, and redundant description is omitted.

（実施形態）
〔１．生成処理〕
まず、図１を用いて、実施形態に係る生成処理の一例について説明する。図１は、実施形態に係る生成処理の一例を示す図である。図１では、生成装置１００が分類対象となる対象情報と、対象情報が各クラスに分類される割合を示す正解情報とに基づいてモデルの生成を行う場合を示す。以下では、正解情報が対応付けられた対象情報を「学習データ」ともいう。図１及び図２の例では、対象情報が画像情報（以下、単に「画像」ともいう）である場合を一例に説明するについては後述する。なお、対象情報は画像に限らず、文字情報や、画像と文字情報を組み合わせた記事コンテンツ等の種々の情報であってもよい。 (Embodiment)
[1. Generation process)
First, an example of the generation process according to the embodiment will be described with reference to FIG. FIG. 1 is a diagram illustrating an example of a generation process according to the embodiment. FIG. 1 illustrates a case where the generation apparatus 100 generates a model based on target information to be classified and correct answer information indicating a ratio at which the target information is classified into each class. Hereinafter, the target information associated with the correct answer information is also referred to as “learning data”. In the example of FIGS. 1 and 2, a case where the target information is image information (hereinafter also simply referred to as “image”) will be described later as an example. The target information is not limited to images, but may be various information such as character information or article content combining images and character information.

ここで、図１において、生成装置１００が生成するモデル（学習器）について簡単に説明する。生成装置１００が生成するモデルは、例えば、入力されたデータに対する演算結果を出力する複数のノードを多層に接続したモデルであって、教師あり学習により抽象化された画像の特徴を学習されたモデルである。例えば、モデルは、複数のノードを有する層を多段に接続したニューラルネットワークであり、いわゆるディープラーニングの技術により実現されるＤＮＮ（Deep Neural Network）であってもよい。また、画像の特徴とは、画像に含まれる文字の有無、色、構成等、画像内に現れる具体的な特徴のみならず、撮像されている物体が何であるか、画像がどのような利用者に好かれるか、画像の雰囲気等、抽象化（メタ化）された画像の特徴をも含む概念である。 Here, a model (learning device) generated by the generation device 100 in FIG. 1 will be briefly described. The model generated by the generation apparatus 100 is, for example, a model in which a plurality of nodes that output calculation results for input data are connected in multiple layers, and a model in which image features abstracted by supervised learning are learned It is. For example, the model is a neural network in which layers having a plurality of nodes are connected in multiple stages, and may be a DNN (Deep Neural Network) realized by a so-called deep learning technique. Image features include not only the specific features that appear in the image, such as the presence / absence of characters in the image, color, composition, etc., but also what the object is being imaged and what kind of user the image is It is a concept that also includes the characteristics of an abstracted (meta-) image such as the atmosphere of the image.

例えば、モデルは、ディープラーニングの技術により、以下のような学習手法により生成される。例えば、モデルは、各ノードの間の接続係数が初期化され、様々な特徴を有する画像が入力される。そして、モデルは、モデルにおける出力と、入力した画像との誤差が少なくなるようにパラメータ（接続係数）を補正するバックプロパゲーション（誤差逆伝播法）等の処理により生成される。例えば、モデルは、誤差関数等、所定の損失（ロス）関数を最小化するようにバックプロパゲーション等の処理を行うことにより生成される。上述のような処理を繰り返すことで、モデルは、入力された画像をより良く再現できる出力、すなわち入力された画像の特徴を出力することができる。 For example, the model is generated by the following learning method using the deep learning technique. For example, in the model, the connection coefficient between each node is initialized, and images having various characteristics are input. Then, the model is generated by a process such as back propagation (error back propagation method) that corrects a parameter (connection coefficient) so that an error between the output of the model and the input image is reduced. For example, the model is generated by performing processing such as back propagation so as to minimize a predetermined loss function such as an error function. By repeating the processing as described above, the model can output an output that can better reproduce the input image, that is, a feature of the input image.

なお、モデルの学習手法については、上述した手法に限定されるものではなく、任意の公知技術が適用可能である。また、モデルに対する画像の入力方法、モデルが出力するデータの形式、モデルに対して明示的に学習させる特徴の内容等は、任意の手法が適用できる。すなわち、生成装置１００は、画像から抽象化された特徴を示す特徴量を算出できるのであれば、任意のモデルを用いることができる。 Note that the model learning technique is not limited to the technique described above, and any known technique can be applied. In addition, any method can be applied to the image input method for the model, the format of the data output by the model, the content of features that are explicitly learned for the model, and the like. That is, the generation apparatus 100 can use an arbitrary model as long as it can calculate a feature amount indicating a feature abstracted from an image.

図１では、生成装置１００は、入力画像の局所領域の畳み込みとプーリングとを繰り返す、いわゆる畳み込みニューラルネットワーク（ＣＮＮ：Convolutional Neural Network）によるモデルＭ１〜Ｍ３等を生成するものとする。以下では、畳み込みニューラルネットワークをＣＮＮと記載する場合がある。例えば、ＣＮＮによるモデルＭ１〜Ｍ３等は、画像から特徴を抽出して出力する機能に加え、画像内に含まれる文字や撮像対象等の位置的変異に対し、出力の不変性を有する。このため、モデルＭ１〜Ｍ３等は、画像の抽象化された特徴を精度良く算出することができる。なお、上記のように、「モデルＭ＊（＊は任意の数値）」と記載した場合、そのモデルはモデルＩＤ「Ｍ＊」により識別されるモデルであることを示す。例えば、「モデルＭ１」と記載した場合、そのモデルはモデルＩＤ「Ｍ１」により識別されるモデルである。 In FIG. 1, the generation device 100 generates models M1 to M3 and the like based on a so-called convolutional neural network (CNN) that repeats convolution and pooling of a local region of an input image. Hereinafter, the convolutional neural network may be referred to as CNN. For example, the models M1 to M3 by CNN have output invariance with respect to positional variations such as characters and imaging objects included in the image in addition to the function of extracting and outputting features from the image. Therefore, the models M1 to M3 and the like can accurately calculate the abstract features of the image. As described above, when “model M * (* is an arbitrary numerical value)” is described, this indicates that the model is a model identified by the model ID “M *”. For example, when “model M1” is described, the model is a model identified by the model ID “M1”.

〔生成システムの構成〕
まず、図１の説明に先立って、図３に示す生成システム１について説明する。図３に示すように、生成システム１は、端末装置１０と、生成装置１００とが含まれる。端末装置１０と、生成装置１００とは所定のネットワークＮを介して、有線または無線により通信可能に接続される。図３は、実施形態に係る生成システムの構成例を示す図である。なお、図３に示した生成システム１には、複数台の端末装置１０や、複数台の生成装置１００が含まれてもよい。 [Configuration of generation system]
First, the generation system 1 shown in FIG. 3 will be described prior to the description of FIG. As illustrated in FIG. 3, the generation system 1 includes a terminal device 10 and a generation device 100. The terminal device 10 and the generation device 100 are connected via a predetermined network N so as to be communicable by wire or wirelessly. FIG. 3 is a diagram illustrating a configuration example of a generation system according to the embodiment. Note that the generation system 1 illustrated in FIG. 3 may include a plurality of terminal devices 10 and a plurality of generation devices 100.

端末装置１０は、ユーザによって利用される情報処理装置である。例えば、ユーザは、クラウドソーシングなどによるタスクを行う複数のワーカである。端末装置１０は、例えば、スマートフォンや、タブレット型端末や、ノート型ＰＣ（Personal Computer）や、デスクトップＰＣや、携帯電話機や、ＰＤＡ（Personal Digital Assistant）等により実現される。図１に示す例においては、端末装置１０がユーザが利用するスマートフォンである場合を示す。なお、以下では、端末装置１０をユーザと表記する場合がある。すなわち、以下では、ユーザを端末装置１０と読み替えることもできる。具体的には、図１では、端末装置１０がユーザＩＤ「Ｕ１」により識別されるユーザ（以下、「ユーザＵ１」とする場合がある）が利用するスマートフォンである場合を示す。 The terminal device 10 is an information processing device used by a user. For example, the user is a plurality of workers who perform tasks such as crowdsourcing. The terminal device 10 is realized by, for example, a smartphone, a tablet terminal, a notebook PC (Personal Computer), a desktop PC, a mobile phone, a PDA (Personal Digital Assistant), or the like. In the example shown in FIG. 1, the case where the terminal device 10 is a smartphone used by the user is shown. Hereinafter, the terminal device 10 may be referred to as a user. That is, hereinafter, the user can be read as the terminal device 10. Specifically, FIG. 1 illustrates a case where the terminal device 10 is a smartphone used by a user identified by the user ID “U1” (hereinafter, may be referred to as “user U1”).

また、図１に示す例においては、端末装置１０を利用するユーザに応じて、端末装置１０を端末装置１０−１〜１０−５として説明する。例えば、端末装置１０−１は、ユーザＵ１により使用される端末装置１０である。また、例えば、端末装置１０−２は、ユーザＵ２により使用される端末装置１０である。また、以下では、端末装置１０−１〜１０−５について、特に区別なく説明する場合には、端末装置１０と記載する。なお、上記のように、「ユーザＵ＊（＊は任意の数値）」と記載した場合、そのユーザはユーザＩＤ「Ｕ＊」により識別されるユーザであることを示す。例えば、「ユーザＵ１」と記載した場合、そのユーザはユーザＩＤ「Ｕ１」により識別されるユーザである。 In the example illustrated in FIG. 1, the terminal device 10 will be described as terminal devices 10-1 to 10-5 in accordance with a user who uses the terminal device 10. For example, the terminal device 10-1 is the terminal device 10 used by the user U1. For example, the terminal device 10-2 is the terminal device 10 used by the user U2. Hereinafter, the terminal devices 10-1 to 10-5 are referred to as the terminal device 10 when they are not particularly distinguished. Note that, as described above, “user U * (* is an arbitrary numerical value)” indicates that the user is a user identified by the user ID “U *”. For example, when “user U1” is described, the user is a user identified by the user ID “U1”.

生成装置１００は、分類対象となる対象情報と、複数のユーザの各々により象情報の各クラスに分類された割合を示す割合情報に基づいて、モデルを生成する情報処理装置である。例えば、生成装置１００は、一の対象情報が入力された場合に、一の対象情報が各クラスに分類される割合を推定するモデルを生成する。また、生成装置１００は、対象情報をモデルに入力することにより、新たな対象情報に対する各クラスに分類される割合を推定する。 The generation apparatus 100 is an information processing apparatus that generates a model based on target information to be classified and ratio information indicating a ratio classified into each class of elephant information by each of a plurality of users. For example, when one piece of target information is input, the generation apparatus 100 generates a model that estimates the ratio at which the one piece of target information is classified into each class. Moreover, the generation apparatus 100 estimates the ratio of classification into each class with respect to new target information by inputting the target information into the model.

まず、図１の示す例において、生成装置１００は、正解情報を生成するための情報を収集する。具体的には、生成装置１００は、画像ＩＭ１０１〜ＩＭ１０３等の画像群ＩＭＬ１をクラウドソーシングによりユーザ（ワーカ）に分類させ、その回答を取得する。なお、図１の例では、生成装置１００は、画像を犬または猫のいずれかに分類するタスクをユーザに行わせるものとする。 First, in the example illustrated in FIG. 1, the generation device 100 collects information for generating correct answer information. Specifically, the generating apparatus 100 classifies the image group IML1 such as the images IM101 to IM103 into users (workers) by crowdsourcing, and acquires the answer. In the example of FIG. 1, the generation apparatus 100 is assumed to cause the user to perform a task of classifying an image as either a dog or a cat.

例えば、生成装置１００は、ワーカであるユーザＵ１が利用する端末装置１０−１に対象情報を提供する（ステップＳ１１−１）。図１の例では、生成装置１００は、端末装置１０−１に対象情報である画像ＩＭ１０１を提供する。そして、生成装置１００は、ユーザＵ１から画像ＩＭ１０１に対する回答を取得する（ステップＳ１２−１）。図１の例では、生成装置１００は、ユーザＵ１から画像ＩＭ１０１が「猫」であるとの回答を取得する。 For example, the generation device 100 provides the target information to the terminal device 10-1 used by the user U1 who is a worker (Step S11-1). In the example of FIG. 1, the generation device 100 provides an image IM101 that is target information to the terminal device 10-1. Then, the generation apparatus 100 acquires an answer to the image IM101 from the user U1 (Step S12-1). In the example of FIG. 1, the generation apparatus 100 acquires a reply from the user U1 that the image IM101 is “cat”.

また、例えば、生成装置１００は、ワーカであるユーザＵ２が利用する端末装置１０−２に対象情報を提供する（ステップＳ１１−２）。図１の例では、生成装置１００は、端末装置１０−２に対象情報である画像ＩＭ１０１を提供する。そして、生成装置１００は、ユーザＵ２から画像ＩＭ１０１に対する回答を取得する（ステップＳ１２−２）。図１の例では、生成装置１００は、ユーザＵ２から画像ＩＭ１０１が「猫」であるとの回答を取得する。 Further, for example, the generation device 100 provides the target information to the terminal device 10-2 used by the user U2 who is a worker (Step S11-2). In the example of FIG. 1, the generation device 100 provides an image IM101 that is target information to the terminal device 10-2. Then, the generation device 100 acquires an answer to the image IM101 from the user U2 (Step S12-2). In the example of FIG. 1, the generation apparatus 100 acquires a reply from the user U2 that the image IM101 is “cat”.

また、例えば、生成装置１００は、ワーカであるユーザＵ３が利用する端末装置１０−３に対象情報を提供する（ステップＳ１１−３）。図１の例では、生成装置１００は、端末装置１０−３に対象情報である画像ＩＭ１０１を提供する。そして、生成装置１００は、ユーザＵ３から画像ＩＭ１０１に対する回答を取得する（ステップＳ１２−３）。図１の例では、生成装置１００は、ユーザＵ３から画像ＩＭ１０１が「犬」であるとの回答を取得する。 Further, for example, the generation device 100 provides the target information to the terminal device 10-3 used by the user U3 who is a worker (Step S11-3). In the example of FIG. 1, the generation device 100 provides an image IM101 that is target information to the terminal device 10-3. Then, the generation apparatus 100 acquires a response to the image IM101 from the user U3 (Step S12-3). In the example of FIG. 1, the generation apparatus 100 acquires a reply from the user U3 that the image IM101 is “dog”.

また、例えば、生成装置１００は、ワーカであるユーザＵ４が利用する端末装置１０−４に対象情報を提供する（ステップＳ１１−４）。図１の例では、生成装置１００は、端末装置１０−４に対象情報である画像ＩＭ１０１を提供する。そして、生成装置１００は、ユーザＵ４から画像ＩＭ１０１に対する回答を取得する（ステップＳ１２−４）。図１の例では、生成装置１００は、ユーザＵ４から画像ＩＭ１０１が「犬」であるとの回答を取得する。 Further, for example, the generation device 100 provides the target information to the terminal device 10-4 used by the user U4 who is a worker (Step S11-4). In the example of FIG. 1, the generation device 100 provides the terminal device 10-4 with an image IM101 that is target information. Then, the generation device 100 acquires an answer to the image IM101 from the user U4 (Step S12-4). In the example of FIG. 1, the generation apparatus 100 obtains an answer from the user U4 that the image IM101 is “dog”.

また、例えば、生成装置１００は、ワーカであるユーザＵ５が利用する端末装置１０−５に対象情報を提供する（ステップＳ１１−５）。図１の例では、生成装置１００は、端末装置１０−５に対象情報である画像ＩＭ１０１を提供する。そして、生成装置１００は、ユーザＵ５から画像ＩＭ１０１に対する回答を取得する（ステップＳ１２−５）。図１の例では、生成装置１００は、ユーザＵ５から画像ＩＭ１０１が「猫」であるとの回答を取得する。 Further, for example, the generation device 100 provides the target information to the terminal device 10-5 used by the user U5 who is a worker (Step S11-5). In the example of FIG. 1, the generation device 100 provides an image IM101 that is target information to the terminal device 10-5. Then, the generation device 100 acquires an answer to the image IM101 from the user U5 (Step S12-5). In the example of FIG. 1, the generation apparatus 100 acquires a reply from the user U5 that the image IM101 is “cat”.

以下、ステップＳ１１−１〜Ｓ１１−５を区別せずに説明する場合、ステップＳ１１と総称する。また、ステップＳ１１−１〜Ｓ１１−５に限らず、各ユーザへの対象情報の提供は、画像群ＩＭＦ１の各画像ＩＭ１０２、１０３等について複数回行われてもよい。また、以下、ステップＳ１２−１〜Ｓ１２−５を区別せずに説明する場合、ステップＳ１２と総称する。なお、図１では、５人のユーザＵ１〜Ｕ５を図示するが、生成装置１００は、ユーザＵ１〜Ｕ５に限らず、多数のユーザ（例えば、１００万ユーザや１０００万ユーザ等）による対象情報に対する回答を取得する。例えば、生成装置１００は、ステップＳ１１において画像群ＩＭＦ１を端末装置１０に提供し、ステップＳ１２において画像群ＩＭＦ１に含まれる各画像ＩＭ１０１〜ＩＭ１０３等に対するユーザの回答を取得してもよい。 Hereinafter, when it demonstrates without distinguishing step S11-1-S11-5, it will generically call step S11. In addition to steps S11-1 to S11-5, provision of target information to each user may be performed a plurality of times for each image IM102, 103, etc. of the image group IMF1. In the following description, steps S12-1 to S12-5 will be collectively referred to as step S12 when not described. In FIG. 1, five users U1 to U5 are illustrated. However, the generation apparatus 100 is not limited to the users U1 to U5, and the target information for a large number of users (for example, 1 million users, 10 million users, etc.) Get an answer. For example, the generation device 100 may provide the image group IMF1 to the terminal device 10 in step S11, and acquire a user's answer to each of the images IM101 to IM103 and the like included in the image group IMF1 in step S12.

そして、生成装置１００は、ステップＳ１２で取得した情報に基づいて、対象情報の各クラスに分類された割合を示す割合情報を生成する。また、生成装置１００は、対象情報と、割合情報を正解情報として含む集計情報との組み合わせを学習データとして追加する（ステップＳ１３）。具体的には、生成装置１００は、対象情報である画像ＩＭ１０１〜ＩＭ１０３等の各々に対応するデータＤＴ１０１〜ＤＴ１０３等を学習データ記憶部１２１に追加する。 And the production | generation apparatus 100 produces | generates the ratio information which shows the ratio classified into each class of object information based on the information acquired by step S12. Further, the generation device 100 adds a combination of the target information and the total information including the ratio information as correct answer information as learning data (step S13). Specifically, the generation apparatus 100 adds data DT101 to DT103 and the like corresponding to each of the images IM101 to IM103 that are target information to the learning data storage unit 121.

なお、上記のように、「データＤＴ＊（＊は任意の数値）」と記載した場合、そのデータはデータＩＤ「ＤＴ＊」により識別されるデータであることを示す。例えば、「データＤＴ１」と記載した場合、そのデータはデータＩＤ「ＤＴ１」により識別されるデータである。 As described above, when “data DT * (* is an arbitrary numerical value)” is described, it indicates that the data is data identified by the data ID “DT *”. For example, when “data DT1” is described, the data is data identified by the data ID “DT1”.

図１中の学習データ記憶部１２１に示す「データＩＤ」は、データを識別するための識別情報を示す。図１中の学習データ記憶部１２１に示す「対象情報」は、データＩＤにより識別されるデータに含まれる対象情報を示す。図１中の学習データ記憶部１２１に示す「集計情報」は、クラウドソーシングによってワーカにより行われたタスクの回答を集計した情報（集計情報）を示す。 “Data ID” shown in the learning data storage unit 121 in FIG. 1 indicates identification information for identifying data. “Target information” shown in the learning data storage unit 121 in FIG. 1 indicates target information included in the data identified by the data ID. The “aggregation information” shown in the learning data storage unit 121 in FIG. 1 indicates information (aggregation information) obtained by aggregating answers to tasks performed by workers by crowdsourcing.

図１中の学習データ記憶部１２１に示す「集計情報」中の「正解情報（割合情報）」は、データＩＤにより識別されるデータに対応する正解情報（割合情報）を示す。例えば、「正解情報（割合情報）」は、ワーカの全回答における各クラスの回答の割合を示す。 “Correct information (ratio information)” in “aggregated information” shown in the learning data storage unit 121 in FIG. 1 indicates correct information (ratio information) corresponding to data identified by the data ID. For example, “correct answer information (ratio information)” indicates a ratio of answers of each class in all answers of workers.

「集計情報」中の「ユーザ数」は、対応する対象情報について回答を行ったユーザ数を示す。「集計情報」中の「猫（ＣＬ１）」は、対応する対象情報について猫と回答を行ったユーザ数を示す。また、図１の例では、「集計情報」中の「猫（ＣＬ１）」は、対応する対象情報について猫と回答を行ったユーザを識別する情報も含まれる。また、「集計情報」中の「犬（ＣＬ２）」は、対応する対象情報について犬と回答を行ったユーザ数を示す。また、図１の例では、「集計情報」中の「犬（ＣＬ２）」は、対応する対象情報について犬と回答を行ったユーザを識別する情報も含まれる。 The “number of users” in the “aggregated information” indicates the number of users who answered about the corresponding target information. “Cat (CL1)” in the “aggregated information” indicates the number of users who answered as cats for the corresponding target information. In the example of FIG. 1, “cat (CL1)” in “aggregated information” also includes information for identifying a user who has made a reply with a cat regarding the corresponding target information. Further, “dog (CL2)” in the “aggregated information” indicates the number of users who have made a reply with the dog regarding the corresponding target information. In the example of FIG. 1, “dog (CL2)” in “aggregated information” also includes information for identifying a user who made a reply with a dog regarding the corresponding target information.

例えば、図１に示す例において、データＩＤ「ＤＴ１０１」により識別されるデータ（データＤＴ１０１）は、分類対象となる対象情報が画像ＩＭ１０１であることを示す。また、データＤＴ１０１は、正解情報が「猫」と分類された割合が「０．５８（５８％）」であり、「犬」と分類された割合が「０．４２（４２％）」であることを示す。また、データＤＴ１０１は、ユーザ数が１０００人であることを示す。また、データＤＴ１０１は、猫と回答したユーザ数が５８０人であり、そのユーザにはユーザＵ１やユーザＵ２等が含まれることを示す。また、データＤＴ１０１は、犬と回答したユーザ数が４２０人であり、そのユーザにはユーザＵ３やユーザＵ４等が含まれることを示す。すなわち、データＤＴ１０１は、画像ＩＭ１０１について、１０００人のユーザ（ワーカ）のうち、５８０人が猫と回答し、４２０人が犬と回答したことを示す。そのため、データＤＴ１０１は、正解情報における猫の割合が「５８０／１０００＝０．５８」となり、正解情報における犬の割合が「４２０／１０００＝０．４２」となる。 For example, in the example illustrated in FIG. 1, the data identified by the data ID “DT101” (data DT101) indicates that the target information to be classified is the image IM101. In addition, in the data DT101, the ratio of correct information classified as “cat” is “0.58 (58%)”, and the ratio classified as “dog” is “0.42 (42%)”. It shows that. Data DT101 indicates that the number of users is 1000. The data DT101 indicates that the number of users who answered “cat” is 580, and the user includes the user U1, the user U2, and the like. The data DT101 indicates that the number of users who answered “dog” is 420, and the user includes the user U3 and the user U4. That is, the data DT101 indicates that, for the image IM101, out of 1000 users (workers), 580 responded as cats and 420 responded as dogs. Therefore, in the data DT101, the ratio of the cat in the correct answer information is “580/1000 = 0.58”, and the ratio of the dog in the correct answer information is “420/1000 = 0.42”.

また、データＩＤ「ＤＴ１０２」により識別されるデータ（データＤＴ１０２）は、画像ＩＭ１０２について、２０００人のユーザ（ワーカ）のうち、１７００人が猫と回答し、３００人が犬と回答したことを示す。そのため、データＤＴ１０２は、正解情報における猫の割合が「１７００／２０００＝０．８５」となり、正解情報における犬の割合が「３００／２０００＝０．１５」となる。 The data identified by the data ID “DT102” (data DT102) indicates that, for the image IM102, 12000 out of 2000 users (workers) responded as cats and 300 responded as dogs. . Therefore, in the data DT102, the ratio of the cat in the correct answer information is “1700/2000 = 0.85”, and the ratio of the dog in the correct answer information is “300/2000 = 0.15”.

また、データＩＤ「ＤＴ１０３」により識別されるデータ（データＤＴ１０３）は、画像ＩＭ１０３について、１００００人のユーザ（ワーカ）のうち、５５００人が猫と回答し、４５００人が犬と回答したことを示す。そのため、データＤＴ１０３は、正解情報における猫の割合が「５５００／１００００＝０．５５」となり、正解情報における犬の割合が「４５００／１００００＝０．４５」となる。 Further, the data (data DT103) identified by the data ID “DT103” indicates that, for the image IM103, among 10000 users (workers), 5500 responded as cats and 4500 responded as dogs. . Therefore, in the data DT103, the ratio of the cat in the correct answer information is “5500/10000 = 0.55”, and the ratio of the dog in the correct answer information is “4500/10000 = 0.45”.

上記のように、学習データとして用いられる画像群ＩＭＦ１には、ワーカの分類結果がクラス間で差が小さい画像ＩＭ１０１、ＩＭ１０３等やワーカの分類結果がクラス間で差が大きい画像ＩＭ１０２等が含まれる。例えば、図１中の画像群ＩＭＦ１には、画像ＩＭ１０１や画像ＩＭ１０３のような各クラスの割合の差が所定の閾値（例えば０．１や０．２等）未満の画像が含まれる。また、例えば、図１中の画像群ＩＭＦ１には、画像ＩＭ１０２のような、各クラスの割合の差が所定の閾値（例えば０．１や０．２等）以上である画像も含まれる。そこで、生成装置１００は、割合情報を正解情報として、モデルを学習（生成）する。この点について、以下詳述する。 As described above, the image group IMF1 used as learning data includes the images IM101 and IM103 that have a small difference in worker classification results between classes, and the image IM102 that has a large difference in worker classification results between classes. . For example, the image group IMF1 in FIG. 1 includes images such as the image IM101 and the image IM103 in which the difference in the ratio of each class is less than a predetermined threshold (for example, 0.1 or 0.2). Further, for example, the image group IMF1 in FIG. 1 includes images such as the image IM102 in which the difference in the ratio of each class is equal to or greater than a predetermined threshold (for example, 0.1 or 0.2). Therefore, the generation device 100 learns (generates) a model using the ratio information as correct answer information. This point will be described in detail below.

生成装置１００は、上記のような画像ＩＭ１０１〜ＩＭ１０３等を含む学習データに基づいてモデルを生成する（ステップＳ１４）。例えば、生成装置１００は、学習データ記憶部１２１中のデータＤＴ１０１〜ＤＴ１０３等を学習データ（教師データ）として、学習を行なうことにより、モデルを生成する。 The generation device 100 generates a model based on learning data including the images IM101 to IM103 as described above (step S14). For example, the generating apparatus 100 generates a model by performing learning using the data DT101 to DT103 in the learning data storage unit 121 as learning data (teacher data).

図１に示す例は、生成装置１００は、画像ＩＭ１０１〜ＩＭ１０３等と画像ＩＭ１０１〜ＩＭ１０３等の各クラスに分類された割合を示す割合情報を用いてモデルの生成を行う。ここでは、生成装置１００が画像ＩＭ１０１と画像ＩＭ１０１の各クラスに分類された割合を示す割合情報（以下、「正解情報ＲＤＴ１０１」とする）を一例として説明する。 In the example illustrated in FIG. 1, the generation apparatus 100 generates a model using ratio information indicating a ratio classified into each class such as the images IM101 to IM103 and the images IM101 to IM103. Here, the ratio information (hereinafter referred to as “correct answer information RDT101”) indicating the ratio of the generation apparatus 100 classified into each class of the image IM101 and the image IM101 will be described as an example.

まず、モデルＭ１には、画像ＩＭ１０１が入力される。これにより、モデルＭ１は、各クラスに対応するスコアを出力する。図１の例では、モデルＭ１は、猫（クラスＣＬ１）に対応するスコアと犬（クラスＣＬ２）に対応するスコアとを出力する。 First, the image IM101 is input to the model M1. As a result, the model M1 outputs a score corresponding to each class. In the example of FIG. 1, the model M1 outputs a score corresponding to a cat (class CL1) and a score corresponding to a dog (class CL2).

上述したように、例えば、生成装置１００は、ディープラーニングの技術により、モデルＭ１を学習し、生成する。例えば、生成装置１００は、画像ＩＭ１０１と、猫（クラスＣＬ１）の割合「０．５８」及び犬（クラスＣＬ２）の割合「０．４２」とを含む正解情報との組み合わせを学習データとして用いる。例えば、正解情報ＲＤＴ１０１には、猫（クラスＣＬ１）の割合「０．５８」を示す情報や犬（クラスＣＬ２）の割合「０．４２」を示す情報が含まれる。例えば、生成装置１００は、モデルＭ１における出力（各クラスのスコア）と、学習データに含まれる各クラスの割合（値）との誤差が少なくなるようにパラメータ（接続係数）を補正するバックプロパゲーション（誤差逆伝播法）等の処理を行うことにより、モデルＭ１を学習する。例えば、生成装置１００は、所定の誤差（ロス）関数を最小化するようにバックプロパゲーション等の処理を行うことによりモデルＭ１を生成する。 As described above, for example, the generation device 100 learns and generates the model M1 using a deep learning technique. For example, the generating apparatus 100 uses, as learning data, a combination of the image IM101 and correct answer information including a cat (class CL1) ratio “0.58” and a dog (class CL2) ratio “0.42”. For example, the correct answer information RDT101 includes information indicating the ratio “0.58” of the cat (class CL1) and information indicating the ratio “0.42” of the dog (class CL2). For example, the generation device 100 corrects the parameter (connection coefficient) so that the error between the output (score of each class) in the model M1 and the ratio (value) of each class included in the learning data is reduced. The model M1 is learned by performing processing such as (error back propagation method). For example, the generation apparatus 100 generates the model M1 by performing processing such as back propagation so as to minimize a predetermined error (loss) function.

例えば、生成装置１００は、下記の式（１）に示すような、誤差関数Ｌを用いる。下記の式（１）に示すように、生成装置１００は、例えば、Ｎ−クラス分類問題の場合、交差エントロピーを誤差関数として用いる。なお、誤差関数Ｌは、識別結果の確信度を表すものであれば、どのような関数であっても良い。例えば、誤差関数Ｌは、識別確率から求められるエントロピーであってもよい。また、例えば、誤差関数Ｌは、モデルＭ１の認識の精度を示すものであれば、どのような関数であってもよい。 For example, the generation apparatus 100 uses an error function L as shown in the following equation (1). As shown in the following formula (1), for example, in the case of an N-class classification problem, the generation device 100 uses cross entropy as an error function. The error function L may be any function as long as it represents the certainty of the identification result. For example, the error function L may be entropy obtained from the identification probability. For example, the error function L may be any function as long as it indicates the accuracy of recognition of the model M1.

ここで、上記式（１）や下記の式（３）〜（６）中の「ｘ」は画像を示す。例えば、図１に示す例において、上記式（１）や下記の式（３）〜（６）中の「ｘ」は、画像ＩＭに対応する。また、変数「ｎ」に代入される１〜Ｎは、モデルＭ１が識別（分類）する各クラスに対応する。例えば、上記式（１）に対応するモデルＭ１は、Ｎ個のクラスを識別することを示す。例えば、各クラスには、「猫（クラスＣＬ１）」や「犬（クラスＣＬ２）」等が各々対応する。 Here, “x” in the above formula (1) and the following formulas (3) to (6) indicates an image. For example, in the example shown in FIG. 1, “x” in the above formula (1) and the following formulas (3) to (6) corresponds to the image IM. Also, 1 to N assigned to the variable “n” correspond to each class identified (classified) by the model M1. For example, the model M1 corresponding to the above equation (1) indicates that N classes are identified. For example, “Cat (Class CL1)” and “Dog (Class CL2)” correspond to each class.

また、上記式（１）や下記の式（４）、（５）中の「ｔ_ｎ（ｘ）」は、画像ＩＭ１０１が分類されるクラスｎ（１〜Ｎのいずれか）の割合を示す。例えば、上記式（１）中の「ｔ_ｎ（ｘ）」は、正解情報ＲＤＴ１０１中のクラスｎに対応する割合を示す。この場合、例えば、クラス１に対応する対象を「猫」とした場合、「ｔ_１（ｘ）」は、「０．５８（５８％）」となる。例えば、クラスｎ（１〜Ｎのいずれか）の割合を示す「ｔ_ｎ（ｘ）」は、下記の式（２）のような関係である。 Further, “t _n (x)” in the above formula (1) and the following formulas (4) and (5) indicates the ratio of the class n (any one of 1 to N) into which the image IM101 is classified. For example, “t _n (x)” in the above formula (1) indicates a ratio corresponding to the class n in the correct answer information RDT101. In this case, for example, when the target corresponding to class 1 is “cat”, “t ₁ (x)” is “0.58 (58%)”. For example, “t _n (x)” indicating the ratio of class n (any one of 1 to N) has a relationship as shown in the following formula (2).

ここで、上記式（２）に示すように、画像が分類されるクラスｎ（１〜Ｎのいずれか）の割合を示す「ｔ_ｎ（ｘ）」の合計値は「１」となる。例えば、画像ＩＭ１０１が分類される２つクラスの割合を示す「ｔ_１（ｘ）」、「ｔ_２（ｘ）」の合計値は「１」となる。例えば、画像ＩＭ１０１において、クラス１に対応する対象を「猫」とした場合の「ｔ_１（ｘ）」は、「０．５８（５８％）」となり、クラス２に対応する対象を「犬」とした場合の「ｔ_２（ｘ）」は、「０．４２（４２％）」となる。この場合、「ｔ_１（ｘ）＋ｔ_２（ｘ）」は、「０．５８＋０．４２」、すなわち「１」となる。このように、クラスｎ（１〜Ｎのいずれか）の割合を示す「ｔ_ｎ（ｘ）」の各々は、クラス１〜Ｎ全体の合計が「１」となるような値となる。例えば、各クラスｎ（１〜Ｎのいずれか）の割合を示す「ｔ_ｎ（ｘ）」は、合計が「１」となるような値となる。 Here, as shown in the above formula (2), the total value of “t _n (x)” indicating the ratio of the class n (any one of 1 to N) into which the image is classified is “1”. For example, the total value of “t ₁ (x)” and “t ₂ (x)” indicating the ratio of two classes into which the image IM101 is classified is “1”. For example, in the image IM101, “t ₁ (x)” when the object corresponding to class 1 is “cat” is “0.58 (58%)”, and the object corresponding to class 2 is “dog”. In this case, “t ₂ (x)” is “0.42 (42%)”. In this case, “t ₁ (x) + t ₂ (x)” becomes “0.58 + 0.42”, that is, “1”. In this way, each of “t _n (x)” indicating the ratio of class n (any one of 1 to N) is a value such that the total of classes 1 to N as a whole is “1”. For example, “t _n (x)” indicating the ratio of each class n (any one of 1 to N) is a value such that the sum is “1”.

また、上記式（１）や下記の式（３）、（４）中の「ｐ_ｎ（ｘ）」は、画像ＩＭ１０１におけるクラスｎ（１〜Ｎのいずれか）について、モデルＭ１の出力に基づく割合を示す。例えば、上記式（１）中の「ｐ_ｎ（ｘ）」は、モデルＭ１が出力するクラスｎに対応する割合を示す。例えば、クラス１に対応する対象を「猫」とした場合、「ｐ_１（ｘ）」は、モデルＭ１の学習に応じて「０．５５（５５％）」や「０．５７（５７％）」等の種々の値に変動する。 Further, “p _n (x)” in the above formula (1) and the following formulas (3) and (4) is based on the output of the model M1 for the class n (any one of 1 to N) in the image IM101. Indicates the percentage. For example, “p _n (x)” in the above equation (1) indicates a ratio corresponding to the class n output by the model M1. For example, when the object corresponding to class 1 is “cat”, “p ₁ (x)” is “0.55 (55%)” or “0.57 (57%)” according to the learning of the model M1. ", Etc."

また、上記式（１）中の「ｐ_ｎ（ｘ）」は、ｘに対するクラスｎの確率で以下の式（３）に示すようなＳｏｆｔｍａｘ関数で定義される。 Further, “p _n (x)” in the above formula (1) is defined by a Softmax function as shown in the following formula (3) with the probability of class n with respect to x.

上記式（３）の関数「ｆ_ｎ」は、ＣＮＮ（モデルＭ１）が出力するクラスｎのスコアである。「θ」は、ＣＮＮ（モデルＭ１）のパラメータである。また、関数「ｅｘｐ」は、指数関数（exponential function）である。この場合、上記式（１）に示す誤差関数Ｌ（１）の勾配は、下記の式（４）により算出される。 The function “f _n ” in the above equation (3) is a score of class n output by CNN (model M1). “Θ” is a parameter of CNN (model M1). The function “exp” is an exponential function. In this case, the gradient of the error function L (1) shown in the above equation (1) is calculated by the following equation (4).

上記式（４）に示すように、１〜Ｎまでの全クラスにおいて、ｐ_ｎ（ｘ）＝ｔ_ｎ（ｘ）である場合、誤差関数Ｌ（ｘ）の勾配は０になり極値になる。例えば、生成装置１００は、誤差関数Ｌ（ｘ）の勾配が０になるように、フィードバック処理を行う。例えば、生成装置１００が上述のような処理を繰り返すことにより、モデルＭ１は、入力された画像が各クラスに分類される割合を示すスコアを適切に出力することができる。 As shown in the above equation (4), in all classes 1 to N, when p _n (x) = t _n (x), the gradient of the error function L (x) becomes 0 and becomes an extreme value. . For example, the generation device 100 performs feedback processing so that the gradient of the error function L (x) becomes zero. For example, when the generation apparatus 100 repeats the above-described processing, the model M1 can appropriately output a score indicating the ratio at which the input image is classified into each class.

なお、上記の例においては、画像ＩＭ１０１を一例として、上記式（１）を用いて対象情報ごとに処理する場合を示したが、生成装置１００は、下記の式（５）に示すような、全画像に対応する誤差関数Ｌを用いてもよい。 In the above example, the image IM101 is taken as an example, and the case where processing is performed for each target information using the above formula (1) is shown. However, the generation apparatus 100 is configured as shown in the following formula (5), An error function L corresponding to all images may be used.

例えば、上記式（５）中の変数「ｘ」に代入される１〜Ｍは、画像群ＩＭＦ１に含まれる画像ＩＭ１０１〜ＩＭ１０３等の複数の画像の各々に対応する。例えば、生成装置１００は、上記式（５）の誤差関数Ｌの勾配が０になるように、フィードバック処理を行うことにより、モデルＭ１は、入力された画像が各クラスに分類される割合を示すスコアを適切に出力することができる。 For example, 1 to M assigned to the variable “x” in the equation (5) corresponds to each of a plurality of images such as the images IM101 to IM103 included in the image group IMF1. For example, the generation apparatus 100 performs a feedback process so that the gradient of the error function L in the above equation (5) becomes 0, so that the model M1 indicates the ratio at which the input image is classified into each class. A score can be output appropriately.

また、対処情報を２つのクラスに分類する場合、上記式（５）に代えて下記の式（６）を用いてもよい。 Further, when classifying the handling information into two classes, the following formula (6) may be used instead of the above formula (5).

例えば、図１の例では、上記式（６）中の「ｔ_Ａ（ｘ）」は、画像ＩＭ１０１が猫（クラスＣＬ１）に分類される割合を示す。この場合、例えば、正解情報ＲＤＴ１０１における猫（クラスＣＬ１）に分類される割合に対応する「ｔ_Ａ（ｘ）」は、「０．５８（５８％）」となる。また、例えば、「（１−ｔ_Ａ（ｘ））」は、「０．４２（＝１−０．５８）（４２％）」となり、正解情報ＲＤＴ１０１における犬（クラスＣＬ２）に分類される割合に対応する。 For example, in the example of FIG. 1, “t _A (x)” in the above formula (6) indicates the ratio at which the image IM101 is classified as a cat (class CL1). In this case, for example, “t _A (x)” corresponding to the ratio classified as a cat (class CL1) in the correct answer information RDT101 is “0.58 (58%)”. Further, for example, “(1-t _A (x))” becomes “0.42 (= 1−0.58) (42%)”, and the ratio of being classified as a dog (class CL2) in the correct answer information RDT101 Corresponding to

また、上記式（６）中の「ｐ_Ａ（ｘ）」は、画像ＩＭ１０１における猫（クラスＣＬ１）について、モデルＭ１の出力に基づく割合を示す。 Further, “p _A (x)” in the above formula (6) indicates a ratio based on the output of the model M1 for the cat (class CL1) in the image IM101.

なお、モデルの学習手法については、上述した手法に限定されるものではなく、任意の公知技術が適用可能である。なお、各モデルの生成は、機械学習に関する種々の従来技術を適宜用いて行われてもよい。例えば、モデルの生成は、ＳＶＭ（Support Vector Machine）等の教師あり学習の機械学習に関する技術を用いて行われてもよい。また、例えば、モデルの生成は、教師なし学習の機械学習に関する技術を用いて行われてもよい。例えば、モデルの生成は、深層学習（ディープラーニング）の技術を用いて行われてもよい。例えば、モデルの生成は、ＲＮＮ（Recurrent Neural Network）やＣＮＮ等の種々のディープラーニングの技術を適宜用いて行われてもよい。なお、上記モデルの生成に関する記載は例示であり、モデルの生成は、取得可能な情報等に応じて適宜選択された学習手法により行われてもよい。すなわち、生成装置１００は、学習データに含まれる対象情報が入力された場合に、正解情報に対応するスコアを出力するようにモデルＭ１を学習可能であれば、どのような手法によりモデルＭ１の生成を行ってもよい。 Note that the model learning technique is not limited to the technique described above, and any known technique can be applied. Each model may be generated using various conventional techniques relating to machine learning as appropriate. For example, the model generation may be performed using a technique related to machine learning of supervised learning such as SVM (Support Vector Machine). Further, for example, the model generation may be performed using a technique related to machine learning of unsupervised learning. For example, the generation of the model may be performed using a deep learning technique. For example, the model generation may be performed by appropriately using various deep learning techniques such as RNN (Recurrent Neural Network) and CNN. The description relating to the generation of the model is merely an example, and the generation of the model may be performed by a learning method appropriately selected according to information that can be acquired. That is, the generation apparatus 100 generates the model M1 by any method as long as the model M1 can be learned so as to output the score corresponding to the correct answer information when the target information included in the learning data is input. May be performed.

上記のような処理により、図１の例では、生成装置１００は、モデル情報記憶部１２２に示すように、モデルＩＤ「Ｍ１」により識別されるモデル（モデルＭ１）を生成する。また、図１中のモデル情報記憶部１２２に示すように、モデルＭ１は用途「画像（犬猫分類）」、すなわち画像が犬猫の２つのクラスのいずれかに分類されるかの推定のために用いられるモデルであり、その具体的なモデルデータが「モデルデータＭＤＴ１」であることを示す。例えば、生成装置１００は、モデルＭ１に画像情報を入力することにより、入力した画像情報が各クラスに分類される割合を示すスコアを、モデルＭ１に出力させ、モデルＭ１が出力するスコアに基づいて画像が各クラスに分類される割合を推定する。 In the example of FIG. 1, the generation apparatus 100 generates a model (model M1) identified by the model ID “M1” as shown in the model information storage unit 122 by the process as described above. In addition, as shown in the model information storage unit 122 in FIG. 1, the model M1 is used for estimation of whether the image is classified into one of two classes of dogs and cats. This is a model used in the above, and indicates that the specific model data is “model data MDT1”. For example, the generation apparatus 100 inputs image information to the model M1, and causes the model M1 to output a score indicating the ratio of the input image information classified into each class. Based on the score output by the model M1 Estimate the proportion of images classified into each class.

上述したように、生成装置１００は、画像情報と正解情報とが対応付けられた学習データを用いて学習することにより、対象情報の各クラスに分類される割合を適切に推定可能にするモデルを生成することができる。したがって、生成装置１００は、上述のように生成したモデルを用いることにより、例えば、画像（犬猫分類）の各クラスに分類される割合を精度よく推定することを可能にすることができる。 As described above, the generation device 100 learns using a learning data in which image information and correct answer information are associated with each other, so that a model that can appropriately estimate a ratio classified into each class of target information. Can be generated. Therefore, by using the model generated as described above, for example, the generation apparatus 100 can accurately estimate the ratio classified into each class of an image (dog cat classification).

例えば、クラウドソーシングにおいて、単純に画像等の対象情報を所望のクラスに分類するタスクをユーザに依頼し、得られた回答結果のコンセンサスを推定することとすると、コンセンサスが得られない場合は、データとして使用することもできない。ここでいうコンセンサスとは、例えば、回答に偏りがあることによる、ユーザの回答の一致（合意）であってもよい。例えば、コンセンサスとは、ある回答を所定の割合以上のユーザが行ったことであってもよい。例えば、コンセンサスとは、ある回答（例えば「猫」）を所定の割合（７０％）以上のユーザが行ったことであってもよい。また、例えば、コンセンサスが得られない場合のみの情報を用いる場合、対象情報から回答の傾向のような有用な量を推定することが難しい。 For example, in crowdsourcing, if the user simply asks the user to perform a task of classifying target information such as images into a desired class and estimates the consensus of the obtained answer results, Can not be used as. The consensus referred to here may be, for example, a match (agreement) between the user's answers due to bias in the answers. For example, the consensus may be that a certain number of users have made a certain answer. For example, the consensus may be that a certain answer (for example, “cat”) is given by a predetermined percentage (70%) or more of users. For example, when using information only when consensus cannot be obtained, it is difficult to estimate a useful amount such as a tendency of an answer from target information.

例えば、従来の学習においては、図１の例の画像ＩＭ１０１や画像ＩＭ１０３等のようにワーカの分類による各クラスの割合に差がつかなかった画像は、モデルの生成に用いられなかった。例えば、従来の学習においては、画像ＩＭ１０２のような各クラスの割合の差が顕著な画像のみを用いて学習していた。例えば、従来の学習においては、画像ＩＭ１０２について、猫（クラスＣＬ１）のラベルを「１」とし、犬（クラスＣＬ２）のラベルを「０」として学習に用い、画像ＩＭ１０１やＩＭ１０３が学習に用いられなかった。しかしながら、各クラスの割合に差がつかなかった場合も学習データとしても重要であるが、クラス数が大きくなると、使われないデータも多くなるという問題があった。 For example, in conventional learning, images such as the image IM101 and the image IM103 in the example of FIG. 1 that have no difference in the proportion of each class according to worker classification are not used for model generation. For example, in conventional learning, learning is performed using only an image such as the image IM102 in which the difference in the ratio of each class is significant. For example, in the conventional learning, for the image IM102, the label of the cat (class CL1) is set to “1” and the label of the dog (class CL2) is set to “0”, and the images IM101 and IM103 are used for learning. There wasn't. However, even when there is no difference in the proportion of each class, it is important as learning data. However, when the number of classes increases, there is a problem that data that is not used increases.

そこで、生成装置１００は、各クラスを選択したワーカの数の割合を学習するターゲット（正解情報）とすることで、従来では用いられていなかったデータを利用可能にし、画像自体が分類することが難しい画像である場合も適切に推定することが可能になる。また、例えば、閾値を使用して通常分類器として使用しても、学習データ数の増加により、学習器の汎化性能を向上させることができる。 Therefore, the generation apparatus 100 can use data that has not been used in the past and classify the image itself by using a target (correct answer information) to learn the ratio of the number of workers who have selected each class. It is possible to appropriately estimate even a difficult image. Further, for example, even if the threshold value is used as a normal classifier, the generalization performance of the learning device can be improved by increasing the number of learning data.

〔２．推定処理〕
図２を用いて、実施形態に係る推定処理の一例について説明する。図２は、実施形態に係る推定処理の一例を示す図である。図２では、生成装置１００は、新たな対象情報を取得した場合に、その対象情報に対する各クラスに分類される割合を推定し、推定に基づく情報を提供する場合を示す。 [2. (Estimation process)
An example of the estimation process according to the embodiment will be described with reference to FIG. FIG. 2 is a diagram illustrating an example of an estimation process according to the embodiment. In FIG. 2, when new target information is acquired, the generation apparatus 100 estimates a rate of classification into each class with respect to the target information, and provides information based on the estimation.

まず、生成装置１００は、対象情報となる画像ＩＭ１０を取得する（ステップＳ２１）。図２の例では、生成装置１００は、ユーザＵ１０１が利用する端末装置１０から分類対象となる画像ＩＭ１０を取得する。 First, the generation device 100 acquires an image IM10 that is target information (step S21). In the example of FIG. 2, the generation device 100 acquires the image IM10 to be classified from the terminal device 10 used by the user U101.

画像ＩＭ１０を取得した生成装置１００は、画像ＩＭ１０をモデルに入力する。例えば、生成装置１００は、画像ＩＭ１０のデータ（以下、「データＩＭ１０」とする）を、モデルＭ１に入力する。 The generation apparatus 100 that has acquired the image IM10 inputs the image IM10 into the model. For example, the generation apparatus 100 inputs data of the image IM10 (hereinafter referred to as “data IM10”) to the model M1.

図２の例では、生成装置１００は、処理群ＰＳ２１に示すような処理により、画像ＩＭ１０が各クラスに分類される割合を示すスコアを算出する。生成装置１００は、データＩＭ１０をモデルＭ１に入力する（ステップＳ２２）。データＩＭ１０が入力されたモデルＭ１は、スコアを出力する（ステップＳ２３）。モデルＭ１は、各クラスに対応するスコアを出力する。図２の例では、データＩＭ１０が入力されたモデルＭ１は、スコアＳＣ１１に示すように、猫（クラスＣＬ１）のスコア「０．５５」を出力し、犬（クラスＣＬ２）のスコア「０．４５」を出力する。これにより、生成装置１００は、モデルＭ１を用いて、画像ＩＭ１０が猫（クラスＣＬ１）に分類される割合（確率）が５５％であり、犬（クラスＣＬ２）に分類される割合（確率）が４５％であると推定する。 In the example of FIG. 2, the generation apparatus 100 calculates a score indicating the ratio at which the image IM10 is classified into each class by the processing as shown in the processing group PS21. The generation device 100 inputs the data IM10 to the model M1 (Step S22). The model M1 to which the data IM10 is input outputs a score (step S23). The model M1 outputs a score corresponding to each class. In the example of FIG. 2, the model M1 to which the data IM10 is input outputs the score “0.55” of the cat (class CL1) and the score “0.45” of the dog (class CL2) as shown in the score SC11. Is output. Accordingly, the generation apparatus 100 uses the model M1 and the ratio (probability) that the image IM10 is classified as a cat (class CL1) is 55%, and the ratio (probability) that is classified as a dog (class CL2). Estimated to be 45%.

そして、生成装置１００は、推定結果を生成する（ステップＳ２４）。図２の例では、生成装置１００は、推定対象である画像ＩＭ１０が猫に分類される割合が５５％であり、犬に分類される割合が４５％であるとの推定結果情報ＥＲ２１を生成する。 And the production | generation apparatus 100 produces | generates an estimation result (step S24). In the example of FIG. 2, the generation apparatus 100 generates the estimation result information ER21 that the ratio that the image IM10 that is the estimation target is classified as a cat is 55% and the ratio that is classified as a dog is 45%. .

その後、生成装置１００は、推定結果に基づいて情報提供を行う（ステップＳ２５）。図２の例では、生成装置１００は、生成した推定結果情報ＥＲ２１を端末装置１０へ提供する。 Thereafter, the generation device 100 provides information based on the estimation result (step S25). In the example of FIG. 2, the generation device 100 provides the generated estimation result information ER21 to the terminal device 10.

上述したように、生成装置１００は、モデルを用いることにより対象情報に対する各クラスに分類される割合を推定する。図２の例では、生成装置１００は、画像ＩＭ１０をモデルＭ１に入力することにより、モデルＭ１に画像ＩＭ１０が各クラスに分類される割合を示すスコアを出力させる。そして、生成装置１００は、モデルＭ１が出力するスコアが高いクラス程、その対象情報がそのクラスに分類される可能性が高いと推定する。図２の例では、生成装置１００は、モデルＭ１が出力する猫（クラスＣＬ１）のスコアが高いエリア程、その画像が猫に分類される可能性が高いと推定する。そして、生成装置１００は、推定した結果を端末装置１０へ提供する。これにより、生成装置１００から情報提供を受けたユーザは、その対象情報がどのクラスに分類されるかを把握することができる。また、生成装置１００から割合情報の情報提供を受けたユーザは、その対象情報がどの程度分類が難しい対象情報であるかを把握することができる。 As described above, the generation device 100 estimates the ratio of classification into each class for the target information by using the model. In the example of FIG. 2, the generation apparatus 100 inputs the image IM10 to the model M1, and causes the model M1 to output a score indicating the ratio at which the image IM10 is classified into each class. Then, the generation device 100 estimates that the higher the score output by the model M1, the higher the possibility that the target information is classified into that class. In the example of FIG. 2, the generation apparatus 100 estimates that an area with a higher score of a cat (class CL1) output from the model M1 is more likely to be classified as a cat. Then, the generation device 100 provides the estimated result to the terminal device 10. As a result, the user who receives the information provided from the generation apparatus 100 can grasp which class the target information is classified into. In addition, the user who has received the information on the ratio information from the generation apparatus 100 can grasp how difficult the target information is to be classified.

〔２−１．推定の対象〕
図１及び図２の例では、画像が猫（クラスＣＬ１）と犬（クラスＣＬ２）のいずれに分類されるかを推定するモデルを生成したり、モデルを用いて推定を行う場合を示したが、生成装置１００は、どのような分類を行うモデルを生成したりしてもよい。例えば、生成装置１００は、各対象情報に種々の正解情報を対応付けることにより、複数のモデルを生成してもよい。例えば、生成装置１００は、各対象情報に、種々の正解情報を対応付けることにより、複数のモデルを生成してもよい。 [2-1. (Target of estimation)
In the example of FIG. 1 and FIG. 2, a case is shown in which a model for estimating whether an image is classified into a cat (class CL1) or a dog (class CL2) is generated, or estimation is performed using the model. The generation apparatus 100 may generate a model for performing any classification. For example, the generation apparatus 100 may generate a plurality of models by associating various pieces of correct answer information with each target information. For example, the generating apparatus 100 may generate a plurality of models by associating various pieces of correct answer information with each target information.

また、生成装置１００は、画像に限らず、文字情報や、画像と文字情報を組み合わせた記事コンテンツ等の種々の対象情報を対象とするモデルを生成してもよい。 Further, the generation apparatus 100 may generate a model that targets not only images but also various pieces of target information such as character information and article content that combines images and character information.

〔２−２．回答について〕
図１の例では、説明を簡単にするために、各ユーザ（ワーカ）が犬か猫かのいずれかを選択し、いずれか一方を回答する場合を示したが、生成装置１００は、各ユーザから各クラスの割合による回答を取得してもよい。例えば、生成装置１００は、ユーザＵ１から、画像ＩＭ１０１が猫（クラスＣＬ１）であると考える割合「０．６（６０％）」と犬（クラスＣＬ２）であると考える割合「０．４（４０％）」等の各クラスの割合による回答を取得してもよい。例えば、生成装置１００は、ユーザＵ２から、画像ＩＭ１０１が猫（クラスＣＬ１）であると考える割合「０．７（７０％）」と犬（クラスＣＬ２）であると考える割合「０．３（３０％）」等の各クラスの割合による回答を取得してもよい。 [2-2. About answer)
In the example of FIG. 1, in order to simplify the explanation, a case where each user (worker) selects either a dog or a cat and answers either one is shown. You may get answers by percentage of each class. For example, the generation apparatus 100 determines from the user U1 the ratio “0.6 (60%)” that the image IM101 is considered to be a cat (class CL1) and the ratio “0.4 (40 that is considered to be a dog (class CL2)). %) ”Or the like may be obtained. For example, the generation apparatus 100 determines from the user U2 the ratio “0.7 (70%)” that the image IM101 is considered to be a cat (class CL1) and the ratio “0.3 (30) that is considered to be a dog (class CL2). %) ”Or the like may be obtained.

また、生成装置１００は、各ユーザから各クラスの割合による回答に基づいて、正解情報を生成してもよい。例えば、生成装置１００は、上記ユーザＵ１とユーザＵ２の回答に基づいて、正解情報を生成する場合、画像ＩＭ１０１の猫（クラスＣＬ１）の割合を「０．６５（＝（０．６＋０．７）／２）（６５％）」してもよい。例えば、生成装置１００は、上記ユーザＵ１とユーザＵ２の回答に基づいて、正解情報を生成する場合、画像ＩＭ１０１の犬（クラスＣＬ２）の割合を「０．３５（＝（０．４＋０．３）／２）（３５％）」してもよい。なお、上記は一例であり、生成装置１００は、各ユーザから種々の回答を取得して、正解情報を生成してもよい。 Further, the generation apparatus 100 may generate correct answer information based on answers from the ratios of the classes from the users. For example, when generating the correct answer information based on the answers of the user U1 and the user U2, the generating apparatus 100 sets the ratio of the cat (class CL1) of the image IM101 to “0.65 (= (0.6 + 0.7) / 2) (65%) ". For example, when generating the correct answer information based on the answers of the user U1 and the user U2, the generating apparatus 100 sets the ratio of the dog (class CL2) of the image IM101 to “0.35 (= (0.4 + 0.3)”. / 2) (35%) ". Note that the above is an example, and the generation apparatus 100 may generate correct answer information by acquiring various answers from each user.

〔２−３．回答するユーザ（ワーカ）に応じた重み付け〕
また、生成装置１００は、回答したユーザに応じて回答に重み付けを行ってもよい。例えば、生成装置１００は、各ユーザのワーカとしての経歴やスキルレベル等のクラウドソーシングにおける各ユーザの評価や信頼性を示す情報に基づいて、各ユーザの回答に重み付けを行ってもよい。 [2-3. (Weighting according to responding users (workers))
Further, the generation apparatus 100 may weight the answers according to the users who answered. For example, the generation apparatus 100 may weight each user's answer based on information indicating the evaluation and reliability of each user in crowdsourcing, such as the career and skill level of each user.

例えば、生成装置１００は、ワーカとしての経歴が所定の年数以上のユーザの回答については、重みを「１」より大きくしてもよい。例えば、生成装置１００は、ワーカとしての経歴が所定の年数以上のユーザの回答については、重みを「２」とすることにより、２人分の回答の価値があるとしてもよい。また、例えば、生成装置１００は、ワーカとしてのスキルレベルが所定の閾値以上のユーザの回答については、重みを「１」より大きくしてもよい。例えば、生成装置１００は、ワーカとしての経歴が所定の年数以上のユーザの回答については、重みを「１．５」とすることにより、１．５人分の回答の価値があるとしてもよい。なお、上記は一例であり、生成装置１００は、種々の情報を用いて各ユーザの回答の重み付けを行ってもよい。 For example, the generation apparatus 100 may set the weight to be greater than “1” for an answer from a user whose career history is a predetermined number of years or more. For example, the generation apparatus 100 may be worth answering two users by setting the weight to “2” for the answers of users who have a career history of a predetermined number of years or more. Further, for example, the generation apparatus 100 may set the weight to be greater than “1” for a user's answer whose skill level as a worker is equal to or higher than a predetermined threshold. For example, the generation apparatus 100 may set the weight of “1.5” for the answer of a user whose career as a worker is a predetermined number of years or more, and may be worth the answer of 1.5 people. Note that the above is an example, and the generation apparatus 100 may weight each user's answer using various information.

〔３．生成装置の構成〕
次に、図４を用いて、実施形態に係る生成装置１００の構成について説明する。図４は、実施形態に係る生成装置の構成例を示す図である。図４に示すように、生成装置１００は、通信部１１０と、記憶部１２０と、制御部１３０とを有する。なお、生成装置１００は、生成装置１００の管理者等から各種操作を受け付ける入力部（例えば、キーボードやマウス等）や、各種情報を表示するための表示部（例えば、液晶ディスプレイ等）を有してもよい。 [3. Configuration of the generator
Next, the configuration of the generation apparatus 100 according to the embodiment will be described with reference to FIG. FIG. 4 is a diagram illustrating a configuration example of the generation apparatus according to the embodiment. As illustrated in FIG. 4, the generation apparatus 100 includes a communication unit 110, a storage unit 120, and a control unit 130. The generation device 100 includes an input unit (for example, a keyboard and a mouse) that receives various operations from an administrator of the generation device 100 and a display unit (for example, a liquid crystal display) for displaying various types of information. May be.

（通信部１１０）
通信部１１０は、例えば、ＮＩＣ（Network Interface Card）等によって実現される。そして、通信部１１０は、ネットワークと有線または無線で接続され、端末装置１０との間で情報の送受信を行う。 (Communication unit 110)
The communication unit 110 is realized by, for example, a NIC (Network Interface Card). The communication unit 110 is connected to the network by wire or wireless, and transmits / receives information to / from the terminal device 10.

（記憶部１２０）
記憶部１２０は、例えば、ＲＡＭ（Random Access Memory）、フラッシュメモリ（Flash Memory）等の半導体メモリ素子、または、ハードディスク、光ディスク等の記憶装置によって実現される。実施形態に係る記憶部１２０は、図４に示すように、学習データ記憶部１２１と、モデル情報記憶部１２２と、ユーザ情報記憶部１２３と、推定情報記憶部１２４とを有する。 (Storage unit 120)
The storage unit 120 is realized by, for example, a semiconductor memory device such as a RAM (Random Access Memory) or a flash memory, or a storage device such as a hard disk or an optical disk. As illustrated in FIG. 4, the storage unit 120 according to the embodiment includes a learning data storage unit 121, a model information storage unit 122, a user information storage unit 123, and an estimated information storage unit 124.

（学習データ記憶部１２１）
実施形態に係る学習データ記憶部１２１は、学習データに関する各種情報を記憶する。図５は、実施形態に係る学習データ記憶部の一例を示す図である。例えば、学習データ記憶部１２１は、モデルの生成に用いる教師データを記憶する。図５に示す学習データ記憶部１２１には、「データＩＤ」、「対象情報」、「集計情報」といった項目が含まれる。「集計情報」には、「正解情報（割合情報）」、「ユーザ数」、「猫（ＣＬ１）」、「犬（ＣＬ２）」といった項目が含まれる。 (Learning data storage unit 121)
The learning data storage unit 121 according to the embodiment stores various types of information related to learning data. FIG. 5 is a diagram illustrating an example of a learning data storage unit according to the embodiment. For example, the learning data storage unit 121 stores teacher data used for generating a model. The learning data storage unit 121 illustrated in FIG. 5 includes items such as “data ID”, “target information”, and “total information”. “Total information” includes items such as “correct answer information (ratio information)”, “number of users”, “cat (CL1)”, and “dog (CL2)”.

「データＩＤ」は、データを識別するための識別情報を示す。例えば、データＩＤ「ＤＴ１０１」により識別されるデータは、図１の例に示した、データＤＴ１０１に対応する。「対象情報」は、データＩＤにより識別されるデータに含まれる対象情報を示す。例えば、「対象情報」は、分類対象となる対象情報を示す。「集計情報」は、クラウドソーシングによってワーカにより行われたタスクの回答を集計した情報を示す。 “Data ID” indicates identification information for identifying data. For example, the data identified by the data ID “DT101” corresponds to the data DT101 shown in the example of FIG. “Target information” indicates target information included in the data identified by the data ID. For example, “target information” indicates target information to be classified. “Aggregated information” indicates information obtained by aggregating answers to tasks performed by a worker by crowdsourcing.

「集計情報」中の「正解情報（割合情報）」は、データＩＤにより識別されるデータに対応する正解情報（割合情報）を示す。例えば、「正解情報（割合情報）」は、ワーカの全回答における各クラスの回答の割合を示す。 “Correct answer information (ratio information)” in “aggregated information” indicates correct answer information (ratio information) corresponding to data identified by the data ID. For example, “correct answer information (ratio information)” indicates a ratio of answers of each class in all answers of workers.

「集計情報」中の「ユーザ数」は、対応する対象情報について回答を行ったユーザ数を示す。「集計情報」中の「猫（ＣＬ１）」は、対応する対象情報について猫と回答を行ったユーザ数を示す。また、図５の例では、「集計情報」中の「猫（ＣＬ１）」は、対応する対象情報について猫と回答を行ったユーザを識別する情報も含まれる。また、「集計情報」中の「犬（ＣＬ２）」は、対応する対象情報について犬と回答を行ったユーザ数を示す。また、図５の例では、「集計情報」中の「犬（ＣＬ２）」は、対応する対象情報について犬と回答を行ったユーザを識別する情報も含まれる。 The “number of users” in the “aggregated information” indicates the number of users who answered about the corresponding target information. “Cat (CL1)” in the “aggregated information” indicates the number of users who answered as cats for the corresponding target information. In the example of FIG. 5, “cat (CL1)” in “aggregated information” also includes information for identifying a user who made a reply with a cat regarding the corresponding target information. Further, “dog (CL2)” in the “aggregated information” indicates the number of users who have made a reply with the dog regarding the corresponding target information. In the example of FIG. 5, “dog (CL2)” in the “aggregated information” includes information for identifying a user who has made a reply with the dog regarding the corresponding target information.

例えば、図５に示す例において、データＩＤ「ＤＴ１０１」により識別されるデータ（データＤＴ１０１）は、分類対象となる対象情報が画像ＩＭ１０１であることを示す。また、データＤＴ１０１は、正解情報が「猫」と分類された割合が「０．５８（５８％）」であり、「犬」と分類された割合が「０．４２（４２％）」であることを示す。また、データＤＴ１０１は、ユーザ数が１０００人であることを示す。また、データＤＴ１０１は、猫と回答したユーザ数が５８０人であり、そのユーザにはユーザＵ１やユーザＵ２等が含まれることを示す。また、データＤＴ１０１は、犬と回答したユーザ数が４２０人であり、そのユーザにはユーザＵ３やユーザＵ４等が含まれることを示す。すなわち、データＤＴ１０１は、画像ＩＭ１０１について、１０００人のユーザ（ワーカ）のうち、５８０人が猫と回答し、４２０人が犬と回答したことを示す。そのため、データＤＴ１０１は、正解情報における猫の割合が「５８０／１０００＝０．５８」となり、正解情報における犬の割合が「４２０／１０００＝０．４２」となる。 For example, in the example illustrated in FIG. 5, the data (data DT101) identified by the data ID “DT101” indicates that the target information to be classified is the image IM101. In addition, in the data DT101, the ratio of correct information classified as “cat” is “0.58 (58%)”, and the ratio classified as “dog” is “0.42 (42%)”. It shows that. Data DT101 indicates that the number of users is 1000. The data DT101 indicates that the number of users who answered “cat” is 580, and the user includes the user U1, the user U2, and the like. The data DT101 indicates that the number of users who answered “dog” is 420, and the user includes the user U3 and the user U4. That is, the data DT101 indicates that, for the image IM101, out of 1000 users (workers), 580 responded as cats and 420 responded as dogs. Therefore, in the data DT101, the ratio of the cat in the correct answer information is “580/1000 = 0.58”, and the ratio of the dog in the correct answer information is “420/1000 = 0.42”.

なお、学習データ記憶部１２１は、上記に限らず、目的に応じて種々の情報を記憶してもよい。例えば、学習データ記憶部１２１は、学習データが追加された日時に関する情報を記憶してもよい。また、例えば、学習データ記憶部１２１は、各学習データがどのような判定処理により追加されたかを示す情報を記憶してもよい。例えば、学習データ記憶部１２１は、各学習データが管理者の選択により判定されたか等を示す情報を記憶してもよい。 Note that the learning data storage unit 121 is not limited to the above, and may store various information according to the purpose. For example, the learning data storage unit 121 may store information related to the date and time when learning data was added. Further, for example, the learning data storage unit 121 may store information indicating what kind of determination processing each learning data has been added. For example, the learning data storage unit 121 may store information indicating whether each learning data has been determined by an administrator's selection.

（モデル情報記憶部１２２）
実施形態に係るモデル情報記憶部１２２は、モデルに関する情報を記憶する。例えば、モデル情報記憶部１２２は、生成処理により生成されたモデル情報（モデルデータ）を記憶する。図６は、実施形態に係るモデル情報記憶部の一例を示す図である。図６に示すモデル情報記憶部１２２は、「モデルＩＤ」、「用途」、「モデルデータ」といった項目が含まれる。なお、図６では、モデルＭ１〜Ｍ３のみを図示するが、Ｍ４、Ｍ５等、各用途（推定の対象）に応じて多数のモデル情報が記憶されてもよい。 (Model information storage unit 122)
The model information storage unit 122 according to the embodiment stores information about the model. For example, the model information storage unit 122 stores model information (model data) generated by the generation process. FIG. 6 is a diagram illustrating an example of the model information storage unit according to the embodiment. The model information storage unit 122 illustrated in FIG. 6 includes items such as “model ID”, “use”, and “model data”. In FIG. 6, only the models M1 to M3 are illustrated, but a large number of model information such as M4 and M5 may be stored according to each application (estimation target).

「モデルＩＤ」は、モデルを識別するための識別情報を示す。例えば、モデルＩＤ「Ｍ１」により識別されるモデルは、図１の例に示したモデルＭ１に対応する。「用途」は、対応するモデルの用途を示す。また、「モデルデータ」は、対応付けられた対応するモデルのデータを示す。例えば、「モデルデータ」には、各層におけるノードと、各ノードが採用する関数と、ノードの接続関係と、ノード間の接続に対して設定される接続係数とを含む情報が含まれる。 “Model ID” indicates identification information for identifying a model. For example, the model identified by the model ID “M1” corresponds to the model M1 illustrated in the example of FIG. “Use” indicates the use of the corresponding model. “Model data” indicates data of a corresponding model associated with the model data. For example, “model data” includes information including nodes in each layer, functions adopted by the nodes, connection relationships between the nodes, and connection coefficients set for connections between the nodes.

例えば、図６に示す例において、モデルＩＤ「Ｍ１」により識別されるモデル（モデルＭ１）は、用途が「画像（犬猫分類）」であり、入力された画像が犬猫の２つのクラスのいずれかに分類されるかの推定に用いられることを示す。また、モデルＭ１のモデルデータは、モデルデータＭＤＴ１であることを示す。 For example, in the example shown in FIG. 6, the model (model M1) identified by the model ID “M1” has an application of “image (dog cat classification)”, and the input image has two classes of dogs and cats. Indicates that it is used to estimate whether it is classified as either. The model data of the model M1 is model data MDT1.

例えば、図６に示す例において、モデルＩＤ「Ｍ３」により識別されるモデル（モデルＭ３）は、用途が「文字情報（カテゴリ分類）」であり、入力された文字情報が複数のカテゴリ（クラス）のうち、どのクラスに分類されるかの推定に用いられることを示す。例えば、分類するカテゴリ（クラス）は、政治、スポーツ、芸能等の３つ以上のカテゴリが含まれてもよい。また、モデルＭ３のモデルデータは、モデルデータＭＤＴ３であることを示す。 For example, in the example illustrated in FIG. 6, the model (model M3) identified by the model ID “M3” has a usage of “character information (category classification)” and the input character information includes a plurality of categories (classes). Among them, it is used for estimating which class is classified. For example, the category (class) to be classified may include three or more categories such as politics, sports, and entertainment. The model data of the model M3 is model data MDT3.

例えば、モデルＭ１（モデルデータＭＤＴ１）は、分類対象となる対象情報が入力される入力層と、出力層と、入力層から出力層までのいずれかの層であって出力層以外の層に属する第１要素と、第１要素と第１要素の重みとに基づいて値が算出される第２要素と、を含み、入力層に入力された対象情報に対し、出力層以外の各層に属する各要素を第１要素として、第１要素と第１要素の重みとに基づく演算を行うことにより、対象情報の各クラスに分類される割合の推定に用いられるスコアの値を出力層から出力するよう、コンピュータを機能させるためのモデルである。 For example, the model M1 (model data MDT1) belongs to an input layer to which target information to be classified is input, an output layer, and any layer from the input layer to the output layer and other than the output layer. A first element, and a second element whose value is calculated based on the weight of the first element and the first element, and each of the target information input to the input layer belongs to each layer other than the output layer By using the element as the first element and performing an operation based on the first element and the weight of the first element, the score value used for estimating the ratio classified into each class of the target information is output from the output layer It is a model for functioning a computer.

ここで、モデルＭ１〜Ｍ３等が「ｙ＝ａ_１＊ｘ_１＋ａ_２＊ｘ_２＋・・・＋ａ_ｉ＊ｘ_ｉ」で示す回帰モデルで実現されるとする。この場合、例えば、モデルＭ１が含む第１要素は、ｘ１やｘ２等といった入力データ（ｘｉ）に対応する。また、第１要素の重みは、ｘｉに対応する係数ａｉに対応する。ここで、回帰モデルは、入力層と出力層とを有する単純パーセプトロンと見做すことができる。各モデルを単純パーセプトロンと見做した場合、第１要素は、入力層が有するいずれかのノードに対応し、第２要素は、出力層が有するノードと見做すことができる。 Here, it is assumed that the models M1 to M3 and the like are realized by a regression model represented by “y = a ₁ * x ₁ + a ₂ * x ₂ +... + A _i * x _i ”. In this case, for example, the first element included in the model M1 corresponds to input data (xi) such as x1 or x2. The weight of the first element corresponds to the coefficient ai corresponding to xi. Here, the regression model can be regarded as a simple perceptron having an input layer and an output layer. When each model is regarded as a simple perceptron, the first element can correspond to any node of the input layer, and the second element can be regarded as a node of the output layer.

また、モデルＭ１〜Ｍ３等がＤＮＮ等、１つまたは複数の中間層を有するニューラルネットワークで実現されるとする。この場合、例えば、モデルＭ１が含む第１要素は、入力層または中間層が有するいずれかのノードに対応する。また、第２要素は、第１要素と対応するノードから値が伝達されるノードである次段のノードに対応する。また、第１要素の重みは、第１要素と対応するノードから第２要素と対応するノードに伝達される値に対して考慮される重みである接続係数に対応する。 Further, it is assumed that the models M1 to M3 and the like are realized by a neural network having one or more intermediate layers such as DNN. In this case, for example, the first element included in the model M1 corresponds to any node of the input layer or the intermediate layer. The second element corresponds to the next node, which is a node to which a value is transmitted from the node corresponding to the first element. The weight of the first element corresponds to a connection coefficient that is a weight considered for a value transmitted from a node corresponding to the first element to a node corresponding to the second element.

なお、モデル情報記憶部１２２は、上記に限らず、目的に応じて種々のモデル情報を記憶してもよい。例えば、モデル情報記憶部１２２は、画像と文字情報とを組み合わせた記事コンテンツが各クラスに分類される割合の推定に用いられるモデルを記憶してもよい。 The model information storage unit 122 is not limited to the above, and may store various model information according to the purpose. For example, the model information storage unit 122 may store a model used for estimating the rate at which article content combining an image and character information is classified into each class.

（ユーザ情報記憶部１２３）
実施形態に係るユーザ情報記憶部１２３は、ユーザに関する各種情報を記憶する。例えば、ユーザ情報記憶部１２３は、クラウドソーシングなどによるタスクを行う複数のワーカ（ユーザ）に関する情報を記憶する。図７は、実施形態に係るユーザ情報記憶部の一例を示す図である。図７に示すユーザ情報記憶部１２３は、「ユーザＩＤ」、「年齢」、「性別」、「自宅」、「勤務地」、「興味」といった項目が含まれる。 (User information storage unit 123)
The user information storage unit 123 according to the embodiment stores various types of information regarding the user. For example, the user information storage unit 123 stores information on a plurality of workers (users) who perform tasks such as crowdsourcing. FIG. 7 is a diagram illustrating an example of a user information storage unit according to the embodiment. The user information storage unit 123 illustrated in FIG. 7 includes items such as “user ID”, “age”, “sex”, “home”, “work location”, and “interest”.

「ユーザＩＤ」は、ユーザを識別するための識別情報を示す。例えば、ユーザＩＤ「Ｕ１」により識別されるユーザは、図１の例に示したユーザＵ１に対応する。また、「年齢」は、ユーザＩＤにより識別されるユーザの年齢を示す。なお、「年齢」は、例えば３５歳など、ユーザＩＤにより識別されるユーザの具体的な年齢であってもよい。また、「性別」は、ユーザＩＤにより識別されるユーザの性別を示す。 “User ID” indicates identification information for identifying a user. For example, the user identified by the user ID “U1” corresponds to the user U1 illustrated in the example of FIG. “Age” indicates the age of the user identified by the user ID. The “age” may be a specific age of the user identified by the user ID, such as 35 years old. “Gender” indicates the gender of the user identified by the user ID.

また、「自宅」は、ユーザＩＤにより識別されるユーザの自宅の位置情報を示す。なお、図７に示す例では、「自宅」は、「ＬＣ１１」といった抽象的な符号を図示するが、緯度や経度を示す情報であってもよい。また、例えば、「自宅」は、地域名や住所であってもよい。 “Home” indicates location information of the user's home identified by the user ID. In the example illustrated in FIG. 7, “home” illustrates an abstract code such as “LC11”, but may be information indicating latitude and longitude. For example, “home” may be a region name or an address.

また、「勤務地」は、ユーザＩＤにより識別されるユーザの勤務地の位置情報を示す。なお、図７に示す例では、「勤務地」は、「ＬＣ１２」といった抽象的な符号を図示するが、緯度や経度を示す情報であってもよい。また、例えば、「勤務地」は、地域名や住所であってもよい。 “Work location” indicates position information of the user's work location identified by the user ID. In the example illustrated in FIG. 7, “work location” illustrates an abstract code such as “LC12”, but may be information indicating latitude and longitude. Further, for example, the “work location” may be an area name or an address.

また、「興味」は、ユーザＩＤにより識別されるユーザの興味を示す。すなわち、「興味」は、ユーザＩＤにより識別されるユーザが関心の高い対象を示す。なお、図７に示す例では、「興味」は、各ユーザに１つずつ図示するが、複数であってもよい。 “Interest” indicates the interest of the user identified by the user ID. That is, “interest” indicates an object that is highly interested by the user identified by the user ID. In the example illustrated in FIG. 7, one “interest” is illustrated for each user, but may be plural.

例えば、図７に示す例において、ユーザＩＤ「Ｕ１」により識別されるユーザの年齢は、「２０代」であり、性別は、「男性」であることを示す。また、例えば、ユーザＩＤ「Ｕ１」により識別されるユーザは、自宅が「ＬＣ１１」であることを示す。また、例えば、ユーザＩＤ「Ｕ１」により識別されるユーザは、勤務地が「ＬＣ１２」であることを示す。また、例えば、ユーザＩＤ「Ｕ１」により識別されるユーザは、「スポーツ」に興味があることを示す。 For example, in the example illustrated in FIG. 7, the age of the user identified by the user ID “U1” is “20s”, and the gender is “male”. For example, the user identified by the user ID “U1” indicates that the home is “LC11”. For example, the user identified by the user ID “U1” indicates that the work location is “LC12”. For example, the user identified by the user ID “U1” indicates that he / she is interested in “sports”.

なお、ユーザ情報記憶部１２３は、上記に限らず、目的に応じて種々の情報を記憶してもよい。例えば、ユーザ情報記憶部１２３は、ユーザのデモグラフィック属性に関する情報やサイコグラフィック属性に関する情報を記憶してもよい。例えば、ユーザ情報記憶部１２３は、氏名、家族構成、収入、興味、ライフスタイル等の情報を記憶してもよい。また、例えば、ユーザ情報記憶部１２３は、各ユーザのワーカとしての経歴やスキルレベル等のクラウドソーシングにおける各ユーザの評価や信頼性を示す情報を記憶してもよい。 Note that the user information storage unit 123 is not limited to the above, and may store various types of information according to the purpose. For example, the user information storage unit 123 may store information regarding the demographic attribute of the user and information regarding the psychographic attribute. For example, the user information storage unit 123 may store information such as name, family structure, income, interest, and lifestyle. Further, for example, the user information storage unit 123 may store information indicating the evaluation and reliability of each user in crowdsourcing such as the career and skill level of each user.

（推定情報記憶部１２４）
実施形態に係る推定情報記憶部１２４は、推定対象や推定結果等の推定に関する各種情報を記憶する。図８に、実施形態に係る推定情報記憶部１２４の一例を示す。図８に示す推定情報記憶部１２４は、「推定対象」、「クラス」、「スコア」といった項目を有する。図８に示す推定情報記憶部１２４は、図２において推定した画像ＩＭ１０のクラス分類（犬猫分類）に関する情報を示す。 (Estimated information storage unit 124)
The estimation information storage unit 124 according to the embodiment stores various types of information related to estimation such as an estimation target and an estimation result. FIG. 8 shows an example of the estimated information storage unit 124 according to the embodiment. The estimation information storage unit 124 illustrated in FIG. 8 includes items such as “estimation target”, “class”, and “score”. The estimated information storage unit 124 illustrated in FIG. 8 indicates information related to the class classification (dog cat classification) of the image IM10 estimated in FIG.

「推定対象」は、推定する分類対象（対象）を示す。「クラス」は、推定対象を分類するクラスを示す。「スコア」は、対応するクラスの評価値となるスコアを示す。例えば、「スコア」は、対応するクラスに分類されると推定される割合を示す。 The “estimation target” indicates a classification target (target) to be estimated. “Class” indicates a class for classifying the estimation target. “Score” indicates a score that is an evaluation value of the corresponding class. For example, “score” indicates a ratio estimated to be classified into a corresponding class.

例えば、図８に示す例において、推定する対象は、画像ＩＭ１０であることを示す。また、画像ＩＭ１０が猫（クラスＣＬ１）のスコアが「０．５５」であることを示す。例えば、画像ＩＭ１０が猫（クラスＣＬ１）と分類される割合が５５％であることを示す。また、画像ＩＭ１０が犬（クラスＣＬ２）のスコアが「０．４５」であることを示す。例えば、画像ＩＭ１０が犬（クラスＣＬ２）と分類される割合が４５％であることを示す。 For example, in the example illustrated in FIG. 8, the estimation target is the image IM10. Further, the image IM10 indicates that the score of the cat (class CL1) is “0.55”. For example, the ratio that the image IM10 is classified as a cat (class CL1) is 55%. Further, the image IM10 indicates that the score of the dog (class CL2) is “0.45”. For example, the ratio that the image IM10 is classified as a dog (class CL2) is 45%.

なお、推定情報記憶部１２４は、上記に限らず、目的に応じて種々の情報を記憶してもよい。 Note that the estimated information storage unit 124 is not limited to the above, and may store various information according to the purpose.

（制御部１３０）
図４の説明に戻って、制御部１３０は、コントローラ（controller）であり、例えば、ＣＰＵ（Central Processing Unit）やＭＰＵ（Micro Processing Unit）等によって、生成装置１００内部の記憶装置に記憶されている各種プログラム（生成プログラムの一例に相当）がＲＡＭを作業領域として実行されることにより実現される。また、制御部１３０は、コントローラであり、例えば、ＡＳＩＣ（Application Specific Integrated Circuit）やＦＰＧＡ（Field Programmable Gate Array）等の集積回路により実現される。制御部１３０は、モデル情報記憶部１２２に記憶されているモデルＭ１等に従った情報処理により、分類対象となる対象情報が入力される入力層と、出力層と、入力層から出力層までのいずれかの層であって出力層以外の層に属する第１要素と、第１要素と第１要素の重みとに基づいて値が算出される第２要素と、を含み、入力層に入力された対象情報に対し、出力層以外の各層に属する各要素を第１要素として、第１要素と第１要素の重みとに基づく演算を行うことにより、対象情報の各クラスに分類される割合の推定に用いられるスコアの値を出力層から出力する。 (Control unit 130)
Returning to the description of FIG. 4, the control unit 130 is a controller and is stored in a storage device inside the generation apparatus 100 by, for example, a CPU (Central Processing Unit), an MPU (Micro Processing Unit), or the like. Various programs (corresponding to an example of a generation program) are implemented by using the RAM as a work area. The control unit 130 is a controller, and is realized by an integrated circuit such as an application specific integrated circuit (ASIC) or a field programmable gate array (FPGA). The control unit 130 includes an input layer to which target information to be classified is input, an output layer, and an input layer to an output layer by information processing according to the model M1 stored in the model information storage unit 122. A first element belonging to any layer other than the output layer, and a second element whose value is calculated based on the first element and the weight of the first element, and is input to the input layer For each target information, the elements belonging to each layer other than the output layer are set as the first element, and the calculation based on the first element and the weight of the first element is performed. The score value used for estimation is output from the output layer.

図４に示すように、制御部１３０は、取得部１３１と、生成部１３２と、推定部１３３と、提供部１３４とを有し、以下に説明する情報処理の機能や作用を実現または実行する。なお、制御部１３０の内部構成は、図４に示した構成に限られず、後述する情報処理を行う構成であれば他の構成であってもよい。制御部１３０は、記憶部１２０に記憶されているモデルＭ１（モデルデータＭＤＴ１）に従った情報処理により、第１要素と第１要素の重みとに基づいて値が算出される第２要素と、を含み、入力層に入力された対象情報に対し、出力層以外の各層に属する各要素を第１要素として、第１要素と第１要素の重みとに基づく演算を行うことにより、対象情報の各クラスに分類される割合の推定に用いられるスコアの値を出力層から出力する。 As shown in FIG. 4, the control unit 130 includes an acquisition unit 131, a generation unit 132, an estimation unit 133, and a provision unit 134, and implements or executes the information processing functions and operations described below. . Note that the internal configuration of the control unit 130 is not limited to the configuration illustrated in FIG. 4, and may be another configuration as long as information processing described later is performed. The control unit 130, by information processing according to the model M1 (model data MDT1) stored in the storage unit 120, a second element whose value is calculated based on the first element and the weight of the first element, And performing calculation based on the first element and the weight of the first element with each element belonging to each layer other than the output layer as the first element for the target information input to the input layer The score value used for estimating the ratio classified into each class is output from the output layer.

（取得部１３１）
取得部１３１は、各種情報を取得する。例えば、取得部１３１は、学習データ記憶部１２１と、モデル情報記憶部１２２と、ユーザ情報記憶部１２３と、推定情報記憶部１２４等から各種情報を取得する。また、取得部１３１は、各種情報を外部の情報処理装置から取得してもよい。また、取得部１３１は、各種情報を端末装置１０等から取得してもよい。例えば、取得部１３１は、ワーカから対象情報に対する回答を取得する。 (Acquisition part 131)
The acquisition unit 131 acquires various types of information. For example, the acquisition unit 131 acquires various types of information from the learning data storage unit 121, the model information storage unit 122, the user information storage unit 123, the estimated information storage unit 124, and the like. The acquisition unit 131 may acquire various types of information from an external information processing apparatus. The acquisition unit 131 may acquire various types of information from the terminal device 10 or the like. For example, the acquisition unit 131 acquires an answer to the target information from the worker.

例えば、取得部１３１は、分類対象となる対象情報と、複数のユーザの各々により対象情報の各クラスに分類された割合を示す割合情報とを取得する。例えば、取得部１３１は、分類対象となる対象情報と、複数のユーザの各々により選択された対象情報の各クラスに分類された割合を示す割合情報とを取得する。例えば、取得部１３１は、クラウドソーシングにおいて複数のユーザの各々により対象情報の各クラスに分類された割合を示す割合情報を取得する。例えば、取得部１３１は、複数のユーザの数と、対象情報の各クラスを選択したユーザの数とに基づく割合情報を取得する。例えば、取得部１３１は、画像情報を含む対象情報を取得する。例えば、取得部１３１は、文字情報を含む対象情報を取得する。 For example, the acquisition unit 131 acquires target information to be classified and ratio information indicating a ratio classified into each class of target information by each of a plurality of users. For example, the acquisition unit 131 acquires target information to be classified and ratio information indicating a ratio classified into each class of target information selected by each of a plurality of users. For example, the acquisition unit 131 acquires ratio information indicating a ratio classified into each class of target information by each of a plurality of users in crowdsourcing. For example, the acquisition unit 131 acquires ratio information based on the number of users and the number of users who have selected each class of target information. For example, the acquisition unit 131 acquires target information including image information. For example, the acquisition unit 131 acquires target information including character information.

図１の例では、取得部１３１は、正解情報を生成するための情報を収集する。例えば、取得部１３１は、画像ＩＭ１０１〜ＩＭ１０３等の画像群ＩＭＬ１をクラウドソーシングによりユーザ（ワーカ）に分類させ、その回答を取得する。 In the example of FIG. 1, the acquisition unit 131 collects information for generating correct answer information. For example, the acquisition unit 131 classifies the image group IML1 such as the images IM101 to IM103 into users (workers) by crowdsourcing, and acquires the answer.

図１の例では、取得部１３１は、ユーザＵ１から画像ＩＭ１０１に対する回答を取得する。例えば、取得部１３１は、ユーザＵ１から画像ＩＭ１０１が「猫」であるとの回答を取得する。例えば、取得部１３１は、ユーザＵ２から画像ＩＭ１０１が「猫」であるとの回答を取得する。例えば、取得部１３１は、ユーザＵ３から画像ＩＭ１０１が「犬」であるとの回答を取得する。例えば、取得部１３１は、ユーザＵ４から画像ＩＭ１０１が「犬」であるとの回答を取得する。例えば、取得部１３１は、ユーザＵ５から画像ＩＭ１０１が「猫」であるとの回答を取得する。 In the example of FIG. 1, the acquisition unit 131 acquires a response to the image IM101 from the user U1. For example, the acquisition unit 131 acquires a reply from the user U1 that the image IM101 is “cat”. For example, the acquisition unit 131 acquires a reply from the user U2 that the image IM101 is “cat”. For example, the acquisition unit 131 acquires a reply from the user U3 that the image IM101 is “dog”. For example, the acquisition unit 131 acquires a reply from the user U4 that the image IM101 is “dog”. For example, the acquisition unit 131 acquires a reply from the user U5 that the image IM101 is “cat”.

例えば、取得部１３１は、新たな対象情報を取得する。図２の例では、取得部１３１は、ユーザＵ１０１が利用する端末装置１０から分類対象となる画像ＩＭ１０を取得する。 For example, the acquisition unit 131 acquires new target information. In the example of FIG. 2, the acquisition unit 131 acquires the image IM10 to be classified from the terminal device 10 used by the user U101.

（生成部１３２）
生成部１３２は、各種情報を生成する。例えば、生成部１３２は、学習データ記憶部１２１に記憶された学習データを用いて、モデル情報記憶部１２２に示すようなモデルを生成する。例えば、生成部１３２は、取得部１３１により取得された学習データに基づいて、対象情報の各クラスに分類される割合の推定に用いられるモデルを生成する。例えば、生成部１３２は、対象情報と、対象情報が各クラスに分類される割合を示す正解情報とを含む学習データに基づいて、対象情報の各クラスに分類される割合の推定に用いられるモデルを生成する。 (Generator 132)
The generation unit 132 generates various types of information. For example, the generation unit 132 generates a model as shown in the model information storage unit 122 using the learning data stored in the learning data storage unit 121. For example, the generation unit 132 generates a model used for estimating the ratio classified into each class of the target information based on the learning data acquired by the acquisition unit 131. For example, the generation unit 132 is a model used for estimating a ratio classified into each class of target information based on learning data including target information and correct answer information indicating a ratio at which the target information is classified into each class. Is generated.

例えば、生成部１３２は、取得部１３１により取得された対象情報と割合情報とに基づいて、一の対象情報が入力された場合に、一の対象情報が各クラスに分類される割合を推定するモデルを生成する。例えば、生成部１３２は、一の対象情報が入力された場合に、一の対象情報が各クラスに分類される割合を推定するニューラルネットワーク（neural network）であるモデルを生成する。例えば、生成部１３２は、畳み込み処理及びプーリング処理を行うニューラルネットワークであるモデルを生成する。例えば、生成部１３２は、割合情報における各クラスに対応する割合間の差が所定の範囲内となる対象情報に基づいて、モデルを生成する。 For example, based on the target information and the ratio information acquired by the acquisition unit 131, the generation unit 132 estimates the ratio at which one target information is classified into each class when the one target information is input. Generate a model. For example, when one piece of target information is input, the generation unit 132 generates a model that is a neural network that estimates the rate at which the one piece of target information is classified into each class. For example, the generation unit 132 generates a model that is a neural network that performs convolution processing and pooling processing. For example, the generation unit 132 generates a model based on target information in which a difference between ratios corresponding to each class in the ratio information is within a predetermined range.

例えば、生成部１３２は、モデルＭ１〜Ｍ３等を生成し、生成したモデルＭ１〜Ｍ３等をモデル情報記憶部１２２に格納する。なお、生成部１３２は、いかなる学習アルゴリズムを用いてモデルＭ１〜Ｍ３等を生成してもよい。例えば、生成部１３２は、ニューラルネットワーク、サポートベクターマシン（ＳＶＭ）、クラスタリング、強化学習等の学習アルゴリズムを用いてモデルＭ１〜Ｍ３等を生成する。一例として、生成部１３２がニューラルネットワークを用いてモデルＭ１〜Ｍ３等を生成する場合、モデルＭ１〜Ｍ３等は、一以上のニューロンを含む入力層と、一以上のニューロンを含む中間層と、一以上のニューロンを含む出力層とを有する。 For example, the generation unit 132 generates models M1 to M3 and stores the generated models M1 to M3 and the like in the model information storage unit 122. Note that the generation unit 132 may generate the models M1 to M3 and the like using any learning algorithm. For example, the generation unit 132 generates models M1 to M3 and the like using a learning algorithm such as a neural network, a support vector machine (SVM), clustering, and reinforcement learning. As an example, when the generation unit 132 generates models M1 to M3 and the like using a neural network, the models M1 to M3 and the like include an input layer including one or more neurons, an intermediate layer including one or more neurons, and one And an output layer including the above neurons.

生成部１３２は、モデルを生成し、生成したモデルをモデル情報記憶部１２２に格納する。具体的には、生成部１３２は、分類対象となる対象情報が入力される入力層と、出力層と、入力層から出力層までのいずれかの層であって出力層以外の層に属する第１要素と、第１要素と第１要素の重みとに基づいて値が算出される第２要素と、を含み、入力層に入力された対象情報に対し、出力層以外の各層に属する各要素を第１要素として、第１要素と第１要素の重みとに基づく演算を行うことにより、対象情報の各クラスに分類される割合の推定に用いられるスコアの値を出力層から出力するモデルを生成する。 The generation unit 132 generates a model and stores the generated model in the model information storage unit 122. Specifically, the generation unit 132 includes an input layer to which target information to be classified is input, an output layer, and any layer from the input layer to the output layer and belonging to layers other than the output layer. Each element belonging to each layer other than the output layer with respect to the target information input to the input layer, including one element and a second element whose value is calculated based on the first element and the weight of the first element As a first element, a model that outputs a score value used for estimating a ratio classified into each class of target information from the output layer by performing an operation based on the first element and the weight of the first element Generate.

図１の例では、生成部１３２は、対象情報の各クラスに分類される割合を示す割合情報を生成する。例えば、生成部１３２は、対象情報と、割合情報を正解情報として含む集計情報との組み合わせを学習データとして追加する。例えば、生成部１３２は、対象情報である画像ＩＭ１０１〜ＩＭ１０３等の各々に対応するデータＤＴ１０１〜ＤＴ１０３等を学習データ記憶部１２１に追加する。 In the example of FIG. 1, the generation unit 132 generates ratio information indicating a ratio classified into each class of target information. For example, the generation unit 132 adds a combination of target information and total information including ratio information as correct answer information as learning data. For example, the generation unit 132 adds data DT101 to DT103 and the like corresponding to each of the images IM101 to IM103 that are target information to the learning data storage unit 121.

図１の例では、生成部１３２は、上記のような画像ＩＭ１０１〜ＩＭ１０３等を含む学習データに基づいてモデルを生成する。例えば、生成部１３２は、学習データ記憶部１２１中のデータＤＴ１０１〜ＤＴ１０３等を学習データ（教師データ）として、学習を行なうことにより、モデルを生成する。図１に示すような処理により生成部１３２は、モデル情報記憶部１２２に示すように、モデルＩＤ「Ｍ１」により識別されるモデル（モデルＭ１）を生成する。 In the example of FIG. 1, the generation unit 132 generates a model based on learning data including the images IM101 to IM103 as described above. For example, the generation unit 132 generates a model by performing learning using the data DT101 to DT103 in the learning data storage unit 121 as learning data (teacher data). As illustrated in the model information storage unit 122, the generation unit 132 generates a model (model M1) identified by the model ID “M1” by the process illustrated in FIG.

例えば、生成部１３２は、推定部１３３により推定された情報に基づいてユーザに提供する推定結果を生成する。図２の例では、生成部１３２は、推定対象である画像ＩＭ１０が猫に分類される割合が５５％であり、犬に分類される割合が４５％であるとの推定結果情報ＥＲ２１を生成する。 For example, the generation unit 132 generates an estimation result to be provided to the user based on the information estimated by the estimation unit 133. In the example of FIG. 2, the generation unit 132 generates the estimation result information ER21 that the ratio that the image IM10 that is the estimation target is classified as a cat is 55% and the ratio that is classified as a dog is 45%. .

（推定部１３３）
推定部１３３は、各種情報を推定する。推定部１３３は、学習データ記憶部１２１と、モデル情報記憶部１２２と、ユーザ情報記憶部１２３と、推定情報記憶部１２４等に記憶された情報を用いて種々の情報を推定する。例えば、推定部１３３は、取得部１３１により取得された各種情報に基づいて、種々の情報を推定する。 (Estimation unit 133)
The estimation unit 133 estimates various information. The estimation unit 133 estimates various information using information stored in the learning data storage unit 121, the model information storage unit 122, the user information storage unit 123, the estimation information storage unit 124, and the like. For example, the estimation unit 133 estimates various information based on various information acquired by the acquisition unit 131.

例えば、推定部１３３は、モデルを用いて、対象情報に対する各クラスに分類される割合を推定する。推定部１３３は、新たな対象情報をモデルに入力することにより、新たな対象情報に対する各クラスに分類される割合を推定する。 For example, the estimation unit 133 estimates the ratio classified into each class with respect to the target information using the model. The estimation part 133 estimates the ratio classified into each class with respect to new object information by inputting new object information into a model.

図２の例では、推定部１３３は、処理群ＰＳ２１に示すような処理により、画像ＩＭ１０が各クラスに分類される割合を示すスコアを算出する。推定部１３３は、データＩＭ１０をモデルＭ１に入力する。推定部１３３によりデータＩＭ１０が入力されたモデルＭ１は、スコアを出力する。モデルＭ１は、各クラスに対応するスコアを出力する。図２の例では、データＩＭ１０が入力されたモデルＭ１は、スコアＳＣ１１に示すように、猫（クラスＣＬ１）のスコア「０．５５」を出力し、犬（クラスＣＬ２）のスコア「０．４５」を出力する。これにより、推定部１３３は、モデルＭ１を用いて、画像ＩＭ１０が猫（クラスＣＬ１）に分類される割合（確率）が５５％であり、犬（クラスＣＬ２）に分類される割合（確率）が４５％であると推定する。 In the example of FIG. 2, the estimation unit 133 calculates a score indicating the ratio at which the image IM10 is classified into each class by processing as shown in the processing group PS21. The estimation unit 133 inputs the data IM10 to the model M1. The model M1 to which the data IM10 is input by the estimation unit 133 outputs a score. The model M1 outputs a score corresponding to each class. In the example of FIG. 2, the model M1 to which the data IM10 is input outputs the score “0.55” of the cat (class CL1) and the score “0.45” of the dog (class CL2) as shown in the score SC11. Is output. As a result, the estimation unit 133 uses the model M1 and the ratio (probability) that the image IM10 is classified as a cat (class CL1) is 55%, and the ratio (probability) that the image IM10 is classified as a dog (class CL2). Estimated to be 45%.

例えば、推定部１３３は、上述した回帰モデルやニューラルネットワーク等、任意の構造を有するモデルを用いて、スコアの算出を行う。具体的には、モデルＭ１は、分類対象となる対象情報（すなわち、上述したスコアの算出に用いられる各要素）が入力された場合に、所定の対象の推定を定量化した値（すなわち、一の対象情報が各クラスに分類される割合が発生する可能性が高いかを示唆するスコア）を出力するように係数が設定される。推定部１３３は、このようなモデルＭ１を用いて、各出品の対象に関するスコアを算出する。 For example, the estimation unit 133 calculates a score using a model having an arbitrary structure such as the above-described regression model or neural network. Specifically, the model M1 is a value obtained by quantifying the estimation of a predetermined target (that is, one item) when target information to be classified (that is, each element used for calculating the score described above) is input. The coefficient is set so as to output a score indicating whether or not there is a high possibility that the target information is classified into each class. The estimation unit 133 calculates a score related to each exhibition target using such a model M1.

なお、上記例では、モデルＭ１が、分類対象となる対象情報が入力された場合に、対象情報の各クラスに分類される割合の推定を定量化した値を出力するモデルである例を示した。しかし、実施形態に係るモデル（モデルＸ）は、モデルＭ１にデータの入出力を繰り返すことで得られる結果に基づいて生成されるモデルであってもよい。例えば、モデルＸは、分類対象となる対象情報を入力とし、モデルＭ１が出力するスコアを出力とするよう学習されたモデル（モデルＹ)であってもよい。または、モデルＭ１は、分類対象となる対象情報を入力とし、モデルＹの出力値を出力とするよう学習されたモデルであってもよい。また、推定部１３３がＧＡＮ（Generative Adversarial Networks）を用いた推定処理を行う場合、モデルＭ１は、ＧＡＮの一部を構成するモデルであってもよい。 In the above example, an example in which the model M1 is a model that outputs a value obtained by quantifying the estimation of the ratio classified into each class of the target information when the target information to be classified is input. . However, the model (model X) according to the embodiment may be a model generated based on a result obtained by repeatedly inputting / outputting data to / from the model M1. For example, the model X may be a model (model Y) that is learned so that target information to be classified is input and a score output by the model M1 is output. Alternatively, the model M1 may be a model that has been learned so that target information to be classified is input and the output value of the model Y is output. When the estimation unit 133 performs an estimation process using GAN (Generative Adversarial Networks), the model M1 may be a model that constitutes a part of the GAN.

（提供部１３４）
提供部１３４は、各種情報を提供する。例えば、提供部１３４は、端末装置１０に各種情報を提供する。提供部１３４は、推定部１３３により推定された一の対象情報が各クラスに分類される割合に基づくサービスを提供する。例えば、提供部１３４は、推定部１３３により推定された一の対象情報が各クラスに分類される割合に基づいて、一の対象情報がいずれのクラスであるかを示す情報を提供する。また、例えば、提供部１３４は、生成部１３２により生成されたモデルに関する情報を外部の情報処理装置へ提供してもよい。また、例えば、提供部１３４は、モデルが出力する情報を外部の情報処理装置へ提供してもよい。 (Providing unit 134)
The providing unit 134 provides various information. For example, the providing unit 134 provides various information to the terminal device 10. The providing unit 134 provides a service based on a ratio in which one piece of target information estimated by the estimating unit 133 is classified into each class. For example, the providing unit 134 provides information indicating which class the one target information is based on the ratio of the one target information estimated by the estimation unit 133 into each class. Further, for example, the providing unit 134 may provide information regarding the model generated by the generating unit 132 to an external information processing apparatus. For example, the providing unit 134 may provide information output from the model to an external information processing apparatus.

図１の例では、提供部１３４は、ワーカであるユーザＵ１が利用する端末装置１０−１に対象情報を提供する。例えば、提供部１３４は、端末装置１０−１〜１０−５に対象情報である画像ＩＭ１０１等を提供する。例えば、提供部１３４は、端末装置１０−１〜１０−５に対象情報である画像群ＩＭＬ１等を提供する。 In the example of FIG. 1, the providing unit 134 provides target information to the terminal device 10-1 used by the user U1 who is a worker. For example, the providing unit 134 provides the image IM101, which is target information, to the terminal devices 10-1 to 10-5. For example, the providing unit 134 provides the terminal device 10-1 to 10-5 with the image group IML1 that is the target information.

例えば、提供部１３４は、推定結果に基づいて情報提供を行う。図２の例では、提供部１３４は、生成部１３２により生成された推定結果情報ＥＲ２１を端末装置１０へ提供する。 For example, the providing unit 134 provides information based on the estimation result. In the example of FIG. 2, the providing unit 134 provides the estimation result information ER <b> 21 generated by the generating unit 132 to the terminal device 10.

〔４．生成処理のフロー〕
次に、図９を用いて、実施形態に係る生成システム１による生成処理の手順について説明する。図９は、実施形態に係る生成処理の一例を示すフローチャートである。 [4. Generation process flow)
Next, the procedure of the generation process by the generation system 1 according to the embodiment will be described with reference to FIG. FIG. 9 is a flowchart illustrating an example of the generation process according to the embodiment.

図９に示すように、生成装置１００は、学習データを取得する（ステップＳ１０１）。例えば、生成装置１００は、学習データ記憶部１２１から学習データを取得する。 As illustrated in FIG. 9, the generation device 100 acquires learning data (step S101). For example, the generation device 100 acquires learning data from the learning data storage unit 121.

その後、生成装置１００は、学習データに基づきモデルを生成する（ステップＳ１０２）。図１の例では、生成装置１００は、学習データ記憶部１２１から学習データを用いてモデルＭ１を生成する。 Thereafter, the generation device 100 generates a model based on the learning data (step S102). In the example of FIG. 1, the generation apparatus 100 generates a model M1 using learning data from the learning data storage unit 121.

〔５．推定処理のフロー〕
次に、図１０を用いて、実施形態に係る生成システム１による推定処理の手順について説明する。図１０は、実施形態に係る推定処理の一例を示すフローチャートである。 [5. (Estimation process flow)
Next, the procedure of the estimation process by the generation system 1 according to the embodiment will be described with reference to FIG. FIG. 10 is a flowchart illustrating an example of the estimation process according to the embodiment.

図１０に示すように、生成装置１００は、対象情報を取得する（ステップＳ２０１）。図２の例では、生成装置１００は、ユーザＵ１０１が利用する端末装置１０から対象情報として画像ＩＭ１０を取得する。 As illustrated in FIG. 10, the generation device 100 acquires target information (step S201). In the example of FIG. 2, the generation device 100 acquires the image IM10 as target information from the terminal device 10 used by the user U101.

また、生成装置１００は、対象情報とモデルを用いて対象情報に対する各クラスに分類される割合を推定する（ステップＳ２０２）。図２の例では、生成装置１００は、モデルＭ１を用いて、画像ＩＭ１０が猫（クラスＣＬ１）に分類される割合（確率）が５５％であり、犬（クラスＣＬ２）に分類される割合（確率）が４５％であると推定する。 Further, the generation apparatus 100 estimates the ratio of classification into each class for the target information using the target information and the model (Step S202). In the example of FIG. 2, the generation apparatus 100 uses the model M1, and the ratio (probability) that the image IM10 is classified as a cat (class CL1) is 55%, and the ratio (class CL2) that is classified as a dog (class CL2) ( (Probability) is estimated to be 45%.

また、生成装置１００は、推定した対象情報に対する各クラスに分類される割合に関する情報を提供する（ステップＳ２０３）。図２の例では、生成装置１００は、推定対象である画像ＩＭ１０が猫に分類される割合が５５％であり、犬に分類される割合が４５％であるとの推定結果情報ＥＲ２１をユーザＵ１０１が利用する端末装置１０へ提供する。 In addition, the generation device 100 provides information related to the ratio classified into each class with respect to the estimated target information (step S203). In the example of FIG. 2, the generation apparatus 100 uses the estimation result information ER21 that the image IM10 to be estimated is classified as a cat by 55% and the ratio classified as a dog by 45% as user U101. Is provided to the terminal device 10 used.

〔６．効果〕
上述してきたように、実施形態に係る生成装置１００は、取得部１３１と、生成部１３２とを有する。取得部１３１は、分類対象となる対象情報と、複数のユーザの各々により対象情報の各クラスに分類された割合を示す割合情報とを取得する。また、生成部１３２は、取得部１３１により取得された対象情報と割合情報とに基づいて、一の対象情報が入力された場合に、一の対象情報が各クラスに分類される割合を推定するモデルを生成する。 [6. effect〕
As described above, the generation device 100 according to the embodiment includes the acquisition unit 131 and the generation unit 132. The acquisition unit 131 acquires target information to be classified and ratio information indicating a ratio classified into each class of target information by each of a plurality of users. Further, the generation unit 132 estimates the ratio of classification of one target information into each class when the one target information is input based on the target information and the ratio information acquired by the acquisition unit 131. Generate a model.

このように、実施形態に係る生成装置１００は、対象情報と、複数のユーザの各々により対象情報の各クラスに分類された割合を示す割合情報とに基づいて、一の対象情報が入力された場合に、一の対象情報が各クラスに分類される割合を推定するモデルを生成することにより、対象情報の各クラスに分類される割合を適切に推定可能にすることができる。 As described above, in the generation apparatus 100 according to the embodiment, one target information is input based on the target information and the ratio information indicating the ratio classified into each class of the target information by each of a plurality of users. In this case, by generating a model that estimates the rate at which one target information is classified into each class, the rate at which the target information is classified into each class can be appropriately estimated.

また、実施形態に係る生成装置１００において、取得部１３１は、クラウドソーシングにおいて複数のユーザの各々により対象情報の各クラスに分類された割合を示す割合情報を取得する。 In the generation device 100 according to the embodiment, the acquisition unit 131 acquires ratio information indicating a ratio classified into each class of target information by each of a plurality of users in crowdsourcing.

このように、実施形態に係る生成装置１００は、クラウドソーシングにおいて複数のユーザの各々により対象情報の各クラスに分類された割合を示す割合情報を取得することにより、対象情報の各クラスに分類される割合を適切に推定可能にすることができる。 As described above, the generation apparatus 100 according to the embodiment is classified into each class of the target information by acquiring the ratio information indicating the ratio classified into each class of the target information by each of a plurality of users in crowdsourcing. Can be appropriately estimated.

また、実施形態に係る生成装置１００において、取得部１３１は、複数のユーザの数と、対象情報の各クラスを選択したユーザの数とに基づく割合情報を取得する。 In the generation device 100 according to the embodiment, the acquisition unit 131 acquires ratio information based on the number of a plurality of users and the number of users who have selected each class of target information.

このように、実施形態に係る生成装置１００は、複数のユーザの数と、対象情報の各クラスを選択したユーザの数とに基づく割合情報を取得することにより、対象情報の各クラスに分類される割合を適切に推定可能にすることができる。 As described above, the generation apparatus 100 according to the embodiment is classified into each class of the target information by acquiring the ratio information based on the number of the plurality of users and the number of users who have selected each class of the target information. Can be appropriately estimated.

また、実施形態に係る生成装置１００において、取得部１３１は、画像情報を含む対象情報を取得する。 In the generation device 100 according to the embodiment, the acquisition unit 131 acquires target information including image information.

このように、実施形態に係る生成装置１００は、画像情報を含む対象情報を取得することにより、対象情報の各クラスに分類される割合を適切に推定可能にすることができる。 As described above, the generation apparatus 100 according to the embodiment can appropriately estimate the ratio of classification into each class of the target information by acquiring the target information including the image information.

また、実施形態に係る生成装置１００において、取得部１３１は、文字情報を含む対象情報を取得する。 In the generation device 100 according to the embodiment, the acquisition unit 131 acquires target information including character information.

このように、実施形態に係る生成装置１００は、文字情報を含む対象情報を取得することにより、対象情報の各クラスに分類される割合を適切に推定可能にすることができる。 As described above, the generation apparatus 100 according to the embodiment can appropriately estimate the ratio of classification into each class of the target information by acquiring the target information including the character information.

また、実施形態に係る生成装置１００において、生成部１３２は、一の対象情報が入力された場合に、一の対象情報が各クラスに分類される割合を推定するニューラルネットワークであるモデルを生成する。 Further, in the generation device 100 according to the embodiment, the generation unit 132 generates a model that is a neural network that estimates a ratio of one target information classified into each class when the one target information is input. .

このように、実施形態に係る生成装置１００は、一の対象情報が入力された場合に、一の対象情報が各クラスに分類される割合を推定するニューラルネットワークであるモデルを生成することにより、対象情報の各クラスに分類される割合を適切に推定可能にすることができる。 As described above, the generation apparatus 100 according to the embodiment generates a model that is a neural network that estimates a ratio of one target information classified into each class when one target information is input. It is possible to appropriately estimate the ratio of the target information classified into each class.

また、実施形態に係る生成装置１００において、生成部１３２は、畳み込み処理及びプーリング処理を行うニューラルネットワークであるモデルを生成する。 In the generation device 100 according to the embodiment, the generation unit 132 generates a model that is a neural network that performs convolution processing and pooling processing.

このように、実施形態に係る生成装置１００は、畳み込み処理及びプーリング処理を行うニューラルネットワークであるモデルを生成することにより、対象情報の各クラスに分類される割合を適切に推定可能にすることができる。 As described above, the generation apparatus 100 according to the embodiment can appropriately estimate the ratio classified into each class of target information by generating a model that is a neural network that performs convolution processing and pooling processing. it can.

また、実施形態に係る生成装置１００において、推定部１３３を有する。推定部１３３は、モデルを用いて、対象情報に対する各クラスに分類される割合を推定する。 In addition, the generation device 100 according to the embodiment includes an estimation unit 133. The estimation unit 133 estimates the ratio classified into each class for the target information using the model.

このように、実施形態に係る生成装置１００は、モデルを用いて、対象情報に対する各クラスに分類される割合を推定することにより、対象情報の各クラスに分類される割合を適切に推定可能にすることができる。 As described above, the generation apparatus 100 according to the embodiment can appropriately estimate the ratio classified into each class of the target information by estimating the ratio classified into each class with respect to the target information using the model. can do.

また、実施形態に係る生成装置１００において、取得部１３１は、新たな対象情報を取得する。推定部１３３は、新たな対象情報をモデルに入力することにより、新たな対象情報に対する各クラスに分類される割合を推定する。 In the generation device 100 according to the embodiment, the acquisition unit 131 acquires new target information. The estimation part 133 estimates the ratio classified into each class with respect to new object information by inputting new object information into a model.

このように、実施形態に係る生成装置１００は、新たな対象情報をモデルに入力することにより、新たな対象情報に対する各クラスに分類される割合を推定することにより、対象情報の各クラスに分類される割合を適切に推定可能にすることができる。 As described above, the generation apparatus 100 according to the embodiment classifies the target information into each class by estimating the ratio of the new target information that is classified into each class by inputting the new target information into the model. It is possible to appropriately estimate the ratio to be performed.

また、実施形態に係る生成装置１００において、生成部１３２は、割合情報における各クラスに対応する割合間の差が所定の範囲内となる対象情報に基づいて、モデルを生成する。 In the generation device 100 according to the embodiment, the generation unit 132 generates a model based on target information in which the difference between the ratios corresponding to each class in the ratio information is within a predetermined range.

このように、実施形態に係る生成装置１００は、割合情報における各クラスに対応する割合間の差が所定の範囲内となる対象情報に基づいて、モデルを生成することにより、対象情報の各クラスに分類される割合を適切に推定可能にすることができる。 As described above, the generation apparatus 100 according to the embodiment generates each model of the target information by generating the model based on the target information in which the difference between the ratios corresponding to the classes in the ratio information is within a predetermined range. It is possible to appropriately estimate the ratio classified as “1”.

〔７．ハードウェア構成〕
上述してきた実施形態に係る生成装置１００は、例えば図１１に示すような構成のコンピュータ１０００によって実現される。図１１は、生成装置の機能を実現するコンピュータの一例を示すハードウェア構成図である。コンピュータ１０００は、ＣＰＵ１１００、ＲＡＭ１２００、ＲＯＭ（Read Only Memory）１３００、ＨＤＤ（Hard Disk Drive）１４００、通信インターフェイス（Ｉ／Ｆ）１５００、入出力インターフェイス（Ｉ／Ｆ）１６００、及びメディアインターフェイス（Ｉ／Ｆ）１７００を有する。 [7. Hardware configuration)
The generation apparatus 100 according to the embodiment described above is realized by a computer 1000 having a configuration as shown in FIG. 11, for example. FIG. 11 is a hardware configuration diagram illustrating an example of a computer that realizes the function of the generation device. The computer 1000 includes a CPU 1100, a RAM 1200, a ROM (Read Only Memory) 1300, an HDD (Hard Disk Drive) 1400, a communication interface (I / F) 1500, an input / output interface (I / F) 1600, and a media interface (I / F). ) 1700.

ＣＰＵ１１００は、ＲＯＭ１３００またはＨＤＤ１４００に格納されたプログラムに基づいて動作し、各部の制御を行う。ＲＯＭ１３００は、コンピュータ１０００の起動時にＣＰＵ１１００によって実行されるブートプログラムや、コンピュータ１０００のハードウェアに依存するプログラム等を格納する。 The CPU 1100 operates based on a program stored in the ROM 1300 or the HDD 1400 and controls each unit. The ROM 1300 stores a boot program executed by the CPU 1100 when the computer 1000 is started up, a program depending on the hardware of the computer 1000, and the like.

ＨＤＤ１４００は、ＣＰＵ１１００によって実行されるプログラム、及び、かかるプログラムによって使用されるデータ等を格納する。通信インターフェイス１５００は、ネットワークＮを介して他の機器からデータを受信してＣＰＵ１１００へ送り、ＣＰＵ１１００が生成したデータをネットワークＮを介して他の機器へ送信する。 The HDD 1400 stores programs executed by the CPU 1100, data used by the programs, and the like. The communication interface 1500 receives data from other devices via the network N and sends the data to the CPU 1100, and transmits data generated by the CPU 1100 to other devices via the network N.

ＣＰＵ１１００は、入出力インターフェイス１６００を介して、ディスプレイやプリンタ等の出力装置、及び、キーボードやマウス等の入力装置を制御する。ＣＰＵ１１００は、入出力インターフェイス１６００を介して、入力装置からデータを取得する。また、ＣＰＵ１１００は、生成したデータを入出力インターフェイス１６００を介して出力装置へ出力する。 The CPU 1100 controls an output device such as a display and a printer and an input device such as a keyboard and a mouse via the input / output interface 1600. The CPU 1100 acquires data from the input device via the input / output interface 1600. In addition, the CPU 1100 outputs the generated data to the output device via the input / output interface 1600.

メディアインターフェイス１７００は、記録媒体１８００に格納されたプログラムまたはデータを読み取り、ＲＡＭ１２００を介してＣＰＵ１１００に提供する。ＣＰＵ１１００は、かかるプログラムを、メディアインターフェイス１７００を介して記録媒体１８００からＲＡＭ１２００上にロードし、ロードしたプログラムを実行する。記録媒体１８００は、例えばＤＶＤ（Digital Versatile Disc）、ＰＤ（Phase change rewritable Disk）等の光学記録媒体、ＭＯ（Magneto-Optical disk）等の光磁気記録媒体、テープ媒体、磁気記録媒体、または半導体メモリ等である。 The media interface 1700 reads a program or data stored in the recording medium 1800 and provides it to the CPU 1100 via the RAM 1200. The CPU 1100 loads such a program from the recording medium 1800 onto the RAM 1200 via the media interface 1700, and executes the loaded program. The recording medium 1800 is, for example, an optical recording medium such as a DVD (Digital Versatile Disc) or PD (Phase change rewritable disk), a magneto-optical recording medium such as an MO (Magneto-Optical disk), a tape medium, a magnetic recording medium, or a semiconductor memory. Etc.

例えば、コンピュータ１０００が実施形態に係る生成装置１００として機能する場合、コンピュータ１０００のＣＰＵ１１００は、ＲＡＭ１２００上にロードされたプログラムまたはデータ（例えば、モデルＭ１（モデルデータＭＤＴ１））を実行することにより、制御部１３０の機能を実現する。コンピュータ１０００のＣＰＵ１１００は、これらのプログラムまたはデータ（例えば、モデルＭ１（モデルデータＭＤＴ１））を記録媒体１８００から読み取って実行するが、他の例として、他の装置からネットワークＮを介してこれらのプログラムを取得してもよい。 For example, when the computer 1000 functions as the generation apparatus 100 according to the embodiment, the CPU 1100 of the computer 1000 performs control by executing a program or data (for example, model M1 (model data MDT1)) loaded on the RAM 1200. The function of the unit 130 is realized. The CPU 1100 of the computer 1000 reads and executes these programs or data (for example, model M1 (model data MDT1)) from the recording medium 1800. As another example, these programs or data are transmitted from other devices via the network N. May be obtained.

以上、本願の実施形態及び変形例のいくつかを図面に基づいて詳細に説明したが、これらは例示であり、発明の開示の行に記載の態様を始めとして、当業者の知識に基づいて種々の変形、改良を施した他の形態で本発明を実施することが可能である。 As described above, some of the embodiments and modifications of the present application have been described in detail with reference to the drawings. It is possible to carry out the present invention in other forms that have been modified and improved.

〔８．その他〕
また、上記実施形態及び変形例において説明した各処理のうち、自動的に行われるものとして説明した処理の全部または一部を手動的に行うこともでき、あるいは、手動的に行われるものとして説明した処理の全部または一部を公知の方法で自動的に行うこともできる。この他、上記文書中や図面中で示した処理手順、具体的名称、各種のデータやパラメータを含む情報については、特記する場合を除いて任意に変更することができる。例えば、各図に示した各種情報は、図示した情報に限られない。 [8. Others]
In addition, among the processes described in the above-described embodiments and modifications, all or a part of the processes described as being automatically performed can be manually performed, or are described as being performed manually. All or part of the processing can be automatically performed by a known method. In addition, the processing procedures, specific names, and information including various data and parameters shown in the document and drawings can be arbitrarily changed unless otherwise specified. For example, the various types of information illustrated in each drawing is not limited to the illustrated information.

また、図示した各装置の各構成要素は機能概念的なものであり、必ずしも物理的に図示の如く構成されていることを要しない。すなわち、各装置の分散・統合の具体的形態は図示のものに限られず、その全部または一部を、各種の負荷や使用状況などに応じて、任意の単位で機能的または物理的に分散・統合して構成することができる。 Further, each component of each illustrated apparatus is functionally conceptual, and does not necessarily need to be physically configured as illustrated. In other words, the specific form of distribution / integration of each device is not limited to that shown in the figure, and all or a part thereof may be functionally or physically distributed or arbitrarily distributed in arbitrary units according to various loads or usage conditions. Can be integrated and configured.

また、上述してきた実施形態及び変形例は、処理内容を矛盾させない範囲で適宜組み合わせることが可能である。 In addition, the above-described embodiments and modifications can be combined as appropriate within a range that does not contradict processing contents.

また、上述してきた「部（section、module、unit）」は、「手段」や「回路」などに読み替えることができる。例えば、取得部は、取得手段や取得回路に読み替えることができる。 In addition, the “section (module, unit)” described above can be read as “means” or “circuit”. For example, the acquisition unit can be read as acquisition means or an acquisition circuit.

１生成システム
１００生成装置
１２１学習データ記憶部
１２２モデル情報記憶部
１２３ユーザ情報記憶部
１２４推定情報記憶部
１３０制御部
１３１取得部
１３２生成部
１３３推定部
１３４提供部
１０端末装置
Ｎネットワーク DESCRIPTION OF SYMBOLS 1 Generation system 100 Generation apparatus 121 Learning data storage part 122 Model information storage part 123 User information storage part 124 Estimated information storage part 130 Control part 131 Acquisition part 132 Generation part 133 Estimation part 134 Provision part 10 Terminal apparatus N network

Claims

An acquisition unit that acquires target information to be classified and ratio information indicating a ratio classified into each class of the target information by each of a plurality of users;
Based on the target information and the ratio information acquired by the acquisition unit, a model is generated that estimates a ratio at which the one target information is classified into each class when the one target information is input. A generator to
A generating apparatus comprising:

The acquisition unit
Obtaining ratio information indicating a ratio classified into each class of the target information by each of the plurality of users in crowdsourcing;
The generating apparatus according to claim 1, wherein:

The acquisition unit
Obtaining the ratio information based on the number of the plurality of users and the number of users who have selected each class of the target information;
The generating apparatus according to claim 1 or claim 2, wherein

The acquisition unit
The generation apparatus according to claim 1, wherein the target information including image information is acquired.

The acquisition unit
The generation apparatus according to claim 1, wherein the target information including character information is acquired.

The generator is
The said model which is a neural network which estimates the ratio by which said one target information is classified into each said class when said one target information is input is generated. The generating device according to claim 1.

The generator is
The generation apparatus according to claim 6, wherein the model that is the neural network that performs the convolution process and the pooling process is generated.

Using the model, an estimation unit that estimates a ratio classified into each class with respect to target information,
The generator according to claim 1, further comprising:

The acquisition unit
Obtain new target information,
The estimation unit includes
The generation apparatus according to claim 8, wherein the new target information is input to the model to estimate a ratio of the new target information that is classified into each class.

The generator is
The said model is produced | generated based on the said target information from which the difference between the ratios corresponding to each class in the said ratio information becomes in a predetermined | prescribed range. The Claim 1 characterized by the above-mentioned. Generator.

A generation method executed by a computer,
An acquisition step of acquiring target information to be classified and ratio information indicating a ratio classified into each class of the target information by each of a plurality of users;
Based on the target information and the ratio information acquired in the acquisition step, a model is generated that estimates a ratio at which the one target information is classified into each class when one target information is input. Generating process to
A generation method comprising:

An acquisition procedure for acquiring target information to be classified and ratio information indicating a ratio classified into each class of the target information by each of a plurality of users;
Based on the target information acquired by the acquisition procedure and the ratio information, when one target information is input, a model for estimating a ratio at which the one target information is classified into each class is generated Generation procedure to
A program for causing a computer to execute.

Learning data including target information to be classified and correct information indicating a ratio classified into each class of the target information by each of a plurality of users,
A first element having an input layer and an output layer, the layer belonging to any layer from the input layer to the output layer other than the output layer, and weights of the first element and the first element A second element whose value is calculated based on the above and the target information input to the input layer, the elements belonging to each layer other than the output layer as the first element, the first element and the The output value indicating the calculation result is output from the output layer of the model by being input to the input layer of the model that performs the calculation based on the weight of the first element, and the correct information corresponding to the target information To perform learning based on the comparison with the output value,
Learning data to make the computer function.

An input layer for inputting target information to be classified;
The output layer,
A first element belonging to any layer from the input layer to the output layer other than the output layer;
A second element whose value is calculated based on the first element and a weight of the first element;
By performing an operation based on the first element and the weight of the first element with respect to the target information input to the input layer, each element belonging to each layer other than the output layer as the first element, In order to output a score value indicating the ratio classified into each class of target information from the output layer,
A model that allows computers to function.