JP6373243B2

JP6373243B2 - Information processing apparatus, information processing method, and information processing program

Info

Publication number: JP6373243B2
Application number: JP2015226736A
Authority: JP
Inventors: 賢太郎西; 陽本谷; 井関　洋平; 洋平井関; 淳基寺山; 渉祐川▲崎▼
Original assignee: Yahoo Japan Corp
Current assignee: Yahoo Japan Corp
Priority date: 2015-11-19
Filing date: 2015-11-19
Publication date: 2018-08-15
Anticipated expiration: 2035-11-19
Also published as: JP2017097488A

Description

本発明は、情報処理装置、情報処理方法および情報処理プログラムに関する。 The present invention relates to an information processing apparatus, an information processing method, and an information processing program.

近年、用語の解説を提示する技術が知られている。例えば、このような技術では、コンテンツに含まれる用語の解説を取得してコンテンツの出力されている映像と用語の解説を合成することで、出力されている映像に対してリアルタイムに用語解説を表示する。 In recent years, techniques for presenting explanations of terms have been known. For example, in such a technology, explanations of terms included in the content are obtained, and the explanation of the terms is displayed in real time on the output video by synthesizing the explanation of the terms with the output video of the content. To do.

特開２０１１−２５９１７６号公報JP 2011-259176 A

しかしながら、上記の従来技術では、聞き手にとって適切な用語の解説を表示することができるとは限らなかった。例えば、上記の従来技術では、コンテンツに含まれる用語の解説を画一的に取得して表示するので、聞き手にとって常識的な用語の解説を表示する場合がある。このようなことから、上記の従来技術では、聞き手にとって適切な用語の解説を表示することができるとは限らなかった。 However, in the above prior art, it is not always possible to display explanations of terms suitable for the listener. For example, in the above-described conventional technology, explanations of terms included in the content are uniformly acquired and displayed, so that explanations of terms common to the listener may be displayed. For this reason, in the above-described conventional technology, it is not always possible to display explanations of terms appropriate for the listener.

本願は、上記に鑑みてなされたものであって、聞き手にとって適切な用語の解説を表示することができる情報処理装置、情報処理方法および情報処理プログラムを提供することを目的とする。 The present application has been made in view of the above, and an object thereof is to provide an information processing apparatus, an information processing method, and an information processing program capable of displaying an explanation of terms suitable for a listener.

本願に係る情報処理装置は、音声に関する情報を受信する受信部と、前記受信部によって受信された音声に関する情報に含まれる用語を抽出する抽出部と、前記抽出部によって抽出された用語のうち聞き手に応じた専門用語の要約を出力する出力部とを備えたことを特徴とする。 An information processing apparatus according to the present application includes a receiving unit that receives information related to speech, an extracting unit that extracts terms included in information related to speech received by the receiving unit, and a listener among terms extracted by the extracting unit And an output unit for outputting a summary of technical terms corresponding to the.

実施形態の一態様によれば、聞き手にとって適切な用語の解説を表示することができるという効果を奏する。 According to one aspect of the embodiment, it is possible to display an explanation of terms appropriate for the listener.

図１は、実施形態に係る提示システムによる提示処理の一例を示す説明図である。Drawing 1 is an explanatory view showing an example of presentation processing by a presentation system concerning an embodiment. 図２は、実施形態に係る情報処理装置の構成例を示す図である。FIG. 2 is a diagram illustrating a configuration example of the information processing apparatus according to the embodiment. 図３は、実施形態に係る音声情報記憶部の一例を示す図である。FIG. 3 is a diagram illustrating an example of a voice information storage unit according to the embodiment. 図４は、実施形態に係る用語情報記憶部の一例を示す図である。FIG. 4 is a diagram illustrating an example of the term information storage unit according to the embodiment. 図５は、実施形態に係るユーザ情報記憶部の一例を示す図である。FIG. 5 is a diagram illustrating an example of a user information storage unit according to the embodiment. 図６は、要約の一例を示す図である。FIG. 6 is a diagram illustrating an example of the summary. 図７は、提示システムによる提示処理手順を示すシーケンスである。FIG. 7 is a sequence showing a presentation processing procedure by the presentation system. 図８は、表示画面の一例を示す図である。FIG. 8 is a diagram illustrating an example of a display screen. 図９は、設定画面の一例を示す図である。FIG. 9 is a diagram illustrating an example of the setting screen. 図１０は、選択画面の一例を示す図である。FIG. 10 is a diagram illustrating an example of the selection screen. 図１１は、情報処理装置の機能を実現するコンピュータの一例を示すハードウェア構成図である。FIG. 11 is a hardware configuration diagram illustrating an example of a computer that implements the functions of the information processing apparatus.

以下に、本願に係る情報処理装置、情報処理方法および情報処理プログラムを実施するための形態（以下、「実施形態」と呼ぶ）について図面を参照しつつ詳細に説明する。なお、この実施形態により本願に係る情報処理装置、情報処理方法および情報処理プログラムが限定されるものではない。また、以下の実施形態において同一の部位には同一の符号を付し、重複する説明は省略される。 Hereinafter, a mode for carrying out an information processing apparatus, an information processing method, and an information processing program according to the present application (hereinafter referred to as “embodiment”) will be described in detail with reference to the drawings. Note that the information processing apparatus, the information processing method, and the information processing program according to the present application are not limited by this embodiment. Moreover, in the following embodiment, the same code | symbol is attached | subjected to the same site | part and the overlapping description is abbreviate | omitted.

〔１．実施形態〕
〔１−１．実施形態に係る提示処理〕
まず、図１を用いて、実施形態に係る提示処理の一例について説明する。図１は、実施形態に係る提示システム１による提示処理の一例を示す説明図である。提示システム１では、図１に示すように、情報処理装置１００が、講演者ＳＭ（話し手の一例に相当）の講話に含まれる専門用語の要約を聴衆Ｕ１〜Ｕ２（聞き手の一例に相当）に提示する提示処理が行われる。 [1. Embodiment)
[1-1. Presentation processing according to embodiment]
First, an example of presentation processing according to the embodiment will be described with reference to FIG. Drawing 1 is an explanatory view showing an example of presentation processing by presentation system 1 concerning an embodiment. In the presentation system 1, as shown in FIG. 1, the information processing apparatus 100 gives a summary of technical terms included in a lecture of a speaker SM (corresponding to an example of a speaker) to audiences U1 to U2 (corresponding to an example of a listener). The presentation process to present is performed.

図１に示すように提示システム１には、話者端末１０と、聴衆端末２０Ａ〜２０Ｂと、情報提供装置５０と、情報処理装置１００とが含まれる。話者端末１０、聴衆端末２０Ａ〜２０Ｂ、情報提供装置５０、情報処理装置１００は、それぞれネットワークと無線または有線により通信可能に接続される。なお、以下では、聴衆端末２０Ａ〜２０Ｂの各装置を区別なく総称する場合には、「聴衆端末２０」と記載する場合がある。 As shown in FIG. 1, the presentation system 1 includes a speaker terminal 10, audience terminals 20 A to 20 B, an information providing apparatus 50, and an information processing apparatus 100. The speaker terminal 10, the audience terminals 20A to 20B, the information providing apparatus 50, and the information processing apparatus 100 are communicably connected to the network by wireless or wired communication. Hereinafter, when the devices of the audience terminals 20A to 20B are collectively referred to without distinction, they may be referred to as “audience terminal 20”.

話者端末１０は、スマートフォンや、タブレット型端末や、携帯電話機、ＰＣ（Personal Computer）や、ＰＤＡ（Personal Digital Assistant）等の情報処理装置である。具体的には、話者端末１０は、講話の話し手である講演者ＳＭに所有される。例えば、話者端末１０は、話し手から発せられる音声を録音する機能や、音声を認識する機能を有する。 The speaker terminal 10 is an information processing apparatus such as a smartphone, a tablet terminal, a mobile phone, a PC (Personal Computer), or a PDA (Personal Digital Assistant). Specifically, the speaker terminal 10 is owned by the speaker SM who is the speaker of the lecture. For example, the speaker terminal 10 has a function of recording sound emitted from a speaker and a function of recognizing sound.

聴衆端末２０は、スマートフォンや、タブレット型端末や、携帯電話機、ＰＣや、ＰＤＡ等の情報処理装置である。具体的には、聴衆端末２０は、講話の聞き手である聴衆Ｕ１〜Ｕ２に所有される。例えば、聴衆端末２０は、専門用語の要約を画面に表示する機能を有する。 The audience terminal 20 is an information processing apparatus such as a smartphone, a tablet terminal, a mobile phone, a PC, or a PDA. Specifically, the audience terminal 20 is owned by the audience U1 to U2 who are listeners of the lecture. For example, the audience terminal 20 has a function of displaying a summary of technical terms on a screen.

情報提供装置５０は、各種の情報を提供するサーバ装置である。具体的には、情報提供装置５０は、インターネット百科事典として用語の意味を提供する。例えば、情報提供装置５０は、検索クエリを受信した場合に、かかる検索クエリに対応する用語の解説情報を送信元の装置に提供する。 The information providing device 50 is a server device that provides various types of information. Specifically, the information providing apparatus 50 provides the meaning of a term as an Internet encyclopedia. For example, when the information providing apparatus 50 receives a search query, the information providing apparatus 50 provides commentary information on terms corresponding to the search query to the transmission source apparatus.

情報処理装置１００は、専門用語の要約を出力するサーバ装置である。具体的には、情報処理装置１００は、まず、音声に関する情報を受信する（ステップＳ１）。より具体的には、情報処理装置１００は、音声に関する情報として、講演者ＳＭの講話の一部分を音声認識した音声認識結果ＳＲ１「間接金融がナッシュ均衡だったからペンディングした」を、話者端末１０から受信する。 The information processing apparatus 100 is a server apparatus that outputs a summary of technical terms. Specifically, the information processing apparatus 100 first receives information related to sound (step S1). More specifically, the information processing apparatus 100 receives from the speaker terminal 10 a speech recognition result SR1 “pending because indirect finance was Nash balanced” as a speech-related information from a part of the lecture of the speaker SM. Receive.

続いて、情報処理装置１００は、受信された音声に関する情報に含まれる用語を抽出する（ステップＳ２）。具体的には、情報処理装置１００は、受信された音声認識結果ＳＲ１に含まれる用語「間接金融」、「ナッシュ均衡」、「ペンディング」を抽出する（ステップＳ２）。 Subsequently, the information processing apparatus 100 extracts terms included in the received information related to the voice (step S2). Specifically, the information processing apparatus 100 extracts the terms “indirect finance”, “Nash equilibrium”, and “pending” included in the received speech recognition result SR1 (step S2).

そして、情報処理装置１００は、抽出した用語の要約を生成する（ステップＳ３）。具体的には、情報処理装置１００は、情報提供装置５０を利用して、「間接金融」、「ナッシュ均衡」、「ペンディング」の要約をそれぞれ生成する。一例としては、情報処理装置１００は、検索クエリとして「間接金融」を情報提供装置５０に送信する。続いて、情報処理装置１００は、情報提供装置５０から検索クエリの応答として「間接金融」の解説情報を受信する。そして、情報処理装置１００は、受信した「間接金融」の解説情報を参照し、「間接金融」の要約Ａｂ１「金融の一形態で融資する側と受ける側の間に間接的に資金を貸し借りする機関が存在する仕組みのこと」を生成する。同様に、情報処理装置１００は、「ナッシュ均衡」の要約Ａｂ２「ゲーム理論における非協力ゲームの解の一種であり、いくつかの解の概念の中で最も基本的な概念である。」を生成する。また、情報処理装置１００は、「ペンディング」の要約Ａｂ３「「未定」、「保留」もしくは「先送り」といった意味の外来語であるが、業界によって微妙にニュアンスが異なる場合がある。」を生成する。 Then, the information processing apparatus 100 generates a summary of the extracted terms (step S3). Specifically, the information processing apparatus 100 uses the information providing apparatus 50 to generate summaries of “indirect finance”, “Nash equilibrium”, and “pending”, respectively. As an example, the information processing apparatus 100 transmits “indirect finance” to the information providing apparatus 50 as a search query. Subsequently, the information processing apparatus 100 receives “indirect finance” commentary information from the information providing apparatus 50 as a response to the search query. Then, the information processing apparatus 100 refers to the received commentary information on “indirect finance” and summarizes Ab1 “indirect finance”, which indirectly lends and borrows funds between the lender and the recipient. It is a mechanism that an institution exists. Similarly, the information processing apparatus 100 generates the summary Nb2 “Nash equilibrium”, “A kind of solution of the non-cooperative game in game theory, and the most basic concept among several solutions”. To do. Further, the information processing apparatus 100 is a foreign word meaning “abnormal”, “pending”, or “deferred” summary Ab3 of “pending”, but the nuance may differ slightly depending on the industry. Is generated.

続いて、情報処理装置１００は、音声に関する情報に含まれる用語として抽出された「間接金融」、「ナッシュ均衡」、「ペンディング」のうち聴衆Ｕ１に応じた専門用語の要約を、聴衆Ｕ１が有する聴衆端末２０Ａに対してリアルタイムに出力する（ステップＳ４）。ここで、聴衆Ｕ１は、「音楽」の専門家であるものとする。言い換えると、聴衆Ｕ１は、ユーザ属性として「音楽」を有するものとする。この場合、情報処理装置１００は、聴衆Ｕ１に応じた専門用語として、「音楽」以外の分野に属する用語である「間接金融」、「ナッシュ均衡」、「ペンディング」の要約Ａｂ１〜Ａｂ３を聴衆Ｕ１が有する聴衆端末２０Ａに出力する。これにより、聴衆端末２０Ａは、図１に示すように、「間接金融」、「ナッシュ均衡」、「ペンディング」の要約Ａｂ１〜Ａｂ３をリアルタイムに画面に表示する。 Subsequently, in the information processing apparatus 100, the audience U1 has a summary of technical terms corresponding to the audience U1 among “indirect finance”, “Nash equilibrium”, and “pending” extracted as terms included in the information related to speech. Output to the audience terminal 20A in real time (step S4). Here, it is assumed that the audience U1 is an expert of “music”. In other words, the audience U1 has “music” as the user attribute. In this case, the information processing apparatus 100 uses the abstracts Ab1 to Ab3 of “indirect finance”, “Nash equilibrium”, and “pending” that are terms belonging to fields other than “music” as technical terms corresponding to the audience U1. Is output to the audience terminal 20A. As a result, the audience terminal 20A displays the sums Ab1 to Ab3 of “indirect finance”, “Nash equilibrium”, and “pending” on the screen in real time as shown in FIG.

また、情報処理装置１００は、音声に関する情報に含まれる用語として抽出された「間接金融」、「ナッシュ均衡」、「ペンディング」のうち聴衆Ｕ２に応じた専門用語の要約を、聴衆Ｕ２が有する聴衆端末２０Ｂに対してリアルタイムに出力する（ステップＳ５）。ここで、聴衆Ｕ２は、「金融」および「経済」の専門家であるものとする。言い換えると、聴衆Ｕ２は、ユーザ属性として「金融」および「経済」を有するものとする。この場合、情報処理装置１００は、聴衆Ｕ２に応じた専門用語として、「金融」および「経済」以外の分野に属する用語である「ペンディング」の要約を聴衆端末２０Ｂに出力する。一方、情報処理装置１００は、「金融」および「経済」の分野に属する用語である「間接金融」、「ナッシュ均衡」の要約を聴衆端末２０Ｂに出力しない。これにより、聴衆端末２０Ｂは、図１に示すように、「間接金融」および「ナッシュ均衡」の要約Ａｂ１〜Ａｂ２を表示せず、「ペンディング」の要約Ａｂ３をリアルタイムに画面に表示する。 In addition, the information processing apparatus 100 has an audience that the audience U2 has a summary of technical terms corresponding to the audience U2 among “indirect finance”, “Nash equilibrium”, and “pending” extracted as terms included in the information related to audio. Output to the terminal 20B in real time (step S5). Here, it is assumed that the audience U2 is an expert of “finance” and “economy”. In other words, the audience U2 has “finance” and “economy” as user attributes. In this case, the information processing apparatus 100 outputs a summary of “pending”, which is a term belonging to a field other than “finance” and “economy”, as technical terms corresponding to the audience U2, to the audience terminal 20B. On the other hand, the information processing apparatus 100 does not output the summary of “indirect finance” and “Nash equilibrium”, which are terms belonging to the fields of “finance” and “economy”, to the audience terminal 20B. Thereby, as shown in FIG. 1, the audience terminal 20B does not display the sums Ab1 to Ab2 of “indirect finance” and “Nash equilibrium”, but displays the summary Ab3 of “pending” on the screen in real time.

このように、実施形態に係る情報処理装置１００は、音声に関する情報を受信する。また、情報処理装置１００は、受信された音声に関する情報に含まれる用語を抽出する。また、情報処理装置１００は、抽出された用語のうち聞き手に応じた専門用語の要約を出力する。 As described above, the information processing apparatus 100 according to the embodiment receives information related to sound. In addition, the information processing apparatus 100 extracts terms included in the received information related to speech. In addition, the information processing apparatus 100 outputs a summary of technical terms corresponding to the listener among the extracted terms.

これにより、情報処理装置１００は、話し手の音声に含まれる用語のうち聞き手に応じた専門用語を出力することができるので、聞き手にとって適切な用語の解説を表示することができる。例えば、情報処理装置１００は、聞き手が専門外とする分野に属する用語の要約を出力することができるので、ユーザが知りたい用語の要約を表示することができる。また、情報処理装置１００は、聞き手が専門とする分野に属する用語の要約を出力しないので、ユーザにとって解説が不要な用語が表示されることを防ぎ見易さを高く保つことができる。 Thus, the information processing apparatus 100 can output technical terms corresponding to the listener among the terms included in the voice of the speaker, and can display an explanation of the term appropriate for the listener. For example, the information processing apparatus 100 can output a summary of terms that belong to a field that the listener does not specialize, and therefore can display a summary of terms that the user wants to know. Further, since the information processing apparatus 100 does not output a summary of terms belonging to the field that the listener specializes in, it is possible to prevent the display of terms that do not require explanation for the user and to keep the visibility high.

なお、図１では、提示システム１に、１台の話者端末１０と、２台の聴衆端末２０Ａ〜２０Ｂと、１台の情報処理装置１００とが含まれる例を示したが、提示システム１には、複数台の話者端末１０や、２台に限らず複数台の聴衆端末２０Ａ〜２０Ｂや、複数台の情報処理装置１００が含まれてもよい。 Although FIG. 1 shows an example in which the presentation system 1 includes one speaker terminal 10, two audience terminals 20A to 20B, and one information processing apparatus 100, the presentation system 1 May include a plurality of speaker terminals 10, a plurality of audience terminals 20 A to 20 B, and a plurality of information processing apparatuses 100.

また、図１では、説明を簡単にするため講演者ＳＭの講話の一部分である「間接金融がナッシュ均衡だったからペンディングした」を例として示したが、実際には講話の一部分に限らず、講話の全部分を対象とし、講話に含まれる専門用語が出現する度にかかる専門用語の要約をリアルタイムに順次表示する。 In addition, in FIG. 1, for the sake of simplicity, an example of “pending because indirect finance was Nash equilibrium”, which is a part of the lecture of the lecturer SM, is shown as an example. A summary of technical terms is displayed sequentially in real time whenever technical terms included in the lecture appear.

〔１−２．実施形態に係る情報処理装置の構成〕
次に、図２を用いて、実施形態に係る情報処理装置１００の構成について説明する。図２は、実施形態に係る情報処理装置１００の構成例を示す図である。図２に示すように、情報処理装置１００は、通信部１１０と、記憶部１２０と、制御部１３０とを有する。なお、情報処理装置１００は、情報処理装置１００を利用する管理者等から各種操作を受け付ける入力部（例えば、キーボードやマウス等）や、各種情報を表示するための表示部（例えば、液晶ディスプレイ等）を有してもよい。 [1-2. Configuration of Information Processing Device According to Embodiment]
Next, the configuration of the information processing apparatus 100 according to the embodiment will be described with reference to FIG. FIG. 2 is a diagram illustrating a configuration example of the information processing apparatus 100 according to the embodiment. As illustrated in FIG. 2, the information processing apparatus 100 includes a communication unit 110, a storage unit 120, and a control unit 130. The information processing apparatus 100 includes an input unit (for example, a keyboard and a mouse) that receives various operations from an administrator who uses the information processing apparatus 100, and a display unit (for example, a liquid crystal display) that displays various types of information. ).

（通信部１１０について）
通信部１１０は、ＮＩＣ（Network Interface Card）等によって実現される。具体的には、通信部１１０は、ネットワークと有線または無線で接続され、ネットワークを介して、話者端末１０や聴衆端末２０、情報提供装置５０との間で情報の送受信を行う。例えば、通信部１１０は、話者端末１０から音声に関する情報の受信を行う。他の例では、通信部１１０は、聴衆端末２０に対して専門用語の要約に関する情報の送信を行う。他の例では、通信部１１０は、情報提供装置５０との間で、用語に関する情報の送信と、用語の解説情報の受信とを行う。 (About the communication unit 110)
The communication unit 110 is realized by a NIC (Network Interface Card) or the like. Specifically, the communication unit 110 is connected to the network by wire or wirelessly, and transmits and receives information to and from the speaker terminal 10, the audience terminal 20, and the information providing device 50 via the network. For example, the communication unit 110 receives information related to voice from the speaker terminal 10. In another example, the communication unit 110 transmits information on the technical term summary to the audience terminal 20. In another example, the communication unit 110 transmits information about terms and receives commentary information on terms with the information providing apparatus 50.

（記憶部１２０について）
記憶部１２０は、例えば、ＲＡＭ（Random Access Memory)、フラッシュメモリ（Flash Memory）等の半導体メモリ素子、または、ハードディスク、光ディスク等の記憶装置によって実現される。記憶部１２０は、音声情報記憶部１２１と、用語情報記憶部１２２と、ユーザ情報記憶部１２３とを有する。 (About the storage unit 120)
The storage unit 120 is realized by, for example, a semiconductor memory device such as a RAM (Random Access Memory) or a flash memory, or a storage device such as a hard disk or an optical disk. The storage unit 120 includes a voice information storage unit 121, a term information storage unit 122, and a user information storage unit 123.

（音声情報記憶部１２１について）
音声情報記憶部１２１は、音声に関する情報を記憶する。具体的には、音声情報記憶部１２１は、話し手から発せられる音声を音声認識した音声認識結果に関する情報を記憶する。ここで、図３に、実施形態に係る音声情報記憶部１２１の一例を示す。図３に示すように、音声情報記憶部１２１は、「音声ＩＤ」および「認識結果」といった項目を有する。 (About the voice information storage unit 121)
The voice information storage unit 121 stores information related to voice. Specifically, the voice information storage unit 121 stores information related to a voice recognition result obtained by voice recognition of a voice emitted from a speaker. Here, FIG. 3 shows an example of the audio information storage unit 121 according to the embodiment. As shown in FIG. 3, the voice information storage unit 121 includes items such as “voice ID” and “recognition result”.

「音声ＩＤ」は、音声に関する情報を識別するための識別情報を示す。例えば、「音声ＩＤ」には、音声ごとに個別に割り当てられる英数字等のユニークな文字列などが記憶される。「認識結果」は、音声に対して音声認識を行なった結果を示す。例えば、「認識結果」には、音声から認識された文字列などが記憶される。 “Voice ID” indicates identification information for identifying information related to voice. For example, the “voice ID” stores a unique character string such as an alphanumeric character individually assigned for each voice. “Recognition result” indicates a result of performing speech recognition on speech. For example, the “recognition result” stores a character string recognized from speech.

すなわち、図３では、音声ＩＤ「ＳＲ１」によって識別される音声を音声認識した音声認識結果は、「間接金融がナッシュ均衡だったからペンディングした」である例を示している。また、図３では、音声ＩＤ「ＳＲ２」によって識別される音声を音声認識した音声認識結果は、「米ダウ工業株30種平均、独ＤＡＸ指数は、原油安の影響でともに１％強下がった」である例を示している。 That is, FIG. 3 shows an example in which the voice recognition result obtained by voice recognition of the voice identified by the voice ID “SR1” is “pending because indirect finance was Nash equilibrium”. In addition, in FIG. 3, the speech recognition result obtained by recognizing the speech identified by the speech ID “SR2” shows that “the US Dow Industrial Average of 30 species, and the German DAX index both fell slightly by 1% due to the impact of lower crude oil. An example is shown.

（用語情報記憶部１２２について）
用語情報記憶部１２２は、用語に関する情報を記憶する。具体的には、用語情報記憶部１２２は、音声毎に、音声に関する情報に含まれる用語に関する情報を記憶する。ここで、図４に、実施形態に係る用語情報記憶部１２２の一例を示す。図４に示すように、用語情報記憶部１２２は、「音声ＩＤ」、「用語ＩＤ」および「用語」といった項目を有する。 (Regarding the term information storage unit 122)
The term information storage unit 122 stores information on terms. Specifically, the term information storage part 122 memorize | stores the information regarding the term contained in the information regarding an audio | voice for every audio | voice. Here, FIG. 4 shows an example of the term information storage unit 122 according to the embodiment. As illustrated in FIG. 4, the term information storage unit 122 includes items such as “voice ID”, “term ID”, and “term”.

「音声ＩＤ」は、音声に関する情報を識別するための識別情報を示す。例えば、「音声ＩＤ」には、音声ごとに個別に割り当てられる英数字等のユニークな文字列などが記憶される。「用語ＩＤ」は、音声に関する情報に含まれる用語を識別するための識別情報を示す。例えば、「用語ＩＤ」には、音声に関する情報に含まれる用語ごとに個別に割り当てられるユニークな英数字等の文字列などが記憶される。「用語」は、音声に関する情報に含まれる用語を示す。例えば、「用語」には、音声に関する情報に含まれる単語のうち固有名詞の単語などが記憶される。 “Voice ID” indicates identification information for identifying information related to voice. For example, the “voice ID” stores a unique character string such as an alphanumeric character individually assigned for each voice. “Term ID” indicates identification information for identifying a term included in the information related to voice. For example, the “term ID” stores a character string such as a unique alphanumeric character that is individually assigned to each term included in the information related to speech. “Term” indicates a term included in information related to speech. For example, in the “term”, a proper noun word among words included in information related to speech is stored.

すなわち、図４では、音声ＩＤ「ＳＲ１」によって識別される音声の音声認識結果は、用語「間接金融」、「ナッシュ均衡」および「ペンディング」を含む例を示している。また、「間接金融」の用語ＩＤは、「Ｗ１」である例を示している。また、「ナッシュ均衡」の用語ＩＤは、「Ｗ２」である例を示している。また、「ペンディング」の用語ＩＤは、「Ｗ３」である例を示している。 That is, FIG. 4 shows an example in which the voice recognition result of the voice identified by the voice ID “SR1” includes the terms “indirect finance”, “Nash equilibrium”, and “pending”. Moreover, the term ID of “indirect finance” is “W1”. Moreover, the term ID of “Nash equilibrium” is “W2”. In addition, the term ID of “pending” is “W3”.

（ユーザ情報記憶部１２３について）
ユーザ情報記憶部１２３は、ユーザに関する情報を記憶する。具体的には、ユーザ情報記憶部１２３は、ユーザ毎に、ユーザの特徴を示すユーザ属性に関する情報を記憶する。ここで、図５に、実施形態に係るユーザ情報記憶部１２３の一例を示す。図５に示すように、ユーザ情報記憶部１２３は、「ユーザＩＤ」および「ユーザ属性」といった項目を有する。 (User information storage unit 123)
The user information storage unit 123 stores information about the user. Specifically, the user information storage unit 123 stores information on user attributes indicating user characteristics for each user. Here, FIG. 5 shows an example of the user information storage unit 123 according to the embodiment. As illustrated in FIG. 5, the user information storage unit 123 includes items such as “user ID” and “user attribute”.

「ユーザＩＤ」は、ユーザを識別するための識別情報を示す。例えば、「ユーザＩＤ」には、ユーザごとに個別に割り当てられるユニークな英数字等の文字列などが記憶される。「ユーザ属性」は、ユーザの特徴を表す属性を示す。例えば、「ユーザ属性」には、ユーザの登録情報や検索履歴、Ｗｅｂページの閲覧履歴、商品購入履歴、サービスの利用履歴などといった各種の情報から推定される属性が記憶される。 “User ID” indicates identification information for identifying a user. For example, the “user ID” stores a character string such as a unique alphanumeric character individually assigned to each user. “User attribute” indicates an attribute representing a feature of the user. For example, “user attributes” stores attributes estimated from various information such as user registration information, search history, Web page browsing history, product purchase history, service usage history, and the like.

すなわち、図５では、ユーザＩＤ「Ｕ１」によって識別されるユーザＵ１のユーザ属性は、「音楽」である例を示している。このため、ユーザＵ１は、他の分野と比較して、「音楽」の分野に関する知識を有すると考えられる。また、ユーザＩＤ「Ｕ２」によって識別されるユーザＵ２のユーザ属性は、「金融」および「経済」である例を示している。このため、ユーザＵ２は、他の分野と比較して、「金融」および「経済」の分野に関する知識を有すると考えられる。 That is, FIG. 5 illustrates an example in which the user attribute of the user U1 identified by the user ID “U1” is “music”. For this reason, it is considered that the user U1 has knowledge about the field of “music” as compared to other fields. Further, the user attributes of the user U2 identified by the user ID “U2” are “finance” and “economy”. For this reason, it is considered that the user U2 has knowledge regarding the fields of “finance” and “economy” as compared to other fields.

（制御部１３０について）
制御部１３０は、例えば、ＣＰＵ（Central Processing Unit）やＭＰＵ（Micro Processing Unit）等によって、情報処理装置１００内部の記憶装置に記憶されている各種プログラム（情報処理プログラムの一例に相当）がＲＡＭを作業領域として実行されることにより実現される。また、制御部１３０は、例えば、ＡＳＩＣ（Application Specific Integrated Circuit）やＦＰＧＡ（Field Programmable Gate Array）等の集積回路により実現される。 (About the control unit 130)
The control unit 130 includes, for example, a CPU (Central Processing Unit), an MPU (Micro Processing Unit), or the like that stores various programs (corresponding to an example of an information processing program) stored in a storage device inside the information processing apparatus 100 in a RAM. This is realized by being executed as a work area. The control unit 130 is realized by an integrated circuit such as an application specific integrated circuit (ASIC) or a field programmable gate array (FPGA).

制御部１３０は、図２に示すように、受信部１３１と、抽出部１３２と、生成部１３３と、出力部１３４とを有し、以下に説明する情報処理の機能や作用を実現または実行する。なお、制御部１３０の内部構成は、図２に示した構成に限られず、後述する提示処理を行う構成であれば他の構成であってもよい。また、制御部１３０が有する各処理部の接続関係は、図２に示した接続関係に限られず、他の接続関係であってもよい。 As illustrated in FIG. 2, the control unit 130 includes a reception unit 131, an extraction unit 132, a generation unit 133, and an output unit 134, and implements or executes the information processing functions and operations described below. . Note that the internal configuration of the control unit 130 is not limited to the configuration illustrated in FIG. 2, and may be another configuration as long as it performs a presentation process described later. In addition, the connection relationship between the processing units included in the control unit 130 is not limited to the connection relationship illustrated in FIG. 2, and may be another connection relationship.

（受信部１３１について）
受信部１３１は、音声に関する情報を受信する。具体的には、受信部１３１は、音声に関する情報として、話し手が発する音声を音声認識した音声認識結果を話者端末１０から受信する。また、受信部１３１は、音声に関する情報を受信した場合に、受信した音声に関する情報を音声情報記憶部１２１に格納する。例えば、受信部１３１は、音声認識結果に個別の音声ＩＤを付与し、音声認識結果を音声ＩＤに対応付けて音声情報記憶部１２１に格納する。 (About the receiver 131)
The receiving unit 131 receives information related to audio. Specifically, the receiving unit 131 receives, from the speaker terminal 10, a speech recognition result obtained by recognizing speech generated by a speaker as information related to speech. Further, when receiving information related to sound, the receiving unit 131 stores the received information related to sound in the sound information storage unit 121. For example, the receiving unit 131 assigns an individual voice ID to the voice recognition result, and stores the voice recognition result in the voice information storage unit 121 in association with the voice ID.

また、受信部１３１は、用語の解説情報を受信する。具体的には、受信部１３１は、後述する抽出部１３２によって抽出された用語を検索クエリとして情報提供装置５０に送信した場合に、かかる検索クエリの応答として用語の解説情報を情報提供装置５０から受信する。 The receiving unit 131 receives terminology explanation information. Specifically, when the receiving unit 131 transmits a term extracted by the extracting unit 132 (to be described later) to the information providing apparatus 50 as a search query, the receiving unit 131 sends commentary information on the term from the information providing apparatus 50 as a response to the search query. Receive.

（抽出部１３２について）
抽出部１３２は、受信部１３１によって受信された音声に関する情報に含まれる用語を抽出する。具体的には、抽出部１３２は、音声に関する情報として話者端末１０から受信した音声認識結果に含まれる用語を抽出する。例えば、抽出部１３２は、音声認識結果に含まれる単語のうち名詞を抽出する。一例としては、抽出部１３２は、音声認識結果に含まれる名詞の組み合わせのローマ字読みをインターネット百科事典等に掲載された記事名のローマ字読みと照合することで、音声認識結果に含まれる用語を抽出する。これにより、抽出部１３２は、表記ゆれ（例えば、数字と漢数字）によって用語の抽出を失敗することを防ぐことができる。そして、抽出部１３２は、抽出した用語を用語情報記憶部１２２に格納する。例えば、抽出部１３２は、抽出した用語に個別の用語ＩＤを付与し、用語を音声ＩＤおよび用語ＩＤに対応付けて用語情報記憶部１２２に格納する。 (About the extraction unit 132)
The extraction unit 132 extracts terms included in the information regarding the sound received by the reception unit 131. Specifically, the extraction unit 132 extracts terms included in the speech recognition result received from the speaker terminal 10 as information related to speech. For example, the extraction unit 132 extracts nouns from words included in the speech recognition result. As an example, the extraction unit 132 extracts a term included in the speech recognition result by collating the Roman character reading of the combination of nouns included in the speech recognition result with the Roman character reading of the article name published in the Internet encyclopedia or the like. To do. Thereby, the extraction part 132 can prevent the extraction of a term from failing by notation fluctuations (for example, numbers and Chinese numerals). Then, the extraction unit 132 stores the extracted terms in the term information storage unit 122. For example, the extraction unit 132 assigns an individual term ID to the extracted term, and stores the term in the term information storage unit 122 in association with the voice ID and the term ID.

（生成部１３３について）
生成部１３３は、抽出部１３２によって抽出された用語の要約を生成する。具体的には、生成部１３３は、情報提供装置５０を利用して、音声に関する情報に含まれる用語の要約を生成する。例えば、生成部１３３は、まず、抽出部１３２によって抽出された用語を検索クエリとして情報提供装置５０に送信する。続いて、生成部１３３は、送信した検索クエリの応答として用語の解説情報（例えば、Ｗｉｋｉｐｅｄｉａ（登録商標）において検索クエリを検索した検索結果の記事）を情報提供装置５０から受信する。そして、情報処理装置１００は、受信した用語の解説情報を用いて用語の要約を生成する。 (About the generator 133)
The generation unit 133 generates a summary of terms extracted by the extraction unit 132. Specifically, the generation unit 133 uses the information providing device 50 to generate a summary of terms included in information related to speech. For example, the generation unit 133 first transmits the term extracted by the extraction unit 132 to the information providing apparatus 50 as a search query. Subsequently, the generation unit 133 receives, from the information providing apparatus 50, terminology explanation information (for example, an article of a search result obtained by searching a search query in Wikipedia (registered trademark)) as a response to the transmitted search query. Then, the information processing apparatus 100 generates a summary of terms using the received explanation information of terms.

この点について、図６を用いて詳細に説明する。図６は、用語の要約を生成する生成処理を説明するための説明図である。図６の例では、生成部１３３は、用語「ナッシュ均衡」の要約Ａｂ２を生成する。具体的には、生成部１３３は、図６に示すように、第１パラグラフＰｒ１、第２パラグラフＰｒ２および第３パラグラフＰｒ３によって形成される解説情報Ｃｍのうち第１パラグラフＰｒ１を用いて用語「ナッシュ均衡」の要約Ａｂ２を生成する。より具体的には、生成部１３３は、まず、解説情報Ｃｍから第１パラグラフＰｒ１を抽出する。続いて、生成部１３３は、抽出した解説情報Ｃｍの第１パラグラフＰｒ１から冗長な表現を削除する。例えば、生成部１３３は、解説情報Ｃｍの第１パラグラフＰｒ１の第１文目から「〜は、」に該当する部分を削除する。図６の例では、「〜」は、要約を生成する対象となる用語である「ナッシュ均衡」を意味する。また、生成部１３３は、解説情報Ｃｍの第１パラグラフＰｒ１の全体から「〜」に該当する部分を削除する。そして、生成部１３３は、第１パラグラフＰｒ１の残りの文章の全体の長さを所定の範囲内に調整する。例えば、生成部１３３は、第１パラグラフの残りの文章の長さが所定の範囲内より長い場合には、所定の範囲内に収まるように第１パラグラフの残りの文章の一部を削除する。これにより、生成部１３３は、「ナッシュ均衡」の要約Ａｂ２を生成する。一方、生成部１３３は、第１パラグラフの残りの文章の長さが所定の範囲内より短い場合には、例えば、第１パラグラフの残りの文章と、削除した第１パラグラフの文章とを組み合わせた要約を生成する。他の例では、生成部１３３は、第２パラグラフの内容を用いて第１パラグラフと同様の処理を行ない、第１パラグラフの残りの文章と第２パラグラフの残りの文章とを組み合わせた要約を生成する。そして、生成部１３３は、生成した用語の要約を用語情報記憶部１２２に格納する。例えば、生成部１３３は、生成した用語の要約を用語ＩＤに対応付けて用語情報記憶部１２２に格納する。 This point will be described in detail with reference to FIG. FIG. 6 is an explanatory diagram for explaining a generation process for generating a summary of terms. In the example of FIG. 6, the generation unit 133 generates a summary Ab2 of the term “Nash equilibrium”. Specifically, as shown in FIG. 6, the generation unit 133 uses the first paragraph Pr1 among the explanation information Cm formed by the first paragraph Pr1, the second paragraph Pr2, and the third paragraph Pr3, and uses the term “Nash”. A balance Ab2 summary Ab2 is generated. More specifically, the generation unit 133 first extracts the first paragraph Pr1 from the comment information Cm. Subsequently, the generation unit 133 deletes redundant expressions from the first paragraph Pr1 of the extracted commentary information Cm. For example, the generation unit 133 deletes a portion corresponding to “to” from the first sentence of the first paragraph Pr1 of the comment information Cm. In the example of FIG. 6, “˜” means “Nash equilibrium” which is a term for which a summary is generated. In addition, the generation unit 133 deletes a portion corresponding to “˜” from the entire first paragraph Pr1 of the comment information Cm. Then, the generation unit 133 adjusts the overall length of the remaining sentence of the first paragraph Pr1 within a predetermined range. For example, when the length of the remaining sentence of the first paragraph is longer than the predetermined range, the generation unit 133 deletes a part of the remaining sentence of the first paragraph so as to be within the predetermined range. Accordingly, the generation unit 133 generates a summary Ab2 of “Nash equilibrium”. On the other hand, when the length of the remaining sentence of the first paragraph is shorter than the predetermined range, the generating unit 133 combines, for example, the remaining sentence of the first paragraph and the deleted sentence of the first paragraph. Generate a summary. In another example, the generation unit 133 performs the same process as the first paragraph using the contents of the second paragraph, and generates a summary combining the remaining sentences of the first paragraph and the remaining sentences of the second paragraph. To do. Then, the generation unit 133 stores the generated term summary in the term information storage unit 122. For example, the generation unit 133 stores the generated term summary in the term information storage unit 122 in association with the term ID.

（出力部１３４について）
出力部１３４は、抽出部１３２によって抽出された用語のうち聞き手に応じた専門用語の要約を出力する。具体的には、出力部１３４は、抽出部１３２によって抽出された用語が属する分野に基づいて聞き手に応じた専門用語の要約を出力する。。例えば、出力部１３４は、聞き手のユーザ属性に基づいて専門用語の要約を出力する。一例としては、出力部１３４は、ユーザ情報記憶部１２３を参照し、生成部１３３によって生成された用語の要約のうちユーザが有するユーザ属性以外の分野に属する用語の要約を、かかるユーザが有する聴衆端末２０にリアルタイムに出力する。 (About the output unit 134)
The output unit 134 outputs a summary of technical terms corresponding to the listener among the terms extracted by the extraction unit 132. Specifically, the output unit 134 outputs a summary of technical terms corresponding to the listener based on the field to which the term extracted by the extraction unit 132 belongs. . For example, the output unit 134 outputs a summary of technical terms based on the user attributes of the listener. As an example, the output unit 134 refers to the user information storage unit 123, and among the summary of terms generated by the generation unit 133, the audience that the user has a summary of terms belonging to a field other than the user attribute of the user. Output to the terminal 20 in real time.

他の例では、出力部１３４は、音声に関する情報に含まれる用語のうち出現頻度に基づく専門用語度が所定の閾値以上の専門用語の要約を出力する。例えば、出力部１３４は、専門用語度として、以下の式（１）によって表されるＩＤＦ（Inverse Document Frequency）値が所定の閾値以上の用語の要約を出力する。 In another example, the output unit 134 outputs a summary of technical terms whose terminology degree based on the appearance frequency is greater than or equal to a predetermined threshold among terms included in information related to speech. For example, the output unit 134 outputs a summary of terms whose IDF (Inverse Document Frequency) value represented by the following formula (1) is a predetermined threshold or more as the technical term degree.

ここで、式（１）の「Ｎ」は、インターネット百科事典が有する全記事数を示す。また、式（１）の「ｄｆ_ｊ」は、記事名が文書に出現する頻度を示す。したがって、出力部１３４は、出現頻度が低い用語ほど専門的な用語であると判断して要約を優先して出力する。なお、ＩＤＦ値は、一例としては、Ｈａｄｏｏｐで分散処理によって計算される。 Here, “N” in the formula (1) indicates the total number of articles in the Internet encyclopedia. In addition, “df _j ” in Expression (1) indicates the frequency at which the article name appears in the document. Therefore, the output unit 134 determines that a term having a lower appearance frequency is a specialized term and outputs the summary with priority. As an example, the IDF value is calculated by distributed processing using Hadoop.

ここで、出力部１３４は、ユーザ属性に応じて調整された専門用語度の閾値に基づいて専門用語の要約を出力してもよい。例えば、出力部１３４は、専門用語度が、ユーザ属性に応じて専門用語度を算出する算出式の係数の重みを変更することで調整された閾値以上の用語の要約を出力する。 Here, the output unit 134 may output a summary of technical terms based on a threshold value of technical terms adjusted according to user attributes. For example, the output unit 134 outputs a summary of terms having a technical term degree equal to or greater than a threshold adjusted by changing the weight of a coefficient of a calculation formula for calculating the technical term degree according to the user attribute.

〔１−３．実施形態に係る提示処理手順〕
次に、図７を用いて、実施形態に係る提示システム１による処理の手順について説明する。図７は、実施形態に係る提示システム１による提示処理手順を示すシーケンスである。 [1-3. Presentation processing procedure according to embodiment]
Next, the procedure of processing by the presentation system 1 according to the embodiment will be described with reference to FIG. FIG. 7 is a sequence showing a presentation processing procedure by the presentation system 1 according to the embodiment.

図７に示すように、情報処理装置１００は、話者端末１０から音声に関する情報を受信する（ステップＳ１０１）。例えば、情報処理装置１００は、音声に関する情報として、話し手が発する音声を音声認識した音声認識結果を話者端末１０から受信する。そして、情報処理装置１００は、音声に関する情報を受信した場合に、受信した音声に関する情報を音声情報記憶部１２１に格納する。 As illustrated in FIG. 7, the information processing apparatus 100 receives information related to speech from the speaker terminal 10 (step S 101). For example, the information processing apparatus 100 receives from the speaker terminal 10 a speech recognition result obtained by recognizing speech generated by a speaker as information related to speech. When the information processing apparatus 100 receives information related to sound, the information processing apparatus 100 stores the received information related to sound in the sound information storage unit 121.

続いて、情報処理装置１００は、受信された音声に関する情報に含まれる用語を抽出する（ステップＳ１０２）。例えば、情報処理装置１００は、音声に関する情報として話者端末１０から受信した音声認識結果に含まれる用語を抽出する。そして、情報処理装置１００は、抽出した用語を用語情報記憶部１２２に格納する。 Subsequently, the information processing apparatus 100 extracts terms included in the received information related to the voice (step S102). For example, the information processing apparatus 100 extracts terms included in the speech recognition result received from the speaker terminal 10 as information related to speech. Then, the information processing apparatus 100 stores the extracted terms in the term information storage unit 122.

その後、情報処理装置１００は、抽出された用語を検索クエリとして情報提供装置５０に送信する（ステップＳ１０３）。そして、情報処理装置１００は、送信した検索クエリの応答として用語の解説情報を情報提供装置５０から受信する（ステップＳ１０４）。続いて、情報処理装置１００は、受信した用語の解説情報に基づいて用語の要約を生成する（ステップＳ１０５）。そして、情報処理装置１００は、生成した要約を用語と対応付けて用語情報記憶部１２２に格納する。 Thereafter, the information processing apparatus 100 transmits the extracted term as a search query to the information providing apparatus 50 (step S103). Then, the information processing apparatus 100 receives commentary information on terms from the information providing apparatus 50 as a response to the transmitted search query (step S104). Subsequently, the information processing apparatus 100 generates a summary of terms based on the received explanation information of terms (step S105). Then, the information processing apparatus 100 stores the generated summary in the term information storage unit 122 in association with the term.

続いて、情報処理装置１００は、抽出された用語のうち聞き手に応じた専門用語の要約を出力する（ステップＳ１０６）。例えば、情報処理装置１００は、ユーザ情報記憶部１２３を参照し、生成された用語の要約のうちユーザが有するユーザ属性以外の分野に属する用語の要約を、かかるユーザが有する聴衆端末２０に出力する。 Subsequently, the information processing apparatus 100 outputs a summary of technical terms corresponding to the listener among the extracted terms (step S106). For example, the information processing apparatus 100 refers to the user information storage unit 123 and outputs a summary of terms belonging to a field other than the user attribute possessed by the user to the audience terminal 20 possessed by the user. .

〔１−４．実施形態の効果〕
上述してきたように、実施形態に係る情報処理装置１００は、受信部１３１と、抽出部１３２と、出力部１３４とを有する。受信部１３１は、音声に関する情報を受信する。抽出部１３２は、受信部１３１によって受信された音声に関する情報に含まれる用語を抽出する。出力部１３４は、抽出部１３２によって抽出された用語のうち聞き手に応じた専門用語の要約を出力する。 [1-4. Effects of the embodiment
As described above, the information processing apparatus 100 according to the embodiment includes the reception unit 131, the extraction unit 132, and the output unit 134. The receiving unit 131 receives information related to audio. The extraction unit 132 extracts terms included in the information regarding the sound received by the reception unit 131. The output unit 134 outputs a summary of technical terms corresponding to the listener among the terms extracted by the extraction unit 132.

また、実施形態に係る情報処理装置１００において、出力部１３４は、抽出部１３２によって抽出された用語が属する分野に基づいて聞き手に応じた専門用語の要約を出力する。これにより、情報処理装置１００は、聞き手にとって知識が浅い分野に属する用語の要約を聞き手に対して提示することができるので、聞き手の聴講をサポートすることができる。例えば、情報処理装置１００は、聞き手に応じた用語の要約を自動的に提示することができるので、聞き手が専門用語を能動的に検索する手間を削減することができる。 In the information processing apparatus 100 according to the embodiment, the output unit 134 outputs a summary of technical terms corresponding to the listener based on the field to which the term extracted by the extraction unit 132 belongs. As a result, the information processing apparatus 100 can present to the listener a summary of terms belonging to a field that is not familiar to the listener, and thus can support the listener's audition. For example, since the information processing apparatus 100 can automatically present a summary of terms according to the listener, it is possible to reduce the effort for the listener to actively search for technical terms.

また、実施形態に係る情報処理装置１００において、出力部１３４は、音声に関する情報に含まれる用語のうち出現頻度に基づく専門用語度が所定の閾値以上の専門用語の要約を出力する。これにより、情報処理装置１００は、出現頻度が低い専門的な用語の要約を出力することができるので、聞き手が知らない可能性が高い用語の要約を提示することができる。例えば、情報処理装置１００は、出願頻度が高い用語の要約は出力しないので、ユーザにとって見慣れた用語の要約が表示されることを防ぎ見易さを高く保つことができる。 Further, in the information processing apparatus 100 according to the embodiment, the output unit 134 outputs a summary of technical terms whose terminology degree based on the appearance frequency is greater than or equal to a predetermined threshold among terms included in the information related to speech. As a result, the information processing apparatus 100 can output a summary of technical terms with a low appearance frequency, and thus can present a summary of terms that are highly likely to be unknown to the listener. For example, the information processing apparatus 100 does not output a summary of terms having a high application frequency, so that it is possible to prevent a summary of terms familiar to the user from being displayed and maintain high visibility.

また、実施形態に係る情報処理装置１００において、出力部１３４は、聞き手のユーザ属性に基づいて専門用語の要約を出力する。これにより、情報処理装置１００は、聞き手にとって知識が浅い分野に属する用語の要約を出力することができるので、聞き手が知らない可能性が高い用語の要約を提示することができる。例えば、情報処理装置１００は、聞き手の専門以外の分野に属する用語の要約を出力することができるので、聞き手の聴講を支援することができる。 In the information processing apparatus 100 according to the embodiment, the output unit 134 outputs a summary of technical terms based on the user attributes of the listener. As a result, the information processing apparatus 100 can output a summary of terms belonging to a field that is not familiar to the listener, and thus can present a summary of terms that the listener is likely not to know. For example, since the information processing apparatus 100 can output a summary of terms belonging to a field other than the listener's specialty, it can support the listener's audition.

〔２．変形例〕
上述した実施形態に係る情報処理装置１００は、上記実施形態以外にも種々の異なる形態にて実施されてよい。そこで、以下では、上記の情報処理装置１００の他の実施形態について説明する。 [2. (Modification)
The information processing apparatus 100 according to the above-described embodiment may be implemented in various different forms other than the above-described embodiment. Therefore, hereinafter, another embodiment of the information processing apparatus 100 will be described.

〔２−１．聞き手の操作に応じた要約〕
上記の実施形態では、情報処理装置１００が、抽出部１３２によって抽出された用語のうち聞き手に応じた専門用語の要約を出力する例を挙げて説明した。ここで、情報処理装置１００は、聞き手の操作に応じた専門用語の要約を出力してもよい。 [2-1. (Summary according to the operation of the listener)
In the above embodiment, the information processing apparatus 100 has been described with an example in which a summary of technical terms corresponding to the listener among the terms extracted by the extraction unit 132 is output. Here, the information processing apparatus 100 may output a summary of technical terms corresponding to the operation of the listener.

具体的には、情報処理装置１００の出力部１３４は、聞き手による専門用語の要約に関する操作に基づいて専門用語の要約を出力する。言い換えると、出力部１３４は、専門用語の要約に対する聴衆の操作をフィードバックして専門用語の要約を出力する。例えば、出力部１３４は、専門用語の要約に対する聞き手の選択操作または削除操作に基づいて専門用語の要約を出力する。 Specifically, the output unit 134 of the information processing apparatus 100 outputs a summary of technical terms based on an operation related to the summary of technical terms by a listener. In other words, the output unit 134 feeds back the audience's operation on the technical term summary and outputs the technical term summary. For example, the output unit 134 outputs a summary of technical terms based on a listener selection operation or deletion operation on the technical term summary.

この点について、図８を用いて説明する。図８は、表示画面の一例を示す図である。図８に示すように、聴衆端末２０は、用語の要約Ａｂとともに、かかる用語に対応するリンクボタンＬｉを表示する。ここで、聴衆端末２０は、聞き手によってリンクボタンＬｉが選択された場合に、用語に対応するインターネット百科事典の記事へ遷移する。そして、情報処理装置１００は、例えば、用語に対応するリンクボタンＬｉを選択する選択操作の回数が多いほど専門的な用語であるとして、今後かかる用語が出現した場合に要約Ａｂを優先して出力する。 This point will be described with reference to FIG. FIG. 8 is a diagram illustrating an example of a display screen. As shown in FIG. 8, the audience terminal 20 displays a link button Li corresponding to the term together with the term summary Ab. Here, the audience terminal 20 transitions to an article of the Internet encyclopedia corresponding to the term when the link button Li is selected by the listener. Then, for example, the information processing apparatus 100 preferentially outputs the summary Ab when the term appears in the future, assuming that the term is more specialized as the number of selection operations for selecting the link button Li corresponding to the term increases. To do.

なお、情報処理装置１００は、リンクボタンＬｉを選択する選択操作の回数に限らず、各種の選択操作の回数に基づいて専門用語の要約を出力してもよい。例えば、情報処理装置１００は、用語の要約に対応して表示される図示しない「役立つ」ボタンを選択する選択操作の回数が多いほど専門的な用語であるとして、今後かかる用語が出現した場合にかかる用語の要約を優先して出力してもよい。 The information processing apparatus 100 may output a summary of technical terms based on the number of various selection operations without being limited to the number of selection operations for selecting the link button Li. For example, the information processing apparatus 100 assumes that the more the number of selection operations for selecting a “useful” button (not shown) displayed corresponding to the summary of the term, the more specialized the term is, and the term appears in the future. A summary of such terms may be output with priority.

他の例では、聴衆端末２０は、画面に表示された要約Ａｂ上で指を左右に素早く動かすフリック操作が行われた場合に、要約Ａｂを表示画面から削除する。そして、情報処理装置１００は、例えば、用語の要約Ａｂを画面上から削除する削除操作の回数が多いほど専門的な用語ではないとして、今後かかる用語が出現した場合に、かかる用語の要約Ａｂを優先して出力しない。 In another example, the audience terminal 20 deletes the summary Ab from the display screen when a flick operation is performed in which the finger is quickly moved left and right on the summary Ab displayed on the screen. The information processing apparatus 100, for example, assumes that the term is not a technical term as the number of deletion operations for deleting the term summary Ab from the screen is large. Do not output with priority.

なお、情報処理装置１００は、要約に対するフリック操作による削除操作に限らず、各種の操作によって実行される削除操作の回数に基づいて専門用語の要約を出力してもよい。例えば、情報処理装置１００は、用語の要約に対応して表示される図示しない「削除」ボタンを選択する選択操作の回数が多いほど専門的な用語でないとして、今後かかる用語が出現した場合にかかる用語の要約を優先して出力しないようにしてもよい。 Note that the information processing apparatus 100 may output a summary of technical terms based on the number of deletion operations performed by various operations, not limited to a deletion operation by a flick operation on a summary. For example, the information processing apparatus 100 takes a case where such a term appears in the future, assuming that the term is not a technical term as the number of selection operations for selecting a “delete” button (not shown) displayed corresponding to the term summary is larger. The summary of terms may not be output with priority.

このように、変形例に係る情報処理装置１００は、聞き手による専門用語の要約に関する操作に基づいて専門用語の要約を出力する。これにより、情報処理装置１００は、要約に対する聞き手の反応に応じて用語の要約を出力することができるので、聞き手にとって適切な用語の解説を表示することができる。 As described above, the information processing apparatus 100 according to the modification outputs a summary of technical terms based on an operation related to the summary of technical terms by a listener. As a result, the information processing apparatus 100 can output a summary of terms according to the listener's reaction to the summary, and can display a description of terms appropriate for the listener.

また、変形例に係る情報処理装置１００は、専門用語の要約に対する聞き手の選択操作または削除操作に基づいて専門用語の要約を出力する。これにより、情報処理装置１００は、聞き手が深く調べる傾向にある用語の要約を出力することができるので、聞き手が知りたい可能性が高い用語の解説を表示することができる。また、情報処理装置１００は、聞き手が削除する傾向にある用語の要約を出力することができるので、聞き手にとって解説を表示しなくてもよい常識的な用語の解説が表示されてしまい見易さが損なわれることを防ぐことができる。 In addition, the information processing apparatus 100 according to the modification outputs a summary of technical terms based on a listener's selection operation or deletion operation on the technical term summary. As a result, the information processing apparatus 100 can output a summary of terms that the listener tends to investigate deeply, and can display explanations of terms that the listener is likely to know. Further, since the information processing apparatus 100 can output a summary of terms that the listener tends to delete, explanations of common-sense terms that do not need to be displayed for the listener are displayed and are easy to see. Can be prevented from being damaged.

〔２−２．聞き手の操作状況を話し手に出力〕
上記の変形例では、情報処理装置１００が、要約に対する聞き手の反応に応じて用語の要約を出力する例を挙げて説明した。ここで、情報処理装置１００は、要約に対する聞き手の操作状況を話し手に出力してもよい。 [2-2. (The listener's operation status is output to the speaker.)
In the above modification, the information processing apparatus 100 has been described by taking an example in which a summary of terms is output according to a listener's reaction to the summary. Here, the information processing apparatus 100 may output the operation state of the listener for the summary to the speaker.

具体的には、情報処理装置１００の出力部１３４は、聞き手による専門用語の要約に関する操作状況を音声に関する情報の話し手に出力する。例えば、情報処理装置１００は、要約に関する操作状況として、用語の要約に対して選択操作がなされた選択回数をかかる用語と対応付けて話者端末１０にリアルタイムに出力する。他の例では、情報処理装置１００は、要約に関する操作状況として、用語の要約に対して削除操作がなされた削除回数をかかる用語と対応付けて話者端末１０にリアルタイムに出力する。 Specifically, the output unit 134 of the information processing apparatus 100 outputs the operation status related to the summary of the technical terms by the listener to the speaker of the information related to speech. For example, the information processing apparatus 100 outputs, in real time, the number of times the selection operation has been performed on the summary of terms as the operation status related to the summary in association with the term. In another example, the information processing apparatus 100 outputs, to the speaker terminal 10 in real time, the number of deletions performed on the summary of terms as the operation status related to the summary in association with the term.

このように、変形例に係る情報処理装置１００は、聞き手による専門用語の要約に関する操作状況を音声に関する情報の話し手に出力する。これにより、情報処理装置１００は、要約に対して聞き手が行った操作について話し手に通知することができるので、話し手の講演の質を向上させることができる。例えば、情報処理装置１００は、用語の要約に対して選択操作がなされた回数を話し手に通知することができるので、聞き手が理解していない用語を話し手に把握させることができる。また、情報処理装置１００は、用語の要約に対して削除操作がなされた回数を話し手に通知することができるので、聞き手が理解している用語を話し手に把握させることができる。このため、情報処理装置１００は、用語の要約に対する操作によって効果を測定することができるので、話し手が聞き手の理解度を把握するのに役立つ情報を提供することができる。 As described above, the information processing apparatus 100 according to the modified example outputs the operation status related to the summary of the technical terms by the listener to the speaker of the information related to the voice. As a result, the information processing apparatus 100 can notify the speaker of operations performed by the listener on the summary, thereby improving the quality of the speaker's lecture. For example, the information processing apparatus 100 can notify the speaker of the number of times that the selection operation has been performed on the term summary, and thus allows the speaker to grasp a term that the listener does not understand. Further, since the information processing apparatus 100 can notify the speaker of the number of times the deletion operation has been performed on the term summary, the speaker can grasp the term understood by the listener. For this reason, since the information processing apparatus 100 can measure the effect by an operation on the summary of terms, it is possible to provide information useful for the speaker to grasp the understanding level of the listener.

〔２−３．聞き手に応じたタイミングで出力〕
上記の実施形態では、情報処理装置１００が、抽出部１３２によって抽出された用語のうち聞き手に応じた専門用語の要約を出力する例を挙げて説明した。ここで、情報処理装置１００は、聞き手に応じたタイミングで要約を出力してもよい。 [2-3. (Output at the timing according to the listener)
In the above embodiment, the information processing apparatus 100 has been described with an example in which a summary of technical terms corresponding to the listener among the terms extracted by the extraction unit 132 is output. Here, the information processing apparatus 100 may output the summary at a timing according to the listener.

具体的には、情報処理装置１００の出力部１３４は、聞き手の音声に関する情報における分野の知識が浅いほど高い頻度で専門用語の要約を出力する。例えば、出力部１３４は、聞き手が専門用語の要約を参照した回数が多いほど高い頻度で専門用語の要約を出力する。一例としては、情報処理装置１００は、聞き手が用語の要約を選択して用語に対応する記事へ遷移した回数が多いほど高い頻度で用語の要約を聴衆端末２０に出力する。言い換えると、情報処理装置１００は、聞き手の選択回数が所定の回数より多く知識レベルが初級である場合には、専門用語が出現する度に用語の要約を出力する。一方、情報処理装置１００は、聞き手の選択回数が所定の回数より少なく知識レベルが中級以上である場合には、話の最後にまとめて用語の要約を出力する。 Specifically, the output unit 134 of the information processing apparatus 100 outputs a summary of technical terms with a higher frequency as the knowledge of the field in the information regarding the listener's voice is shallower. For example, the output unit 134 outputs the technical term summary with a higher frequency as the number of times the listener refers to the technical term summary increases. As an example, the information processing apparatus 100 outputs the term summary to the audience terminal 20 as the number of times that the listener selects the term summary and changes to the article corresponding to the term increases. In other words, the information processing apparatus 100 outputs a summary of terms each time a technical term appears when the number of listeners selected is greater than a predetermined number and the knowledge level is basic. On the other hand, the information processing apparatus 100 outputs a summary of terms collectively at the end of a story when the number of selections by the listener is less than a predetermined number and the knowledge level is intermediate or higher.

このように、変形例に係る情報処理装置１００は、聞き手の音声に関する情報における分野の知識が浅いほど高い頻度で専門用語の要約を出力する。これにより、情報処理装置１００は、聞き手の知識に応じて要約を出力することができるので、聞き手に合ったタイミングで要約を表示させることができる。例えば、情報処理装置１００は、聞き手が初級者である場合には、専門用語が出現するとすぐに要約を提示することができる。一方、情報処理装置１００は、聞き手が中級者以上である場合には、用語の要約が頻繁に出現する煩わしさを防ぐことができる。 As described above, the information processing apparatus 100 according to the modified example outputs a summary of technical terms with higher frequency as the knowledge of the field in the information related to the voice of the listener is shallower. As a result, the information processing apparatus 100 can output a summary according to the knowledge of the listener, so that the summary can be displayed at a timing suitable for the listener. For example, when the listener is a beginner, the information processing apparatus 100 can present a summary as soon as a technical term appears. On the other hand, the information processing apparatus 100 can prevent the annoyance that term summaries frequently appear when the listener is intermediate or higher.

また、変形例に係る情報処理装置１００は、聞き手が専門用語の要約を参照した回数が多いほど高い頻度で専門用語の要約を出力する。これにより、情報処理装置１００は、聞き手のレベルを高い精度で推定することができるので、聞き手に合ったタイミングで要約を提示することができる。 Further, the information processing apparatus 100 according to the modified example outputs the technical term summary more frequently as the number of times the listener refers to the technical term summary increases. Thereby, since the information processing apparatus 100 can estimate the level of the listener with high accuracy, the summary can be presented at a timing suitable for the listener.

〔２−４．用語ランキング〕
上記の実施形態では、情報処理装置１００が、抽出部１３２によって抽出された用語のうち聞き手に応じた専門用語の要約を出力する例を挙げて説明した。ここで、情報処理装置１００は、各種の形態で用語に関する情報を出力してもよい。 [2-4. (Term ranking)
In the above embodiment, the information processing apparatus 100 has been described with an example in which a summary of technical terms corresponding to the listener among the terms extracted by the extraction unit 132 is output. Here, the information processing apparatus 100 may output information on terms in various forms.

具体的には、情報処理装置１００の出力部１３４は、専門用語の要約に対する聞き手の選択回数が多い順または削除回数が少ない順に専門用語を並べた用語ランキングを出力する。例えば、情報処理装置１００は、要約に対する選択回数を用語名と対応付けて昇順に並べた表を聴衆端末２０に対して出力する。他の例では、情報処理装置１００は、要約に対する削除回数を用語名と対応付けて降順に並べた表を聴衆端末２０に対して出力する。 Specifically, the output unit 134 of the information processing apparatus 100 outputs a term ranking in which technical terms are arranged in descending order of the number of listeners selected with respect to the summary of technical terms or in the order of few deletions. For example, the information processing apparatus 100 outputs to the audience terminal 20 a table in which the number of selections for the summary is associated with the term name and arranged in ascending order. In another example, the information processing apparatus 100 outputs to the audience terminal 20 a table in which the number of deletions for the summary is associated with the term name and arranged in descending order.

このように、変形例に係る情報処理装置１００は、専門用語の要約に対する聞き手の選択回数が多い順または削除回数が少ない順に専門用語を並べた用語ランキングを出力する。これにより、情報処理装置１００は、他の用語と比較して聞き手が知らない用語を容易に把握可能な情報を提供することができるので、聞き手の利便性を向上させることができる。 As described above, the information processing apparatus 100 according to the modified example outputs a term ranking in which technical terms are arranged in descending order of the number of listeners selected with respect to the summary of technical terms or in the order of few deletions. As a result, the information processing apparatus 100 can provide information that allows a user to easily grasp a term that the listener does not know compared to other terms, thereby improving the convenience of the listener.

〔２−５．グルーピング〕
上記の実施形態では、情報処理装置１００が、抽出部１３２によって抽出された用語のうち聞き手に応じた専門用語の要約を出力する例を挙げて説明した。ここで、情報処理装置１００は、各種の形態で専門用語に関する情報を出力してもよい。 [2-5. grouping〕
In the above embodiment, the information processing apparatus 100 has been described with an example in which a summary of technical terms corresponding to the listener among the terms extracted by the extraction unit 132 is output. Here, the information processing apparatus 100 may output information on technical terms in various forms.

具体的には、情報処理装置１００の出力部１３４は、抽出部１３２によって抽出された用語のうち聞き手に応じた専門用語をかかる専門用語が属するグループ別に出力する。例えば、情報処理装置１００は、専門用語を分野別に分類した表を聴衆端末２０に対して出力する。他の例では、情報処理装置１００は、講話において出現した時間帯ごとに専門用語をまとめて分類した表を聴衆端末２０に対して出力する。一例としては、情報処理装置１００は、講話の質疑応答時間に出現した専門用語をまとめた表を聴衆端末２０に対して出力する。 Specifically, the output unit 134 of the information processing apparatus 100 outputs technical terms corresponding to the listener among the terms extracted by the extracting unit 132 for each group to which the technical terms belong. For example, the information processing apparatus 100 outputs a table in which technical terms are classified by field to the audience terminal 20. In another example, the information processing apparatus 100 outputs, to the audience terminal 20, a table in which technical terms are grouped and classified for each time zone that appears in the lecture. As an example, the information processing apparatus 100 outputs to the audience terminal 20 a table in which technical terms appearing at the question and answer time of the lecture are collected.

このように、変形例に係る情報処理装置１００は、抽出部１３２によって抽出された用語のうち聞き手に応じた専門用語をかかる専門用語が属するグループ別に出力する。これにより、情報処理装置１００は、専門用語の傾向を把握させることができるので、話し手や聞き手に役立つ情報を提供することができる。 As described above, the information processing apparatus 100 according to the modification outputs the technical terms corresponding to the listener among the terms extracted by the extraction unit 132 for each group to which the technical terms belong. Thereby, since the information processing apparatus 100 can make it understand the tendency of a technical term, it can provide useful information for a speaker or a listener.

〔２−６．要約の量を補正〕
上記の実施形態では、情報処理装置１００が、抽出部１３２によって抽出された用語のうち聞き手に応じた専門用語の要約を出力する例を挙げて説明した。ここで、情報処理装置１００は、各種の情報に基づいて要約の量を補正してもよい。 [2-6. Correct the amount of summarization)
In the above embodiment, the information processing apparatus 100 has been described with an example in which a summary of technical terms corresponding to the listener among the terms extracted by the extraction unit 132 is output. Here, the information processing apparatus 100 may correct the amount of summary based on various types of information.

具体的には、情報処理装置１００の出力部１３４は、聞き手による専門用語の要約に関する操作に基づいて専門用語の要約の量を補正して出力する。言い換えると、情報処理装置１００は、要約に対する聞き手の操作をフィードバックして要約の文字数を補正する。例えば、情報処理装置１００は、用語の記事へ遷移するリンクボタンを選択する選択操作の回数が多いほど専門的な用語であるとして要約の文字数を増やして聴衆端末２０に出力する。一方、情報処理装置１００は、用語の要約を削除する削除操作の回数が多いほど専門的な用語でないとして要約の文字数を減らして聴衆端末２０に出力する。 Specifically, the output unit 134 of the information processing apparatus 100 corrects and outputs the technical term summary amount based on an operation related to the technical term summary by the listener. In other words, the information processing apparatus 100 corrects the number of characters in the summary by feeding back the listener's operation on the summary. For example, the information processing apparatus 100 increases the number of characters in the summary and outputs the result to the audience terminal 20 as the technical term is more specialized as the number of selection operations for selecting a link button that transitions to a term article increases. On the other hand, the information processing apparatus 100 reduces the number of characters in the summary and outputs the result to the audience terminal 20 as the number of deletion operations for deleting the summary of the term increases as the term is not a specialized term.

このように、変形例に係る情報処理装置１００は、聞き手による専門用語の要約に関する操作に基づいて専門用語の要約の量を補正して出力する。これにより、情報処理装置１００は、要約に対する操作結果を反映した要約を出力することができるので、聞き手にとって質の高い要約を提供することができる。 As described above, the information processing apparatus 100 according to the modified example corrects and outputs the technical term summary amount based on the operation related to the technical term summary by the listener. Thus, the information processing apparatus 100 can output a summary reflecting the operation result for the summary, and thus can provide a high-quality summary for the listener.

〔２−７．専門用語度の設定〕
上記の実施形態では、情報処理装置１００が、音声に関する情報に含まれる用語のうち出現頻度に基づく専門用語度が所定の閾値以上の専門用語の要約を出力する例を挙げて説明した。ここで、情報処理装置１００は、聞き手によって設定された専門用語度の閾値に基づいて専門用語の要約を出力してもよい。 [2-7. (Defining technical terms)
In the above-described embodiment, the information processing apparatus 100 has been described with an example in which a summary of technical terms whose terminology degree based on the appearance frequency is greater than or equal to a predetermined threshold among terms included in the information related to speech is output. Here, the information processing apparatus 100 may output a summary of technical terms based on a technical term degree threshold set by the listener.

この点について図９を用いて説明する。図９は、設定画面の一例を示す図である。例えば、聴衆端末２０は、聞き手から設定画面へ遷移する操作を受け付けた場合に、図９に示すように、設定バーＢｒ上でつまみＢｕを左右に動かすことで専門用語度の閾値が設定される設定画面を表示する。一例としては、専門用語度の閾値は、設定バーＢｒ上のうちつまみＢｕが左に位置するほど低い値が設定される。一方、専門用語度の閾値は、設定バーＢｒ上のうちつまみＢｕが右に位置するほど高い値が設定される。 This point will be described with reference to FIG. FIG. 9 is a diagram illustrating an example of the setting screen. For example, when the audience terminal 20 receives an operation for transition from the listener to the setting screen, as shown in FIG. 9, the threshold value of the technical term is set by moving the knob Bu left and right on the setting bar Br. Display the setting screen. As an example, the threshold value of the technical term level is set to a lower value as the knob Bu is located on the left of the setting bar Br. On the other hand, the threshold value of the technical term is set higher as the knob Bu is positioned on the right of the setting bar Br.

そして、情報処理装置１００は、設定バーＢｒ上のつまみＢｕの位置によって設定される専門用語度の閾値に基づいて専門用語の要約を聴衆端末２０に出力する。言い換えると、情報処理装置１００は、聞き手によって設定された専門用語度の閾値に基づいて出力する専門用語の要約を調整する。具体的には、情報処理装置１００は、設定バーＢｒ上のうちつまみＢｕが左に位置するほど専門用語度の閾値が低く設定されているので多くの専門用語の要約を聴衆装置２０に出力する。一方、情報処理装置１００は、設定バーＢｒ上のうちつまみＢｕが右に位置するほど専門用語度の閾値が高く設定されているので少なく専門用語の要約を聴衆装置２０に出力する。 Then, the information processing apparatus 100 outputs a summary of technical terms to the audience terminal 20 based on a threshold value of technical terms set by the position of the knob Bu on the setting bar Br. In other words, the information processing apparatus 100 adjusts the summary of technical terms to be output based on the technical term degree threshold set by the listener. Specifically, the information processing device 100 outputs a summary of many technical terms to the audience device 20 because the threshold of the technical term level is set lower as the knob Bu is located on the left of the setting bar Br. . On the other hand, the information processing apparatus 100 outputs a summary of technical terms to the audience device 20 because the threshold value of the technical term level is set higher as the knob Bu is positioned to the right of the setting bar Br.

このように、変形例に係る情報処理装置１００は、聞き手によって設定された専門用語度の閾値に基づいて専門用語の要約を出力する。これにより、情報処理装置１００は、聞き手が所望するレベル以上の専門的な用語の要約を出力することができるので、聞き手の満足度を高めることができる。 As described above, the information processing apparatus 100 according to the modification outputs a summary of technical terms based on the technical term degree threshold set by the listener. As a result, the information processing apparatus 100 can output a summary of technical terms that are higher than the level desired by the listener, so that satisfaction of the listener can be increased.

〔２−８．講演を選択〕
上記の実施形態では、情報処理装置１００が、抽出部１３２によって抽出された用語のうち聞き手に応じた専門用語の要約を出力する例を挙げて説明した。ここで、情報処理装置１００は、聞き手によって選択された講演の講話に含まれる専門用語の要約を出力してもよい。 [2-8. (Select a lecture)
In the above embodiment, the information processing apparatus 100 has been described with an example in which a summary of technical terms corresponding to the listener among the terms extracted by the extraction unit 132 is output. Here, the information processing apparatus 100 may output a summary of technical terms included in the lecture lecture selected by the listener.

この点について図１０を用いて説明する。図１０は、選択画面の一例を示す図である。例えば、聴衆端末２０は、図１０に示すように、専門用語の要約を出力可能な講演Ｒｍ１「カルテット第１講演会」、講演Ｒｍ２「カルテット第２セミナー」、講演Ｒｍ３「カルテット第３会議」を掲載したセミナー一覧を画面に表示する。ここで、聴衆端末２０は、講演Ｒｍ１〜Ｒｍ３の中から聞き手によって選択された講演を受け付ける。そして、情報処理装置１００は、聞き手によって選択された講演の講話に含まれる用語のうち聞き手に応じた専門用語の要約を聴衆端末２０に出力する。 This point will be described with reference to FIG. FIG. 10 is a diagram illustrating an example of the selection screen. For example, as shown in FIG. 10, the audience terminal 20 provides a lecture Rm1 “quartet first lecture”, a lecture Rm2 “quartet second seminar”, and a lecture Rm3 “quartet third meeting” that can output a summary of technical terms. Display the list of posted seminars on the screen. Here, the audience terminal 20 accepts the lecture selected by the listener from the lectures Rm1 to Rm3. Then, the information processing apparatus 100 outputs to the audience terminal 20 a summary of technical terms corresponding to the listener among the terms included in the lecture lecture selected by the listener.

このように、変形例に係る情報処理装置１００は、聞き手によって選択された講演の講話に含まれる専門用語の要約を出力する。これにより、情報処理装置１００は、聞き手が所望する講演における専門用語の要約を出力することができるので、複数の講演が同時に行われている場合でも聞き手が所望する専門用語の要約を提供することができる。 In this way, the information processing apparatus 100 according to the modification outputs a summary of technical terms included in the lecture lecture selected by the listener. Accordingly, the information processing apparatus 100 can output a summary of technical terms in a lecture desired by the listener, and therefore provides a summary of technical terms desired by the listener even when a plurality of lectures are performed simultaneously. Can do.

〔２−９．パーソナライズ〕
上記の実施形態では、情報処理装置１００が、聞き手のユーザ属性に基づいて専門用語の要約を出力する例を挙げて説明した。ここで、情報処理装置１００は、各種の情報に基づいて聞き手に応じた専門用語の要約を出力してもよい。 [2-9. (Personalize)
In the above embodiment, the information processing apparatus 100 has been described with an example in which a summary of technical terms is output based on the user attribute of the listener. Here, the information processing apparatus 100 may output a summary of technical terms corresponding to the listener based on various types of information.

具体的には、情報処理装置１００は、他の聞き手との間の類似性に基づいて専門用語の要約を出力する。例えば、情報処理装置１００は、他のサービスなどにおける利用履歴が類似する他の聞き手が要約に対して行った選択操作に基づいて、専門用語の要約を出力する。一例としては、情報処理装置１００は、類似する他の聞き手によって要約に対する選択操作が多く行われた用語の要約ほど優先して出力する。 Specifically, the information processing apparatus 100 outputs a summary of technical terms based on the similarity with other listeners. For example, the information processing apparatus 100 outputs a summary of technical terms based on a selection operation performed on the summary by another listener who has a similar usage history in another service or the like. As an example, the information processing apparatus 100 preferentially outputs a summary of a term that has been subjected to many selection operations on the summary by other similar listeners.

他の例では、情報処理装置１００は、聞き手に対して過去に出力した専門用語の要約の履歴に基づいて、専門用語の要約を出力する。例えば、情報処理装置１００は、過去に１度要約を聴衆端末２０に出力したことがある用語が出現した場合には、かかる用語の要約を同一の聴衆端末２０に対して出力しない。言い換えると、情報処理装置１００は、同一の聞き手に対して同一の用語の要約を出力しない。 In another example, the information processing apparatus 100 outputs a summary of technical terms based on a history of technical term summaries output to the listener in the past. For example, when a term appears that has been output to the audience terminal 20 once in the past, the information processing apparatus 100 does not output the summary of the term to the same audience terminal 20. In other words, the information processing apparatus 100 does not output the same term summary for the same listener.

このように、変形例に係る情報処理装置１００は、各種の情報に基づいて聞き手に応じた専門用語の要約を出力する。これにより、情報処理装置１００は、聞き手に特化した用語の要約を提供することができるので、聞き手における利便性を高めることができる。 Thus, the information processing apparatus 100 according to the modification outputs a summary of technical terms corresponding to the listener based on various types of information. As a result, the information processing apparatus 100 can provide a summary of terms specific to the listener, thereby improving convenience for the listener.

〔２−１０．適用対象〕
上記の実施形態では、情報処理装置１００が、講演者の講話に含まれる専門用語の要約を聴衆に対して出力する例を挙げて説明した。ここで、情報処理装置１００は、講演に限らず、各種の発話を適用対象にしてもよい。具体的には、情報処理装置１００は、知識レベルの異なる話し手と聞き手の会話に含まれる専門用語の要約を出力する。例えば、情報処理装置１００は、先生が生徒に対して行う授業に含まれる専門用語の要約を出力する。他の例では、情報処理装置１００は、医者が患者に対して行う診察に含まれる専門用語の要約を出力する。 [2-10. (Applicable)
In the above embodiment, the information processing apparatus 100 has been described with an example in which a summary of technical terms included in the lecturer's lecture is output to the audience. Here, the information processing apparatus 100 may apply not only a lecture but also various utterances. Specifically, the information processing apparatus 100 outputs a summary of technical terms included in a conversation between a speaker and a listener having different knowledge levels. For example, the information processing apparatus 100 outputs a summary of technical terms included in a class conducted by a teacher for a student. In another example, the information processing apparatus 100 outputs a summary of technical terms included in a diagnosis performed by a doctor on a patient.

また、情報処理装置１００は、語学学習を用途として適用してもよい。例えば、情報処理装置１００は、まず、英語によってなされる発話を音声認識した音声認識結果を話者端末１０から受信する。続いて、情報処理装置１００は、受信した音声認識結果を英語から日本語に翻訳する。そして、情報処理装置１００は、日本語に翻訳した音声認識結果を日本人の聴衆が有する聴衆端末２０に送信する。 The information processing apparatus 100 may apply language learning as an application. For example, the information processing apparatus 100 first receives a speech recognition result obtained by recognizing speech made in English from the speaker terminal 10. Subsequently, the information processing apparatus 100 translates the received speech recognition result from English to Japanese. Then, the information processing apparatus 100 transmits the speech recognition result translated into Japanese to the audience terminal 20 of the Japanese audience.

また、上記の実施形態では、情報処理装置１００は、スマートフォンに対して専門用語の要約を出力する例を挙げて説明した。ここで、情報処理装置１００は、スマートフォンに限らず、タブレット端末やＰＣなど各種の端末装置に対して専門用語の要約を出力してもよい。 Further, in the above-described embodiment, the information processing apparatus 100 has been described with an example in which a summary of technical terms is output to a smartphone. Here, the information processing apparatus 100 may output a summary of technical terms to various terminal devices such as a tablet terminal and a PC, not limited to a smartphone.

このように、変形例に係る情報処理装置１００は、各種の発話を適用対象にする。これにより、情報処理装置１００は、講演に限らず各種の発話に含まれる専門用語の要約や日本語訳を聞き手に提供することができる。 As described above, the information processing apparatus 100 according to the modified example targets various utterances. Thereby, the information processing apparatus 100 can provide the listener with a summary of technical terms and Japanese translations included in various utterances as well as lectures.

〔２−１１．話し手と聞き手との間の知識差に基づいて専門用語の要約を出力〕
上記の実施形態では、抽出部１３２によって抽出された用語のうち聞き手に応じた専門用語の要約を出力する例を挙げて説明した。ここで、情報処理装置１００は、話し手と聞き手との間の知識差を判断基準として採用してもよい。 [2-11. A summary of terminology is output based on the knowledge difference between the speaker and the listener)
In the above embodiment, an example has been described in which a summary of technical terms corresponding to the listener among the terms extracted by the extraction unit 132 is output. Here, the information processing apparatus 100 may employ a knowledge difference between the speaker and the listener as a determination criterion.

具体的には、情報処理装置１００の出力部１３４は、音声に関する情報の話し手と聞き手との間の知識差に基づいて専門用語の要約を出力する。例えば、情報処理装置１００は、話し手が話す講話のテーマや分野などにおける聞き手と話し手の知識レベルをそれぞれ設定する。一態様としては、知識レベルは、話し手や聞き手から受け付けたレベルが設定されてもよいし、行動履歴やプロフィールに基づいて設定されてもよい。そして、情報処理装置１００は、聞き手と話し手の知識レベルの差分に基づいて専門用語の要約を出力する。一例としては、情報処理装置１００は、知識レベルの差分が高いほど専門用語度の所定の閾値を低く設定することで専門用語の要約を相対的に多く出力する。 Specifically, the output unit 134 of the information processing apparatus 100 outputs a summary of technical terms based on a knowledge difference between a speaker and a listener of information related to speech. For example, the information processing apparatus 100 sets the knowledge level of the listener and the speaker in the lecture theme and field spoken by the speaker. As one aspect, the level accepted from a speaker or listener may be set as the knowledge level, or may be set based on an action history or a profile. The information processing apparatus 100 then outputs a summary of technical terms based on the difference between the knowledge levels of the listener and the speaker. As an example, the information processing apparatus 100 outputs a relatively large number of technical term summaries by setting a predetermined threshold value of the technical term level lower as the knowledge level difference is higher.

他の例では、情報処理装置１００は、話し手のユーザ属性と聞き手のユーザ属性とに基づいて専門用語の要約を出力する。一例としては、情報処理装置１００は、話し手のユーザ属性と聞き手のユーザ属性とが異なる場合に、専門用語度の所定の閾値を調整して専門用語の要約を出力する。一態様としては、情報処理装置１００は、話し手のユーザ属性と聞き手のユーザ属性との間の類似度が低いほど専門用語度の所定の閾値を低く設定することで専門用語の要約を相対的に多く出力する。 In another example, the information processing apparatus 100 outputs a summary of technical terms based on the user attributes of the speaker and the user attributes of the listener. As an example, when the user attribute of the speaker is different from the user attribute of the listener, the information processing apparatus 100 adjusts a predetermined threshold of technical terms and outputs a technical term summary. As one aspect, the information processing apparatus 100 sets the predetermined terminology degree threshold value lower as the similarity between the user attribute of the speaker and the user attribute of the listener is lower. Output a lot.

このように、変形例に係る情報処理装置１００は、音声に関する情報の話し手と聞き手との間の知識差に基づいて専門用語の要約を出力する。これにより、情報処理装置１００は、話し手と聞き手の知識差を考慮して専門用語の要約を出力することができるので、聞き手の聴講を支援することができる。例えば、情報処理装置１００は、話し手の専門分野と聞き手の専門分野が異なるほど多くの要約を出力することができるので、聞き手が用語の理解不足で話についていけなくなることを防ぐことができる。 As described above, the information processing apparatus 100 according to the modification outputs a summary of technical terms based on a knowledge difference between a speaker and a listener of information related to speech. Thus, the information processing apparatus 100 can output a summary of technical terms in consideration of the knowledge difference between the speaker and the listener, and thus can support the listener's listening. For example, the information processing apparatus 100 can output more summaries as the subject area of the speaker is different from the subject area of the listener, so that it is possible to prevent the listener from being unable to follow the story due to insufficient understanding of terms.

〔３．その他〕
また、上記実施形態において説明した各処理のうち、自動的に行われるものとして説明した処理の全部または一部を手動的に行うこともでき、あるいは、手動的に行われるものとして説明した処理の全部または一部を公知の方法で自動的に行うこともできる。この他、上記文書中や図面中で示した処理手順、具体的名称、各種のデータやパラメータを含む情報については、特記する場合を除いて任意に変更することができる。 [3. Others]
In addition, among the processes described in the above embodiment, all or part of the processes described as being automatically performed can be performed manually, or the processes described as being performed manually can be performed. All or a part can be automatically performed by a known method. In addition, the processing procedures, specific names, and information including various data and parameters shown in the document and drawings can be arbitrarily changed unless otherwise specified.

また、図示した各装置の各構成要素は機能概念的なものであり、必ずしも物理的に図示の如く構成されていることを要しない。すなわち、各装置の分散・統合の具体的形態は図示のものに限られず、その全部または一部を、各種の負荷や使用状況などに応じて、任意の単位で機能的または物理的に分散・統合して構成することができる。 Further, each component of each illustrated apparatus is functionally conceptual, and does not necessarily need to be physically configured as illustrated. In other words, the specific form of distribution / integration of each device is not limited to that shown in the figure, and all or a part thereof may be functionally or physically distributed or arbitrarily distributed in arbitrary units according to various loads or usage conditions. Can be integrated and configured.

例えば、図２に示した音声情報記憶部１２１や用語情報記憶部１２２、ユーザ情報記憶部１２３は、情報処理装置１００が保持せずに、ストレージサーバ等に保持されてもよい。この場合、情報処理装置１００は、ストレージサーバにアクセスすることで、音声に関する情報や、用語に関する情報、ユーザに関する情報を取得する。 For example, the audio information storage unit 121, the term information storage unit 122, and the user information storage unit 123 illustrated in FIG. 2 may be stored in a storage server or the like without being stored in the information processing apparatus 100. In this case, the information processing apparatus 100 acquires information related to voice, information related to terms, and information related to users by accessing the storage server.

また、情報処理装置１００は、出力処理は行わず、抽出処理や生成処理のみを行う情報処理装置であってもよい。この場合、情報処理装置は、出力部１３４を有しない。そして、出力部１３４を有する出力装置が、情報処理装置１００によって生成された専門用語の要約を聴衆端末２０に対して出力する。 Further, the information processing apparatus 100 may be an information processing apparatus that performs only an extraction process or a generation process without performing an output process. In this case, the information processing apparatus does not have the output unit 134. Then, the output device having the output unit 134 outputs the technical term summary generated by the information processing device 100 to the audience terminal 20.

また、上記の実施形態では、サーバ装置である情報処理装置１００が用語の要約を生成して聴衆端末２０に出力する例を挙げたが、話者端末１０が用語の要約を生成して出力してもよい。この場合、例えば、話者端末１０は、受信部１３１、抽出部１３２、生成部１３３および出力部１３４に相当する機能を有する。そして、話者端末１０は、まず、音声に関する情報を受信する。続いて、話者端末１０は、受信された音声に関する情報に含まれる用語を抽出する。そして、話者端末１０は、抽出された用語のうち聞き手に応じた専門用語の要約を聴衆端末２０に出力する。 In the above embodiment, the information processing apparatus 100 that is a server device generates a summary of terms and outputs the summary to the audience terminal 20. However, the speaker terminal 10 generates and outputs a summary of terms. May be. In this case, for example, the speaker terminal 10 has functions corresponding to the reception unit 131, the extraction unit 132, the generation unit 133, and the output unit 134. And the speaker terminal 10 receives the information regarding an audio | voice first. Subsequently, the speaker terminal 10 extracts terms included in the received information related to speech. Then, the speaker terminal 10 outputs a summary of technical terms corresponding to the listener among the extracted terms to the audience terminal 20.

また、上記の実施形態では、聴衆端末２０が用語の要約を生成して出力してもよい。この場合、例えば、聴衆端末２０は、受信部１３１、抽出部１３２、生成部１３３および出力部１３４に相当する機能を有する。そして、聴衆端末２０は、まず、音声に関する情報を受信する。続いて、聴衆端末２０は、受信された音声に関する情報に含まれる用語を抽出する。そして、聴衆端末２０は、抽出された用語のうち聞き手に応じた専門用語の要約を画面に表示する。 In the above embodiment, the audience terminal 20 may generate and output a summary of terms. In this case, for example, the audience terminal 20 has functions corresponding to the reception unit 131, the extraction unit 132, the generation unit 133, and the output unit 134. And the audience terminal 20 receives the information regarding an audio | voice first. Subsequently, the audience terminal 20 extracts terms included in the received information related to voice. Then, the audience terminal 20 displays a summary of technical terms corresponding to the listener among the extracted terms on the screen.

また、上記の実施形態では、話者端末１０が講演者の講話の音声認識を行なう例を示したが、話者端末１０に限らずサーバ（例えば、情報処理装置１００）が音声認識を行なってもよい。この場合、例えば、情報処理装置１００は、講演者の講話を録音した音声データ等を話者端末１０から取得する。続いて、情報処理装置１００は、取得した音声データの音声認識を実行する。その後、情報処理装置１００は、音声認識結果に含まれる用語を抽出する。そして、情報処理装置１００は、抽出した用語のうち聞き手に応じた専門用語の要約を出力する。 In the above embodiment, the speaker terminal 10 performs speech recognition of the lecturer's lecture, but the server (for example, the information processing apparatus 100) performs speech recognition without being limited to the speaker terminal 10. Also good. In this case, for example, the information processing apparatus 100 acquires voice data or the like recorded from the lecturer's lecture from the speaker terminal 10. Subsequently, the information processing apparatus 100 performs voice recognition of the acquired voice data. Thereafter, the information processing apparatus 100 extracts terms included in the speech recognition result. Then, the information processing apparatus 100 outputs a summary of technical terms corresponding to the listener among the extracted terms.

また、上述してきた実施形態に係る情報処理装置１００は、例えば図１１に示すような構成のコンピュータ１０００によって実現される。以下、情報処理装置１００を例に挙げて説明する。図１１は、情報処理装置１００の機能を実現するコンピュータ１０００の一例を示すハードウェア構成図である。コンピュータ１０００は、ＣＰＵ１１００、ＲＡＭ１２００、ＲＯＭ１３００、ＨＤＤ１４００、通信インターフェイス（Ｉ／Ｆ）１５００、入出力インターフェイス（Ｉ／Ｆ）１６００、およびメディアインターフェイス（Ｉ／Ｆ）１７００を有する。 Further, the information processing apparatus 100 according to the embodiment described above is realized by a computer 1000 having a configuration as shown in FIG. 11, for example. Hereinafter, the information processing apparatus 100 will be described as an example. FIG. 11 is a hardware configuration diagram illustrating an example of a computer 1000 that implements the functions of the information processing apparatus 100. The computer 1000 includes a CPU 1100, RAM 1200, ROM 1300, HDD 1400, communication interface (I / F) 1500, input / output interface (I / F) 1600, and media interface (I / F) 1700.

ＣＰＵ１１００は、ＲＯＭ１３００またはＨＤＤ１４００に格納されたプログラムに基づいて動作し、各部の制御を行う。ＲＯＭ１３００は、コンピュータ１０００の起動時にＣＰＵ１１００によって実行されるブートプログラムや、コンピュータ１０００のハードウェアに依存するプログラム等を格納する。 The CPU 1100 operates based on a program stored in the ROM 1300 or the HDD 1400 and controls each unit. The ROM 1300 stores a boot program executed by the CPU 1100 when the computer 1000 is started up, a program depending on the hardware of the computer 1000, and the like.

ＨＤＤ１４００は、ＣＰＵ１１００によって実行されるプログラム、および、かかるプログラムによって使用されるデータ等を格納する。通信インターフェイス１５００は、ネットワークＮを介して他の機器からデータを受信してＣＰＵ１１００へ送り、ＣＰＵ１１００が生成したデータを、ネットワークＮを介して他の機器へ送信する。 The HDD 1400 stores programs executed by the CPU 1100, data used by the programs, and the like. The communication interface 1500 receives data from other devices via the network N and sends the data to the CPU 1100, and transmits data generated by the CPU 1100 to other devices via the network N.

ＣＰＵ１１００は、入出力インターフェイス１６００を介して、ディスプレイやプリンタ等の出力装置、および、キーボードやマウス等の入力装置を制御する。ＣＰＵ１１００は、入出力インターフェイス１６００を介して、入力装置からデータを取得する。また、ＣＰＵ１１００は、生成したデータを、入出力インターフェイス１６００を介して出力装置へ出力する。 The CPU 1100 controls an output device such as a display and a printer and an input device such as a keyboard and a mouse via the input / output interface 1600. The CPU 1100 acquires data from the input device via the input / output interface 1600. Further, the CPU 1100 outputs the generated data to the output device via the input / output interface 1600.

メディアインターフェイス１７００は、記録媒体１８００に格納されたプログラムまたはデータを読み取り、ＲＡＭ１２００を介してＣＰＵ１１００に提供する。ＣＰＵ１１００は、かかるプログラムを、メディアインターフェイス１７００を介して記録媒体１８００からＲＡＭ１２００上にロードし、ロードしたプログラムを実行する。記録媒体１８００は、例えばＤＶＤ（Digital Versatile Disk）、ＰＤ（Phase change rewritable Disk）等の光学記録媒体、ＭＯ（Magneto Optical disk）等の光磁気記録媒体、テープ媒体、磁気記録媒体、または半導体メモリ等である。 The media interface 1700 reads a program or data stored in the recording medium 1800 and provides it to the CPU 1100 via the RAM 1200. The CPU 1100 loads such a program from the recording medium 1800 onto the RAM 1200 via the media interface 1700, and executes the loaded program. The recording medium 1800 is, for example, an optical recording medium such as a DVD (Digital Versatile Disk) or PD (Phase change rewritable disk), a magneto-optical recording medium such as an MO (Magneto Optical disk), a tape medium, a magnetic recording medium, or a semiconductor memory. It is.

例えば、コンピュータ１０００が実施形態に係る情報処理装置１００として機能する場合、コンピュータ１０００のＣＰＵ１１００は、ＲＡＭ１２００上にロードされたプログラムを実行することにより、制御部１３０の機能を実現する。また、ＨＤＤ１４００には、記憶部１２０内のデータが格納される。コンピュータ１０００のＣＰＵ１１００は、これらのプログラムを記録媒体１８００から読み取って実行するが、他の例として、他の装置からネットワークＮを介してこれらのプログラムを取得してもよい。 For example, when the computer 1000 functions as the information processing apparatus 100 according to the embodiment, the CPU 1100 of the computer 1000 implements the function of the control unit 130 by executing a program loaded on the RAM 1200. The HDD 1400 stores data in the storage unit 120. The CPU 1100 of the computer 1000 reads these programs from the recording medium 1800 and executes them. However, as another example, these programs may be acquired from other devices via the network N.

以上、本願の実施形態のいくつかを図面に基づいて詳細に説明したが、これらは例示であり、発明の概要の欄に記載の態様を始めとして、当業者の知識に基づいて種々の変形、改良を施した他の形態で本発明を実施することが可能である。 As described above, some of the embodiments of the present application have been described in detail with reference to the drawings. It is possible to implement the present invention in other forms with improvements.

また、上述した情報処理装置１００は、複数のサーバコンピュータで実現してもよく、また、機能によっては外部のプラットフォーム等をＡＰＩ（Application Programming Interface）やネットワークコンピューティングなどで呼び出して実現するなど、構成は柔軟に変更できる。 In addition, the information processing apparatus 100 described above may be realized by a plurality of server computers, and depending on the function, an external platform or the like may be realized by calling an API (Application Programming Interface) or network computing. Can be changed flexibly.

また、特許請求の範囲に記載した「部（section、module、unit）」は、「手段」や「回路」などに読み替えることができる。例えば、受信部は、受信手段や受信回路に読み替えることができる。 Further, “section (module, unit)” described in the claims can be read as “means”, “circuit”, and the like. For example, the receiving unit can be read as receiving means or a receiving circuit.

１提示システム
１０話者端末
２０聴衆端末
５０情報提供装置
１００情報処理装置
１２１音声情報記憶部
１２２用語情報記憶部
１２３ユーザ情報記憶部
１３１受信部
１３２抽出部
１３３生成部
１３４出力部 DESCRIPTION OF SYMBOLS 1 Presentation system 10 Speaker terminal 20 Audience terminal 50 Information provision apparatus 100 Information processing apparatus 121 Speech information storage part 122 Term information storage part 123 User information storage part 131 Receiving part 132 Extraction part 133 Generation part 134 Output part

Claims

A receiver that receives information about the speech that is the speaker's utterance ;
An extraction unit that extracts terms included in the information about the voice received by the reception unit;
A summary of technical terms corresponding to the listener among the terms extracted by the extraction unit is output at a timing according to the listener, and an operation status related to the summary of the technical terms by the listener is output while the speaker is speaking. An information processing apparatus, comprising:

The output unit is
The information processing apparatus according to claim 1, wherein a summary of technical terms corresponding to the listener is output based on a field to which the term extracted by the extraction unit belongs.

The output unit is
The information processing apparatus according to claim 1 or 2, wherein a summary of technical terms having a technical term degree based on an appearance frequency among terms included in the information related to the speech is a predetermined threshold or more is output.

The output unit is
The information processing apparatus according to claim 1, wherein a summary of the technical term is output based on a user attribute of the listener.

The output unit is
The information processing apparatus according to claim 1, wherein the summary of the technical terms is output based on an operation related to the summary of the technical terms by the listener.

The output unit is
The information processing apparatus according to claim 1, wherein the technical term summary is output based on a selection operation or a deletion operation of the listener with respect to the technical term summary.

The output unit is
The information processing apparatus according to any one of claims 1 to 6 , wherein a summary of the technical term is output at a higher frequency as the field knowledge in the information about the voice of the listener is shallower.

The output unit is
The information processing apparatus according to any one of claims 1-7, characterized in that the listener is a summary of the technical terms frequently as the number of times of referring to the summary of the terminology.

The output unit is
The information according to any one of claims 1 to 8 , wherein a term ranking in which the technical terms are arranged in descending order of selection frequency or deletion frequency in the technical term summary is output. Processing equipment.

The output unit is
The information processing apparatus according to any one of claims 1 to 9 , wherein a technical term corresponding to a listener among the terms extracted by the extraction unit is output for each group to which the technical term belongs.

The output unit is
The information processing apparatus according to any one of claims 1-10, characterized in that outputs the corrected amount of summary of the terminology based on operation relating to summary of the technical terms by the listener.

The output unit is
The information processing apparatus according to any one of claims 1 to 11, characterized that a summary of the terminology based on knowledge difference between the listener and the speaker of the information about the voice.

An information processing method executed by an information processing apparatus,
A receiving process for receiving information about the speech that is the speaker's utterance ;
An extraction step of extracting terms included in the information about the voice received by the reception step;
A summary of technical terms corresponding to the listener among the terms extracted in the extraction step is output at a timing according to the listener, and an operation status related to the summary of the technical terms by the listener is output while the speaker is speaking. An information processing method characterized by including an output step.

A receiving procedure for receiving information about the voice of the speaker ,
An extraction procedure for extracting terms included in the information about the voice received by the reception procedure;
A summary of technical terms corresponding to the listener among the terms extracted by the extraction procedure is output at a timing according to the listener, and an operation status regarding the summary of the technical terms by the listener is output while the speaker is speaking. An information processing program for causing a computer to execute an output procedure.