JP2016218995A

JP2016218995A - Machine translation method, machine translation system and program

Info

Publication number: JP2016218995A
Application number: JP2016039350A
Authority: JP
Inventors: 菜々美藤原; Nanami Fujiwara; 山内　真樹; Maki Yamauchi; 真樹山内
Original assignee: Panasonic Intellectual Property Corp of America
Current assignee: Panasonic Intellectual Property Corp of America
Priority date: 2015-05-25
Filing date: 2016-03-01
Publication date: 2016-12-22
Anticipated expiration: 2036-03-01
Also published as: JP6649124B2

Abstract

PROBLEM TO BE SOLVED: To provide a machine translation system capable of outputting a translation text desirable for a user with few correction on an input text or re-inputting thereof.SOLUTION: The machine translation system generates plural different forward translation texts in which translation object text of a first language received is translated into a second language, and generates plural reverse translation texts in which each of the plural different forward translation texts are reverse translated into the first language. When receiving an operation to select a reverse translation text from the plural reverse translation texts while an information output unit outputs plural reverse translation texts, the forward translation text corresponding to the reverse translation text is output.SELECTED DRAWING: Figure 3

Description

本発明は、複数言語間の翻訳を行う装置、複数言語間の翻訳を行う方法、及び、複数言語間の翻訳を行うシステムに関する。 The present invention relates to an apparatus for performing translation between a plurality of languages, a method for performing translation between a plurality of languages, and a system for performing translation between a plurality of languages.

近年のグローバル化に伴い、異なる言語を母国語とするユーザ同士のコミュニケーションを可能にする機械翻訳装置、機械翻訳システムの開発が行われている。また、機械翻訳機能を提供するサービスの運用も開始されており、例えば旅行会話などの場面で実際に利用されつつある。 With globalization in recent years, machine translation devices and machine translation systems that enable communication between users whose native languages are different languages have been developed. In addition, the operation of a service that provides a machine translation function has been started, and it is actually being used in scenes such as travel conversations.

特許第５６５３３９２号公報Japanese Patent No. 5653392 特開２００５−７８３１８号公報JP 2005-78318 A 特許第５０９７３４０号公報Japanese Patent No. 5097340 国際公開第２０１３／０１４８７７号International Publication No. 2013/014877

Ｐａｐｉｎｅｎｉ，Ｋ．，Ｒｏｕｋｏｓ，Ｓ．，Ｗａｒｄ，Ｔ．，ａｎｄＺｈｕ，Ｗ．Ｊ．「ＢＬＥＵ：ａｍｅｔｈｏｄｆｏｒａｕｔｏｍａｔｉｃｅｖａｌｕａｔｉｏｎｏｆｍａｃｈｉｎｅｔｒａｎｓｌａｔｉｏｎ」，Ｐｒｏｃ．ｏｆｔｈｅＡｎｎｕａｌＭｅｅｔｉｎｇｏｆｔｈｅＡｓｓｏｃｉａｔｉｏｎｏｆＣｏｍｐｕｔａｔｉｏｎａｌＬｉｎｇｕｉｓｔｉｃｓ（ＡＣＬ），ｐｐ．３１１−３１８，２００２Papineni, K .; Roukos, S .; , Ward, T .; , And Zhu, W .; J. et al. “BLEU: a method for automatic evaluation of machine translation”, Proc. of the Annual Meeting of the Association of Computational Linguistics (ACL), pp. 311-318, 2002 高地なつめ、磯崎秀樹、「スクランブリングを考慮した和訳の自動評価法のＮＴＣＩＲ−９データによる検証」、第２１９回自然言語処理研究発表会、岡山県立大、２０１４Natsume Takachi, Hideki Amagasaki, “Verification of automatic translation method considering scrambling with NTCIR-9 data”, 219th Natural Language Processing Research Presentation, Okayama Prefectural University, 2014 ＩｌｙａＳｕｔｓｋｅｖｅｒＰａｐｉｎｅｎｉ他、「ＳｅｑｕｅｎｃｅｔｏＳｅｑｕｅｎｃｅＬｅａｒｎｉｎｇｗｉｔｈＮｅｕｒａｌＮｅｔｗｏｒｋｓ」，ＡｄｖａｎｃｅｓｉｎＮｅｕｒａｌＩｎｆｏｒｍａｔｉｏｎＰｒｏｃｅｓｓｉｎｇＳｙｓｔｅｍｓ２７，ｐｐ．３１０４−３１１２，２０１４Ilya Suskever Papineni et al., “Sequence to Sequence Learning with Neural Networks”, Advances in Neural Information Processing Systems 27, p. 3104-3112, 2014 ＤｚｍｉｔｒｙＢａｈｄａｎａｕ他、「ＮｅｕｒａｌＭａｃｈｉｎｅＴｒａｎｓｌａｔｉｏｎｂｙＪｏｉｎｔｌｙＬｅａｒｎｉｎｇｔｏＡｌｉｇｎａｎｄＴｒａｎｓｌａｔｅ」，ａｒＸｉｖ：１４０９．０４７３ｖ５，ＩＣＬＲ２０１５Dzmitry Bahdanau et al., “Neural Machine Translation by Joining Learning to Alignment and Translation”, arXiv: 1409.04733, ICLR 2015

しかし、上記の機械翻訳装置、機械翻訳システムには、更なる改善が必要であった。 However, the machine translation apparatus and machine translation system described above require further improvement.

上記課題を解決するための、本発明の一態様は、言語情報を出力する情報出力装置へ接続し、第１言語と第２言語との間の翻訳処理を行なう機械翻訳システムにおける機械翻訳方法であって、前記第１言語の翻訳対象文を受信し、受信した前記翻訳対象文を前記第２言語へ翻訳した複数の異なる順翻訳文を生成し、前記複数の異なる前記順翻訳文の各々について前記第１言語へ逆翻訳した複数の逆翻訳文を生成し、前記情報出力装置において前記複数の逆翻訳文を出力しているときに、前記複数の逆翻訳文から一の逆翻訳文を選択する操作を受け付けた場合、前記一の逆翻訳文に対応する前記順翻訳文を出力する。 One aspect of the present invention for solving the above problem is a machine translation method in a machine translation system that is connected to an information output device that outputs language information and performs a translation process between a first language and a second language. And receiving the translation target sentence in the first language, generating a plurality of different forward translation sentences obtained by translating the received translation target sentence into the second language, and each of the plurality of different forward translation sentences. Generating a plurality of back-translated sentences back-translated into the first language, and selecting one back-translated sentence from the plurality of back-translated sentences when the information output device outputs the plurality of back-translated sentences When the operation to perform is accepted, the forward translation sentence corresponding to the one reverse translation sentence is output.

上記態様により、更なる改善が実現できた。 According to the above aspect, further improvement can be realized.

本実施の形態における、機械翻訳システムの全体構成の一例を示す図である。It is a figure which shows an example of the whole structure of the machine translation system in this Embodiment. 本実施の形態における、情報表示端末構成を示すブロック図である。It is a block diagram which shows the information display terminal structure in this Embodiment. 本実施の形態における、翻訳サーバの構成を示すブロック図である。It is a block diagram which shows the structure of the translation server in this Embodiment. 本実施の形態における、機械翻訳システムのハードウェア構成を示す図である。It is a figure which shows the hardware constitutions of the machine translation system in this Embodiment. 本実施の形態における、機械翻訳システムの動作を示すフローチャートである。It is a flowchart which shows operation | movement of the machine translation system in this Embodiment. 本実施の形態における、翻訳文選択処理の具体的な動作を示すフローチャートである。It is a flowchart which shows the specific operation | movement of the translation selection process in this Embodiment. 本実施の形態における、逆翻訳文選択処理の具体的な動作を示すフローチャートである。It is a flowchart which shows the specific operation | movement of the back translation sentence selection process in this Embodiment. 本実施の形態における、フレーズ評価処理の具体的な動作を示すフローチャートである。It is a flowchart which shows the specific operation | movement of a phrase evaluation process in this Embodiment. 本実施の形態における、学習処理の具体的な動作を示すフローチャートである。It is a flowchart which shows the specific operation | movement of a learning process in this Embodiment. 本実施の形態における、学習処理の具体的な動作を示すフローチャートである。It is a flowchart which shows the specific operation | movement of a learning process in this Embodiment. 本実施の形態における、一般的なフレーズテーブルの例である。It is an example of the general phrase table in this Embodiment. 本実施の形態における、フレーズ分割の概要を表わす説明図である。It is explanatory drawing showing the outline | summary of the phrase division | segmentation in this Embodiment. （Ａ）、（Ｂ）、（Ｃ）は、それぞれ、本実施の形態における、表示画面の一例を示す図である。(A), (B), (C) is a figure which shows an example of the display screen in this Embodiment, respectively. 本実施の形態における、表示画面の一例を示す図である。It is a figure which shows an example of the display screen in this Embodiment.

以下で説明する実施の形態は、いずれも本発明の一具体例を示すものである。以下の実施の形態で示される数値、形状、構成要素、ステップ、ステップの順序などは、一例であり、本発明を限定する主旨ではない。また、以下の実施の形態における構成要素のうち、最上位概念を示す独立請求項に記載されていない構成要素については、任意の構成要素として説明される。また全ての実施の形態において、各々の内容を組み合わせることも出来る。 Each of the embodiments described below shows a specific example of the present invention. Numerical values, shapes, components, steps, order of steps and the like shown in the following embodiments are merely examples, and are not intended to limit the present invention. In addition, among the constituent elements in the following embodiments, constituent elements that are not described in the independent claims indicating the highest concept are described as optional constituent elements. In all the embodiments, the contents can be combined.

（本発明に至った知見）
翻訳機器の開発が盛んに行われている状況において、異なる言語での円滑なコミュニケーションを実現するには、機械翻訳の精度が完全であることが望まれる。しかし、現状の機械翻訳においては、任意の文を誤りなく翻訳することは極めて困難であり、翻訳できる分野（ドメイン）を旅行会話といった形で限定することで翻訳精度を高めているが、完全には程遠い状況である。 (Knowledge that led to the present invention)
In the situation where translation equipment is actively developed, it is desired that the accuracy of machine translation is perfect in order to realize smooth communication in different languages. However, in the current machine translation, it is extremely difficult to translate an arbitrary sentence without error, and the translation accuracy is improved by limiting the fields (domains) that can be translated in the form of travel conversations. Is far away.

ここで、機械翻訳技術は大きく３種類に分類される。本編ではそれぞれ、１）ルールベース機械翻訳（ＲＢＭＴ：Ｒｕｌｅ−ＢａｓｅｄＭａｃｈｉｎｅＴｒａｎｓｌａｔｉｏｎ）、２）統計的機械翻訳（ＳＭＴ：ＳｔａｓｔｉｓｔｉｃａｌＭａｃｈｉｎｅＴｒａｎｓｌａｔｉｏｎ）、３）ディープニューラルネットによるモデル獲得型の機械翻訳（ＤＮＮＭＴ：ＤｅｅｐＮｅｕｒａｌＮｅｔｗｏｒｋＭａｃｈｉｎｅＴｒａｎｓｌａｔｉｏｎ）と呼ぶことにする。 Here, the machine translation technology is roughly classified into three types. In each of the main sections, 1) Rule-based machine translation (RBMT), 2) Statistical machine translation (SMT), 3) Model acquisition type machine translation (DNNNMT: Deep) It will be referred to as “Neural Network Machine Translation”.

１）ルールベース機械翻訳（ＲＢＭＴ）
ルールベース機械翻訳（ＲＢＭＴ）は、人手によって構築された変換規則（訳語の対をデータベースとして記憶したもの）を元に翻訳を行うシステムであり、原文と訳文のデータベースは翻訳メモリと表現されることもある。 1) Rule-based machine translation (RBMT)
Rule-based machine translation (RBMT) is a system that performs translation based on manual conversion rules (stored translation pairs as a database), and the database of the original text and the translated text is expressed as a translation memory. There is also.

長所として、「規則（翻訳するパターン）を厳密に定義できるため、原文（の一部）が当該規則内に存在する場合には、対応する部分の翻訳の精度が高くなり、規則内の翻訳表現を予め揃えておくことで、翻訳出力の翻訳表現についても一貫性も保ちやすい」ことが挙げられる。 As an advantage, “Rules (patterns to be translated) can be defined precisely, so if the original text (part) exists in the rules, the translation accuracy of the corresponding parts will be high, and the translation expression in the rules. , It is easy to maintain consistency in the translation expression of the translation output.

短所は、「規則が存在しない場合には翻訳精度が非常に低くなる、もしくは全く翻訳できず、想定以外の業界・分野への適応性も極めて低い」ことが挙げられる。また、基本的には人手によって規則を構築・拡充するため、開発に掛かるコストも高くなる。更に、翻訳性能を向上させるためには規則を追加していく必要があるが、ユーザ側で規則を構築して翻訳システムをカスタマイズするには、規則の設計ルールについて相当の知識が要求されるため、一般ユーザが気軽に使えるものとはなっていない。このため、ルールベース翻訳（ＲＢＭＴ）の主な展開先は、業務用翻訳（特許翻訳など）となっており、市販の業務用翻訳ソフトは、このルールベース翻訳や翻訳メモリを用いているものが多い。 Disadvantages include: “If there are no rules, translation accuracy will be very low, or translation will not be possible at all, and adaptability to other industries and fields will be extremely low”. In addition, since the rules are basically constructed and expanded manually, the development costs are also high. Furthermore, it is necessary to add rules in order to improve translation performance. However, in order to customize the translation system by building rules on the user side, considerable knowledge of the rule design rules is required. It is not easy for general users to use. For this reason, the main development of rule-based translation (RBMT) is business translation (patent translation, etc.), and commercially available business translation software uses this rule-based translation or translation memory. Many.

２）統計的機械翻訳（ＳＭＴ）
統計的機械翻訳（ＳＭＴ）は、ＲＢＭＴのような規則（ルール）を作成せず、対訳コーパスと呼ばれる「原言語と目的言語双方での訳文対」を大量に準備し、この対訳コーパスから統計的に翻訳確率を計算し訳文を生成する手法である。グーグル（登録商標）やマイクロソフト（登録商標）等がウェブ上で提供する翻訳エンジンに広く用いられている。 2) Statistical machine translation (SMT)
Statistical machine translation (SMT) does not create rules (rules) like RBMT, but prepares a large number of “translation pairs in both source and target languages” called bilingual corpus, and statistically translates from this bilingual corpus This is a method of generating a translation by calculating the translation probability. It is widely used for translation engines provided on the web by Google (registered trademark), Microsoft (registered trademark), and the like.

この手法は、翻訳確率を計算するにあたり、言語モデル（単言語における単語間の出現確率をｎ−ｇｒａｍモデルで表現するものが一般的）、及び、翻訳モデル（双言語間での、単語もしくはフレーズ単位での対応関係をアライメントモデルとして表現し、各単語間の語彙レベルでの訳語対関係を語彙モデルとして表現するものが一般的である）の２つのモデルについて、それぞれ対訳コーパスから統計的に確率を計算し、各モデルのパラメータ（確率値）を得ることで、統計的な翻訳を実現する手法である。一般的にＩＢＭモデルと呼ばれている。（日本語を含む場合は、「並び替えモデル」と呼ばれるモデルを構文解析等により追加する場合がある）。 In this method, in calculating the translation probability, a language model (generally expressing the appearance probability between words in a single language by an n-gram model) and a translation model (word or phrase between bilinguals). It is statistically probable from the bilingual corpus for each of the two models that express the correspondence in units as an alignment model and the translational relationship between words at the vocabulary level as a vocabulary model. This is a technique that realizes statistical translation by obtaining the parameters (probability values) of each model. It is generally called an IBM model. (If Japanese is included, a model called “reorder model” may be added by parsing etc.).

このＳＭＴの長所として、ＲＢＭＴのようなルールを用いないことから、「『規則が存在しない場合に翻訳精度が非常に低くなる』と言った問題を未然に回避でき、汎化能力のある翻訳エンジンを構築できる」点が挙げられる。 The advantage of this SMT is that it does not use rules like RBMT, so it can avoid the problem of “translation accuracy is very low when no rules exist” and has a generalization ability. Can be built ".

一方、短所として、「確率的な表現に基づくため、確率計算の基となる対訳コーパスを大量に準備する必要がある」点がある。また、汎化性能が高くなる一方で、翻訳結果は「確率的なスコアが高い」出力に過ぎなくなるため、ＲＢＭＴで上手く翻訳できた事例についてＳＭＴでも同等の翻訳精度が得られる保証は無くなり、「全体／平均としては、比較的良い」翻訳では有るが、個別の翻訳ケースでは確実性に劣る場面も発生する。 On the other hand, there is a disadvantage that “because it is based on a probabilistic expression, it is necessary to prepare a large number of parallel corpora as a basis for probability calculation”. In addition, while the generalization performance is improved, the translation result is only an output with a “high probability score”, and therefore, there is no guarantee that the same translation accuracy can be obtained even in SMT for the case where the translation was successful in RBMT. Although there is a translation that is "relatively good as a whole / average", there are cases where individual translation cases are less reliable.

更に、確率計算を行う際には、各内部モデル（言語モデル、翻訳モデル等）が出力する確率値を掛け合わせる必要がある。この際、ＳＭＴの翻訳エンジン全体をチューニングするため、各モデルからの確率値に重み付けをして重みをパラメータとした機械学習を行うが、ここで用いられる機械評価値が文字通り「機械的な評価値（例えば、ＢＬＥＵと呼ばれる評価値）」となるため、必ずしもユーザの主観評価と一致しないことが報告されている（例えば、非特許文献２参照）。つまり、「機械評価値は高いがユーザの実際の評価とは結びつかない」と言った構造的な欠点も有している。 Furthermore, when performing the probability calculation, it is necessary to multiply the probability values output by the internal models (language model, translation model, etc.). At this time, in order to tune the entire SMT translation engine, the machine learning is performed by weighting the probability value from each model and using the weight as a parameter. The machine evaluation value used here is literally “mechanical evaluation value”. (For example, an evaluation value called BLEU) ", it has been reported that it does not necessarily coincide with the subjective evaluation of the user (for example, see Non-Patent Document 2). That is, it also has a structural defect that “the machine evaluation value is high but is not linked to the user's actual evaluation”.

また、ユーザ側やシステム側で性能向上を図ろうとして、仮に対訳コーパスを数文追加したとしても、翻訳出力は、対訳コーパス全体を統計処理した上での確率的な振る舞いに左右されるため、性能向上に直結しない（性能が下がることも有り得る）と言った点も別の構造的な欠点として挙げられる。 In addition, even if you add several sentences of the bilingual corpus to improve performance on the user side or system side, the translation output depends on the stochastic behavior after statistical processing of the entire bilingual corpus, Another structural drawback is that it does not directly improve performance (it can also degrade performance).

３）ディープニューラルネットによるモデル獲得型の機械翻訳（ＤＮＮＭＴ）
ディープニューラルネットによるモデル獲得型の機械翻訳（ＤＮＮＭＴ）は、ディープラーニング（ＤＬ：ＤｅｅｐＬｅａｒｎｉｎｇ）技術を機械翻訳に適用した比較的新しい翻訳技術である（例えば、非特許文献３，非特許文献４参照）。 3) Model acquisition type machine translation (DNNMT) by deep neural network
Model acquisition type machine translation (DNNNMT) using a deep neural network is a relatively new translation technology in which deep learning (DL: Deep Learning) technology is applied to machine translation (for example, see Non-Patent Document 3 and Non-Patent Document 4). ).

ＲＢＭＴやＳＭＴのような「人によって設計されたルールやモデル」に入力文や対訳コーパスを当てはめ／統計処理する翻訳手法とは異なり、システムが適切なモデル自体を学習することを特徴としている。現時点では黎明期の技術であり、機械翻訳としては実用化には至っていないが、類似技術として音声認識では既に、アップル社のＳｉｒｉ（登録商標）などで実用化されている。 Unlike a translation technique that applies / statistically processes an input sentence or a bilingual corpus to “rules and models designed by a person” such as RBMT and SMT, the system is characterized by learning an appropriate model itself. At present, this technology is in its early days and has not been put into practical use as machine translation. However, as a similar technology, it has already been put into practical use in speech recognition such as Siri (registered trademark) of Apple Inc.

長所として、適切なモデルの学習に成功した場合には、翻訳性能の向上が期待されており、特にＳＭＴを超える汎化性能や、ＲＢＭＴやＳＭＴが不得手とする比較的長い文（例えば４０以上の単語で構成される文）について翻訳性能をそれほど劣化させずに翻訳結果を出力することが期待されている。 As an advantage, if an appropriate model is successfully learned, an improvement in translation performance is expected. In particular, a generalization performance that exceeds SMT, and a relatively long sentence (eg, 40 or more) that RBMT and SMT are not good at. It is expected that the translation result will be output without degrading the translation performance so much.

短所としては、ＤＮＮＭＴ自身がどのようにモデルを学習するかについて、外部制御が困難な点が挙げられる。いわゆる「パラメータを用いたチューニング」に相当するような「パラメータ」が明示的に存在せず、内部の変数は非常に多いものの、どの変数が翻訳性能と直結しているかが不明である。つまり、ニューラルネットワークの構成と入力データ・教師データを決めた後は、どうシステムが学習するか・どのような性能が出るか制御困難であり、原理的にどうすれば性能向上が図れるのかが不明である。一般的なチューニング手法が適用困難であり、ＳＭＴ以上にユーザ側やシステム側で性能向上を図ろうとしても対応が困難である。仮に対訳コーパスを数文追加したとしても、翻訳出力はニューラルネットの学習に左右され、何が出てくるかはニューラルネットワークの内部変数が決まらないと分からない、という構造的な欠点が存在する。 Disadvantages include the difficulty of external control of how DNNNMT itself learns the model. There is no explicit “parameter” corresponding to so-called “tuning using parameters”, and although there are a large number of internal variables, it is unclear which variables are directly linked to translation performance. In other words, it is difficult to control how the system learns and what kind of performance comes out after deciding the neural network configuration and input data / teacher data, and it is unclear how the performance can be improved in principle. . It is difficult to apply a general tuning method, and it is difficult to cope with an attempt to improve performance on the user side or system side beyond SMT. Even if several sentences are added to the bilingual corpus, the translation output depends on the learning of the neural network, and there is a structural defect that what will come out cannot be known unless the internal variables of the neural network are determined.

以上が、機械翻訳技術を大きく３種類に分類したときの、それぞれの特徴・長所・短所である。 The above are the features, advantages, and disadvantages of machine translation technology when classified into three main types.

一方でユーザ観点から機械翻訳技術を見ると、翻訳したい言語である目的言語について、ユーザが見識を持っていれば、機械翻訳の精度があまり高くない場合でも、翻訳出力結果を元に会話を進めることができる可能性がある。つまり、機械翻訳の出力について、ユーザが「正しい／正しくない」を判断でき、有用な翻訳部分についてはそれを利用する、といった使い方が期待できる。 On the other hand, when looking at machine translation technology from the user's perspective, if the user has insight into the target language, which is the language he wants to translate, even if the accuracy of machine translation is not very high, the conversation will proceed based on the translation output result Could be possible. In other words, it can be expected that the user can determine whether the output of machine translation is “correct / incorrect” and use a useful translation part.

しかし実際には、機械翻訳の出力結果をユーザが参照したとしても、その翻訳結果が示している内容を理解できないケースも容易に想像される。例えば、ユーザを日本人と想定した場合に、日本語をメジャーな言語（英語など）へ翻訳したいケースも有れば、比較的マイナーな言語（例えば、マレー語やベトナム語など）へ翻訳したいケースも有る。例えば、目的言語が英語である場合、英語に関して知見が有るユーザは、機械翻訳の結果を参考にして自らの英語での会話に役立てることが可能かも知れない。一方で、目的言語がマイナーな言語である場合には、ユーザに目的言語についての知見が全く無いことが多く、ユーザは機械翻訳結果の内容が正しいかどうかについて、何の情報も得ることが出来ない。つまり、目的言語の訳文が機械翻訳によって提示されたとしても、その内容を全く理解することが出来ないケースが発生する。言語は数千種類も存在するとされており、ほとんどのユーザにとって、知見のない言語の方が大多数であるとさえ言える。このように、目的言語の翻訳結果について、その意味内容を理解できない場合には、相手に伝わる意図を確認できないまま訳文を提示してしまうことにもつながり、コミュニケーションが破綻する恐れがある。 However, in practice, even if the user refers to the output result of the machine translation, it is easily imagined that the contents indicated by the translation result cannot be understood. For example, assuming that the user is Japanese, there are cases where you want to translate Japanese into a major language (English, etc.), while you want to translate into a relatively minor language (eg Malay, Vietnamese, etc.) There is also. For example, when the target language is English, a user who has knowledge about English may be able to use it for his own English conversation with reference to the result of machine translation. On the other hand, if the target language is a minor language, the user often has no knowledge of the target language, and the user can obtain any information as to whether the content of the machine translation result is correct. Absent. In other words, even if the target language translation is presented by machine translation, the content cannot be understood at all. Thousands of languages are said to exist, and for most users it can be said that the majority of languages are unknown. As described above, if the meaning of the translation result of the target language cannot be understood, the translation may be presented without confirming the intention transmitted to the other party, and communication may be broken.

このような現状の機械翻訳精度を少しでも向上させるためには、機械翻訳システムそのものが自動的に学習し、性能向上していくことが望まれる。機械翻訳システムが自動的に学習する際には、ユーザ（原言語話者）に負担をかけないことが期待され、その上で、ユーザが相手に伝わる意図を簡便な方法で確認しながら翻訳文を生成・利用できるように機械翻訳システムを構築する必要がある。機械翻訳システムには、ユーザの利用結果を用いて、自動学習することが要求される。また、当然のこととして、システムそのものに要求される計算機資源や開発コストを極力少なくすることも同時に求められる。 In order to improve the current machine translation accuracy as much as possible, it is desired that the machine translation system itself automatically learns and improves its performance. When a machine translation system automatically learns, it is expected that it will not put a burden on the user (source language speaker), and then the translated sentence while confirming the intention transmitted to the other party by a simple method. It is necessary to build a machine translation system so that can be generated and used. The machine translation system is required to automatically learn using the use result of the user. Naturally, it is also required to reduce the computer resources and development costs required for the system itself as much as possible.

すなわち、機械翻訳システムに求められることは、「（要件１）ユーザに負担を掛けず、ユーザが相手に伝わる意図を簡便な形で確認をしながら翻訳文を生成・利用できること」、「（要件２）ユーザの利用結果を用いて（新たにユーザに負担をかけること無く）、自動的に学習できること」、「（要件３）要件１・要件２の実現と同時に、計算機資源や開発コストを下げること」の３要件である。これらの３要件を同時に満たした機械翻訳システムの実現が課題となっている。 That is, what is required of a machine translation system is “(Requirement 1) that a user can generate and use a translated sentence while confirming the intention transmitted to the other party in a simple manner without burdening the user”, “(Requirement 2) Can be automatically learned by using the user's usage results (without newly burdening the user) "," (Requirement 3) Simultaneously realizing requirement 1 and requirement 2, simultaneously reducing computer resources and development costs 3 requirements. The realization of a machine translation system that satisfies these three requirements simultaneously has been an issue.

この課題を解決する手段として、例えば、ＲＢＭＴのように、原言語用例と目的言語用例を対応づけた用例をデータベースや規則・ルールとして持っておき、発話された入力文に対して類似する用例を取得し、原言語・目的言語両方で提示するもの（例えば、特許文献１）、入力文と逆翻訳（目的言語から原言語への翻訳）結果の距離を翻訳信頼度として算出し、必要に応じて信頼度と逆翻訳結果をあわせて提示し、ユーザに言い直しや言い換えを行わせるもの（例えば、特許文献４）、統計的機械翻訳（ＳＭＴ）システムの性能を評価する手法であるＢＬＥＵ（ＢｉＬｉｎｇｕａｌＥｖａｌｕａｔｉｏｎＵｎｄｅｒｓｔｕｄｙ）を用いるもの（例えば、非特許文献１）、入力文に対し、逆翻訳文をＮ個取得し、入力文と逆翻訳文の比較評価を行うもの（例えば、特許文献２）、対訳語句を対訳辞書に登録する際、対象とする翻訳装置に有効であるか否かを判別した結果を機械学習するもの（例えば、特許文献３）などがある。 As a means for solving this problem, for example, as in RBMT, an example that associates an example of the source language with an example of the target language is provided as a database or a rule / rule, and an example similar to the spoken input sentence. Obtain and present in both source and target languages (for example, Patent Document 1), calculate the distance between the input sentence and the result of reverse translation (translation from the target language to the source language) as translation reliability, and if necessary BLE (BiLingual, which is a method for evaluating the performance of a statistical machine translation (SMT) system, which presents the reliability and reverse translation results together, and allows the user to rephrase and paraphrase (for example, Patent Document 4). E (under Non-Patent Literature 1) using N (Evaluation Under study), N reverse translation sentences are acquired for the input sentence, and the comparative evaluation of the input sentence and the reverse translation sentence What performs valency (for example, Patent Document 2), and performs machine learning on the result of determining whether or not it is effective for the target translation device when registering the bilingual phrase in the bilingual dictionary (for example, Patent Document 3) and so on.

特許文献１には、予め、原言語の用例と、原言語の用例の翻訳文である目的言語の用例とが対応づけられて用例格納部に格納されている内容が開示されている。 Patent Document 1 discloses contents stored in an example storage unit in advance in association with a source language example and a target language example that is a translation of the source language example.

また、特許文献１には、前述の用例格納部に格納された情報に基づいて、入力された原言語の文字列と、原言語の文字列に対応する目的言語の文字列とが、表示部において、それぞれ異なる表示エリアに表示される内容が開示されている。 Patent Document 1 discloses that an input source language character string and a target language character string corresponding to the source language character string are displayed on the display unit based on the information stored in the example storage unit. The contents displayed in different display areas are disclosed.

具体的には、入力された原言語の文字列に対して、その原言語の文字列に類似する複数の類似用例の文字列が同一の表示エリアに表示され、これに対して、入力された原言語の文字列および複数の類似用例の文字列の各々に対応する目的言語の文字列が、異なる表示エリアに表示される。 Specifically, for the input source language character string, a plurality of similar example character strings similar to the source language character string are displayed in the same display area. A character string in the target language corresponding to each of the character string in the source language and a plurality of similar example character strings is displayed in different display areas.

原言語話者および目的言語話者は、入力文に対して意味を確認したい時に、用例を確認することができる。また、目的言語話者、または原言語話者が選択した類似文用例が相手の話者にもハイライト等で表示される。 When the source language speaker and the target language speaker want to confirm the meaning of the input sentence, they can confirm the example. In addition, the similar sentence example selected by the target language speaker or the source language speaker is also displayed in highlight or the like on the partner speaker.

特許文献１では、入力文である原言語の文字列を取得し、データベース内の対訳例中にある類似用例を検索し、類似度が閾値以上の用例であれば類似用例であると判断して、入力文に対する類似用例として出力する。このとき、出力された類似用例に対するユーザの選択動作が行なわれた場合には、選択された用例をハイライト表示する内容も開示されている。 In Patent Document 1, a source language character string as an input sentence is acquired, a similar example in a parallel translation example in the database is searched, and if the example has a similarity equal to or greater than a threshold, it is determined that the example is a similar example. Is output as a similar example for the input sentence. At this time, when a user's selection operation is performed on the output similar example, the content of highlighting the selected example is also disclosed.

特許文献２には、原文に対して順方向機械翻訳部１１で出力された順方向翻訳文を受け、逆方向翻訳部１２ａ、１２ｂ、１２ｃにおいて、逆方向翻訳文Ａ、Ｂ、Ｃを評価部１３で評価する内容が開示されている。 In Patent Document 2, the forward translation sentence output from the forward machine translation unit 11 is received with respect to the original sentence, and the backward translations A, B, and C are evaluated by the backward translation units 12a, 12b, and 12c. The content to be evaluated at 13 is disclosed.

このときの、評価手法としては、非特許文献１に開示されている内容が一般的に知られている。非特許文献１においては、参照訳（人手で作成した正解訳）と機械翻訳で出力された訳との間のＮ−ｇｒａｍが一致した数を計算した上で、参照訳の長さの影響を加味した補正を行ってＢＬＥＵ値を取得する。ＢＬＥＵは翻訳精度の評価手法としてよく用いられているが、特に、日英・英日のような語順が大きく異なる言語間の翻訳においては、人手評価と相関が低いことが知られている（例えば、非特許文献２参照）。 As an evaluation method at this time, the contents disclosed in Non-Patent Document 1 are generally known. In Non-Patent Document 1, after calculating the number of matching N-grams between the reference translation (correct translation created manually) and the translation output by machine translation, the influence of the length of the reference translation is calculated. The BLEU value is acquired by performing correction with consideration. BLEU is often used as an evaluation method for translation accuracy, but it is known that human translation and correlation are low particularly in translation between languages such as Japanese-English and English-Japanese that have very different word orders (for example, Non-Patent Document 2).

特許文献２には、さらに、逆翻訳文Ａ、Ｂ、Ｃと原文のＤＰマッチングを行なった後に、最大スコアを有する逆方向翻訳文と原文を出力する内容が開示されている。評価者がこれらを比較することによって順方向翻訳文の主観評価が可能となる。 Patent Document 2 further discloses the contents of outputting the reverse translation sentence having the maximum score and the original sentence after DP matching between the reverse translation sentences A, B, and C and the original sentence. The evaluator compares these to enable subjective evaluation of the forward translation.

以上で述べたように、機械翻訳精度が完全でない状況下において翻訳を行う場合、ユーザ（原言語話者）に繰り返し入力などの負担をかけることなく、簡単にかつ相手に伝わる意図を確認しながら翻訳発話文を生成し、ユーザの選択を評価としてシステムそのものが評価／学習することが必要であった。 As described above, when translation is performed under a situation where machine translation accuracy is not perfect, the user (source language speaker) can easily confirm the intention transmitted to the other party without burdening the user repeatedly. It was necessary for the system itself to generate translation utterances and evaluate / learn the user's selection as an evaluation.

機械翻訳システムに求められることを再掲すると、「（要件１）ユーザに負担を掛けず、ユーザが相手に伝わる意図を簡便な形で確認をしながら翻訳文を生成・利用できること」、「（要件２）ユーザの利用結果を用いて（新たにユーザに負担をかけること無く）、自動的に学習できること」、「（要件３）要件１・要件２の実現と同時に、計算機資源や開発コストを下げること」、の３要件を同時に満たすことである。 Reprinting what is required of a machine translation system, “(Requirement 1) A user can generate and use a translated sentence while checking the intention transmitted to the other party in a simple manner without burdening the user”, “(Requirement 2) Can be automatically learned by using the user's usage results (without newly burdening the user) "," (Requirement 3) Simultaneously realizing requirement 1 and requirement 2, simultaneously reducing computer resources and development costs It is to satisfy the three requirements at the same time.

この課題を解決する手段として、先に挙げた３種類の技術では次のようなアプローチが取られている。 As means for solving this problem, the following approaches are taken in the three types of technologies mentioned above.

１）ルールベース機械翻訳（ＲＢＭＴ）
前述のように、特許文献１では、入力された原言語の文字列から、類似する原言語の類似文を出力し、出力された原言語の類似文に対しても対応する翻訳文を選択可能に出力する。これによって、例えば、原言語の文字列が音声入力される場合などには、発話入力時の音声認識誤りなどによる入力ミスの影響を減少させ、伝えたい意図を簡便にユーザが選択できる。 1) Rule-based machine translation (RBMT)
As described above, in Patent Document 1, a similar sentence in a similar source language can be output from the input source language character string, and a corresponding translation sentence can be selected for the output similar sentence in the source language. Output to. As a result, for example, when a character string in the source language is input by voice, the influence of an input error due to a voice recognition error at the time of speech input is reduced, and the user can easily select an intention to be transmitted.

特許文献１の段落番号［００１２］には、「用例格納部１０５は、原言語の用例（以下、原言語用例ともいう）と目的言語の用例（以下、目的言語用例ともいう）とを関連づけて格納する。」との記述があり、用例格納部１０５が用例検索を行うデータベースとなって原言語と目的言語の対を格納している。この部分がルールベース翻訳のデータベースに相当している。 In paragraph [0012] of Patent Document 1, “the example storage unit 105 associates a source language example (hereinafter also referred to as a source language example) with a target language example (hereinafter also referred to as a target language example). The example storage unit 105 stores a pair of the source language and the target language as a database for performing an example search. This part corresponds to a rule-based translation database.

なお、特許文献１の段落番号［００１１］には「機械翻訳部１０３は、音声認識部１０２から原言語文字列を受け取り、原言語文字列を目的言語（第２言語ともいう）の文字列に機械翻訳し、翻訳結果の文字列である目的言語文字列を得る。機械翻訳の具体的な処理は、一般的な処理を行えばよいため、ここでの説明を省略する。」との記述があり、ここで機械翻訳部１０３について、ＳＭＴやＤＮＮＭＴを用いることによる利点や課題は特許文献１で開示されておらず、特許文献１における音声翻訳システム全体としては、用例格納部１０５によるＲＢＭＴとしての働きが特徴的に開示されているに過ぎない。 The paragraph number [0011] in Patent Document 1 states that “the machine translation unit 103 receives a source language character string from the speech recognition unit 102 and converts the source language character string into a character string of a target language (also referred to as a second language). Machine translation is performed to obtain a target language character string that is a character string of the translation result.Since the concrete process of the machine translation may be a general process, the description is omitted here. " Yes, the advantages and problems of using the SMT and DNNNMT for the machine translation unit 103 are not disclosed in Patent Document 1, and the entire speech translation system in Patent Document 1 is an RBMT by the example storage unit 105. The work is only disclosed characteristically.

先に挙げたように、ＲＢＭＴは規則（ルール）に記述されていない入力文や、対応していない業界・分野の入力が有った場合に翻訳精度が著しく低下する、もしくは全く翻訳が出来ないという問題を有している。特許文献１においても、発話入力に対して用例格納部１０５に類似例が記述されていない場合や分野が異なる場合、翻訳精度が極めて低い翻訳文を提示する可能性が有り、目的言語に関する知識が無いユーザは相手に伝わる意図を確認できないまま訳文を提示してしまう（もしくは相手に伝わる意図を誤解したまま提示してしまう）ことに繋がりかねないが、こういった課題は開示されておらず解決策も記されていない。 As mentioned earlier, RBMT has a significant decrease in translation accuracy or cannot be translated at all if there is an input sentence not described in the rules (rules) or an input from an industry or field that is not supported. Has the problem. Also in Patent Document 1, when a similar example is not described in the example storage unit 105 with respect to an utterance input, or when the field is different, there is a possibility that a translated sentence with extremely low translation accuracy may be presented, and there is knowledge about the target language. It may lead to a user who does not have the intention to communicate with the other party presenting the translation (or presenting the intention transmitted to the other party misunderstanding), but these issues are not disclosed and resolved There are no measures.

更に、用例格納部１０５に類似例が存在しなかった場合は、類似例そのものの提示が為されず、ユーザは音声認識結果の（誤りの多い）入力文を用いる他無い状況となる。翻訳結果についても、音声認識結果による入力文に対する機械翻訳部１０３の翻訳結果のみとなる。この場合、ユーザは「適切に伝えたい意図に則した文章」を「簡便に選択」することが出来ず、この課題についての解決策も特許文献１では開示されていない。 Further, when there is no similar example in the example storage unit 105, the similar example itself is not presented, and the user has no choice but to use an input sentence (a lot of errors) of the speech recognition result. The translation result is only the translation result of the machine translation unit 103 for the input sentence based on the speech recognition result. In this case, the user cannot “simplely select” “sentence in accordance with the intention to be conveyed appropriately”, and a solution to this problem is not disclosed in Patent Document 1.

すなわち、（要件１）について、「ユーザが（文章を）簡便に選択」する手段は、複数の類似文検索結果を提示し選択する手段を提供することで部分的に開示されているとも言えるが、ＲＢＭＴに相当する用例格納部１０５に類似例が存在しなかった場合は、選択肢そのものが失われる結果となり、完全には「ユーザが（文章を）簡便に選択」する手段は提供されていない。 That is, for (Requirement 1), the means for “simply selecting (sentence) by the user” is partially disclosed by providing means for presenting and selecting a plurality of similar sentence search results. When there is no similar example in the example storage unit 105 corresponding to RBMT, the option itself is lost, and a means for “a user can easily select (sentence)” is not provided completely.

更に「相手に伝わる意図の確認」についても、あくまで用例格納部１０５が保持している内容に検索がヒットした場合においてのみ限定的に解決されているに過ぎず、機械翻訳部１０３の翻訳結果出力には触れられていない。すなわち、機械翻訳部１０３の翻訳結果が、元の原言語文字列の内容に沿っているかどうかについては言及されておらず、（特許文献１の段落番号［００１１］に「機械翻訳部１０３は、音声認識部１０２から原言語文字列を受け取り、原言語文字列を目的言語（第２言語ともいう）の文字列に機械翻訳し、翻訳結果の文字列である目的言語文字列を得る。機械翻訳の具体的な処理は、一般的な処理を行えばよいため、ここでの説明を省略する。」との記載があり、機械翻訳部１０３の出力の正当性は担保されていない）、原言語文字列の内容とかけ離れた内容が機械翻訳部１０３の出力として出てきていたとしても、ユーザはそれを知る由もない。つまり、先の「目的言語の知識がないユーザは相手に伝わる意図を確認できないまま訳文を提示してしまう」という課題について、開示も解決もなされていない。 Furthermore, “confirmation of intention transmitted to the other party” is only limitedly solved when the search hits the content held in the example storage unit 105, and the translation result output by the machine translation unit 103 is output. Is not touched. That is, no mention is made as to whether or not the translation result of the machine translation unit 103 is in accordance with the contents of the original source language character string (in the paragraph number [0011] of Patent Document 1, “the machine translation unit 103 A source language character string is received from the speech recognition unit 102, the source language character string is machine-translated into a target language (also referred to as second language) character string, and a target language character string that is a translation result character string is obtained. The specific processing in FIG. 4 is not described here because general processing may be performed, and the correctness of the output of the machine translation unit 103 is not guaranteed), the source language Even if content that is far from the content of the character string appears as the output of the machine translation unit 103, the user has no reason to know it. That is, the above-mentioned problem that “a user without knowledge of the target language presents a translation without confirming the intention transmitted to the other party” has not been disclosed or solved.

また、（要件２）・（要件３）について、開示も示唆もされていない。特に要件３については、システム全体として軽量化するどころか、機械翻訳部１０３に加え、新たにＲＢＭＴに相当する用例格納部１０５が必要となっているため、計算機資源的にも開発工数的にも増大している。 Further, (Requirement 2) and (Requirement 3) are not disclosed or suggested. Especially for requirement 3, in addition to reducing the weight of the entire system, in addition to the machine translation unit 103, an example storage unit 105 corresponding to RBMT is newly required, which increases both in terms of computer resources and development man-hours. doing.

ＲＢＭＴにおける「（要件２）ユーザの利用結果を用いて、新たにユーザに負担をかけること無く、自動的に学習できること」への対応については、特許文献３のような、入力文と翻訳文をそれぞれ形態素解析し、対訳語句を対訳辞書に登録する際に、対象とする翻訳装置に有効であるか否かを判別した結果を機械学習するアプローチが開示されている。 Regarding the correspondence to “(Requirement 2) that a user can automatically learn without newly burdening the user with the use result of the user” in RBMT, an input sentence and a translation sentence as in Patent Document 3 are used. An approach is disclosed in which machine learning is performed on the result of determining whether or not each translation device is effective when performing morphological analysis and registering the bilingual phrase in the bilingual dictionary.

特許文献３の段落番号［００２０］では、「５は機械翻訳エンジンであって、例えば規則主導型機械翻訳では、」として、規則主導型機械翻訳という名称でＲＢＭＴを翻訳システムとして例示し、対訳語句を対訳辞書に登録する際の有効性の判別方法を開示している。ここでは、ルールベースの対訳辞書に新たな語句を登録する場合に、その語句を登録することがシステム上有効かどうかを、それまでに登録された対訳対を形態素解析した上で、サポートベクトルマシンを用いて識別した判定空間に照らし合わせることで判断を行っている。すなわち、「ルールベースへの新たな語句の登録」は、ユーザの翻訳利用とは関係なく行われており、ユーザの利用結果を用いた自動学習については課題の開示も解決もなされていない。 In paragraph [0020] of Patent Document 3, “5 is a machine translation engine, for example, in rule-driven machine translation”, RBMT is exemplified as a translation system under the name of rule-driven machine translation. Is disclosed as a method for determining the effectiveness of registering a word in a bilingual dictionary. Here, when registering a new word / phrase in the rule-based bilingual dictionary, whether or not the word / phrase registration is effective in the system is analyzed, and after analyzing the bilingual pair registered so far, a support vector machine Judgment is made by checking against the judgment space identified using. In other words, “registration of a new phrase to the rule base” is performed regardless of the user's use of translation, and no problem is disclosed or solved for automatic learning using the user's use result.

一般にＲＢＭＴは、「規則が存在しない場合には翻訳精度が非常に低くなる、もしくは全く翻訳できず、想定以外の業界・分野への適応性も極めて低い」という短所を有しており、この短所は本質的に解決されていない（例えば特許文献１）。 In general, RBMT has the disadvantage that “if there is no rule, translation accuracy will be very low, or translation will not be possible at all, and adaptability to other industries and fields will be extremely low”. Is not essentially solved (for example, Patent Document 1).

また翻訳性能を向上させるためには規則を追加していく必要があるが、これには開発コストが必要となる。更にユーザ側で規則を構築して翻訳システムをカスタマイズするには、規則の設計ルールについて相当の知識が要求されるため、一般ユーザが気軽に使えるものとはなっていない。すなわち、「ＲＢＭＴの持つルールが増加するに従い、ルール同士の干渉・副作用が発生する可能性が飛躍的に増大し、ある文例で有効なルールを追記することで、他の文例で不都合が生じるといった現象が頻発するようになる」が、これを回避するためには、「システムが有している全てのルールを把握した上で、新たなルール追記が必要となる」ため、「ユーザの利用結果を用いた自動学習」はおろか、簡便なルール追加すら困難である。例えば、先に挙げた特許文献３では、ＲＢＭＴが持つ翻訳ルールのうち、単語訳対のみについて、その有効性を自動で判断しようとしているに過ぎず、構文的な翻訳ルールについての干渉回避には至っていない。 In order to improve translation performance, it is necessary to add rules, but this requires development costs. Furthermore, in order to customize the translation system by constructing rules on the user side, considerable knowledge about the rule design rules is required, so that it is not easy for general users to use. That is, as the rules of RBMT increase, the possibility of interference and side effects between rules increases dramatically, and adding a valid rule in one sentence example causes inconvenience in another sentence example. The phenomenon will occur frequently ", but in order to avoid this, it is necessary to add new rules after understanding all the rules of the system. In addition to “automatic learning using”, it is difficult to add simple rules. For example, in the above-mentioned Patent Document 3, only the word translation pairs of the translation rules possessed by the RBMT are automatically determined to be effective, and interference with syntactic translation rules is avoided. Not reached.

ＲＢＭＴはこのような原理的な短所を有しているため、「ユーザによるＲＢＭＴのデータベースへの単語・用例登録」の効率化・簡便化がＲＢＭＴの課題として開示されている。これらは、ユーザに対して明示的に学習用データの入力や判断を求めるものであり、このような要求をユーザに対して行わないこと、すなわち、先の課題である「ユーザの利用結果を用いた自動学習」については、課題の開示も解決もなされていない状況である。 Since RBMT has such a fundamental disadvantage, the efficiency and simplification of “word / example registration in the RBMT database by the user” is disclosed as an issue of RBMT. These require users to input learning data and make decisions, and do not make such requests to the users. “Automatic learning” has not been disclosed or solved.

２）統計的機械翻訳（ＳＭＴ）
ＳＭＴでは、入力文と逆翻訳（入力文を目的言語に一旦翻訳し、更に目的言語から原言語に再度翻訳したもの）の結果の距離を翻訳信頼度として算出し、必要に応じて信頼度と逆翻訳結果をあわせて提示し、ユーザに言い直しや言い換えを行わせるもの（例えば、特許文献４参照。ただし特許文献４ではＳＭＴを前提としない場合についても言及されており、ＲＢＭＴも機械翻訳エンジンとして想定している。ＲＢＭＴについて、特許文献４では段落番号［０００９］にて、「文法規則型翻訳」としている。）、ＳＭＴの翻訳精度を評価する手法として、単語ｎ−ｇｒａｍ単位での類似度（ＢＬＥＵ値）を用いて自動学習をおこなうもの（非特許文献１参照）、入力文に対して得られた翻訳文に対する逆翻訳文をＮ個生成し、入力文と逆翻訳文の類似度から翻訳文の良否を比較評価するもの（例えば、特許文献２）、などがある。 2) Statistical machine translation (SMT)
In SMT, the distance of the result of the input sentence and reverse translation (the input sentence is once translated into the target language and then re-translated from the target language to the source language) is calculated as the translation reliability. Presenting the reverse translation result together and letting the user rephrase or paraphrase (see, for example, Patent Document 4. However, Patent Document 4 also mentions a case where SMT is not premised, and RBMT is also a machine translation engine) As for RBMT, in Patent Document 4, paragraph number [0009] is referred to as “grammatical rule type translation”.) As a technique for evaluating the translation accuracy of SMT, similarity in units of word n-grams is assumed. Automatic learning using degree (BLEU value) (see Non-Patent Document 1), N back-translated sentences for the translated sentence obtained for the input sentence are generated, and the input sentence and the reverse translation are generated Those comparative evaluation the quality of the translation from the similarity (e.g., Patent Document 2), and the like.

ここで、「（要件１）ユーザに負担を掛けず、ユーザが相手に伝わる意図を簡便な形で確認をしながら翻訳文を生成・利用できること」について、これらの例では、例えば、ユーザに逆翻訳結果を提示すること（特許文献４）で、「相手に伝わる意図」をユーザに確認する手法を開示している。しかしながら、「簡便な形で翻訳文を生成・利用できること」については、逆翻訳結果に応じてユーザに再入力や言い換えを要求しており、課題解決には至っていない。 Here, with regard to “(Requirement 1) that the translation can be generated and used while confirming the intention transmitted to the other party in a simple manner without imposing a burden on the user”, in these examples, for example, the user can By presenting the translation result (Patent Document 4), a method for confirming the “intention transmitted to the other party” to the user is disclosed. However, regarding “being able to generate and use a translated sentence in a simple form”, the user is requested to input again or paraphrase according to the result of reverse translation, and the problem has not been solved.

具体的には、特許文献４の段落番号［００１３］では、「適切に翻訳結果に対する信頼度を得ることができ、かつ低信頼度であった場合に入力側ユーザに適切に再入力を促すことができる信頼度算出装置、翻訳信頼度算出利用方法および翻訳エンジン用プログラムを提供する」と開示されている。すなわち、翻訳結果に対する信頼度が低いとシステムが判断した場合、ユーザは原文の再入力を強いられるのみならず、充分に高い信頼度を持つ翻訳文が出力されるまで、原文の表現を変えるなどの試行錯誤を行いながら翻訳システムへの入力と訳文出力の確認作業を実行し続ける必要がある。更にこの時に留意しなければならないことは、使用するユーザにとって、当該翻訳システムが「どういった内部動作で翻訳文を生成」し、「どういった基準で信頼度を算出」し、「どうすれば信頼度が高い翻訳文を得ることができるのか」という点について、何の知見も有していないということである。ユーザは指針も無く、ただ信頼度の高い翻訳結果を得るために、文章を様々に言い換えながらの入力を強いられることとなり、これは実用上の観点から見ると、極めて使いにくいシステムとなってしまう。特許文献４では、これらについての課題提起も解決法も提示されておらず、「簡便な形で翻訳文を生成・利用できること」という点において不十分であると言える。 Specifically, in paragraph [0013] of Patent Document 4, “the reliability of the translation result can be appropriately obtained, and if the reliability is low, the input side user is appropriately prompted to input again. Providing a reliability calculation device, a translation reliability calculation utilization method, and a translation engine program that can perform the above-mentioned ". In other words, when the system determines that the reliability of the translation result is low, the user is not only forced to re-input the original text, but also changes the expression of the original text until a translated text with sufficiently high reliability is output. It is necessary to continue to check the input to the translation system and the translation output while performing trial and error. In addition, it is important to note that the translation system “generates translated texts with what internal operation”, “calculates reliability based on what criteria”, and “how to trust” It means that they have no knowledge about the point of “whether a highly translated sentence can be obtained”. The user has no guidelines and is forced to input text in various ways in order to obtain highly reliable translation results. This is a system that is extremely difficult to use from a practical point of view. . Patent Document 4 does not present a problem or a solution for these problems, and it can be said that it is insufficient in that “translated sentences can be generated and used in a simple form”.

また、「（要件２）ユーザの利用結果を用いて（新たにユーザに負担をかけること無く）、自動的に学習できること」については、これまで「（要件１）ユーザに負担を掛けず、ユーザが相手に伝わる意図を簡便な形で確認をしながら翻訳文を生成・利用できること」に関して述べたように、ＲＢＭＴ・ＳＭＴ双方において、ユーザが負担を受けずに「相手に伝わる意図を簡便な形で確認をしながら翻訳文を生成・利用できること」が実現されておらず、そのため、（要件１）を満たすような「ユーザの利用結果」を得る手法が開示されていないため、（要件１）を満たす「ユーザの利用結果」を用いた自動的な学習についても、課題の開示や解決法の提示は為されていない。 In addition, regarding “(Requirement 2) to be able to automatically learn using the user's usage result (without newly burdening the user)”, “(Requirement 1) As described in regard to the ability to generate and use translated sentences while confirming the intention transmitted to the other party in a simple form, both the RBMT and the SMT are able to generate a simple form of the intention transmitted to the other party without burdening the user. "A translation sentence can be generated and used while confirming in" is not realized, and therefore a method for obtaining a "user usage result" that satisfies (Requirement 1) is not disclosed (Requirement 1) Regarding automatic learning using the “user usage result” that satisfies the above, no problem is disclosed or a solution is not presented.

この「（要件１）を満たすようなユーザの利用結果を得る手法が無い」ことを前提とした上で、何らかの別の学習用データが与えられた場合の翻訳システムの自動評価・学習法という観点で、先に述べた、「統計的機械翻訳システムの性能を評価する手法であるＢＬＥＵ値による自動学習をおこなうもの（非特許文献１参照）」、「入力文に対し、逆翻訳文をＮ個取得し、入力文と逆翻訳文の類似度から比較評価を行うもの（例えば、特許文献２）、「対訳語句を対訳辞書に登録する際、対象とする翻訳装置に有効であるか否かを判別した結果を機械学習するもの（例えば、特許文献３参照）」等が存在する。 On the premise that there is no method for obtaining user usage results that satisfy (Requirement 1), the viewpoint of an automatic evaluation / learning method for a translation system when some other learning data is given As described above, “Those that perform automatic learning using BLEU values, which is a method for evaluating the performance of a statistical machine translation system (see Non-Patent Document 1)”, “N back-translated sentences for an input sentence Acquired and compared and evaluated from the similarity between the input sentence and the reverse translation sentence (for example, Patent Document 2), “When registering a bilingual phrase in the bilingual dictionary, whether or not it is effective for the target translation device There are those that machine-learn the determined results (see, for example, Patent Document 3).

非特許文献１は、事前に入力文に対する参照訳（正解データ）を準備し、翻訳エンジンの翻訳出力結果と正解データとを単言語でのｎ−ｇｒａｍをベースに比較することで、その比較結果を値（ＢＬＥＵスコア）として機械的に算出し、その値が高くなるようにシステムをチューニングする、というものである。事前に入力文と、正解データとなる参照訳（翻訳文）を準備する必要があり、本質的に翻訳エンジンの内部モデルはこのチューニングによって何ら変化せず単に重みが変わるだけのため、モデルそのものを学習したい場合や、正解データが与えられない・正解データが一意に決定されない場合には、本手法は適用できない。 Non-Patent Document 1 prepares a reference translation (correct data) for an input sentence in advance, and compares the translation output result of the translation engine with the correct data based on n-gram in a single language. Is mechanically calculated as a value (BLEU score), and the system is tuned to increase the value. It is necessary to prepare an input sentence and a reference translation (translated sentence) as correct data in advance. Essentially, the internal model of the translation engine does not change at all by this tuning, and the weight itself changes. This method cannot be applied when learning is desired, or when correct data is not given or correct data is not uniquely determined.

仮に「要件１を満たすユーザの利用結果」が得られたとしても、その結果は「ユーザが、相手に伝わる意図を簡便な形で確認をしながら翻訳文を生成・利用した」結果である。この「結果」のうち、「どれが正解データであるかどうか」、「どう一意に正解データとするか」、「その後どうモデルの学習を行うか」について、何ら課題の開示も解決も示されていない。 Even if the “use result of the user satisfying the requirement 1” is obtained, the result is “the user has generated and used the translated sentence while confirming the intention transmitted to the other party in a simple manner”. Of these “results”, disclosure of problems and solutions for “whether it is correct answer data”, “how to make the correct answer data uniquely”, and “how to learn the model afterwards” are also shown. Not.

特許文献２には、翻訳結果を複数の逆方向の翻訳機で原言語に戻し、入力文と複数の逆翻訳文との間で文の類似度を機械的に算出することで、元の翻訳結果の良否を評価する内容が開示されている。特許文献２では、複数の逆翻訳文を生成することで、元の翻訳結果の評価を行うことが開示されているが、特許文献２において解決を図っている課題は、翻訳結果についての何らかの自動的な評価を行うことであり、この点において本質的に先の非特許文献１との差は存在しない。すなわち、特許文献２には、何らかの正解文（非特許文献１では参照訳、特許文献２では入力文に相当）と翻訳結果文（非特許文献１では翻訳文、特許文献２では翻訳結果を逆翻訳した逆翻訳文に相当）との間で、一致もしくは類似度を表すスコア（非特許文献１のＢＬＥＵ値に代表されるスコア）を算出し、その値で翻訳結果の良否を評価することのみが開示されている。 In Patent Document 2, the translation result is returned to the original language by a plurality of reverse-direction translators, and the similarity of the sentence between the input sentence and the plurality of reverse-translated sentences is calculated mechanically. The contents for evaluating the quality of the results are disclosed. In Patent Document 2, it is disclosed that the original translation result is evaluated by generating a plurality of back-translated sentences. However, the problem solved in Patent Document 2 is that some automatic processing of the translation result is performed. In this respect, there is essentially no difference from the previous Non-Patent Document 1. That is, Patent Document 2 includes some correct sentence (reference translation in Non-Patent Document 1, equivalent to input sentence in Patent Document 2) and translation result sentence (translation sentence in Non-Patent Document 1, translation result reverse in Patent Document 2). Only a score (score represented by the BLEU value of Non-Patent Document 1) representing the degree of coincidence or similarity with the translated back-translated sentence is calculated, and the quality of the translation result is evaluated with that value. Is disclosed.

なお、特許文献２では翻訳システム自体の学習には言及されていないが、特許文献２の段落番号［００４８］にて、「・・・３つの逆方向翻訳文と原文とのＤＰマッチングを行ない、最大スコアを順方向翻訳文の自動評価の結果とすることができるので、例文の翻訳文だけでなくすべての翻訳文に対する評価を可能とし、評価の労力を少なくし、かつ評価の信頼性を高くすることできる」と記載されており、その目的を、段落番号［０００９］にて、「例文の翻訳文だけでなくすべての翻訳文に対する評価が可能で、評価の信頼性が高く、かつ労力が少ない機械翻訳文の評価方法、および機械翻訳文の評価装置」の構築としている。非特許文献１では事前に必要とされていた正解文（参照訳）が無い場合においても、特許文献２では、逆翻訳文と入力原文との機械的なマッチングスコアによる評価ができることを開示しており、特許文献２では学習について言及されていないが、このスコアを学習に用いることは、非特許文献１との組み合わせにおいて示唆される。 Although Patent Document 2 does not mention learning of the translation system itself, in Paragraph No. [0048] of Patent Document 2, “... Performs DP matching between three backward translated sentences and the original sentence, Since the maximum score can be the result of automatic evaluation of forward translations, it is possible to evaluate not only translations of example sentences but also all translations, reducing the labor of evaluation and increasing the reliability of the evaluation. The purpose of this is described in paragraph No. [0009], and it is possible to evaluate not only translated sentences of example sentences but also all translated sentences. This is a construction of an evaluation method for machine translation sentences and an evaluation apparatus for machine translation sentences ”. Even when there is no correct sentence (reference translation) required in advance in Non-Patent Document 1, Patent Document 2 discloses that evaluation can be performed based on a mechanical matching score between a back-translated sentence and an input original sentence. However, although Patent Document 2 does not mention learning, use of this score for learning is suggested in combination with Non-Patent Document 1.

しかしながら、特許文献２と非特許文献１を組み合わせた場合においても、モデルそのものを学習したい場合や、マッチングスコアが不正な場合や一意に決定されない場合には、依然としてチューニングすることができない。 However, even when Patent Literature 2 and Non-Patent Literature 1 are combined, tuning is still impossible when it is desired to learn the model itself, when the matching score is incorrect, or when the matching score is not uniquely determined.

また、非特許文献１と同様に、仮に「要件１を満たすユーザの利用結果」が得られたとしても、その結果は「ユーザが、相手に伝わる意図を簡便な形で確認をしながら翻訳文を生成・利用した」結果である。このような結果をどう評価値として判断するか、さらにどのようにその評価値でモデルの学習を行うかについては、何ら課題の開示も解決も示されていない。 In addition, as in Non-Patent Document 1, even if a “use result of a user satisfying Requirement 1” is obtained, the result is “translated text while confirming the user's intention transmitted to the other party in a simple manner. This is the result of generating and using As for how to judge such a result as an evaluation value and how to learn a model with the evaluation value, there is no disclosure or solution of any problem.

すなわち、「ユーザが、相手に伝わる意図を簡便な形で確認をしながら翻訳文を生成・利用」することについては、課題の開示も解法の開示もなされておらず、それを用いた学習についても、先の非特許文献１と同様に何ら課題の開示も解決も示されていない。 In other words, regarding “generation and use of translated sentences while confirming the intention to be communicated to the other party in a simple manner”, neither assignments nor solution methods have been disclosed, and learning using them However, as with the previous Non-Patent Document 1, neither disclosure nor solution of the problem is shown.

また、「（要件３）要件１・要件２の実現と同時に、計算機資源や開発コストを下げること」については、非特許文献１ではＢＬＥＵ値の算出、特許文献２ではマッチングスコアの算出が新たに必要となり、計算機資源・開発コストは増大している。 In addition, regarding “(Requirement 3) Realization of Requirement 1 and Requirement 2 and simultaneously reducing computer resources and development costs”, the calculation of BLEU values in Non-Patent Document 1 and the calculation of matching scores in Patent Document 2 are newly performed. It becomes necessary, and computer resources and development costs are increasing.

３）ディープニューラルネットによるモデル獲得型の機械翻訳（ＤＮＮＭＴ）
非特許文献３及び非特許文献４は、ニューラルネットによる機械翻訳として、ディープニューラルネット（ＤＮＮ：ＤｅｅｐＮｕｅｒａｌＮｅｔ）のうちＲＮＮ（ＲｅｃｕｒｒｅｎｔＮｅｕｒａｌＮｅｔｗｏｒｋ）およびＲＮＮの一種とされるＬＳＴＭ（ＬｏｎｇＳｈｏｒｔＴｅｒｍＭｅｍｏｒｙ）を用いたＤＮＮＭＴを例示している。いずれの手法も、対訳コーパスをニューラルネットの入力層および出力層に対する正解データ（正例や負例）として用い、ニューラルネットの中間層を直接学習させることにより、ニューラルネットの内部に翻訳モデルを直接構築するものである。ＤＮＮが内部にどのような形で翻訳モデルを持つかは、学習データの種類や与え方、学習回数、ＤＮＮ自身のネットワーク構成などに依存する。いずれのケースにおいても、どのように内部状態を変更すると、どのように翻訳性能が変化するのかについては開示されていない（学術的にも解明されていない）。本来、ニューラルネットの特性として、非線形な出力を学習できるという点があるが、ＤＮＮとなることで非線形性は飛躍的に増しており、何らかの内部パラメータと出力性能との線形的な因果関係は見いだせていないというのが現状である。 3) Model acquisition type machine translation (DNNMT) by deep neural network
Non-Patent Document 3 and Non-Patent Document 4 are LSTM (Long Short Term Memory), which is a kind of RNN (Recurrent Natural Network) and RNN among deep neural networks (DNN) as machine translation by neural networks. This is an example of DNNNMT using Both methods use the bilingual corpus as correct data (positive and negative examples) for the input and output layers of the neural network, and directly learn the intermediate layer of the neural network, so that the translation model is directly inside the neural network. To build. How a DNN has a translation model inside depends on the type and manner of learning data, the number of times of learning, the network configuration of the DNN itself, and the like. In either case, it is not disclosed how the translation performance changes when the internal state is changed (not clarified academically). Originally, the characteristic of neural networks is that non-linear output can be learned. However, the non-linearity has increased dramatically by becoming DNN, and a linear causal relationship between some internal parameters and output performance can be found. The current situation is not.

言い換えると、ＤＮＮＭＴは「入力文に対して何らかの翻訳結果を返す」という意味では、先のＲＢＭＴやＳＭＴと同じである。しかし、「なぜその翻訳結果が得られたのか」に点においては、ＲＢＭＴはそのルールを記述したデータベースを参照すれば翻訳結果が得られた理由（元となったルール）が分かり、ＳＭＴでは翻訳モデル（各単語・句の発生確率、アライメント確率等）と言語モデル（ｎ−ｇｒａｍ確率）から最大確率だったものが選ばれた翻訳結果であることが分かるのに対し、ＤＮＮＭＴでは、ルールやモデルに相当するものをニューラルネットが自ら構築するため、ニューラルネットの出力層が出してきた結果が翻訳結果の文であった、ということ以上に内部モデル・動作についての知見は得られない。 In other words, DNNNMT is the same as the previous RBMT and SMT in the sense of “returning some translation result for the input sentence”. However, in terms of “why the translation result was obtained”, RBMT can understand the reason why the translation result was obtained by referring to the database describing the rule (the original rule). While DNNNMT shows that the translation result is the one with the highest probability selected from the model (probability of occurrence of each word / phrase, alignment probability, etc.) and the language model (n-gram probability), DNNNMT uses rules and models. Since the neural network constructs the equivalent of the above, the result of the output layer of the neural network was the sentence of the translation result, so we cannot obtain any knowledge about the internal model / operation.

このため、ＤＮＮＭＴは学術研究が中心となっており、実用フェーズには至っておらず「（要件１）ユーザに負担を掛けず、ユーザが相手に伝わる意図を簡便な形で確認をしながら翻訳文を生成・利用できること」のような、実利用面からの課題にまでは至っていない。 For this reason, DNNNMT is mainly in academic research and has not yet reached the practical phase. “(Requirement 1) Translated sentences while checking the user's intention in a simple form without placing a burden on the user. It has not yet reached a problem in terms of actual use, such as “to be able to generate and use“.

また、「（要件２）ユーザの利用結果を用いて（新たにユーザに負担をかけること無く）、自動的に学習できること」についても、仮に「（要件１）を満たすユーザの利用結果」が得られたとしても、その結果を用いて「その後どうモデルの学習を行うか」については、内部動作の解明が必要であり、当然ながら何ら課題の開示も解決も示されていない。 In addition, “(Requirement 2) User learning result can be automatically learned using the user's usage result (without newly burdening the user)” is also obtained as “User usage result satisfying (Requirement 1)”. Even if the result is used, it is necessary to clarify the internal operation of “how to learn the model after that” using the result, and of course, no disclosure or solution of the problem is shown.

以上をまとめると、従来技術では以下の点で課題を有していた。 In summary, the prior art has the following problems.

・対訳データベースに類似例や訳文が記述されていない場合、及び、分野が異なる場合に、翻訳精度が極めて低い翻訳文を提示する可能性、もしくは全く翻訳できない可能性が有る。 -If similar examples and translations are not described in the bilingual database, and if the fields are different, there is a possibility that a translation with extremely low translation accuracy may be presented, or that translation may not be possible at all.

・翻訳文の内容をユーザが簡便に確認し選択する方法がないため、入力文に対する逆翻訳文の提示や翻訳品質（信頼度）の提示と言った方法では、提示内容の品質・信頼度が低かった場合に、ユーザは再入力を強いられるが、再入力によって品質が向上する保証はなく、ユーザは入力を試行錯誤する他ない。 -Since there is no way for the user to easily check and select the content of the translated text, the quality and reliability of the presented content is not good with the method of presenting the reverse translation text or the translation quality (reliability) for the input text. If it is low, the user is forced to re-input, but there is no guarantee that the quality will be improved by re-input, and the user will have to try and input the input.

・ＢＬＵＥ値など、何らかの方法で機械的に算出したスコアに基づいた翻訳システムの自動チューニングは先行例において開示されているが、「ユーザが、相手に伝わる意図を簡便な形で確認をしながら翻訳文を生成・利用した」場合、その結果に基づく評価、及び、学習をどのように行なうかについては、いずれも課題が開示されておらず、解決法も未開示である。 -Although the automatic tuning of a translation system based on a score mechanically calculated by some method such as a BLUE value is disclosed in the previous example, "translation while confirming the intention transmitted to the other party in a simple manner by the user" In the case of “generation / use of sentence”, no problem is disclosed about how to perform evaluation and learning based on the result, and no solution is disclosed.

・類似用例やチューニング用のデータ（評価スコアを算出等）を生成する計算機資源が必要である。また類似用例を作成する開発・人的コストも必要となる。 -Computer resources that generate similar examples and tuning data (e.g., evaluation score calculation) are required. In addition, development and human costs for creating similar examples are also required.

そこで、機械翻訳システムの機能向上のため、以下の改善策を検討した。 Therefore, the following improvement measures were examined to improve the functions of the machine translation system.

機械翻訳システムの機械翻訳方法の一態様は、言語情報を出力する情報出力装置へ接続し、第１言語と第２言語との間の翻訳処理を行なう機械翻訳システムにおける機械翻訳方法であって、前記第１言語の翻訳対象文を受信し、受信した前記翻訳対象文を前記第２言語へ翻訳した複数の異なる順翻訳文を生成し、前記複数の異なる前記順翻訳文の各々について前記第１言語へ逆翻訳した複数の逆翻訳文を生成し、前記情報出力装置において前記複数の逆翻訳文を出力しているときに、前記複数の逆翻訳文から一の逆翻訳文を選択する操作を受け付けた場合、前記一の逆翻訳文に対応する前記順翻訳文を出力する。 One aspect of the machine translation method of the machine translation system is a machine translation method in a machine translation system that is connected to an information output device that outputs language information and performs a translation process between a first language and a second language, The translation target sentence in the first language is received, a plurality of different forward translation sentences are generated by translating the received translation target sentence into the second language, and each of the different forward translation sentences includes the first translation sentence. An operation of generating a plurality of back-translated sentences back-translated into a language and selecting one back-translated sentence from the plurality of back-translated sentences when the information output device outputs the plurality of back-translated sentences. If accepted, the forward translation sentence corresponding to the one reverse translation sentence is output.

上記態様によると、第１言語の翻訳対象文を第２言語へ翻訳した複数の異なる順翻訳文を生成し、複数の順翻訳文の各々について第１言語へ逆翻訳した複数の逆翻訳文を生成し、情報出力装置において複数の逆翻訳文を出力しているときに、複数の逆翻訳文から一の逆翻訳文を選択する操作を受け付けた場合、前記一の逆翻訳文に対応する順翻訳文を出力する。 According to the above aspect, a plurality of different forward-translated sentences obtained by translating the sentence to be translated in the first language into the second language are generated, and the plurality of reverse-translated sentences reversely translated into the first language for each of the plurality of forward-translated sentences. When an operation for selecting one reverse translation sentence from a plurality of reverse translation sentences is received when generating and outputting a plurality of reverse translation sentences in the information output device, the order corresponding to the one reverse translation sentence Output translated text.

例えば、受信した翻訳対象文に対応する複数の逆翻訳文をユーザへ提示し、その中からユーザに選択された逆翻訳文に対応する順翻訳文を提示する。従って、ユーザは、複数の逆翻訳文の中から、自身が入力した翻訳対象文の意図に一番近い逆翻訳文を選択することとなるため、例えば、第１言語で入力された翻訳対象文を第２言語へ翻訳した一の翻訳文、および当該翻訳文に対応する一の逆翻訳文のみが提示されるシステムと比較すると、逆翻訳文が翻訳対象文の意図する内容と異なることで、翻訳対象文の修正または入力のやり直しなどが求められる場面が少なくなる。 For example, a plurality of reverse translation sentences corresponding to the received translation target sentence are presented to the user, and a forward translation sentence corresponding to the reverse translation sentence selected by the user is presented. Accordingly, the user selects the reverse translation sentence closest to the intention of the translation target sentence input by the user from the plurality of reverse translation sentences. For example, the translation target sentence input in the first language is selected. Compared with a system in which only one translated sentence translated into the second language and one reverse translated sentence corresponding to the translated sentence are presented, the reverse translated sentence is different from the intended content of the translation target sentence. The number of scenes where correction of the translation target sentence or re-input is required is reduced.

また、例えば、ユーザによって複数の逆翻訳文の中から一の逆翻訳文が選択されるため、機械翻訳システムは、入力された翻訳対象文の意図する内容として、提示した複数の逆翻訳文の中ではどの逆翻訳文が一番妥当であるか、もしくはユーザの好みの表現に合うか、などのフィードバックを得ることが可能である。そのため、例えば、上記態様における機械翻訳システムに機械学習を適用する場合には、入力された翻訳対象文に対して提示した逆翻訳文が妥当であるか否かという評価に加えて、提示した複数の逆翻訳文の中ではいずれの逆翻訳文が妥当であるかという評価も得られる。このとき、機械翻訳システムにおける一度の翻訳動作によって、複数の逆翻訳文に対するフィードバックが得られるので、機械翻訳システムの高い学習効率を実現できる。 In addition, for example, since one reverse translation sentence is selected from a plurality of reverse translation sentences by the user, the machine translation system uses the plurality of presented reverse translation sentences as the intended contents of the input translation target sentence. Among them, it is possible to obtain feedback such as which back-translated sentence is most appropriate or whether it matches the user's favorite expression. Therefore, for example, when machine learning is applied to the machine translation system in the above aspect, in addition to the evaluation of whether or not the reverse translation sentence presented for the input translation target sentence is valid, It is possible to obtain an evaluation as to which of the reverse translated sentences is valid. At this time, feedback for a plurality of reverse translation sentences can be obtained by a single translation operation in the machine translation system, so that high learning efficiency of the machine translation system can be realized.

さらに、上記態様によると、機械翻訳システムにおける機械学習について、ＢＬＥＵ値の算出による翻訳精度の評価、または入力された翻訳対象文と逆翻訳文との機械的なマッチングスコアによる翻訳精度の評価などが必要なく、ユーザによる逆翻訳文の選択によって学習用データが生成される。そのため、学習用データの生成に新たな計算機資源を必要とせず、開発コストを抑えることもできる。 Furthermore, according to the above aspect, for machine learning in the machine translation system, evaluation of translation accuracy by calculating a BLEU value, or evaluation of translation accuracy by a mechanical matching score between an input translation target sentence and a reverse translation sentence, etc. There is no need to generate learning data by selecting a reverse translation sentence by the user. Therefore, new computer resources are not required for generating learning data, and development costs can be reduced.

上記態様において、例えば、前記機械翻訳システムは、さらに、ユーザによる音声入力を受け付ける音声入力装置、およびユーザによるテキスト入力を受け付けるテキスト入力装置と接続し、前記翻訳対象文は、当該翻訳文を表す音声情報、またはテキスト情報の形態で受信し、前記音声情報、または前記テキスト情報のいずれの形態で前記翻訳対象文を受信したかに応じて、前記一の逆翻訳文に対応する前記順翻訳文の出力の形態を変更するとしてもよい。 In the above aspect, for example, the machine translation system is further connected to a voice input device that accepts a voice input by a user and a text input device that accepts a text input by a user, and the translation target sentence is a voice representing the translation sentence Information or text information, and depending on whether the speech information or text information is received as the translation target sentence, the forward translation sentence corresponding to the one reverse translation sentence The output form may be changed.

上記態様によると、翻訳対象文を、音声情報、またはテキスト情報のいずれの形態で受信したかに応じて、順翻訳文の出力の形態を変更する。これによって、例えば、入力モーダルに応じて出力のモーダルが決定されるため、ユーザは入力の形態を変えることで出力の形態を自由に決定できる。 According to the above aspect, the output form of the forward translation sentence is changed according to whether the sentence to be translated is received in the form of audio information or text information. As a result, for example, the output modal is determined in accordance with the input modal. Therefore, the user can freely determine the output form by changing the input form.

上記態様において、例えば、前記情報出力装置は、音声出力装置およびディスプレイを有し、前記翻訳対象文を音声情報の形態で受信した場合は、前記一の逆翻訳文に対応する前記順翻訳文を、前記音声出力装置を介して出力し、前記翻訳対象文をテキスト情報の形態で受信した場合は、前記一の逆翻訳文に対応する前記順翻訳文を、前記ディスプレイを介して出力するとしてもよい。 In the above aspect, for example, the information output device includes an audio output device and a display, and when the translation target sentence is received in the form of audio information, the forward translation sentence corresponding to the one reverse translation sentence is displayed. If the sentence to be translated is received in the form of text information, the forward translation sentence corresponding to the one reverse translation sentence may be outputted via the display. Good.

これによると、入力の形式と出力の形式がそれぞれ同じモーダルで対応しているため、ユーザは、自身の希望する出力形態で翻訳対象文を入力すれば良く、いずれの入力形態で入力すれば希望する出力形態で翻訳文が出力されるかの混乱が生じない。 According to this, since the input format and the output format correspond to each other in the same modal, the user only has to input the sentence to be translated in his / her desired output format, and any input format is desired. There is no confusion as to whether the translated sentence is output in the output form.

上記態様において、例えば、前記翻訳対象文は、当該翻訳対象文を示すテキスト情報で受信し、前記テキスト情報に基づいて、前記翻訳対象文を前記第２言語へ翻訳した複数の異なる順翻訳文を生成するとしてもよい。 In the above aspect, for example, the translation target sentence is received as text information indicating the translation target sentence, and a plurality of different forward-translated sentences obtained by translating the translation target sentence into the second language based on the text information. It may be generated.

上記態様において、例えば、前記機械翻訳システムは、さらに、ユーザによるテキスト入力を受け付けるテキスト入力装置と接続し、前記翻訳対象文は、前記テキスト入力装置から、当該翻訳対象文を示すテキスト情報で受信するとしてもよい。 In the above aspect, for example, the machine translation system is further connected to a text input device that accepts text input by a user, and the translation target sentence is received from the text input device as text information indicating the translation target sentence. It is good.

上記態様において、例えば、前記翻訳対象文は、当該翻訳対象文を表す音声情報で受信し、受信した前記音声情報に対して音声認識処理を行って前記翻訳対象文を示すテキスト情報を生成し、前記テキスト情報に基づいて、前記翻訳対象文を前記第２言語へ翻訳した複数の異なる順翻訳文を生成するとしてもよい。 In the above aspect, for example, the translation target sentence is received as voice information representing the translation target sentence, and the received voice information is subjected to voice recognition processing to generate text information indicating the translation target sentence. A plurality of different forward-translated sentences obtained by translating the sentence to be translated into the second language may be generated based on the text information.

これによって、音声を用いた翻訳対象文の入力が可能となるので、例えば、翻訳対象文をキーボード、タッチディスプレイなどを用いて入力する必要が無いので、ユーザは簡単に翻訳対象文を入力できる。 As a result, it becomes possible to input a translation target sentence using speech. For example, it is not necessary to input the translation target sentence using a keyboard, a touch display, or the like, so that the user can easily input the translation target sentence.

上記態様において、例えば、前記機械翻訳システムは、さらに、ユーザの音声の入力を受け付ける音声入力装置と接続し、前記翻訳対象文は、前記音声入力装置から、当該翻訳対象文を表す音声情報で受信するとしてもよい。 In the above aspect, for example, the machine translation system is further connected to a voice input device that receives an input of a user's voice, and the translation target sentence is received as voice information representing the translation target sentence from the voice input device. You may do that.

上記態様において、例えば、前記情報出力装置はディスプレイを有し、前記複数の逆翻訳文は、前記ディスプレイの第１領域に表示され、前記ディスプレイの第１領域とは異なる第２領域に、前記翻訳対象文が表示されるとしてもよい。 In the above aspect, for example, the information output device includes a display, and the plurality of reverse translation sentences are displayed in a first area of the display, and the translation is performed in a second area different from the first area of the display. The target sentence may be displayed.

上記態様によると、逆翻訳文と翻訳対象文とで表示する領域を分ける。逆翻訳文と翻訳対象文とは同じ言語の文章であるため、ユーザは、いずれが逆翻訳文でいずれが翻訳対象文であるかを簡単に見分けることができ、混同することがない。 According to the said aspect, the area | region displayed by a reverse translation sentence and a translation object sentence is divided. Since the reverse translation sentence and the translation target sentence are sentences in the same language, the user can easily distinguish which is the reverse translation sentence and which is the translation target sentence, and there is no confusion.

上記態様において、例えば、前記ディスプレイの第３領域に、前記一の逆翻訳文に対応する前記順翻訳文が表示されるとしてもよい。 In the above aspect, for example, the forward translation sentence corresponding to the one reverse translation sentence may be displayed in the third area of the display.

これによって、逆翻訳文、翻訳対象文、順翻訳文のそれぞれが異なる領域に表示されるため、それぞれいずれの文章であるか、ユーザにとって分かりやすい。 As a result, the reverse translation sentence, the translation target sentence, and the forward translation sentence are displayed in different areas, so that it is easy for the user to determine which sentence each is.

上記態様において、例えば、前記情報出力装置に対する操作に応じて、前記一の逆翻訳文に対応する前記順翻訳文の表示の向きが変更されるとしてもよい。 In the above aspect, for example, the display direction of the forward translation sentence corresponding to the one reverse translation sentence may be changed according to an operation on the information output device.

これによって、例えば、それぞれ異なる言語を話す、ユーザＡおよびユーザＢが向かい合って会話をする場合、ユーザＡが話す言語で入力した翻訳対象文に対応してユーザＢが話す言語で出力された順翻訳文が情報出力装置に表示されているときに順翻訳文の向きを、例えば、逆向きに変更できれば、ユーザＡが順翻訳文を読み上げたり、情報出力装置自体の向きを変更したりして、ユーザＢに順翻訳文の内容を伝える必要が無く、向かい合った二人のユーザが情報出力装置を上から覗き込むようにして異なる言語間のコミュニケーションを図ることが可能である。 Thus, for example, when the user A and the user B speak in different languages, the forward translation is output in the language spoken by the user B corresponding to the translation target sentence input in the language spoken by the user A. When the sentence is displayed on the information output device, the direction of the forward-translated sentence can be changed, for example, in the reverse direction, the user A reads out the forward-translated sentence, or changes the direction of the information output apparatus itself, There is no need to convey the contents of the forward translation sentence to the user B, and communication between different languages can be achieved by two users facing each other looking into the information output device from above.

上記態様において、例えば、前記順翻訳文の表示の向きは、前記第１領域に表示される前記複数の逆翻訳文の表示の向きとは異なる向きへ変更されるとしてもよい。 In the above aspect, for example, the display direction of the forward translation sentence may be changed to a direction different from the display direction of the plurality of reverse translation sentences displayed in the first area.

上記態様において、例えば、前記順翻訳文の表示の向きは、前記第１領域に表示される前記複数の逆翻訳文の表示の向きと同じ向きへ変更されるとしてもよい。 In the above aspect, for example, the display direction of the forward translation sentence may be changed to the same direction as the display direction of the plurality of reverse translation sentences displayed in the first area.

上記態様において、例えば、前記一の逆翻訳文に対応する前記順翻訳文は、前記第１領域に表示される前記複数の逆翻訳文とは異なる向きで表示されるとしてもよい。 In the above aspect, for example, the forward translation sentence corresponding to the one reverse translation sentence may be displayed in a direction different from the plurality of reverse translation sentences displayed in the first area.

上記態様において、例えば、前記機械翻訳システムは、受信した前記翻訳対象文を前記第２言語へ翻訳した前記順翻訳文の集合であって、前記複数の異なる順翻訳文を含む順翻訳文群を生成し、前記順翻訳文群に含まれる前記順翻訳文の各々について、疑問文、肯定文、否定文、命令文の中のいずれの形態に分類されるかを判断し、前記複数の異なる順翻訳文は、分類された前記形態に基づいて、前記順翻訳文群の中から選択されるとしてもよい。 In the above aspect, for example, the machine translation system is a set of the forward translation sentences obtained by translating the received sentence to be translated into the second language, and includes a forward translation sentence group including the plurality of different forward translation sentences. Generating and determining whether each of the forward translation sentences included in the forward translation sentence group is classified into a question sentence, a positive sentence, a negative sentence, or a command sentence, and the plurality of different orders A translation sentence may be selected from the group of forward translation sentences based on the classified form.

上記態様によると、順翻訳文群の中から、文章の形態に基づいて複数の異なる順翻訳文が選択されるため、例えば、翻訳対象文に基づいて機械的に生成された順翻訳文群から、翻訳対象文の形態と同じ形態である順翻訳文のみを選択することができ、最終的な翻訳精度を向上させることができる。また、例えば、翻訳対象文の形態と異なる形態である順翻訳文を含んで複数の順翻訳文を選択してもよく、これによって、順翻訳文に基づいて生成され、ユーザに提示される、複数の逆翻訳文のバリエーションを増やすことができる。そのため、例えば、機械翻訳システムに機械学習を適用させる際には、似通った内容の複数の逆翻訳文からユーザに一の逆翻訳文を選択させると、選択されなかった逆翻訳文でもユーザの入力した翻訳対象文の示す意図を表わせていた場合に、選択されなかったことで間違った逆翻訳文であると機械翻訳システムが学習してしまうことを防止できる。 According to the above aspect, since a plurality of different forward-translated sentences are selected from the forward-translated sentence group based on the form of the sentence, for example, from the forward-translated sentence group mechanically generated based on the translation target sentence Only the forward translation sentence having the same form as the translation target sentence can be selected, and the final translation accuracy can be improved. Further, for example, a plurality of forward-translated sentences including a forward-translated sentence that is different from the form of the sentence to be translated may be selected, thereby being generated based on the forward-translated sentence and presented to the user. You can increase the variation of multiple back-translated sentences. Therefore, for example, when applying machine learning to a machine translation system, if a user selects one reverse translation sentence from a plurality of reverse translation sentences having similar contents, the user inputs even a reverse translation sentence that has not been selected. When the intention indicated by the translated sentence is expressed, it is possible to prevent the machine translation system from learning that the translated sentence is wrong because it is not selected.

上記態様において、例えば、前記複数の異なる順翻訳文は、各々異なる前記形態に分類された少なくとも２以上の前記順翻訳文を含むとしてもよい。 In the above aspect, for example, the plurality of different forward-translated sentences may include at least two or more forward-translated sentences classified into the different forms.

これによって、順翻訳文に基づいて生成され、ユーザに提示される、複数の逆翻訳文のバリエーションを増やすことができる。そのため、例えば、機械翻訳システムに機械学習を適用させる際には、似通った内容の複数の逆翻訳文からユーザに一の逆翻訳文を選択させると、選択されなかった逆翻訳文でもユーザの入力した翻訳対象文の示す意図を表わせていた場合に、選択されなかったことで間違った逆翻訳文であると機械翻訳システムが学習してしまうことを防止できる。 Thereby, the variation of the several reverse translation sentence produced | generated based on a forward translation sentence and shown to a user can be increased. Therefore, for example, when applying machine learning to a machine translation system, if a user selects one reverse translation sentence from a plurality of reverse translation sentences having similar contents, the user inputs even a reverse translation sentence that has not been selected. When the intention indicated by the translated sentence is expressed, it is possible to prevent the machine translation system from learning that the translated sentence is wrong because it is not selected.

上記態様において、例えば、前記機械翻訳システムは、受信した前記翻訳対象文を前記第２言語へ翻訳した前記順翻訳文の集合である順翻訳文群を生成し、前記順翻訳文群は、前記複数の異なる順翻訳文を含み、前記順翻訳文群に含まれる前記順翻訳文各々の主語、または述語を判断し、前記複数の異なる順翻訳文は、判断された前記主語または前記述語に基づいて、前記順翻訳文群の中から選択されるとしてもよい。 In the above aspect, for example, the machine translation system generates a forward translation sentence group that is a set of the forward translation sentences obtained by translating the received translation target sentence into the second language, and the forward translation sentence group includes the forward translation sentence group, A plurality of different forward-translated sentences, a subject or a predicate of each of the forward-translated sentences included in the forward-translated sentence group is determined, and the plurality of different forward-translated sentences are included in the determined subject or previous description word Based on the forward translation sentence group, the forward translation sentence group may be selected.

上記態様によると、順翻訳文群の中から、主語または述語に基づいて複数の異なる順翻訳文が選択されるため、例えば、翻訳対象文に基づいて機械的に生成された順翻訳文群から、翻訳対象文と同じ主語または述語を有する順翻訳文のみを選択することができ、最終的な翻訳精度を向上させることができる。また、例えば、翻訳対象文と異なる主語または述語を有する順翻訳文を含んで複数の順翻訳文を選択してもよく、これによって、順翻訳文に基づいて生成され、ユーザに提示される、複数の逆翻訳文のバリエーションを増やすことができる。そのため、例えば、機械翻訳システムに機械学習を適用させる際には、似通った内容の複数の逆翻訳文からユーザに一の逆翻訳文を選択させると、選択されなかった逆翻訳文でもユーザの入力した翻訳対象文の示す意図を表わせていた場合に、選択されなかったことで間違った逆翻訳文であると機械翻訳システムが学習してしまうことを防止できる。 According to the above aspect, since a plurality of different forward translation sentences are selected from the forward translation sentence group based on the subject or predicate, for example, from the forward translation sentence group mechanically generated based on the translation target sentence Only the forward translation sentence having the same subject or predicate as the translation object sentence can be selected, and the final translation accuracy can be improved. In addition, for example, a plurality of forward translation sentences including a forward translation sentence having a subject or predicate different from the translation target sentence may be selected, thereby being generated based on the forward translation sentence and presented to the user. You can increase the variation of multiple back-translated sentences. Therefore, for example, when applying machine learning to a machine translation system, if a user selects one reverse translation sentence from a plurality of reverse translation sentences having similar contents, the user inputs even a reverse translation sentence that has not been selected. When the intention indicated by the translated sentence is expressed, it is possible to prevent the machine translation system from learning that the translated sentence is wrong because it is not selected.

上記態様において、例えば、前記複数の異なる順翻訳文は、各々異なる主語、または述語を含むと判断された少なくとも２以上の順翻訳文を含むとしてもよい。 In the above aspect, for example, the plurality of different forward-translated sentences may include at least two or more forward-translated sentences that are determined to include different subjects or predicates.

上記態様において、例えば、前記複数の異なる順翻訳文は、各々同一の主語、または述語を含むと判断された順翻訳文であるとしてもよい。 In the above aspect, for example, the plurality of different forward-translated sentences may be forward-translated sentences that are determined to include the same subject or predicate.

これによって、例えば、翻訳対象文に基づいて機械的に生成された順翻訳文群から、翻訳対象文と同じ主語または述語を有する順翻訳文のみを選択することができ、最終的な翻訳精度を向上させることができる。 As a result, for example, only forward translation sentences having the same subject or predicate as the translation target sentence can be selected from a group of forward translation sentences mechanically generated based on the translation target sentence. Can be improved.

上記態様において、例えば、前記機械翻訳システムは、前記複数の異なる前記順翻訳文の各々に対して少なくとも一以上生成した前記逆翻訳文の集合であって、前記複数の逆翻訳文を含む逆翻訳文群を生成し、前記逆翻訳群に含まれる前記逆翻訳文の各々について、前記翻訳対象文との類似度を評価した評価値を算出し、前記複数の逆翻訳文は、前記評価値に基づいて、前記逆翻訳文群の中から選択されるとしてもよい。 In the above aspect, for example, the machine translation system is a set of at least one reverse translation sentence generated for each of the plurality of different forward translation sentences, and includes the plurality of reverse translation sentences. A sentence group is generated, and for each of the reverse translation sentences included in the reverse translation group, an evaluation value that evaluates the similarity with the translation target sentence is calculated, and the plurality of reverse translation sentences are calculated based on the evaluation value. Based on the reverse translation sentence group, it may be selected.

上記態様によると、逆翻訳文群の中から、翻訳対象文との類似度に基づいて複数の異なる逆翻訳文が選択されるため、例えば、翻訳対象文との類似度が高い逆翻訳文のみを選択することができ、最終的な翻訳精度を向上させることができる。また、例えば、翻訳対象文との類似度が低い逆翻訳文を含んで複数の逆翻訳文を選択してもよく、これによって、ユーザに提示される、複数の逆翻訳文のバリエーションを増やすことができる。そのため、例えば、機械翻訳システムに機械学習を適用させる際には、似通った内容の複数の逆翻訳文からユーザに一の逆翻訳文を選択させると、選択されなかった逆翻訳文でもユーザの入力した翻訳対象文の示す意図を表わせていた場合に、選択されなかったことで間違った逆翻訳文であると機械翻訳システムが学習してしまうことを防止できる。 According to the above aspect, since a plurality of different back-translated sentences are selected from the back-translated sentence group based on the similarity to the translation target sentence, for example, only the reverse translation sentence having a high similarity to the translation target sentence Can be selected, and the final translation accuracy can be improved. In addition, for example, a plurality of back-translated sentences may be selected including a back-translated sentence having a low degree of similarity to the translation target sentence, thereby increasing variations of the plurality of back-translated sentences presented to the user. Can do. Therefore, for example, when applying machine learning to a machine translation system, if a user selects one reverse translation sentence from a plurality of reverse translation sentences having similar contents, the user inputs even a reverse translation sentence that has not been selected. When the intention indicated by the translated sentence is expressed, it is possible to prevent the machine translation system from learning that the translated sentence is wrong because it is not selected.

上記態様において、例えば、前記機械翻訳システムは、前記複数の異なる前記順翻訳文の各々に対して少なくとも一以上生成した前記逆翻訳文の集合であって、前記複数の逆翻訳文を含む逆翻訳文群を生成し、前記逆翻訳文群に含まれる前記逆翻訳文の各々について、疑問文、肯定文、否定文、命令文の中のいずれの形態に分類されるかを判断し、前記複数の逆翻訳文は、分類された前記形態に基づいて、前記逆翻訳文群の中から選択されるとしてもよい。 In the above aspect, for example, the machine translation system is a set of at least one reverse translation sentence generated for each of the plurality of different forward translation sentences, and includes the plurality of reverse translation sentences. Generating a sentence group, determining whether each of the reverse-translated sentences included in the reverse-translated sentence group is classified into a question sentence, an affirmative sentence, a negative sentence, or an imperative sentence; The reverse translation sentence may be selected from the reverse translation sentence group based on the classified form.

上記態様によると、逆翻訳文群の中から、文章の形態に基づいて複数の異なる逆翻訳文が選択されるため、例えば、逆翻訳文群から、翻訳対象文の形態と同じ形態である逆翻訳文のみを選択することができ、最終的な翻訳精度を向上させることができる。また、例えば、翻訳対象文の形態と異なる形態である逆翻訳文を含んで複数の逆翻訳文を選択してもよく、これによって、ユーザに提示される複数の逆翻訳文のバリエーションを増やすことができる。そのため、例えば、機械翻訳システムに機械学習を適用させる際には、似通った内容の複数の逆翻訳文からユーザに一の逆翻訳文を選択させると、選択されなかった逆翻訳文でもユーザの入力した翻訳対象文の示す意図を表わせていた場合に、選択されなかったことで間違った逆翻訳文であると機械翻訳システムが学習してしまうことを防止できる。 According to the above aspect, since a plurality of different back-translated sentences are selected from the back-translated sentence group based on the form of the sentence, for example, from the back-translated sentence group, the reverse is the same form as the form of the translation target sentence. Only the translation can be selected, and the final translation accuracy can be improved. In addition, for example, a plurality of back-translated sentences may be selected including a back-translated sentence that is different from the form of the sentence to be translated, thereby increasing variations of the plurality of back-translated sentences presented to the user. Can do. Therefore, for example, when applying machine learning to a machine translation system, if a user selects one reverse translation sentence from a plurality of reverse translation sentences having similar contents, the user inputs even a reverse translation sentence that has not been selected. When the intention indicated by the translated sentence is expressed, it is possible to prevent the machine translation system from learning that the translated sentence is wrong because it is not selected.

上記態様において、例えば、前記複数の逆翻訳文は、各々異なる前記形態に分類された少なくとも２以上の前記逆翻訳文を含むとしてもよい。 In the above aspect, for example, the plurality of reverse translation sentences may include at least two or more reverse translation sentences classified into different forms.

これによって、ユーザに提示される複数の逆翻訳文のバリエーションを増やすことができる。そのため、例えば、機械翻訳システムに機械学習を適用させる際には、似通った内容の複数の逆翻訳文からユーザに一の逆翻訳文を選択させると、選択されなかった逆翻訳文でもユーザの入力した翻訳対象文の示す意図を表わせていた場合に、選択されなかったことで間違った逆翻訳文であると機械翻訳システムが学習してしまうことを防止できる。 Thereby, the variation of the some reverse translation sentence shown to a user can be increased. Therefore, for example, when applying machine learning to a machine translation system, if a user selects one reverse translation sentence from a plurality of reverse translation sentences having similar contents, the user inputs even a reverse translation sentence that has not been selected. When the intention indicated by the translated sentence is expressed, it is possible to prevent the machine translation system from learning that the translated sentence is wrong because it is not selected.

上記態様において、例えば、前記機械翻訳システムは、前記複数の異なる前記順翻訳文の各々に対して少なくとも一以上生成した前記逆翻訳文の集合であって、前記複数の逆翻訳文を含む逆翻訳文群を生成し、前記逆翻訳文群に含まれる前記逆翻訳文各々の主語、または述語を判断し、前記複数の逆翻訳文は、判断された前記主語または前記述語に基づいて、前記逆翻訳文群の中から選択されるとしてもよい。 In the above aspect, for example, the machine translation system is a set of at least one reverse translation sentence generated for each of the plurality of different forward translation sentences, and includes the plurality of reverse translation sentences. Generating a sentence group, determining a subject or predicate of each of the reverse-translated sentences included in the reverse-translated sentence group, and the plurality of reverse-translated sentences are based on the determined subject or predescription word, The reverse translation sentence group may be selected.

上記態様によると、逆翻訳文群の中から、主語または述語に基づいて複数の異なる逆翻訳文が選択されるため、例えば、逆翻訳文群から、翻訳対象文と同じ主語または述語を有する逆翻訳文のみを選択することができ、最終的な翻訳精度を向上させることができる。また、例えば、翻訳対象文と異なる主語または述語を有する逆翻訳文を含んで複数の逆翻訳文を選択してもよく、これによって、ユーザに提示される複数の逆翻訳文のバリエーションを増やすことができる。そのため、例えば、機械翻訳システムに機械学習を適用させる際には、似通った内容の複数の逆翻訳文からユーザに一の逆翻訳文を選択させると、選択されなかった逆翻訳文でもユーザの入力した翻訳対象文の示す意図を表わせていた場合に、選択されなかったことで間違った逆翻訳文であると機械翻訳システムが学習してしまうことを防止できる。 According to the above aspect, since a plurality of different back-translated sentences are selected based on the subject or predicate from the back-translated sentence group, for example, a reverse having the same subject or predicate as the translation target sentence from the back-translated sentence group. Only the translation can be selected, and the final translation accuracy can be improved. In addition, for example, a plurality of back-translated sentences including a back-translated sentence having a subject or predicate different from the translation target sentence may be selected, thereby increasing variations of the plurality of back-translated sentences presented to the user. Can do. Therefore, for example, when applying machine learning to a machine translation system, if a user selects one reverse translation sentence from a plurality of reverse translation sentences having similar contents, the user inputs even a reverse translation sentence that has not been selected. When the intention indicated by the translated sentence is expressed, it is possible to prevent the machine translation system from learning that the translated sentence is wrong because it is not selected.

上記態様において、例えば、前記複数の逆翻訳文は、各々異なる主語、または述語を含むと判断された少なくとも２以上の逆翻訳文を含むとしてもよい。 In the above aspect, for example, the plurality of back-translated sentences may include at least two or more back-translated sentences determined to include different subjects or predicates.

上記態様において、例えば、前記複数の逆翻訳文は、各々同一の主語、または述語を含むと判断された逆翻訳文であるとしてもよい。 In the above aspect, for example, the plurality of back-translated sentences may be back-translated sentences that are determined to include the same subject or predicate.

これによって、例えば、逆翻訳文群から、翻訳対象文と同じ主語または述語を有する逆翻訳文のみを選択することができ、最終的な翻訳精度を向上させることができる。 Thereby, for example, only a reverse translation sentence having the same subject or predicate as the translation target sentence can be selected from the reverse translation sentence group, and the final translation accuracy can be improved.

上記態様において、例えば、前記機械翻訳システムは、前記翻訳処理において参照する確率モデルを管理し、前記翻訳処理において、機械学習を適応し、前記複数の逆翻訳文の中のいずれの前記逆翻訳文が前記一の逆翻訳文として選択されたかを示す情報に基づいて、前記機械学習を行なって、前記確率モデルのパラメータを更新するとしてもよい。 In the above aspect, for example, the machine translation system manages a probability model to be referred to in the translation process, applies machine learning in the translation process, and selects any one of the plurality of reverse translation sentences. The machine learning may be performed based on information indicating whether is selected as the one reverse translation sentence, and the parameters of the probability model may be updated.

上記態様によると、前記複数の逆翻訳文の中のいずれの前記逆翻訳文が前記一の逆翻訳文として選択されたかを示す情報に基づいて、前記機械学習を行なって、前記確率モデルのパラメータを更新する。これによって、翻訳対象文に対して提示された複数の逆翻訳文の中のいずれの逆翻訳文が選択されたかを示す情報がシステムに反映されるため、機械翻訳システムが使用されることで翻訳精度を向上させることが可能である。 According to the above aspect, the machine learning is performed based on information indicating which one of the plurality of back-translated sentences is selected as the one back-translated sentence. Update. As a result, information indicating which reverse-translated sentence is selected from the multiple reverse-translated sentences presented for the translation target sentence is reflected in the system. The accuracy can be improved.

上記態様において、例えば、前記確率モデルは、前記翻訳処理に用いられる単語またはフレーズ毎に付与される重み値を含み、前記機械翻訳システムは、前記一の逆翻訳文に対応する順翻訳文である選択順翻訳文に含まれる単語またはフレーズと、前記一の逆翻訳文以外の逆翻訳文に対応する順翻訳文である非選択翻訳文に含まれる単語またはフレーズと、を比較し、前記選択順翻訳文にのみ含まれる単語またはフレーズと、前記非選択順翻訳文にのみ含まれる単語またはフレーズと、前記選択順翻訳文と前記非選択順翻訳文の双方に含まれる単語またはフレーズとに対して、各々異なる前記重み値の更新方法を適応して前記重み値を更新し、更新された前記重み値と、更新された前記重み値に対応する前記単語または前記フレーズを教師データとして用いて前記機械学習を行なうとしてもよい。 In the above aspect, for example, the probability model includes a weight value assigned to each word or phrase used in the translation process, and the machine translation system is a forward translation sentence corresponding to the one reverse translation sentence. A word or phrase included in the selected order translation sentence is compared with a word or phrase included in a non-selected translation sentence that is a forward translation sentence corresponding to a reverse translation sentence other than the one reverse translation sentence. For a word or phrase included only in a translated sentence, a word or phrase included only in the non-selected ordered translated sentence, and a word or phrase included in both the selected ordered translated sentence and the non-selected ordered translated sentence The weight value is updated by applying different weight value update methods, and the updated weight value and the word or the phrase corresponding to the updated weight value are updated as a teacher data. May perform the machine learning is used as data.

これによって、例えば、選択順翻訳文に含まれる単語またはフレーズと、選択順翻訳文に含まれない単語またはフレーズとで、スコアに差を付けて機械学習を行うことができるので、非選択順翻訳文の中に含まれる単語またはフレーズであっても、重み値の更新においてプラスの評価が行なわれる場合がある。そのため、非選択順翻訳文において、部分的に正しい翻訳が行なわれていた場合に、その部分を正しく評価可能であり、ユーザの評価結果を反映できる。 Thus, for example, machine learning can be performed with a difference in score between a word or phrase included in the selected order translation sentence and a word or phrase not included in the selection order translation sentence. Even a word or phrase included in a sentence may be positively evaluated in updating the weight value. Therefore, in a non-selection order translation sentence, when a partly correct translation is performed, the part can be correctly evaluated, and a user's evaluation result can be reflected.

さらに、機械学習によって、確率モデルに対して、逐次、単語またはフレーズ単位でユーザの選択結果を反映させながら、学習させることが可能となり、翻訳精度を向上させることができる。 Furthermore, it is possible to perform learning while reflecting the user's selection result in units of words or phrases sequentially with respect to the probability model by machine learning, thereby improving translation accuracy.

上記態様において、例えば、前記確率モデルは、前記翻訳処理に用いられる単語またはフレーズ毎に付与される重み値を含み、前記機械翻訳システムは、前記一の逆翻訳文に含まれる単語またはフレーズと、前記一の逆翻訳文以外の逆翻訳文である非選択逆翻訳文に含まれる単語またはフレーズと、を比較し、前記一の逆翻訳文にのみ含まれる単語またはフレーズと、前記非選択逆翻訳文にのみ含まれる単語またはフレーズと、前記一の逆翻訳文と前記非選択逆翻訳文の双方に含まれる単語またはフレーズとに対して、各々異なる前記重み値の更新方法を適応して前記重み値を更新し、更新された前記重み値と、更新された前記重み値に対応する前記単語または前記フレーズを教師データとして用いて前記機械学習を行なうとしてもよい。 In the above aspect, for example, the probability model includes a weight value assigned to each word or phrase used in the translation process, and the machine translation system includes a word or phrase included in the one reverse translation sentence, A word or phrase included in a non-selected back-translated sentence that is a back-translated sentence other than the one back-translated sentence, and a word or phrase included only in the one back-translated sentence and the non-select back-translated sentence Applying different weight value update methods to the words or phrases included only in the sentence and the words or phrases included in both the one back-translated sentence and the non-selected back-translated sentence, the weights are applied. A value may be updated, and the machine learning may be performed using the updated weight value and the word or the phrase corresponding to the updated weight value as teacher data.

これによって、例えば、選択された一の逆翻訳文に含まれる単語またはフレーズと、選択された一の逆翻訳文に含まれない単語またはフレーズとで、スコアに差を付けて機械学習を行うことができるので、非選択逆翻訳文の中に含まれる単語またはフレーズであっても、重み値の更新においてプラスの評価が行なわれる場合がある。そのため、非選択逆翻訳文において、部分的に正しい翻訳が行なわれていた場合に、その部分を正しく評価可能であり、ユーザの評価結果を反映できる。 Thus, for example, machine learning is performed with a difference in score between a word or phrase included in one selected reverse translation sentence and a word or phrase not included in one selected reverse translation sentence. Therefore, there is a case where a positive evaluation is performed in updating the weight value even for a word or a phrase included in the non-selected back-translated sentence. Therefore, in a non-selection reverse translation sentence, when a partly correct translation is performed, the part can be evaluated correctly and the evaluation result of the user can be reflected.

上記態様において、例えば、前記機械翻訳システムは、前記一の逆翻訳文に含まれる単語またはフレーズと、前記一の逆翻訳文以外の逆翻訳文である非選択逆翻訳文に含まれる単語またはフレーズと、を比較し、前記一の逆翻訳文にのみ含まれる単語またはフレーズと、前記非選択逆翻訳文にのみ含まれる単語またはフレーズと、前記一の逆翻訳文と前記非選択逆翻訳文の双方に含まれる単語またはフレーズとに対して、各々異なる前記重み値の更新方法を適応して前記重み値を更新し、更新された前記重み値と、更新された前記重み値に対応する前記単語または前記フレーズを教師データとして用いて前記機械学習を行なうとしてもよい。 In the above aspect, for example, the machine translation system includes a word or phrase included in the one reverse translation sentence and a word or phrase included in a non-selected reverse translation sentence that is a reverse translation sentence other than the one reverse translation sentence. And the word or phrase included only in the one reverse-translated sentence, the word or phrase included only in the non-selected reverse-translated sentence, the one reverse-translated sentence and the non-selected reverse-translated sentence The weight value is updated by applying different weight value update methods to words or phrases included in both, and the updated weight value and the word corresponding to the updated weight value Alternatively, the machine learning may be performed using the phrase as teacher data.

上記態様において、例えば、前記重み値は、前記一の逆翻訳文のみに対応する前記単語について、正例としての値であり、前記一の逆翻訳文以外の文のみに対応する前記単語について、負例としての値であるとしてもよい。 In the above aspect, for example, the weight value is a value as a positive example for the word corresponding to only the one reverse translation sentence, and the word corresponding to only a sentence other than the one reverse translation sentence, It may be a negative example value.

これによって、前記重み値に対して、プラスの評価とマイナスの評価の双方を反映できる。 Thereby, both positive evaluation and negative evaluation can be reflected on the weight value.

上記態様において、例えば、前記機械学習は、強化学習、識別学習、ニューラルネット学習の中の少なくとも一を用いた学習であるとしてもよい。 In the above aspect, for example, the machine learning may be learning using at least one of reinforcement learning, identification learning, and neural network learning.

また、他の態様において、第１言語と第２言語との間の翻訳処理を行う機械翻訳装置であって、前記第１言語の翻訳対象文の入力を受け付ける入力部と、前記翻訳対象文を前記第２言語へ翻訳した順翻訳文、前記順翻訳文を前記第１言語へ逆翻訳した逆翻訳文、を生成する翻訳部と、前記逆翻訳文および前記一の逆翻訳文に対応する前記順翻訳文を出力する出力部と、ユーザの入力を受け付けるユーザ入力部と、を備え、前記翻訳部は、前記翻訳対象文について複数の異なる前記順翻訳文を生成し、前記複数の異なる前記順翻訳文の各々に対応する複数の逆翻訳文を生成し、前記出力部は、前記複数の逆翻訳文を出力しているときに、前記ユーザ入力部において、前記複数の逆翻訳文から一の逆翻訳文を選択する入力を受け付けた場合、前記一の逆翻訳文に対応する前記順翻訳文を出力する。 In another aspect, the machine translation apparatus performs a translation process between a first language and a second language, and includes an input unit that receives an input of a translation target sentence in the first language, and the translation target sentence. A translation unit that generates a forward translation sentence translated into the second language, a reverse translation sentence obtained by reverse-translating the forward translation sentence into the first language, and the reverse translation sentence and the one reverse translation sentence An output unit that outputs a forward-translated sentence; and a user input unit that accepts a user input, wherein the translation unit generates a plurality of different forward-translated sentences for the translation target sentence, and A plurality of back-translated sentences corresponding to each of the translated sentences are generated, and the output unit outputs one of the plurality of back-translated sentences from the plurality of back-translated sentences when outputting the plurality of back-translated sentences. If an input to select a reverse translation is received, To output the forward translation sentence corresponding to the reverse translation of.

また、第２の他の態様において、情報出力装置へ接続し、第１言語と第２言語との間の翻訳処理を行う機械翻訳装置の動作を制御するプログラムであって、前記機械翻訳装置のコンピュータに対して、前記第１言語の翻訳対象文を受信させ、受信した前記翻訳対象文を前記第２言語へ翻訳した複数の異なる順翻訳文を生成させ、前記複数の異なる前記順翻訳文の各々について前記第１言語へ逆翻訳した複数の逆翻訳文を生成させ、前記情報出力装置において、前記複数の逆翻訳文を表示させているときに、前記複数の逆翻訳文から一の逆翻訳文を選択する操作を受け付けた場合、前記一の逆翻訳文を前記第２言語へ翻訳した順翻訳文を出力させる。 According to a second other aspect, there is provided a program for controlling an operation of a machine translation device connected to an information output device and performing a translation process between a first language and a second language. The computer receives the translation target sentence in the first language, generates a plurality of different forward translation sentences obtained by translating the received translation target sentence into the second language, and outputs the plurality of different forward translation sentences. When a plurality of reverse translation sentences reversely translated into the first language are generated for each, and the plurality of reverse translation sentences are displayed in the information output device, one reverse translation from the plurality of reverse translation sentences When an operation for selecting a sentence is accepted, a forward-translated sentence obtained by translating the one reverse-translated sentence into the second language is output.

（実施の形態）
以下本発明の実施の形態について、図面を参照しながら説明する。 (Embodiment)
Embodiments of the present invention will be described below with reference to the drawings.

なお以下の実施の形態では、翻訳前の言語である原言語を日本語、翻訳後の言語である目的言語を英語として説明している箇所があるが、これらは一例であり、原言語と目的語の対はどのような組み合わせの言語対であっても構わない。 In the following embodiment, there are places where the source language, which is the language before translation, is described as Japanese, and the target language, which is the language after translation, is described as English. The word pair may be any combination of language pairs.

原言語から目的言語への翻訳で得られた翻訳文を順翻訳文、目的言語から原言語への翻訳で得られた翻訳文を逆翻訳文と表記する。 A translation sentence obtained by translation from the source language to the target language is denoted as a forward translation sentence, and a translation sentence obtained by translation from the target language to the source language is denoted as a reverse translation sentence.

また、ユーザに提示される逆翻訳文を、ユーザ提示文と表記し、ユーザが選択した文を（ユーザ）選択文、選択しなかった逆翻訳文を（ユーザ）非選択文と表記する。 Further, the reverse translation sentence presented to the user is referred to as a user presentation sentence, the sentence selected by the user is referred to as a (user) selected sentence, and the reverse translation sentence that is not selected is referred to as a (user) non-selected sentence.

図１は、本実施の形態における、システムの全体構成の一例を示す図である。情報表示端末１００、ネットワーク２００、翻訳サーバ３００、マイク４００、スピーカー５００を備える。情報表示端末１００の例としては、スマートフォンやタブレット端末、専用表示機器端末、パーソナルコンピュータ（ＰＣ）などが挙げられる。ここに挙げたもの以外でも、ユーザと情報のやりとりができる端末であれば何でもよい。 FIG. 1 is a diagram showing an example of the overall configuration of the system in the present embodiment. An information display terminal 100, a network 200, a translation server 300, a microphone 400, and a speaker 500 are provided. Examples of the information display terminal 100 include a smartphone, a tablet terminal, a dedicated display device terminal, and a personal computer (PC). Other than those listed here, any terminal that can exchange information with the user may be used.

また、情報表示端末１００におけるユーザの入力操作は、テキストでの入力、音声による入力などが想定される。テキストでの入力においては、例えば、タッチパネルによる入力や、キーボードによる入力が考えられる。また、音声による入力の場合、例えば、マイクによる入力が考えられる。この他にも、例えば、ジェスチャによる入力などを用いるとしてもよい。 In addition, the user's input operation on the information display terminal 100 may be input by text, input by voice, or the like. In text input, for example, input using a touch panel or input using a keyboard can be considered. In the case of input by voice, for example, input by a microphone can be considered. In addition to this, for example, an input by a gesture may be used.

情報表示端末１００において、機械翻訳結果等を出力する場合、ディスプレイを介して結果を出力してもよいし、音声を用いて結果を出力するとしてもよい。 When outputting a machine translation result or the like in the information display terminal 100, the result may be output via a display, or the result may be output using voice.

ネットワーク２００は、情報表示端末１００、翻訳サーバ３００、マイク４００、スピーカー５００が接続される。接続方法の一例として、有線、無線によるＬＡＮ接続などが挙げられるが、各構成要素を通信可能に接続するものであれば、これに限らない。 The network 200 is connected to the information display terminal 100, the translation server 300, the microphone 400, and the speaker 500. An example of the connection method is wired or wireless LAN connection, but is not limited to this as long as each component is connected to be communicable.

翻訳サーバ３００は、情報表示端末１００から受信した翻訳対象文に対して機械翻訳処理を行う。例えば、情報表示端末１００から入力された原言語の文字列を受信し、機械翻訳処理を行う。また、機械翻訳結果についてユーザからのフィードバックを受けて、機械学習を行なう機能も有する。翻訳サーバ３００の詳細な構成は後述する。 The translation server 300 performs machine translation processing on the translation target sentence received from the information display terminal 100. For example, a character string in the source language input from the information display terminal 100 is received and machine translation processing is performed. It also has a function of performing machine learning in response to feedback from the user about the machine translation result. The detailed configuration of the translation server 300 will be described later.

なお、例えば、情報表示端末１００と翻訳サーバ３００が一体となって実現されてもよい。 For example, the information display terminal 100 and the translation server 300 may be realized integrally.

マイク４００は、機械翻訳システムに対して、音声による入力を行なう。マイク４００は、情報表示端末１００に付属していてもよいし、単独でネットワーク２００に接続する機能を備えているとしてもよい。また、機械翻訳システムに対して、音声による入力が行われない場合は、マイク４００の構成は必須ではない。 The microphone 400 performs voice input to the machine translation system. The microphone 400 may be attached to the information display terminal 100 or may have a function of connecting to the network 200 alone. In addition, the configuration of the microphone 400 is not essential when no voice input is performed for the machine translation system.

スピーカー５００は、機械翻訳システムにおいて、音声による出力を行なう。スピーカー５００は、情報表示端末１００に付属していてもよいし、単独でネットワーク２００に接続する機能を備えているとしてもよい。また、機械翻訳システムにおいて、音声による出力が行われない場合は、スピーカー５００の構成は必須ではない。 The speaker 500 performs audio output in the machine translation system. The speaker 500 may be attached to the information display terminal 100 or may have a function of connecting to the network 200 alone. In addition, in the machine translation system, the configuration of the speaker 500 is not essential when audio output is not performed.

機械翻訳システムの入力・出力モダリティは、音声による入出力、またはテキストでの入出力のいずれか一方のみを備えていてもよいし、両方を備えていてもよい。ユーザから、機械翻訳システムに対して音声による入力が行われた場合、音声による出力を行う。また、ユーザからテキスト形式で入力された場合は、テキスト（画面表示）による出力を行う。 The input / output modality of the machine translation system may be provided with either voice input / output or text input / output, or both. When the user inputs voice to the machine translation system, the voice is output. Further, when input is made in a text format by the user, output is performed in text (screen display).

図２は、本実施の形態における、情報表示端末１００の構成を示すブロック図である。 FIG. 2 is a block diagram showing a configuration of information display terminal 100 in the present embodiment.

情報表示端末１００は、通信部１０１、入力部１０２、出力部１０３、制御部１０４、選択文検出部１０５、記憶部１０６を備える。 The information display terminal 100 includes a communication unit 101, an input unit 102, an output unit 103, a control unit 104, a selected sentence detection unit 105, and a storage unit 106.

通信部１０１は、翻訳サーバ３００との通信を行い、情報表示端末１００において入力された翻訳対象文の送信、後述する翻訳文および逆翻訳文の受信などを行なう。また、これらの情報に限らず、翻訳サーバ３００と各種の情報の送受信を行なう。 The communication unit 101 communicates with the translation server 300 to transmit a translation target sentence input at the information display terminal 100, receive a translated sentence and a reverse translated sentence, which will be described later, and the like. Moreover, not only these information but various information is transmitted / received with the translation server 300. FIG.

入力部１０２は、ユーザからの入力を受け付ける。入力部１０２は、翻訳対象文の入力、後述する逆翻訳文の選択入力などの入力を受け付ける。入力の形態としては、音声入力、テキスト形式での入力が考えられる。音声入力が用いられる場合、音声によって入力された翻訳対象文に対して音声認識処理が行われ、音声認識処理の結果出力される文字列が入力文として機械翻訳システムに入力される。テキスト形式での入力が用いられる場合、キーボード、マウス、タッチパネルなどによる文字列の入力を受け付ける。 The input unit 102 receives input from the user. The input unit 102 receives input such as input of a translation target sentence and selection input of a reverse translation sentence described later. As an input form, speech input or text input is conceivable. When speech input is used, speech recognition processing is performed on a translation target sentence input by speech, and a character string output as a result of the speech recognition processing is input as an input sentence to the machine translation system. When input in text format is used, input of a character string by a keyboard, mouse, touch panel, etc. is accepted.

出力部１０３は、入力部１０２において入力された翻訳対象文、通信部１０１を介して受信した複数の逆翻訳文、翻訳結果などを出力する。なお、出力部１０３は、ディスプレイなど、画面表示を実行する表示部として実現されてもよく、例えば、スマートフォン、タブレット端末などに用いられるタッチパネル式のディスプレイまたはモニタが想定される。また、スピーカーなど、音声を出力する音声出力部として実現されてもよい。制御部１０４は、通信部１０１、入力部１０２、出力部１０３、制御部１０４、選択文検出部１０５、記憶部１０６の動作を制御する。 The output unit 103 outputs a translation target sentence input through the input unit 102, a plurality of reverse translation sentences received via the communication unit 101, translation results, and the like. The output unit 103 may be realized as a display unit that performs screen display, such as a display. For example, a touch panel display or monitor used for a smartphone, a tablet terminal, or the like is assumed. Moreover, you may implement | achieve as an audio | voice output part which outputs audio | voices, such as a speaker. The control unit 104 controls operations of the communication unit 101, the input unit 102, the output unit 103, the control unit 104, the selected sentence detection unit 105, and the storage unit 106.

選択文検出部１０５は、出力部１０３によって出力された複数の逆翻訳文に対して、ユーザがどの逆翻訳文を選択したかを検出する。例えば、入力部１０２において、複数の逆翻訳文から一の逆翻訳文を選択する旨の入力が行なわれた場合、どの逆翻訳文が選択されたかを示すユーザ選択情報が選択文検出部１０５において検出される。検出したユーザ選択情報は、通信部１０１を介して翻訳サーバ３００へ送信される。また、ユーザ選択情報に基づいて、出力部１０３の出力内容を制御してもよい。例えば、出力部１０３がディスプレイによって実現されている場合、ユーザが選択した逆翻訳文を強調表示する、もしくは、ユーザが選択しなかった逆翻訳文を表示画面から消去する制御を行なうとしてもよい。 The selected sentence detection unit 105 detects which reverse translation sentence the user has selected for the plurality of reverse translation sentences output by the output unit 103. For example, when the input unit 102 inputs that one reverse translation sentence is selected from a plurality of reverse translation sentences, user selection information indicating which reverse translation sentence is selected is selected by the selected sentence detection unit 105. Detected. The detected user selection information is transmitted to the translation server 300 via the communication unit 101. Further, the output content of the output unit 103 may be controlled based on the user selection information. For example, when the output unit 103 is realized by a display, the reverse translation sentence selected by the user may be highlighted, or the reverse translation sentence not selected by the user may be controlled to be deleted from the display screen.

ここで、ユーザによって選択された逆翻訳文、およびその逆翻訳文に対応する順翻訳文をユーザ選択文とする。また、ユーザが選択しなかった逆翻訳文、およびその逆翻訳文に対応する順翻訳文をユーザ非選択文とする。 Here, the reverse translation sentence selected by the user and the forward translation sentence corresponding to the reverse translation sentence are set as the user selection sentence. Moreover, the reverse translation sentence which the user did not select and the forward translation sentence corresponding to the reverse translation sentence are made into a user non-selection sentence.

記憶部１０６は、翻訳サーバ３００から受信した情報の一時的な記憶、情報表示端末１００において実行される各種のアプリケーションプログラムの記憶などを行なう。 The storage unit 106 temporarily stores information received from the translation server 300, stores various application programs executed in the information display terminal 100, and the like.

図３は、本実施の形態における、翻訳サーバ３００の構成を示すブロック図である。翻訳サーバ３００は、通信部２１０、制御部２２０、機械翻訳部２３０、記憶部２４０を備える。さらに、機械翻訳部２３０は、順翻訳部２３１、順翻訳文選択部２３２、逆翻訳部２３３、逆翻訳文選択部２３４、選択文判断部２３５、フレーズ分割部２３６、選択結果評価部２３７、学習部２３８を有する。 FIG. 3 is a block diagram showing a configuration of translation server 300 in the present embodiment. The translation server 300 includes a communication unit 210, a control unit 220, a machine translation unit 230, and a storage unit 240. Further, the machine translation unit 230 includes a forward translation unit 231, a forward translation selection unit 232, a reverse translation unit 233, a reverse translation selection unit 234, a selection sentence determination unit 235, a phrase division unit 236, a selection result evaluation unit 237, learning Part 238.

通信部２１０は、情報表示端末１００との通信を行い、情報表示端末１００において入力された翻訳対象文の受信、後述する翻訳文および逆翻訳文の送信などを行なう。また、これらの情報に限らず、情報表示端末１００と各種の情報の送受信を行なう。 The communication unit 210 communicates with the information display terminal 100 to receive a translation target sentence input at the information display terminal 100, transmit a translated sentence and a reverse translated sentence, which will be described later, and the like. Moreover, not only these information but various information are transmitted / received with the information display terminal 100. FIG.

制御部２２０は、通信部２１０、機械翻訳部２３０、記憶部２４０の各種の動作を制御する。 The control unit 220 controls various operations of the communication unit 210, the machine translation unit 230, and the storage unit 240.

記憶部２４０は、機械翻訳部２３０が各種の翻訳処理、フレーズ分割処理などにおいて参照するフレーズテーブルを格納する。フレーズテーブルについては後述する。 The storage unit 240 stores a phrase table that the machine translation unit 230 refers to in various translation processes, phrase division processes, and the like. The phrase table will be described later.

機械翻訳部２３０は、通信部を介して受信した翻訳対象文に対して機械翻訳処理を実行する。機械翻訳部２３０では、ルールベース機械翻訳（ＲＢＭＴ）、統計的機械翻訳（ＳＭＴ）、ディープニューラルネットワークによるモデル獲得型の機械翻訳（ＤＮＮＭＴ）などによって機械翻訳が行われる。機械翻訳部２３０は、翻訳結果を評価して、自動評価スコア（ＢＬＥＵなど）、内部スコア（人手による評価など）などのスコアを取得する。 The machine translation unit 230 executes machine translation processing on the translation target sentence received via the communication unit. The machine translation unit 230 performs machine translation by rule-based machine translation (RBMT), statistical machine translation (SMT), model acquisition type machine translation (DNNMT) using a deep neural network, or the like. The machine translation unit 230 evaluates the translation result and obtains a score such as an automatic evaluation score (such as BLEU) and an internal score (such as manual evaluation).

また、ユーザによる選択結果を機械学習へ反映させるため、翻訳手法によっては必要に応じて図１１に示すような、フレーズの対を予め示した、フレーズテーブルを用意する。 Moreover, in order to reflect the selection result by the user in machine learning, a phrase table as shown in FIG. 11 is prepared as necessary depending on the translation method.

ルールベース機械翻訳（ＲＢＭＴ）は、人手によって構築された変換規則（訳語の対をデータベースとして記憶したもの）を元に翻訳を行うため、図１１のようなフレーズテーブルを保持していない可能性がある。ただし、句または単語単位での対訳データベースが存在する場合は、対訳データベースに学習結果を反映してもよいし、フレーズテーブルを別途用意してもよい。 Since rule-based machine translation (RBMT) performs translation based on conversion rules (stored translation pairs as a database) constructed manually, there is a possibility that the phrase table as shown in FIG. 11 is not held. is there. However, when there is a parallel translation database in units of phrases or words, the learning result may be reflected in the parallel translation database, or a phrase table may be prepared separately.

統計的機械翻訳（ＳＭＴ）では、図１１のようなフレーズテーブルを予め保持しているため、これを使用すればよい。 In statistical machine translation (SMT), since a phrase table as shown in FIG. 11 is held in advance, it may be used.

ディープニューラルネットによるモデル獲得型の機械翻訳（ＤＮＮＭＴ）では、モデル自体を自動で構築するため、フレーズテーブルを保持していないことが多い。よって、別途フレーズテーブルを用意してもよい。 In model acquisition type machine translation (DNNNMT) using a deep neural network, the model itself is automatically constructed, and therefore, the phrase table is often not held. Therefore, a separate phrase table may be prepared.

さらに、ユーザの選択を学習結果に反映する対象はこれだけに限らず、例えば、原言語同士の言い換えの対を表すようなデータベースを持っていてもよい。なお、機械翻訳処理の詳細については、後述する。 Furthermore, the target for reflecting the user's selection in the learning result is not limited to this. For example, a database that represents a paraphrase pair of the source languages may be provided. Details of the machine translation process will be described later.

順翻訳部２３１は、通信部２１０を介して受信した翻訳対象文の言語（原言語）から、翻訳対象文を翻訳した結果出力される言語（目的言語）への機械翻訳処理を実行する。ここで、原言語から目的言語への翻訳を、順翻訳とし、順翻訳によって得られる翻訳文を順翻訳文とする。このとき、順翻訳処理によって、翻訳対象文に対して複数の順翻訳文が生成される。また、順翻訳部２３１は、記憶部２４０に格納されているフレーズテーブルを参照して機械翻訳処理を行う。順翻訳部２３１において生成された複数の順翻訳文を、順翻訳文群とする。順翻訳部２３１は、生成した順翻訳文群を、順翻訳文選択部２３２へ出力する。 The forward translation unit 231 executes machine translation processing from the language (source language) of the translation target sentence received via the communication unit 210 to the language (target language) output as a result of translating the translation target sentence. Here, a translation from the source language to the target language is a forward translation, and a translation sentence obtained by the forward translation is a forward translation sentence. At this time, a plurality of forward translation sentences are generated for the translation target sentence by the forward translation process. Further, the forward translation unit 231 performs machine translation processing with reference to the phrase table stored in the storage unit 240. A plurality of forward translation sentences generated in the forward translation unit 231 are defined as a forward translation sentence group. The forward translation unit 231 outputs the generated forward translation sentence group to the forward translation sentence selection unit 232.

順翻訳文選択部２３２は、順翻訳部２３１によって生成された順翻訳文群の中から、Ｎ個の順翻訳文を選択する順翻訳文選択処理を行う。この順翻訳文選択処理の詳細については後述する。順翻訳文選択部２３２は、選択したＮ個の順翻訳文を逆翻訳部２３３へ出力する。 The forward translation sentence selection unit 232 performs a forward translation sentence selection process of selecting N forward translation sentences from the forward translation sentence group generated by the forward translation part 231. Details of the forward translation selection process will be described later. The forward translation sentence selection unit 232 outputs the selected N forward translation sentences to the reverse translation unit 233.

逆翻訳部２３３は、順翻訳文選択部２３２において選択されたＮ個の順翻訳文の各々について、順翻訳文の言語（目的言語）から、翻訳対象文の言語（原言語）への機械翻訳処理を実行する。ここで、目的言語から原言語への翻訳を、逆翻訳とし、逆翻訳によって得られる翻訳文を逆翻訳文とする。このとき、逆翻訳処理によって、各々の順翻訳文に対して一以上の逆翻訳文が生成される。そのため、結果として、複数の逆翻訳文が生成される。また、逆翻訳部２３３は、記憶部２４０に格納されているフレーズテーブルを参照して機械翻訳処理を行う。逆翻訳部２３３において生成された複数の逆翻訳文を逆翻訳文群とする。逆翻訳部２３３は、生成した逆翻訳文群を逆翻訳文選択部２３４へ出力する。 The reverse translation unit 233 performs machine translation from the language of the forward translation sentence (target language) to the language of the translation target sentence (source language) for each of the N forward translation sentences selected by the forward translation sentence selection unit 232. Execute the process. Here, the translation from the target language to the source language is the reverse translation, and the translation sentence obtained by the reverse translation is the reverse translation sentence. At this time, one or more reverse translation sentences are generated for each forward translation sentence by the reverse translation process. Therefore, as a result, a plurality of reverse translation sentences are generated. Further, the reverse translation unit 233 performs machine translation processing with reference to the phrase table stored in the storage unit 240. A plurality of reverse translation sentences generated in the reverse translation unit 233 are set as a reverse translation sentence group. The reverse translation unit 233 outputs the generated reverse translation sentence group to the reverse translation sentence selection unit 234.

逆翻訳文選択部２３４は、逆翻訳部２３３によって生成された逆翻訳文群の中から、Ｍ個の逆翻訳文を選択する逆翻訳文選択処理を行う。この逆翻訳文選択処理については、後述する。逆翻訳文選択部２３４は、通信部２１０を介して、選択したＭ個の逆翻訳文を情報表示端末１００へ送信する。情報表示端末１００の出力部１０３において、Ｍ個の逆翻訳文は選択可能に出力される。 The reverse translation sentence selection unit 234 performs reverse translation sentence selection processing for selecting M reverse translation sentences from the reverse translation sentence group generated by the reverse translation part 233. This reverse translation sentence selection process will be described later. The reverse translation sentence selection unit 234 transmits the selected M reverse translation sentences to the information display terminal 100 via the communication unit 210. In the output unit 103 of the information display terminal 100, M back-translated sentences are output so as to be selectable.

選択文判断部２３５は、通信部２１０を介して情報表示端末１００から受信したユーザ選択情報に基づいて、逆翻訳文選択部２３４において選択されたＭ個の逆翻訳文の中からユーザがどの逆翻訳文を選択したかを判断し、判断した情報をフレーズ分割部２３６へ出力する。 Based on the user selection information received from the information display terminal 100 via the communication unit 210, the selected sentence determination unit 235 determines which reverse user the M of the reverse translation sentences selected by the reverse translation sentence selection unit 234 has. It is determined whether a translated sentence is selected, and the determined information is output to the phrase dividing unit 236.

フレーズ分割部２３６は、逆翻訳文選択部２３４から入力された複数の逆翻訳文に基づいて、複数の逆翻訳文の各々について、逆翻訳文を句または単語単位に分割する。また、逆翻訳文に対応する順翻訳分についても、句または単語単位に分割する。このとき、選択文判断部２３５から入力された複数の逆翻訳文からいずれの逆翻訳文が選択されたかを示す情報もあわせて用いるとしてもよい。また、記憶部２４０に格納されているフレーズテーブルを用いるとしてもよい。逆翻訳文および順翻訳文を句または単語単位に分割した情報と、ユーザ選択情報とが選択結果評価部２３７へ出力される。 The phrase division unit 236 divides the reverse translation sentence into phrases or words for each of the plurality of reverse translation sentences based on the plurality of reverse translation sentences input from the reverse translation sentence selection unit 234. Also, the forward translation corresponding to the reverse translation sentence is also divided into phrases or words. At this time, information indicating which reverse translation sentence is selected from a plurality of reverse translation sentences input from the selected sentence determination unit 235 may also be used. Further, a phrase table stored in the storage unit 240 may be used. Information obtained by dividing the reverse translation sentence and the forward translation sentence into phrases or words and the user selection information are output to the selection result evaluation unit 237.

フレーズ分割には、統計的機械翻訳（ＳＭＴ）に示されるような、双言語間での、句または単語単位での対応関係を示すフレーズテーブルを用いることが多いが、必ずしも決まったフレーズテーブルを用いる必要はなく、それに類するものであってもよい。機械翻訳においてフレーズテーブルが用いられている場合は、そのフレーズテーブルを用いて分割を行ってもよい。または、別途用意したフレーズテーブルなどを用いてもよいし、対訳辞書などがあればそれを用いてもよい。 Phrase division often uses a phrase table that shows the correspondence between phrases or words, as shown in Statistical Machine Translation (SMT), but it always uses a fixed phrase table. It is not necessary and may be similar. When a phrase table is used in machine translation, division may be performed using the phrase table. Alternatively, a separately prepared phrase table or the like may be used, or a bilingual dictionary or the like may be used.

選択結果評価部２３７では、フレーズ分割部２３６から入力される情報に基づいて、順翻訳文および逆翻訳文に対して評価を行なう。このとき、ユーザ選択情報に基づいて、ユーザ選択文とユーザ非選択文とで異なる評価を行うとしてもよい。詳細な評価方法については、後述する。選択結果評価部２３７は、順翻訳文および逆翻訳文を評価した評価情報を学習部２３８へ出力する。 The selection result evaluation unit 237 evaluates the forward translated sentence and the reverse translated sentence based on the information input from the phrase dividing unit 236. At this time, different evaluations may be performed for the user selection sentence and the user non-selection sentence based on the user selection information. A detailed evaluation method will be described later. The selection result evaluation unit 237 outputs evaluation information obtained by evaluating the forward translation sentence and the reverse translation sentence to the learning unit 238.

学習部２３８は、選択結果評価部２３７から入力された評価情報に基づいて、記憶部２４０に格納されているフレーズテーブルを更新することで機械翻訳処理における機械学習を行なう。すなわち、評価情報をフレーズテーブルに反映する。機械学習の対象としては、順翻訳部２３１が参照するフレーズテーブルでもよいし、逆翻訳部２３３が参照するフレーズテーブルでもよい。また、必ずしも評価情報をフレーズテーブルに反映する必要はなく、例えば、言い換えの辞書や単語の辞書などに結果を反映し、機械翻訳処理における機械学習を行なうとしてもよい。フレーズテーブルに対する評価情報の詳細な反映方法については、後述する。 The learning unit 238 performs machine learning in the machine translation process by updating the phrase table stored in the storage unit 240 based on the evaluation information input from the selection result evaluation unit 237. That is, the evaluation information is reflected in the phrase table. The machine learning target may be a phrase table referred to by the forward translation unit 231 or a phrase table referred to by the reverse translation unit 233. Further, it is not always necessary to reflect the evaluation information in the phrase table. For example, the result may be reflected in a paraphrase dictionary or a word dictionary to perform machine learning in machine translation processing. A detailed reflection method of the evaluation information with respect to the phrase table will be described later.

図４は、情報表示端末の各部の機能をプログラムにより実現するコンピュータのハードウェア構成を示す図である。このコンピュータ１０００は、入力ボタン、タッチパッドなどの入力装置１００１、ディスプレイ、スピーカーなどの出力装置１００２、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）１００３、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）１００４、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）１００５などを備える。また、コンピュータ１０００は、ハードディスク装置、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）などの記憶装置１００６、ＤＶＤ−ＲＯＭ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｋＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、ＵＳＢ（ＵｎｉｖｅｒｓａｌＳｅｒｉａｌＢｕｓ）メモリなどの記録媒体から情報を読み取る読取装置１００７、ネットワークを介して通信を行う送受信装置１００８を備えるとしてもよい。上述した各部は、バス１００９により接続される。 FIG. 4 is a diagram illustrating a hardware configuration of a computer that realizes the functions of the respective units of the information display terminal by a program. The computer 1000 includes an input device 1001 such as an input button and a touch pad, an output device 1002 such as a display and a speaker, a CPU (Central Processing Unit) 1003, a ROM (Read Only Memory) 1004, a RAM (Random Access Memory) 1005, and the like. Prepare. Further, the computer 1000 reads information from a recording medium such as a hard disk device, a storage device 1006 such as an SSD (Solid State Drive), a DVD-ROM (Digital Versatile Disk Read Only Memory), or a USB (Universal Serial Bus) memory. 1007 may include a transmission / reception device 1008 that performs communication via a network. Each unit described above is connected by a bus 1009.

そして、読取装置１００７は、上記各部の機能を実現するためのプログラムを記録した記録媒体からそのプログラムを読み取り、記憶装置１００６に記憶させる。あるいは、送受信装置１００８が、ネットワークに接続されたサーバ装置と通信を行い、サーバ装置からダウンロードした上記各部の機能を実現するためのプログラムを記憶装置１００６に記憶させる。 Then, the reading device 1007 reads the program from a recording medium on which a program for realizing the functions of the above-described units is recorded, and stores the program in the storage device 1006. Alternatively, the transmission / reception device 1008 communicates with a server device connected to the network, and causes the storage device 1006 to store a program for realizing the function of each unit downloaded from the server device.

そして、ＣＰＵ１００３が、記憶装置１００６に記憶されたプログラムをＲＡＭ１００５にコピーし、そのプログラムに含まれる命令をＲＡＭ１００５から順次読み出して実行することにより、上記各部の機能が実現される。また、プログラムを実行する際、ＲＡＭ１００５または記憶装置１００６には、各実施の形態で述べた各種処理で得られた情報が記憶され、適宜利用される。 Then, the CPU 1003 copies the program stored in the storage device 1006 to the RAM 1005, and sequentially reads and executes the instructions included in the program from the RAM 1005, thereby realizing the functions of the above-described units. Further, when executing the program, the RAM 1005 or the storage device 1006 stores information obtained by various processes described in each embodiment, and is used as appropriate.

図５は、本実施の形態における、機械翻訳システムの動作を示すフローチャートである。簡単のため、本フローチャートでは、情報表示端末１００の入力部１０２および出力部１０３は、タッチパネル式ディスプレイによって実現されるとする。情報表示端末１００において、タッチパネル式ディスプレイを介して、ユーザの入力および翻訳結果等の出力が行なわれるとするが、例えば、キーボードとディスプレイなど、入力部１０２と出力部１０３がそれぞれ独立している構成であってもよい。また、音声による入出力が行なわれてもよい。また説明のため、原言語（母国語）を日本語、目的言語を英語として説明を行っている箇所がある。ただしこれらは一例であり、原言語および目的言語はどのような組合せであっても構わない。 FIG. 5 is a flowchart showing the operation of the machine translation system in the present embodiment. For simplicity, in this flowchart, it is assumed that the input unit 102 and the output unit 103 of the information display terminal 100 are realized by a touch panel display. In the information display terminal 100, user input and output of translation results and the like are performed via a touch panel display. For example, a configuration in which the input unit 102 and the output unit 103 are independent, such as a keyboard and a display. It may be. Further, voice input / output may be performed. For explanation, there are places where the source language (native language) is Japanese and the target language is English. However, these are only examples, and the source language and the target language may be in any combination.

まず、ステップＳ４０１において、ユーザによって入力された翻訳対象文を取得する。ステップＳ４０２において、翻訳対象文に対する機械翻訳処理を行う。ここでの機械翻訳処理は、原言語の翻訳対象文を目的言語の文字列（順翻訳文）に翻訳する順翻訳処理である。このとき、例えば、統計的機械翻訳（ＳＭＴ）は、翻訳モデルと言語モデルから翻訳としての確からしさをスコア化する。翻訳モデルとは、訳語の尤もらしさを規定する統計モデルであり、言語モデルとは、出力言語の単語の並びの尤もらしさを規定する統計モデルである。これら２つのモデルから、翻訳としての確からしさをスコア化し、そのスコア順に翻訳結果を出力することにより、複数の順翻訳文が生成されるが、この複数の順翻訳文を、説明の便宜上、順翻訳文群とする。 First, in step S401, a translation target sentence input by the user is acquired. In step S402, machine translation processing is performed on the translation target sentence. The machine translation process here is a forward translation process for translating a source language translation target sentence into a target language character string (forward translation sentence). At this time, for example, statistical machine translation (SMT) scores the certainty as a translation from the translation model and the language model. The translation model is a statistical model that defines the likelihood of the translated word, and the language model is a statistical model that defines the likelihood of the arrangement of words in the output language. From these two models, the probability of translation is scored, and the translation results are output in the order of the scores, so that a plurality of forward translation sentences are generated. A translation group.

本実施の形態における機械翻訳の具体的な処理には、一般的な機械翻訳の処理であるため、ここでの説明は省略する。 Since the specific process of machine translation in the present embodiment is a general machine translation process, the description thereof is omitted here.

ステップＳ４０３において、順翻訳文群から、所定の基準に基づいてＮ個の順翻訳文が選択される。例えば、順翻訳文群に含まれる複数の順翻訳文の各々に対して評価スコアを付与し、評価スコアの高いものからＮ個選択するとしてもよい。また、評価スコアに関係なく、ランダムにＮ個選択するとしてもよい。などである。また、順翻訳文群に含まれる複数の順翻訳文が示す内容を考慮して、選択したＮ個の順翻訳文の中に同趣旨のものが含まれないように選択するとしてもよい。また、順翻訳文群に含まれる複数の順翻訳文の中に異なる趣旨のものが少ない場合には、異なる趣旨を持つ順翻訳文を追加する処理を、必要に応じて行ってもよい。順翻訳文の詳細な選択方法については、図６を用いて後述する。 In step S403, N forward-translated sentences are selected from the forward-translated sentence group based on a predetermined criterion. For example, an evaluation score may be assigned to each of a plurality of forward translation sentences included in the forward translation sentence group, and N may be selected from those having a high evaluation score. Further, N may be selected at random regardless of the evaluation score. Etc. In addition, in consideration of the contents indicated by a plurality of forward translation sentences included in the forward translation sentence group, the selected N forward translation sentences may be selected so as not to include the same meaning. In addition, when a plurality of forward translation sentences included in the forward translation sentence group have few different meanings, a process of adding forward translation sentences having different purposes may be performed as necessary. A detailed method for selecting a forward translation will be described later with reference to FIG.

ステップＳ４０３の処理が行われたのちに、順翻訳文を増やす必要があると判断された場合（Ｓ４０４のＹＥＳ）、ステップＳ４０２に戻り、再び順翻訳処理を実行する。この時、すでに得られている順翻訳文とは異なる順翻訳文を得るために、先ほどよりもスコアの低いものを選択する。また、別の翻訳指標を用いても良い（例えば、ＲＩＢＥＳ：ｒａｎｋ−ｂａｓｅｄｉｎｔｕｉｔｉｖｅｂｉｌｉｎｇｕａｌｅｖａｌｕａｔｉｏｎｓｃｏｒｅ）。また、原言語の言い換えのデータベースを保持していれば、入力文に対してそれらを適用することで類似文を作成し、再び順翻訳処理を実行することもできる。これにより、表層的には異なるが、入力文と同じ意味を持った文が入力されるため、異なる順翻訳文を得ることができる。 If it is determined that it is necessary to increase the number of forward translation sentences after the process of step S403 (YES in S404), the process returns to step S402, and the forward translation process is executed again. At this time, in order to obtain a forward translated sentence different from the already obtained forward translated sentence, a sentence having a lower score than before is selected. Moreover, you may use another translation parameter | index (for example, RIBES: rank-based intuitive biliary evaluation score). In addition, if a database of paraphrases in the source language is held, similar sentences can be created by applying them to the input sentence, and the forward translation process can be executed again. As a result, a sentence having the same meaning as that of the input sentence is inputted although it is different in the surface layer, so that a different forward translation sentence can be obtained.

再び順翻訳処理が実行されたのちに、ステップＳ４０３において順翻訳文選択処理が実行される場合には、前回と異なる基準で順翻訳文を選択してもよいし、同じ基準で順翻訳文を選択してもよい。 If the forward translation process is executed in step S403 after the forward translation process is performed again, the forward translation sentence may be selected based on a different standard from the previous one, or the forward translation sentence may be selected based on the same standard. You may choose.

ステップＳ４０３の処理が行われたのちに、順翻訳文を増やす必要がないと判断された場合（Ｓ４０４のＮｏ）、ステップＳ４０５の逆翻訳処理へフローを進める。ステップＳ４０５では、ステップＳ４０３で得られたＮ個の順翻訳文に対して逆翻訳を行う逆翻訳処理が実行される。原言語から目的言語への翻訳を順方向の翻訳とすると、逆翻訳とは、目的言語から原言語への逆方向の翻訳である。Ｎ個の順翻訳文それぞれにおいて、任意の逆翻訳文を生成する逆翻訳処理を行う。任意の逆翻訳文を生成する逆翻訳処理とは、Ｎ個の順翻訳文の各々について一対一で対応する逆翻訳文を生成する逆翻訳処理、Ｎ個の順翻訳文の中に逆翻訳処理が行われない順翻訳文がある逆翻訳処理、一つの順翻訳文に対して複数の逆翻訳文を生成する逆翻訳処理、などを意味する。この逆翻訳処理によって、複数の逆翻訳文が生成される。この複数の逆翻訳文を、説明の便宜上、逆翻訳文群とする。 If it is determined that it is not necessary to increase the number of forward translation sentences after the process of step S403 is performed (No in S404), the flow proceeds to the reverse translation process of step S405. In step S405, reverse translation processing for performing reverse translation on the N forward-translated sentences obtained in step S403 is executed. If the translation from the source language to the target language is a forward translation, the reverse translation is a backward translation from the target language to the source language. In each of the N forward-translated sentences, a reverse translation process for generating an arbitrary reverse-translated sentence is performed. The reverse translation process for generating an arbitrary reverse translation sentence is a reverse translation process for generating a corresponding one-to-one reverse translation sentence for each of the N forward translation sentences, and a reverse translation process for the N forward translation sentences. This means reverse translation processing with a forward translation sentence that is not performed, reverse translation processing for generating a plurality of reverse translation sentences for one forward translation sentence, and the like. By this reverse translation process, a plurality of reverse translation sentences are generated. The plurality of reverse translation sentences are set as a reverse translation sentence group for convenience of explanation.

また、どのような基準で逆翻訳文を出力するかに対しては、何らかのシステムの基準で決めてもよいし、ユーザがそれらを決定してもよい。ここでのシステムの基準とは、ＢＬＥＵなどの評価や人手評価などを用いて順翻訳文のスコアを算出し、例えば、スコアが低い順翻訳文に関しては逆翻訳処理を行わない（ある順翻訳文に関しては生成する逆翻訳文が０個）、スコアが高い順翻訳文は任意の数の逆翻訳文を得る（ある順翻訳文に関しては逆翻訳文が複数個生成される）などである。ユーザが逆翻訳文の数を決定する場合には、一例として、一つの順翻訳文に対して逆翻訳文をいくつ生成するかを設定する、などが考えられるが、これに限らない。 In addition, the criteria for outputting the back-translated text may be determined based on some system criteria, or the user may determine them. The standard of the system here is to calculate the score of the forward translation sentence using evaluation such as BLEU or manual evaluation, for example, the reverse translation processing is not performed for the forward translation sentence having a low score (a certain forward translation sentence) The number of reverse translation sentences to be generated is 0), and an arbitrary number of reverse translation sentences having a high score are obtained (a plurality of reverse translation sentences are generated for a certain forward translation sentence). When the user determines the number of back-translated sentences, for example, setting how many back-translated sentences are generated for one forward-translated sentence can be considered, but this is not restrictive.

ステップＳ４０６は、ステップＳ４０５で得られた逆翻訳文群の中からＭ個の逆翻訳文を選択する逆翻訳文選択処理である。逆翻訳文選択処理では、ステップＳ４０３の順翻訳文選択処理とほぼ同様の処理を行う。詳細な選択方法については、図６、図７を用いて後述する。 Step S406 is a reverse translation sentence selection process for selecting M reverse translation sentences from the reverse translation sentence group obtained in step S405. In the reverse translation sentence selection process, a process substantially similar to the forward translation sentence selection process in step S403 is performed. A detailed selection method will be described later with reference to FIGS.

ステップＳ４０６の逆翻訳文選択処理が実行されたのちに、逆翻訳文を増やす必要があると判断された場合（Ｓ４０７のＹＥＳ）、ステップＳ４０２に戻り、再び順翻訳処理を行う。再び順翻訳処理が実行されたのちに、ステップＳ４０３において順翻訳文選択処理が実行される場合には、前回と異なる基準で順翻訳文を選択してもよいし、同じ基準で順翻訳文を選択してもよい。 If it is determined that it is necessary to increase the number of reverse translation sentences after the reverse translation sentence selection process in step S406 is executed (YES in S407), the process returns to step S402 and the forward translation process is performed again. If the forward translation process is executed in step S403 after the forward translation process is performed again, the forward translation sentence may be selected based on a different standard from the previous one, or the forward translation sentence may be selected based on the same standard. You may choose.

ステップＳ４０６の逆翻訳処理が行われたのちに、逆翻訳文を増やす必要がないと判断された場合（Ｓ４０７のＮｏ）、逆翻訳文群から選択されたＭ個の逆翻訳文がユーザ提示文として次のステップで情報表示端末１００において出力される。 If it is determined that it is not necessary to increase the number of reverse translation sentences after the reverse translation processing in step S406 (No in S407), the M reverse translation sentences selected from the reverse translation sentence group are the user-provided sentences. Is output at the information display terminal 100 in the next step.

ステップＳ４０８において、Ｍ個の逆翻訳文が情報表示端末１００へ送信され、タッチパネル式ディスプレイに表示される。 In step S408, M back-translated sentences are transmitted to the information display terminal 100 and displayed on the touch panel display.

ステップＳ４０９では、選択文検出部１０５において、情報表示端末１００のタッチパネル式ディスプレイに表示されたＭ個の逆翻訳文から一の逆翻訳文が選択されたか否かを検出する。 In step S409, the selected sentence detection unit 105 detects whether one reverse translation sentence is selected from the M reverse translation sentences displayed on the touch panel display of the information display terminal 100.

一定期間、逆翻訳文の選択がないと判断された場合、機械翻訳システムは初期状態に戻り、ユーザの入力文を受け付ける（Ｓ４０９のＮｏ）。このとき、タッチパネル式ディスプレイの表示画面がリセットされる。 If it is determined that there is no selection of the reverse translation sentence for a certain period, the machine translation system returns to the initial state and accepts the user's input sentence (No in S409). At this time, the display screen of the touch panel display is reset.

また、ユーザが何らかのリセット操作を実行した場合も、機械翻訳システムは同様に初期状態へ戻ってユーザの入力を受け付ける。 Also, when the user performs any reset operation, the machine translation system similarly returns to the initial state and accepts the user's input.

選択文検出部１０５において、いずれかの逆翻訳文が選択されたことが検出された場合（Ｓ４０９のＹｅｓ）、いずれの逆翻訳文が選択されたかを示すユーザ選択情報が翻訳サーバ３００へ送信される。 When the selected sentence detection unit 105 detects that any reverse translation sentence is selected (Yes in S409), user selection information indicating which reverse translation sentence is selected is transmitted to the translation server 300. The

ステップＳ４１１において、選択文判断部２３５は、逆翻訳文選択部２３４からＭ個の逆翻訳文を取得し、情報表示端末１００から受信したユーザ選択情報に基づいて、情報表示端末１００においてＭ個の逆翻訳文の中のいずれの逆翻訳文が選択されたかを判断する。Ｍ個の逆翻訳文の中の選択された逆翻訳文を選択翻訳文、選択されなかった逆翻訳文を非選択逆翻訳文として、以降のステップを説明する。選択文判断部２３５は、選択逆翻訳文および非選択逆翻訳文をフレーズ分割部２３６へ出力する。 In step S411, the selected sentence determination unit 235 acquires M pieces of reverse translation sentences from the reverse translation sentence selection unit 234, and M pieces of information are displayed on the information display terminal 100 based on the user selection information received from the information display terminal 100. It is determined which of the reverse translation sentences has been selected. Subsequent steps will be described assuming that the selected back-translated sentence among the M back-translated sentences is a selected translated sentence, and the unselected back-translated sentence is a non-selected back-translated sentence. The selected sentence determination unit 235 outputs the selected back-translated sentence and the non-selected back-translated sentence to the phrase dividing unit 236.

ステップＳ４１２において、選択逆翻訳文と非選択逆翻訳文、および選択逆翻訳文と非選択逆翻訳文の各々に対応する順翻訳文に対して、フレーズ分割処理を行う。 In step S <b> 412, phrase division processing is performed on the forward translated sentence corresponding to each of the selected reverse translated sentence and the non-selected reverse translated sentence, and the selected reverse translated sentence and the non-selected reverse translated sentence.

フレーズ分割とは、対象となる文を、より短い単位である句または単語に分割することである。フレーズ分割処理の具体例は後述する。 Phrase division is to divide a target sentence into phrases or words that are shorter units. A specific example of the phrase dividing process will be described later.

図１１は、本実施の形態における、一般的なフレーズテーブルの例である。フレーズテーブルとは、原言語と目的言語の双言語間において、句または単語単位での対応関係を表したテーブルである。 FIG. 11 is an example of a general phrase table in the present embodiment. The phrase table is a table that represents a correspondence relationship in units of phrases or words between the bilingual of the source language and the target language.

図１１では、原言語を日本語、目的言語を英語とした場合、左から、日本語のフレーズ、英語のフレーズ、フレーズの英日翻訳確率（英語のフレーズが日本語のフレーズに翻訳される確率）、英日方向の単語の翻訳確率の積（英語が日本語に翻訳される時の、フレーズ内の単語ごとの翻訳確率の積）、フレーズの日英翻訳確率（日本語のフレーズが英語のフレーズに翻訳される確率）、日英方向の単語の翻訳確率の積（日本語が英語に翻訳される時の、フレーズ内の単語ごとの翻訳確率の積）を表している。ただし、フレーズテーブルは、これらすべての情報を必ずしも含んでいる必要はなく、この表記方法に限らない。このフレーズテーブルは、翻訳確率を含むため、広義では確率モデルとも呼ばれる。 In FIG. 11, when the source language is Japanese and the target language is English, from the left, the Japanese phrase, the English phrase, the English-Japanese translation probability of the phrase (the probability that the English phrase is translated into the Japanese phrase) ), English-Japanese word translation probabilities (the product of translation probabilities for each word in a phrase when English is translated into Japanese), phrase Japanese-English translation probabilities (Japanese phrases are in English (Probability of translation into a phrase) and product of translation probabilities of words in Japanese and English directions (product of translation probabilities for each word in the phrase when Japanese is translated into English). However, the phrase table does not necessarily include all these pieces of information, and is not limited to this notation method. Since this phrase table includes translation probabilities, it is also called a probability model in a broad sense.

例えば、図１１に示すフレーズテーブルでは、フレーズＰＨ１が、フレーズＰＨ１へ翻訳される確率が０．３８、単語ＳＤ３が単語ＳＤ１に翻訳される確率と、単語ＳＤ４が単語ＳＤ４へ翻訳される確率との積が０．０４、フレーズＰＨ１がフレーズＰＨ２へ翻訳される確率が０．０５、単語ＳＤ１が単語ＳＤ３へ翻訳される確率と、単語ＳＤ２が単語ＳＤ４へ翻訳される確率との積が０．０２であることを表している。 For example, in the phrase table shown in FIG. 11, the probability that the phrase PH1 is translated into the phrase PH1 is 0.38, the probability that the word SD3 is translated into the word SD1, and the probability that the word SD4 is translated into the word SD4. The product is 0.04, the probability that the phrase PH1 is translated into the phrase PH2 is 0.05, the product of the probability that the word SD1 is translated into the word SD3 and the probability that the word SD2 is translated into the word SD4 is 0.02. It represents that.

このようなフレーズテーブルを用いて、選択逆翻訳文、非選択逆翻訳文、およびこれらの逆翻訳文の各々に対応する順翻訳文に対してフレーズ分割処理を行なう。 Using such a phrase table, the phrase division processing is performed on the selected back-translated sentence, the non-selected back-translated sentence, and the forward-translated sentence corresponding to each of these back-translated sentences.

図１２は、フレーズ分割の概要を表す説明図である。 FIG. 12 is an explanatory diagram showing an outline of phrase division.

図１２には、原言語で表された逆翻訳文ＲＳ１０、ＲＳ２０、ＲＳ３０と、これらの逆翻訳文の各々に対応する目的言語の順翻訳文ＴＳ１０、ＴＳ２０、ＴＳ３０が示されている。原言語の逆翻訳文に対するフレーズ分割処理が実行される場合、例えば、逆翻訳文ＲＳ１０をフレーズ分割すると、フレーズＰＨ１１、ＰＨ１２、ＰＨ１２という３つのフレーズに分割される。また、目的言語の順翻訳文に対するフレーズ分割処理が実行される場合、例えば、順翻訳文ＴＳ１０をフレーズ分割すると、フレーズＰＨ１４、ＰＨ１５、ＰＨ１６という３つのフレーズ（単語）に分割される。 FIG. 12 shows reverse-translated sentences RS10, RS20, and RS30 expressed in the source language, and forward-translated sentences TS10, TS20, and TS30 of the target language corresponding to each of these reverse-translated sentences. When the phrase splitting process is performed on the reverse translation sentence in the source language, for example, when the reverse translation sentence RS10 is split into phrases, the phrase is divided into three phrases PH11, PH12, and PH12. Further, when the phrase dividing process is performed on the target language forward translation sentence, for example, when the forward translation sentence TS10 is divided into phrases, the phrase PH14, PH15, and PH16 are divided into three phrases (words).

フレーズ分割処理を行なう対象文がどのようなフレーズに分割されるかについては、フレーズテーブルに表記されている原言語および目的言語の文字列に依存するため、一意には決まらないことがある。 The phrase into which the target sentence to be subjected to the phrase division process is divided depends on the character strings of the source language and the target language described in the phrase table, and may not be uniquely determined.

ステップＳ４１３は、フレーズ分割処理によって出力される各フレーズに対して、所定の基準に従ってスコアを評価するフレーズ評価処理である。このフレーズ評価処理の詳細については、後述する。 Step S413 is a phrase evaluation process for evaluating a score according to a predetermined standard for each phrase output by the phrase division process. Details of the phrase evaluation process will be described later.

ステップＳ４１４では、Ｓ４１１で判断された選択逆翻訳文に対応する順翻訳文を情報表示端末１００へ送信し、翻訳結果としてタッチパネル式ディスプレイに表示する。このとき、タッチパネル式ディスプレイに表示されている選択逆翻訳文を強調表示してもよい。また、タッチパネル式ディスプレイにおいて非選択逆翻訳文の表示を消去してもよく、表示された翻訳結果がユーザの選択した選択逆翻訳文に対応する順翻訳文であるということを明示できれば、どのような表示を行っても構わない。 In step S414, the forward translation corresponding to the selected reverse translation sentence determined in S411 is transmitted to the information display terminal 100 and displayed on the touch panel display as the translation result. At this time, the selected reverse translation sentence displayed on the touch panel display may be highlighted. In addition, the display of the non-selected reverse translation sentence may be erased on the touch panel display, and if the displayed translation result is a forward translation sentence corresponding to the selected reverse translation sentence selected by the user, what will be done? May be displayed.

なお、ステップＳ４１４とステップＳ４１２〜Ｓ４１４の一連の処理は、並列的に動作可能であるため、ステップＳ４１４はステップＳ４１２のフレーズ分割処理の前からステップＳ４１５の学習処理の後の間であれば、いずれのタイミングで実行されるとしてもよい。 In addition, since a series of processes of step S414 and steps S412 to S414 can be performed in parallel, step S414 can be any time before the phrase dividing process of step S412 and after the learning process of step S415. It may be executed at the timing.

ステップＳ４１５において、ステップＳ４１３で得られたフレーズ毎のスコアに基づいて、強化学習、識別学習、ニューラルネット学習などの機械学習を行う。この処理の詳細に関しては、図８、９を用いて説明する。 In step S415, machine learning such as reinforcement learning, identification learning, and neural network learning is performed based on the score for each phrase obtained in step S413. Details of this processing will be described with reference to FIGS.

図６は、本実施の形態における、翻訳文選択処理の具体的な動作を示すフローチャートである。 FIG. 6 is a flowchart showing a specific operation of the translated sentence selection process in the present embodiment.

図６を用いて、ステップＳ４０３において順翻訳文選択部２３２で実行される順翻訳文選択処理と、ステップＳ４０６において逆翻訳文選択部２３４で実行される逆翻訳文選択処理の具体的な処理を説明する。説明の便宜上、順翻訳文選択処理および逆翻訳文選択処理の２つの処理をまとめて翻訳文選択処理とする。 With reference to FIG. 6, the specific processes of the forward translation sentence selection process executed by the forward translation sentence selection unit 232 in step S403 and the reverse translation sentence selection process executed by the reverse translation sentence selection unit 234 in step S406 are described. explain. For convenience of explanation, the two processes of the forward translation sentence selection process and the reverse translation sentence selection process are collectively referred to as a translation sentence selection process.

順翻訳文選択部２３２では、順翻訳部２３１で生成された順翻訳文群から、Ｎ個の順翻訳文を選択する順翻訳文選択処理が実行され、逆翻訳文選択部２３４では、逆翻訳部２３３で生成された逆翻訳文群から、Ｍ個の逆翻訳文を選択する逆翻訳文選択処理が実行される。順翻訳文群に含まれる複数の順翻訳文の各々の評価スコアに基づいて、Ｎ個の順翻訳文が選択され、逆翻訳文群に含まれる複数の逆翻訳文の各々の評価スコアに基づいて、Ｍ個の逆翻訳文が選択される。 The forward translation sentence selection unit 232 executes a forward translation sentence selection process for selecting N forward translation sentences from the forward translation sentence group generated by the forward translation part 231, and the reverse translation sentence selection part 234 performs reverse translation. A reverse translation sentence selection process for selecting M reverse translation sentences from the reverse translation sentence group generated by the unit 233 is executed. Based on the evaluation score of each of the plurality of forward translation sentences included in the forward translation sentence group, N forward translation sentences are selected, and based on the evaluation score of each of the plurality of reverse translation sentences included in the reverse translation sentence group. Thus, M back-translated sentences are selected.

以下の説明は、順翻訳文選択処理、逆翻訳文選択処理のどちらの場合でもあてはまる内容であるため、翻訳文選択処理として説明し、順翻訳文と逆翻訳文とをまとめて翻訳文と表記する。また、順翻訳文群と逆翻訳文群は翻訳文群と表記する。さらに、実際には、順翻訳文はＮ個選択され、逆翻訳文はＭ個選択されるが、以下の説明では区別なくＮ個として表記して説明する。 The following explanation applies to both the forward translation sentence selection process and the reverse translation sentence selection process, so it is explained as the translation sentence selection process, and the forward translation sentence and the reverse translation sentence are collectively expressed as a translation sentence. To do. A forward translation sentence group and a reverse translation sentence group are referred to as a translation sentence group. Furthermore, in practice, N forward-translated sentences are selected and M reverse-translated sentences are selected, but in the following description, they are described as N without distinction.

ステップＳ５０１において、翻訳文群の中から評価スコアの高い翻訳文をＮ−ｋ個（１＜＝Ｎ、０＜＝ｋ＜＝Ｎ）選択する。ここで、評価スコアの例としては、翻訳精度を評価する手法としてよく用いられるＢＬＥＵが挙げられる。他にも、翻訳精度を評価する手法として、ＷＥＲ（ＷｏｒｄＥｒｒｏｒＲａｔｅ）、ＭＥＴＥＯＲ（ＭｅｔｒｉｃｆｏｒＥｖａｌｕａｔｉｏｎｏｆＴｒａｎｓｌａｔｉｏｎｗｉｔｈＥｘｐｌｉｃｉｔＯＲｄｅｒｉｎｇ）、ＲＩＢＥＳ（Ｒａｎｋ−ｂａｓｅｄＩｎｔｕｉｔｉｖｅＢｉｌｉｎｇｕａｌＥｖａｌｕａｔｉｏｎＳｃｏｒｅ）などがあるが、評価の手法はこれらのいずれを用いてもよいし、これらに限らず他の手法を用いてもよい。 In step S501, Nk translation sentences having a high evaluation score (1 <= N, 0 <= k <= N) are selected from the translation sentence group. Here, as an example of the evaluation score, BLEU which is often used as a method for evaluating the translation accuracy can be cited. Other methods for evaluating translation accuracy include WER (Word Error Rate), METEOR (Metric for Evaluation of Translation Ordering), and RIBES (Rank-based Intrinsic Evaluation). Any of these may be used, and other methods may be used without being limited thereto.

翻訳文群の中でステップＳ５０１において選択されなかった翻訳文から、残りのｋ個を選択する（Ｓ５０２）。 From the translation sentences, the remaining k sentences are selected from the translation sentences not selected in step S501 (S502).

ｋ個の翻訳文を選択するために、評価スコアが所定の閾値内の翻訳文を抽出し、抽出した翻訳文の中からランダムにｋ個選択するとしてよい。また、評価スコアが所定の閾値内の翻訳文を抽出し、抽出した翻訳文の中でスコアの低いものから順番にｋ個を選択するとしてもよい。Ｎ−ｋ個の翻訳文を選択する場合は、評価スコアの高い翻訳文を選択したが、ｋ個の翻訳文を選択する場合は、必ずしも評価スコアの高い翻訳文を選択するのではない。特定の評価基準によって機械的に付与される評価スコアの高い翻訳文ばかりを選択すると、選択した翻訳文は全て似通った内容の文章である可能性が高い。情報表示端末１００において複数の逆翻訳文をユーザに提示し、そこからユーザに一の逆翻訳文を選択させることを考慮すると、ある程度異なる観点で選択された複数の逆翻訳文を提示することが好ましい。 In order to select k translations, translations having an evaluation score within a predetermined threshold may be extracted, and k translations may be selected at random from the extracted translations. Alternatively, translation sentences having an evaluation score within a predetermined threshold may be extracted, and k pieces may be selected in order from the extracted translation sentences having the lowest score. When selecting Nk translations, a translation with a high evaluation score is selected. However, when selecting k translations, a translation with a high evaluation score is not necessarily selected. If only translated sentences having high evaluation scores that are mechanically given according to a specific evaluation criterion are selected, there is a high possibility that the selected translated sentences are all sentences having similar contents. In consideration of presenting a plurality of reverse translation sentences to the user on the information display terminal 100 and allowing the user to select one reverse translation sentence therefrom, a plurality of reverse translation sentences selected from a somewhat different viewpoint may be presented. preferable.

似通った逆翻訳文ばかりを提示して、その中からユーザに選択させると、後述する機械翻訳システムに対する機械学習処理において高い学習効果を得ることが出来ない懸念が生じる。ユーザによって選択された選択逆翻訳文と、ユーザによって選択されなかった非選択逆翻訳文とを教師データとして用いて機械学習処理を実行させる場合に、機械翻訳システムに対して正例（正解）として学習させた選択逆翻訳文と、負例（不正解）として学習させた非選択逆翻訳文とが似通った文章となるため、機械翻訳システムに対して正例と負例の顕著な差を示すことができず、学習の効果を期待することができない。そのため、本実施の形態で説明するように、ある程度異なる観点で複数の逆翻訳文を選択することが好ましい。 If only similar reverse translation sentences are presented and the user selects them, there is a concern that a high learning effect cannot be obtained in a machine learning process for a machine translation system described later. As a correct example (correct answer) for a machine translation system when machine learning processing is executed by using selected back-translated sentences selected by the user and non-selected back-translated sentences not selected by the user as teacher data Since the selected back-translated sentence learned and the non-selected back-translated sentence learned as a negative example (incorrect answer) are similar, the machine translation system shows a significant difference between the positive and negative examples. I can not expect the effect of learning. For this reason, as described in the present embodiment, it is preferable to select a plurality of back-translated sentences from a somewhat different viewpoint.

また、似通った逆翻訳文ばかりを提示すると、ユーザはそれらの逆翻訳文の細かな差異を考慮して一の逆翻訳文を選択する必要が生じるため、直感的に一の逆翻訳文を選択することができず、逆翻訳文の選択に時間がかかってしまうことも懸念される。本実施の形態のように、ある程度異なる観点で選択された複数の逆翻訳文を提示すれば、ユーザはその中から自身の意図する翻訳内容を直感的に選択可能となる。 In addition, when only similar reverse translation sentences are presented, the user needs to select one reverse translation sentence in consideration of fine differences between the reverse translation sentences, so one reverse translation sentence is selected intuitively. There is also a concern that it may take time to select a reverse translation sentence. If a plurality of reverse translation sentences selected from a somewhat different viewpoint are presented as in the present embodiment, the user can intuitively select the translation content intended by the user.

また、逆翻訳文は順翻訳文から生成されるため、順翻訳文を選択する段階においても、同様にある程度異なる観念で複数の順翻訳文を選択しておくことが好ましいと考えられる。 In addition, since the reverse translation sentence is generated from the forward translation sentence, it is considered preferable to select a plurality of forward translation sentences with a somewhat different idea in the same manner at the stage of selecting the forward translation sentence.

また、ｋ個の翻訳文を選択するための異なる手法として、過去のユーザ選択情報に基づいてｋ個の翻訳文を選択するとしてもよい。例えば、翻訳文毎に、ユーザによって過去に選択された回数などを記憶（逆翻訳文については直接選択された回数を記憶し、順翻訳文については対応する逆翻訳文が選択された回数を記憶する）しておいて、翻訳文群の中で記憶された回数が高いものから順番にｋ個の翻訳文を選択するとしてもよい。また、このような直接的な回数に基づいて選択するのではなく、ユーザの過去の機械翻訳システムの利用履歴に基づいて、翻訳対象文に対してユーザが選択しやすい翻訳文の傾向を分析し、分析した傾向に基づいてｋ個の翻訳文を選択するとしてもよい。なお、Ｎ−ｋ個の翻訳文を選択する選択基準と異なる選択基準を用いれば、ｋ個の翻訳文を選択する選択手法はこれらに限らない。 Further, as a different method for selecting k translated sentences, k translated sentences may be selected based on past user selection information. For example, for each translated sentence, the number of times selected by the user in the past is stored (for the back-translated sentence, the number of directly selected is stored, and for the forward-translated sentence, the number of times the corresponding back-translated sentence is selected is stored. In other words, k translations may be selected in order from the highest stored number of translations in the translation group. Also, instead of selecting based on such direct number of times, based on the user's past use history of the machine translation system, the tendency of the translated sentence that the user can easily select for the sentence to be translated is analyzed. , K translations may be selected based on the analyzed tendency. Note that the selection method for selecting k translated sentences is not limited to this as long as a selection criterion different from the selection criterion for selecting N−k translated sentences is used.

さらに、これらｋ個の翻訳文の選択方法においては、Ｎ−ｋ個の翻訳文と同じ趣旨の翻訳文を除く処理を行なってもよい。また、ｋ個の翻訳文の中でも同じ趣旨の翻訳文を除く処理を行なってもよい。または、Ｎ−ｋ個の翻訳文と異なる趣旨の翻訳文がｋ個の翻訳文に含まれていない、またはｋ個の翻訳文の中に少ない場合には、異なる趣旨の翻訳文を追加する処理を行なってもよい。また、逆翻訳文選択処理に関しては、翻訳対象文と比較して、同じ趣旨を有する逆翻訳文を選択するなどの処理を行なってもよい。 Furthermore, in the method for selecting these k translated sentences, a process of removing a translated sentence having the same meaning as the Nk translated sentences may be performed. Moreover, you may perform the process which excludes the translation sentence of the same meaning in k translation sentences. Alternatively, when k translations are not included in k translations or there are few translations having a different meaning from the Nk translations, a process of adding a translation having a different meaning is included. May be performed. Further, regarding the reverse translation sentence selection process, a process such as selecting a reverse translation sentence having the same purpose as compared with the translation target sentence may be performed.

例えば、翻訳文群に、疑問文・肯定文・否定文・命令文のそれぞれの形態の翻訳文がどれだけ含まれているか（文の異なり数）をカウントし、閾値以下であれば、再度、順翻訳処理または逆翻訳処理を行う。 For example, it counts how many translated sentences (question sentences, positive sentences, negative sentences, and imperative sentences) are included in the translated sentence group (the number of different sentences). Perform forward or reverse translation.

また、翻訳文に対して構文解析を行うなどの手法もある。翻訳文群に含まれる複数の翻訳文それぞれに対して構文解析を行い、それぞれ主語を示す単語が何であるかを判断し、主語を示す単語が何種類あったかを示す主語の異なり数が閾値以下であれば、再度、順翻訳処理または逆翻訳処理を行う。このとき、主語ではなく、述語について異なり数を算出するとしてもよい。また、その両方であってもよい。 There are also techniques such as performing syntax analysis on translated sentences. Parse multiple translated sentences included in the translation sentence group, determine what each word indicates the subject, and the number of different subjects indicating the number of different words indicating the subject is less than or equal to the threshold If there is, the forward translation process or the reverse translation process is performed again. At this time, different numbers may be calculated for the predicates rather than the subject. Moreover, both may be sufficient.

ここで、文の異なり数、主語の異なり数、動詞または目的語の異なり数などを評価スコアとして、異なり数が所定の数だけ含まれるように、順翻訳文選択処理および／または逆翻訳文選択処理を行うとしてもよい。 Here, the number of different sentences, the number of different subjects, the number of different verbs or objects, etc. are used as the evaluation score, and the forward translation sentence selection process and / or the reverse translation sentence selection so that the predetermined number of different numbers are included. Processing may be performed.

順翻訳文選択処理を実行する場合に、例えば、「文の異なり数を２以上とする」という評価基準を設け、文の異なり数を評価スコアとして順翻訳文を選択することができる。順翻訳文群に含まれる複数の順翻訳文の各々に対して、順番に構文解析および／または意味解析を用いて解析を行い、疑問文・肯定文・否定文・命令文のうち、いくつの種類が出現したかを文の異なり数としてスコア化し、２種類以上が含まれるｌ個の小集合を作成しても良い。 When executing the forward-translated sentence selection process, for example, an evaluation criterion “make the number of sentence differences 2 or more” can be provided, and the forward-translated sentence can be selected using the number of sentence differences as an evaluation score. For each of a plurality of forward-translated sentences included in the forward-translated sentence group, analysis is performed sequentially using syntax analysis and / or semantic analysis, and any number of question sentences, affirmative sentences, negative sentences, and imperative sentences is selected. Whether a type appears may be scored as the number of different sentences, and l small sets including two or more types may be created.

また例えば、逆翻訳文選択処理を実行する場合に、「主語の異なり数を所定数以下にする」という評価基準を設け、主語の異なり数を評価スコアとして逆翻訳文を選択することもできる。 Further, for example, when the reverse translation sentence selection process is executed, an evaluation criterion “make the number of different subject numbers equal to or less than a predetermined number” can be provided, and the reverse translation sentence can be selected using the number of different subject words as an evaluation score.

なお、順翻訳文選択処理または逆翻訳文選択処理は、これらの例に限定するものではなく任意の順翻訳文選択または逆翻訳文選択の手法を用いるとしてよい。 The forward translation sentence selection process or the reverse translation sentence selection process is not limited to these examples, and any forward translation sentence selection method or reverse translation sentence selection method may be used.

また、逆翻訳文選択処理に関しては、翻訳対象文との比較評価を行い、比較評価によって得られた値を評価スコアとしてもよい。例えば、翻訳対象文、および逆翻訳文群の中の複数の逆翻訳文に対して構文解析を行い、翻訳対象文と複数の逆翻訳文の各々との類似度を判断し、判断した類似度に基づいて、ｋ個の逆翻訳文を選択するとしてもよい。 Regarding the reverse translation sentence selection process, a comparison evaluation with the translation target sentence may be performed, and a value obtained by the comparison evaluation may be used as the evaluation score. For example, syntax analysis is performed on the translation target sentence and a plurality of reverse translation sentences in the reverse translation sentence group, and the similarity between the translation target sentence and each of the plurality of reverse translation sentences is determined, and the determined similarity degree Based on the above, k back-translated sentences may be selected.

また、これらの例に限らず、任意の評価スコアを組み合わせて用いても良い。さらに、Ｎを所定の数として予め設定し、更にそこに含まれる異なり数が所望の数となるようにスコアを組み合わせて用いても良い。 Moreover, you may use combining not only these examples but arbitrary evaluation scores. Furthermore, N may be set in advance as a predetermined number, and the score may be combined and used so that the different number included therein becomes a desired number.

上述の翻訳文の選択方法は一例であり、これらに限らない。 The above-described translation sentence selection method is an example, and is not limited thereto.

なお、ｋ＝０の場合は、Ｎ個すべてが評価スコアの高い翻訳文から順に選択されるものとなる。また、ｋ＝Ｎの場合は、Ｎ個の翻訳文すべてが評価スコアの高い翻訳文から順に選択される以外の選択方法によって選択される。 When k = 0, all N sentences are selected in order from the translation sentence with the highest evaluation score. When k = N, all the N translated sentences are selected by a selection method other than selecting in order from the translated sentence having the highest evaluation score.

ここでのシステムの基準とは、ＢＬＥＵなどの評価や人手評価などを用いて順翻訳文のスコアを算出し、例えば、スコアが低い順翻訳文に関しては逆翻訳処理を行わない（ある順翻訳文に関しては生成する逆翻訳文が０個）、スコアが高い順翻訳文は任意の数の逆翻訳文を得る（ある順翻訳文に関しては逆翻訳文が複数個生成される）などである。ユーザが逆翻訳文の数を決定する場合には、一例として、一つの順翻訳文に対して逆翻訳文をいくつ生成するかを設定する、などが考えられるが、これに限らない。 The standard of the system here is to calculate the score of the forward translation sentence using evaluation such as BLEU or manual evaluation, for example, the reverse translation processing is not performed for the forward translation sentence having a low score (a certain forward translation sentence) The number of reverse translation sentences to be generated is 0), and an arbitrary number of reverse translation sentences having a high score are obtained (a plurality of reverse translation sentences are generated for a certain forward translation sentence). When the user determines the number of back-translated sentences, for example, setting how many back-translated sentences are generated for one forward-translated sentence can be considered, but this is not restrictive.

図７は、本実施の形態における、逆翻訳文選択処理の具体的な動作を示すフローチャートである。 FIG. 7 is a flowchart showing a specific operation of the reverse translation sentence selection process in the present embodiment.

図５を用いた逆翻訳処理（ステップＳ４０５）の説明にて、逆翻訳文が生成されない順翻訳文が存在してもよいこと、一つの順翻訳文から複数の逆翻訳文が生成されるとしてもよいことなどを述べた。ここでは、一つの順翻訳文から得られた複数の逆翻訳文の中から、一つの逆翻訳文を選択する処理について説明する。 In the description of the reverse translation processing (step S405) using FIG. 5, it is assumed that there may be a forward translation sentence in which no reverse translation sentence is generated, and a plurality of reverse translation sentences are generated from one forward translation sentence. It was also good. Here, a process of selecting one reverse translation sentence from a plurality of reverse translation sentences obtained from one forward translation sentence will be described.

順翻訳文選択処理によって選択されたＮ個の順翻訳文すべてに対して、以下の処理が行われる。 The following processing is performed on all N forward-translated sentences selected by the forward-translated sentence selecting process.

ステップＳ６０１において、Ｎ個の順翻訳文の中の順翻訳文Ａに対して生成された逆翻訳文を抽出する。 In step S601, the reverse translation sentence generated for the forward translation sentence A among the N forward translation sentences is extracted.

ステップＳ６０２において、抽出した逆翻訳文の数を判断する。順翻訳文Ａに対して、逆翻訳文が生成されていなかったとき、つまり、順翻訳文Ａに対して生成された逆翻訳文が０個であったとき（ステップＳ６０２の０個）、順翻訳文Ａはユーザに提示されることがないため、削除する（ステップＳ６０３）。 In step S602, the number of extracted reverse translation sentences is determined. When no reverse translation sentence has been generated for forward translation sentence A, that is, when zero reverse translation sentences have been generated for forward translation sentence A (0 in step S602), Since the translated sentence A is not presented to the user, it is deleted (step S603).

次に、順翻訳文Ａに対して逆翻訳文が一個であるとき（ステップＳ６０２の１個）、その逆翻訳文は順翻訳文Ａに対応する逆翻訳文として決定される（ステップＳ６０４）。 Next, when there is one reverse translation sentence for the forward translation sentence A (one in step S602), the reverse translation sentence is determined as a reverse translation sentence corresponding to the forward translation sentence A (step S604).

最後に、順翻訳文Ａに対して逆翻訳文が二個以上生成されているとき（ステップＳ６０２の２個以上）、その中から順翻訳文Ａに対応する逆翻訳文として最適なものを一つ決定する（ステップＳ６０５）。決定方法としては、例えば、自動評価スコアや人手評価によるスコアを参照する方法を用いることによって選ばれる。これらを、順翻訳文Ｎ個すべてに対し繰り返す。 Finally, when two or more reverse-translated sentences are generated for the forward-translated sentence A (two or more in step S602), one of the optimum reverse-translated sentences corresponding to the forward-translated sentence A is selected. Are determined (step S605). The determination method is selected, for example, by using a method of referring to an automatic evaluation score or a score by manual evaluation. These are repeated for all N forward-translated sentences.

最後に、これらの処理で得られた逆翻訳文のうち、同趣旨の逆翻訳文を除く、または、異なる趣旨の逆翻訳文が少ない場合に異なる趣旨を持つ翻訳文を追加する、などの処理を必要に応じて行ってもよい（ステップＳ６０６）。ここでの処理は、図５の説明にて述べた処理と同じである。 Finally, from the back-translated sentences obtained by these processes, remove the back-translated sentence with the same meaning, or add a translated sentence with a different purpose when there are few back-translated sentences with a different purpose. May be performed as necessary (step S606). The processing here is the same as the processing described in the description of FIG.

なお、上述の説明では、図７のステップＳ６０５において、一つの順翻訳文に対して一つの逆翻訳文を選択するものとしていた。しかし、一つの順翻訳文に対して、複数個の逆翻訳文が選択される場合があってもよい。この場合、ユーザに提示される複数の逆翻訳文の中に、対応する順翻訳文が同一の逆翻訳文が存在することになる。 In the above description, one reverse translation sentence is selected for one forward translation sentence in step S605 of FIG. However, a plurality of back-translated sentences may be selected for one forward-translated sentence. In this case, among the plurality of reverse translation sentences presented to the user, there exists a reverse translation sentence having the same corresponding forward translation sentence.

図８は、本実施の形態における、フレーズ評価処理の具体的な動作を示すフローチャートである。 FIG. 8 is a flowchart showing a specific operation of the phrase evaluation process in the present embodiment.

フローの初期状態において、事前にフレーズに分割された、選択逆翻訳文、非選択逆翻訳文、および選択逆翻訳文と非選択逆翻訳文の各々に対応する順翻訳文を取得しているとする。また、これらの逆翻訳文および順翻訳文をフレーズ分割して得られたフレーズに対応するフレーズテーブルも同様に取得しているとする。このフレーズ評価処理は、逆翻訳文および順翻訳文それぞれについて行なわれるが、説明の便宜上、逆翻訳文に対するフレーズ分割処理を例に挙げて説明する。 In the initial state of the flow, when the selected translated sentence, the non-selected reverse translated sentence, and the forward translated sentence corresponding to each of the selected reverse translated sentence and the non-selected reverse translated sentence that have been divided into phrases in advance are acquired. To do. In addition, it is assumed that a phrase table corresponding to a phrase obtained by dividing the reverse translation sentence and the forward translation sentence into phrases is acquired in the same manner. This phrase evaluation process is performed for each of the reverse translation sentence and the forward translation sentence. For convenience of explanation, a phrase division process for the reverse translation sentence will be described as an example.

ステップＳ７０１において、選択逆翻訳文、および非選択逆翻訳文に対して、これらの逆翻訳文に含まれるフレーズを逆翻訳文毎に比較し、選択逆翻訳文のみに存在するフレーズの有無を確認する。 In step S701, with respect to the selected back-translated sentence and the non-selected back-translated sentence, the phrases included in these back-translated sentences are compared for each back-translated sentence, and the presence or absence of a phrase that exists only in the selected back-translated sentence is confirmed To do.

選択逆翻訳文のみに存在するフレーズが有る場合（ステップＳ７０１のＹｅｓ）は、選択逆翻訳文のみに存在するフレーズについては、ユーザ選択スコアを加点する（ステップＳ７０２）。このユーザ選択スコアは、選択文のみに現れるフレーズを良いものとするスコアであり、最終的に図９の処理において、順翻訳文（フレーズ評価処理の対象が順翻訳文の場合は、翻訳対象文）の対応するフレーズとともに、フレーズテーブルの日英翻訳確率または英日翻訳確率に反映される。このときのスコアの加点方法は、どのような方法を用いてもよい。例えば、該当するフレーズに対して一律に加点する、該当するフレーズの長さに依存して加点する、などの方法が想定される。スコアの加点が完了すると、フローをステップＳ７０３に進める。選択逆翻訳文のみに存在するフレーズが無い場合（ステップＳ７０１のＮｏ）は、特段の処理をせずに、フローをそのままステップＳ７０３に進める。 When there is a phrase that exists only in the selected reverse translation sentence (Yes in step S701), a user selection score is added to the phrase that exists only in the selected reverse translation sentence (step S702). This user-selected score is a score that favors phrases that appear only in the selected sentence. Finally, in the process of FIG. 9, in the case of a forward translation sentence (if the phrase evaluation process target is a forward translation sentence, the translation target sentence) ) Are reflected in the phrase table Japanese-English translation probability or English-Japanese translation probability. Any method may be used as the score addition method at this time. For example, a method of adding points uniformly to the corresponding phrase or adding points depending on the length of the corresponding phrase is assumed. When the score addition is completed, the flow proceeds to step S703. If there is no phrase that exists only in the selected reverse translation sentence (No in step S701), the flow proceeds directly to step S703 without performing any special processing.

同様に、ステップＳ７０３において、選択逆翻訳文、および非選択逆翻訳文に対して、これらの逆翻訳文に含まれるフレーズを逆翻訳文毎に比較し、非選択逆翻訳文のみに存在するフレーズの有無を確認する。 Similarly, in step S703, the phrase included in these reverse translation sentences is compared for each reverse translation sentence with respect to the selected reverse translation sentence and the non-selection reverse translation sentence, and the phrase exists only in the non-selection reverse translation sentence. Check if there is any.

非選択逆翻訳文のみに存在するフレーズが有る場合（ステップＳ７０３のＹｅｓ）、非選択逆翻訳文のみに存在するフレーズについては、ユーザ選択スコアを減点する（ステップＳ７０４）。このときのスコアの減点方法は、どのような方法を用いてもよい。例えば、該当するフレーズに対して一律に減点する、該当するフレーズの長さに依存して減点する、などの方法が想定される。 If there is a phrase that exists only in the non-selected reverse translation sentence (Yes in step S703), the user selection score is deducted for the phrase that exists only in the non-selected reverse translation sentence (step S704). Any method may be used as a score deduction method at this time. For example, a method of deducting points uniformly for the corresponding phrase or deducting points depending on the length of the corresponding phrase is assumed.

なお、ステップＳ７０２とＳ７０４において、ユーザ選択スコアの加点、減点は必ずしも必須ではない。つまり、図８のフローチャートにおいて、選択逆翻訳文のみに存在するフレーズに対してスコアの加点も減点も行わず、非選択逆翻訳文のみに存在するフレーズに対してユーザ選択スコアの減点を行うとしてもよい。または、選択逆翻訳文のみに存在するフレーズに対してユーザ選択スコアの加点を行い、非選択逆翻訳文のみに存在するフレーズに対してユーザ選択スコアの加点も減点も行わないとしてもよい。 In steps S702 and S704, addition or deduction of the user selection score is not always essential. That is, in the flowchart of FIG. 8, it is assumed that score addition or deduction is not performed for a phrase that exists only in a selected reverse translation sentence, and a user selection score is deducted for a phrase that exists only in a non-selected reverse translation sentence. Also good. Alternatively, the user selection score may be added to a phrase that exists only in the selected reverse translation sentence, and the user selection score may not be added or reduced to a phrase that exists only in the non-selected reverse translation sentence.

また、選択逆翻訳文と一部の非選択逆翻訳文に存在するフレーズに関して、スコアの加点を行なっても構わない。この場合、例えば、選択逆翻訳文のみに含まれるフレーズのユーザ選択スコアと、非選択逆翻訳文のみに含まれるフレーズのユーザ選択スコアを考慮した値（例えば、選択逆翻訳文のみに含まれるフレーズのユーザ選択スコアと、非選択逆翻訳文のみに含まれるフレーズのユーザ選択スコアの平均値）を加点することが考えられる。これらのスコアの加点方法については一例であり、これらに限らない。 In addition, scores may be added for phrases existing in the selected back-translated sentence and some non-selected back-translated sentences. In this case, for example, a value considering the user selection score of a phrase included only in the selected reverse translation sentence and a user selection score of a phrase included only in the non-selected reverse translation sentence (for example, a phrase included only in the selected reverse translation sentence) It is conceivable to add a user selection score of (2) and an average value of user selection scores of phrases included only in the non-selected reverse translation sentence. These score addition methods are merely examples, and are not limited thereto.

以下、図１２を用いて、具体例を挙げて説明する。例えば、情報表示端末１００に３つの逆翻訳文が提示され、その中の１つの逆翻訳文をユーザが選択した場合について説明する。このとき、情報表示端末１００に提示される３つの逆翻訳文は、逆翻訳文ＲＳ１０、ＲＳ２０、ＲＳ３０として、ユーザによって選択された逆翻訳文は、逆翻訳文ＲＳ１０であるとする。また、説明の便宜上、ユーザによって選択された逆翻訳文を選択逆翻訳文、ユーザによって選択されなかった逆翻訳文を非選択逆翻訳文と定義する。フローの初期状態において、選択逆翻訳文である逆翻訳文ＲＳ１０は、フレーズＰＨ１１、ＰＨ１２、ＰＨ１３に分割されている。また、非選択逆翻訳文についても同様に、逆翻訳文ＲＳ２０は、フレーズＰＨ２１、ＰＨ２２、ＰＨ２３に分割されており、逆翻訳文ＲＳ３０は、フレーズＰＨ３１、ＰＨ３２、ＰＨ３３に分割されている。 Hereinafter, a specific example will be described with reference to FIG. For example, a case will be described in which three reverse translation sentences are presented on the information display terminal 100 and the user selects one of the reverse translation sentences. At this time, the three reverse translation sentences presented on the information display terminal 100 are the reverse translation sentences RS10, RS20, and RS30, and the reverse translation sentence selected by the user is the reverse translation sentence RS10. For convenience of explanation, a reverse translation sentence selected by the user is defined as a selected reverse translation sentence, and a reverse translation sentence not selected by the user is defined as a non-selected reverse translation sentence. In the initial state of the flow, the reverse translation sentence RS10 that is the selected reverse translation sentence is divided into phrases PH11, PH12, and PH13. Similarly, for the non-selected reverse translation sentence, the reverse translation sentence RS20 is divided into phrases PH21, PH22, and PH23, and the reverse translation sentence RS30 is divided into phrases PH31, PH32, and PH33.

Ｓ７０１において、逆翻訳文ＲＳ１０、ＲＳ２０、ＲＳ３０の中で、選択逆翻訳文である逆翻訳文ＲＳ１０のみに存在するフレーズの有無を確認する。すると、選択逆翻訳文である逆翻訳文ＲＳ１０にのみフレーズＰＨ１２が含まれるため、フレーズＰＨ１２のユーザ選択スコアを加点して、「＋１」とする。 In S701, the presence / absence of a phrase that exists only in the reverse translation sentence RS10 that is the selected reverse translation sentence among the reverse translation sentences RS10, RS20, and RS30 is confirmed. Then, since the phrase PH12 is included only in the reverse translation sentence RS10 that is the selected reverse translation sentence, the user selection score of the phrase PH12 is added to “+1”.

同様に、Ｓ７０２において、非選択逆翻訳文である逆翻訳文ＲＳ２０、ＲＳ３０のみに存在するフレーズの有無を確認する。すると、非選択逆翻訳文にのみフレーズＰＨ２２（ＰＨ３２）とフレーズＰＨ３１が含まれるため、フレーズＰＨ２２（ＰＨ３２）とフレーズＰＨ３１のユーザ選択スコアをそれぞれ減点して、「−１」とする。 Similarly, in S702, the presence / absence of a phrase that exists only in the reverse translation sentences RS20 and RS30, which are non-selected reverse translation sentences, is confirmed. Then, since the phrase PH22 (PH32) and the phrase PH31 are included only in the non-selected reverse translation sentence, the user selection scores of the phrase PH22 (PH32) and the phrase PH31 are decremented to “−1”.

ここで、選択逆翻訳文、非選択逆翻訳文の両方に含まれるフレーズＰＨ１１（ＰＨ２１）、ＰＨ１３（ＰＨ２３、ＰＨ３３）に関しては、ユーザ選択スコアの加点または減点を行なわない。 Here, for the phrases PH11 (PH21) and PH13 (PH23, PH33) included in both the selected back-translated sentence and the non-selected back-translated sentence, the user-selected score is not added or reduced.

上述の処理によって、フレーズ毎の最終的なユーザ選択スコアの加点量または減点量は次の通りである。フレーズＰＨ１１（ＰＨ２１）に対しては「±０」、フレーズＰＨ３１に対しては「−１」、フレーズＰＨ２２（ＰＨ３２）に対しては「−１」、フレーズＰＨ１２に対しては「＋１」、フレーズＰＨ１３（ＰＨ２３、ＰＨ３３）に対しては「±０」といった加点量または減点量となる。ここでのスコアの加点量および減点量は一例であるため、これよりも大きいオーダーで加点または減点を行なってもよいし、これよりも小さいオーダーで加点または減点を行なってもよい。 With the above-described processing, the amount of points added or deducted for the final user-selected score for each phrase is as follows. “± 0” for phrase PH11 (PH21), “−1” for phrase PH31, “−1” for phrase PH22 (PH32), “+1” for phrase PH12, phrase For PH13 (PH23, PH33), the added amount or deducted amount is “± 0”. Since the score addition and deduction points here are only examples, points may be added or deducted in an order larger than this, or points may be added or deducted in an order smaller than this.

また、目的言語である順翻訳文に対するフレーズ評価処理について、図１２を用いて、以下に具体例を挙げて説明する。前述した逆翻訳文ＲＳ１０、ＲＳ２０、ＲＳ３０には、それぞれ順翻訳文ＴＳ１０、ＴＳ２０、ＴＳ３０が対応している。フローの初期状態において、順翻訳文ＴＳ１０は、フレーズＰＨ１４、ＰＨ１５、ＰＨ１６に分割されている。また、順翻訳文ＴＳ２０は、フレーズＰＨ２４、ＰＨ２５、ＰＨ２６に分割されており、ＴＳ３０は、フレーズＰＨ３４、ＰＨ３５、ＰＨ３６に分割されている。 Moreover, the phrase evaluation process with respect to the forward-translated sentence which is a target language is demonstrated with a specific example below using FIG. The above-described reverse translation sentences RS10, RS20, and RS30 correspond to the forward translation sentences TS10, TS20, and TS30, respectively. In the initial state of the flow, the forward translation TS10 is divided into phrases PH14, PH15, and PH16. Further, the forward translation TS20 is divided into phrases PH24, PH25, and PH26, and TS30 is divided into phrases PH34, PH35, and PH36.

Ｓ７０１において、選択逆翻訳文１に対応する順翻訳文１にのみ存在するフレーズの有無を確認する。すると、順翻訳文ＴＳ１０にのみフレーズＰＨ１６が含まれるため、フレーズＰＨ１６のユーザ選択スコアを加点して、「＋１」とする。 In S701, the presence or absence of a phrase that exists only in the forward translation sentence 1 corresponding to the selected reverse translation sentence 1 is confirmed. Then, since the phrase PH16 is included only in the forward translation sentence TS10, the user selection score of the phrase PH16 is added to “+1”.

同様に、Ｓ７０２において、非選択逆翻訳文である逆翻訳文ＲＳ２０、ＲＳ３０に対応する順翻訳文ＴＳ２０、ＴＳ３０のいずれかにのみ存在するフレーズの有無を確認する。すると、順翻訳文ＴＳ２０またはＴＳ３０にのみ、フレーズＰＨ２６（ＰＨ３６）とフレーズＰＨ３４が含まれるため、フレーズＰＨ２６（ＰＨ３６）とフレーズＰＨ３４のユーザ選択スコアをそれぞれ減点して、「−１」とする。 Similarly, in S702, the presence / absence of a phrase that exists only in one of the forward-translated sentences TS20 and TS30 corresponding to the reverse-translated sentences RS20 and RS30 that are non-selected reverse-translated sentences is confirmed. Then, since the phrase PH26 (PH36) and the phrase PH34 are included only in the forward translation TS20 or TS30, the user-selected scores of the phrase PH26 (PH36) and the phrase PH34 are decremented to “−1”.

選択逆翻訳文に対応する順翻訳文ＴＳ１０に含まれ、非選択逆翻訳文に対応する順翻訳文ＴＳ２０または順翻訳文ＴＳ３０のいずれかにも含まれる、フレーズＰＨ２４とフレーズＰＨ１５（ＰＨ２５、ＰＨ３５）に関しては、ユーザ選択スコアの加点または減点を行なわない。 Phrase PH24 and phrase PH15 (PH25, PH35) included in the forward translation sentence TS10 corresponding to the selected reverse translation sentence and included in either the forward translation sentence TS20 or the forward translation sentence TS30 corresponding to the non-selected reverse translation sentence With respect to, the user-selected score is not added or reduced.

上述の処理によって、フレーズ毎の最終的なスコアの加点量または減点量は、フレーズＰＨ２４（ＰＨ３４）に対しては「±０」、フレーズＰＨ３４に対しては「−１」、フレーズＰＨ２６（ＰＨ３６）に対しては「−１」、フレーズＰＨ１６に対しては「＋１」、フレーズＰＨ１５（ＰＨ２５、ＰＨ３５）に対しては「±０」となる。ここでのユーザ選択スコアの加点量および減点量は一例であるため、これよりも大きいオーダーで加点または減点を行なってもよいし、これよりも小さいオーダーで加点または減点を行なってもよい。 With the above-described processing, the final score addition or deduction amount for each phrase is “± 0” for the phrase PH24 (PH34), “−1” for the phrase PH34, and the phrase PH26 (PH36). Is "-1" for the phrase PH16, "+1" for the phrase PH16, and "± 0" for the phrase PH15 (PH25, PH35). Since the points added and deducted for the user-selected score here are examples, points may be added or deducted in an order larger than this, or points added or deducted in an order smaller than this.

図９は、本実施の形態における、学習処理の具体的な動作を示すフローチャートである。 FIG. 9 is a flowchart showing a specific operation of the learning process in the present embodiment.

ステップＳ８０１において、選択逆翻訳文に対応する順翻訳文に含まれるフレーズと、選択逆翻訳文に含まれるフレーズとのフレーズ対、もしくは選択逆翻訳文に対応する翻訳対象文に含まれるフレーズと、選択逆翻訳文に対応する順翻訳文に含まれるフレーズとのフレーズ対を取得する。フレーズ対とは、機械翻訳の際、原言語、目的言語の両言語間でそれぞれ対応が取られた（同じ意味を持つ）２つのフレーズのことである。さらに、図８の処理で得たユーザ選択スコアも同時に取得する。 In step S801, a phrase pair of a phrase included in the forward translation sentence corresponding to the selected reverse translation sentence and a phrase included in the selected reverse translation sentence, or a phrase included in the translation target sentence corresponding to the selected reverse translation sentence; A phrase pair with a phrase included in the forward translation corresponding to the selected reverse translation is acquired. A phrase pair is two phrases that have correspondence (having the same meaning) between the source language and the target language during machine translation. Furthermore, the user selection score obtained by the process of FIG.

このフレーズ対において、原言語から目的言語へ翻訳される場合に参照されるフレーズテーブルの値に対するユーザ選択スコアを、図１２に示す例を用いて定義すると、例えば、フレーズＰＨ３１→フレーズＰＨ３４：−１／フレーズＰＨ２２（ＰＨ３２）→フレーズＰＨ２６（ＰＨ３５）：−１／フレーズＰＨ１１（ＰＨ２１）→フレーズＰＨ１４（ＰＨ２４）：０／フレーズＰＨ１３（ＰＨ２３、ＰＨ３３）→フレーズＰＨ１５（ＰＨ２５、ＰＨ３５）：０／フレーズＰＨ１２→フレーズＰＨ１６：＋１のようになる。 In this phrase pair, if the user selection score for the value of the phrase table that is referred to when translated from the source language to the target language is defined using the example shown in FIG. 12, for example, phrase PH31 → phrase PH34: −1 / Phrase PH22 (PH32) → Phrase PH26 (PH35):-1 / Phrase PH11 (PH21) → Phrase PH14 (PH24): 0 / Phrase PH13 (PH23, PH33) → Phrase PH15 (PH25, PH35): 0 / Phrase PH12 → Phrase PH16: +1.

また、フレーズ対において、目的言語から原言語へ翻訳される場合に参照されるフレーズテーブルの値に対するユーザ選択スコアを定義すると、例えば、フレーズＰＨ３４→フレーズＰＨ３１：−１／フレーズＰＨ２６（ＰＨ３６）→フレーズＰＨ２２（ＰＨ３２）：−１／フレーズＰＨ１４（ＰＨ２４）→フレーズＰＨ１１（ＰＨ２１）：０／フレーズＰＨ１５（ＰＨ２５、ＰＨ３５）→フレーズＰＨ１３（ＰＨ２３、ＰＨ３３）：０／フレーズＰＨ１６→フレーズＰＨ１２：＋１のようになる。 Further, when a user selection score is defined for a phrase table value that is referred to when a target language is translated into a source language in a phrase pair, for example, phrase PH34 → phrase PH31: −1 / phrase PH26 (PH36) → phrase PH22 (PH32): -1 / phrase PH14 (PH24)-> phrase PH11 (PH21): 0 / phrase PH15 (PH25, PH35)-> phrase PH13 (PH23, PH33): 0 / phrase PH16-> phrase PH12: +1 Become.

ステップＳ８０２において、記憶部２４０に格納されているフレーズテーブルの英日翻訳確率または日英翻訳確率に、上述のユーザ選択スコアを反映する。またユーザ選択スコアに一定の値を乗算してからフレーズテーブルに反映するなど、ユーザ選択スコアに傾斜や重み付けを与えても構わない。 In step S802, the above-mentioned user selection score is reflected on the English-Japanese translation probability or the Japanese-English translation probability of the phrase table stored in the storage unit 240. In addition, the user selection score may be given a slope or weight, for example, by multiplying the user selection score by a certain value and reflecting the result in the phrase table.

これらを用いて、機械翻訳部２３０や逆翻訳部２３３において、強化学習や識別学習、ニューラルネット学習などの機械学習を行う。 Using these, the machine translation unit 230 and the reverse translation unit 233 perform machine learning such as reinforcement learning, identification learning, and neural network learning.

これまでの機械翻訳においては、図１１に示すようなフレーズテーブルの確率値を、対訳コーパス（異なる２つの言語間で、お互いが翻訳となっている文の対を集めたデータ）を元にチューニングすることは行われていたが、ユーザ選択文に含まれるものと含まれないものでスコアに差を付けて機械学習を行う手法はこれまでになく、本開示の機械翻訳システムは、よりユーザの評価結果を反映できる。 In conventional machine translation, the probability value of the phrase table as shown in FIG. 11 is tuned based on the bilingual corpus (data that collects pairs of sentences translated between two different languages). However, there has never been a method for performing machine learning with a difference in score between what is included in a user-selected sentence and what is not included in a user-selected sentence. Evaluation results can be reflected.

さらに、機械学習によって、予め用意された対訳コーパスから作成された翻訳モデルや言語モデルなどに対して、逐次、フレーズ単位でユーザの選択結果を混ぜ込みながら、翻訳モデルや言語モデルを学習させることができるため、精度を向上させることができる。 Furthermore, it is possible to train a translation model or a language model by mixing user selection results in phrase units sequentially with a translation model or a language model created from a pre-prepared bilingual corpus by machine learning. Therefore, the accuracy can be improved.

さらに、機械学習を行うことで、データを元に最適なパラメータが選択される。これらにより、人（ユーザ）の選択結果が翻訳システム反映されるため、人が使いやすい翻訳システムを構築することができる。 Further, by performing machine learning, an optimum parameter is selected based on the data. As a result, since the selection result of the person (user) is reflected in the translation system, it is possible to construct a translation system that is easy for a person to use.

なお、これらのような機械学習を行うだけではなく、得られたフレーズから新規コーパスを生成して、翻訳エンジン学習のための対訳コーパスとして利用することもできる。 In addition to performing machine learning like these, it is also possible to generate a new corpus from the obtained phrases and use it as a parallel corpus for translation engine learning.

図１０は、本実施の形態における、学習部２３８の具体的な処理を示すフローチャートである。図１２に示す逆翻訳文および順翻訳文を用いて、図１０に示すフローチャートの内容を説明する。 FIG. 10 is a flowchart showing specific processing of the learning unit 238 in the present embodiment. The contents of the flowchart shown in FIG. 10 will be described using the reverse translation sentence and the forward translation sentence shown in FIG.

ステップＳ９０１において、ユーザ選択文とその順翻訳文のフレーズ対を取得する。 In step S901, a phrase pair of a user-selected sentence and its forward translation sentence is acquired.

例えば、ユーザ提示文表示エリア１１０２に、逆翻訳文ＲＳ１０、ＲＳ２０、ＲＳ３０が表示された状態で、ユーザによって逆翻訳文ＲＳ１０が選択された場合（逆翻訳文ＲＳ１０がユーザ選択文である場合）を説明する。逆翻訳文ＲＳ１０がユーザによって選択されたため、フレーズＰＨ１１とフレーズＰＨ１４、フレーズＰＨ１２とフレーズＰＨ１６、フレーズＰＨ１３とフレーズＰＨ１５といったフレーズ対が取得される。 For example, when the reverse translation sentence RS10 is selected by the user in a state where the reverse translation sentences RS10, RS20, and RS30 are displayed in the user presentation sentence display area 1102 (when the reverse translation sentence RS10 is a user selection sentence). explain. Since the reverse translation RS10 is selected by the user, phrase pairs such as the phrase PH11 and the phrase PH14, the phrase PH12 and the phrase PH16, and the phrase PH13 and the phrase PH15 are acquired.

ステップＳ９０２において、機械翻訳部２３０での機械翻訳の際に使われた、入力文とその順翻訳文におけるフレーズ対を取得する。 In step S902, a phrase pair in the input sentence and the forward translation sentence used in the machine translation in the machine translation unit 230 is acquired.

例えば、逆翻訳文ＲＳ３０の示す内容が入力文である場合には、フレーズＰＨ３１とフレーズＰＨ３４、フレーズＰＨ３２とフレーズＰＨ３６、フレーズＰＨ３３とフレーズＰＨ３５といったフレーズ対が取得される。 For example, when the content indicated by the reverse translation RS30 is an input sentence, phrase pairs such as a phrase PH31 and a phrase PH34, a phrase PH32 and a phrase PH36, and a phrase PH33 and a phrase PH35 are acquired.

ステップＳ９０３において、入力文とユーザ選択文で取得されたフレーズに対し、目的言語の文字列が同一であるフレーズを取得する。例えば、ユーザ選択文におけるフレーズ対が次の通りであったとする。ユーザ選択文におけるフレーズ対は、フレーズＰＨ１１とフレーズＰＨ１４、フレーズＰＨ１２とフレーズＰＨ１６、フレーズＰＨ１３とフレーズＰＨ１５である。これに対し、入力文のフレーズ対が次の通りであったとする。入力文のフレーズ対は、フレーズＰＨ３１とフレーズＰＨ３４、フレーズＰＨ３２とＰＨ３６、フレーズＰＨ４１とＰＨ４２である。 In step S903, a phrase having the same target language character string is acquired for the phrase acquired in the input sentence and the user-selected sentence. For example, it is assumed that a phrase pair in the user-selected sentence is as follows. The phrase pairs in the user-selected sentence are phrase PH11 and phrase PH14, phrase PH12 and phrase PH16, phrase PH13 and phrase PH15. On the other hand, it is assumed that the phrase pair of the input sentence is as follows. The phrase pairs of the input sentence are phrase PH31 and phrase PH34, phrases PH32 and PH36, and phrases PH41 and PH42.

このとき、ユーザ選択文のフレーズと入力文のフレーズにおいて、原言語のフレーズＰＨ３３とフレーズＰＨ４１は、同じ意味を持つが表現の異なるフレーズである。また、フレーズＰＨ３３とフレーズＰＨ４１とは、ユーザ選択文のフレーズと入力文のフレーズの間で、それぞれ対応するフレーズである。 At this time, in the phrase of the user selected sentence and the phrase of the input sentence, the phrase PH33 and the phrase PH41 in the source language are phrases having the same meaning but different expressions. The phrase PH33 and the phrase PH41 are phrases corresponding to each other between the phrase of the user-selected sentence and the phrase of the input sentence.

最後に、目的言語が同一で原言語が異なるかどうかをチェックし、異なるものをパラフレーズ（言い換え）として保持する（Ｓ９０４のＹｅｓ、Ｓ９０５）。つまり、目的言語が同一でも原言語が異なるということから、これらは原言語における言い換えであるとみなすことができる。 Finally, it is checked whether the target languages are the same and the source languages are different, and different ones are held as paraphrases (paraphrases) (Yes in S904, S905). That is, since the source language is different even if the target language is the same, these can be regarded as paraphrases in the source language.

例えば、フレーズＰＨ３３とフレーズＰＨ４１は言い換えとみなすことができ、これらを原言語同士のパラフレーズとして保持する。 For example, the phrase PH33 and the phrase PH41 can be regarded as paraphrases, and these are held as paraphrases between the source languages.

このパラフレーズは、機械翻訳部２３０において、機械翻訳をする際に参照する、もしくは翻訳を行う前に、原言語側での言い換えをとして参照することできる。 This paraphrase can be referred to when performing machine translation in the machine translation unit 230, or can be referred to as paraphrasing on the source language side before translation.

図１３は、本実施の形態における、表示画面の一例を示す図である。 FIG. 13 is a diagram illustrating an example of a display screen in the present embodiment.

例えば、図１３（Ａ）に示すように、ユーザから翻訳対象である原文の入力を受け付けると、入力された原文ＯＳ１内容が入力文表示エリア１１０１に表示される。 For example, as shown in FIG. 13A, when the input of the original text to be translated is received from the user, the input original text OS1 content is displayed in the input text display area 1101.

次に、図１３（Ｂ）に示すように、原文ＯＳ１を翻訳した翻訳文に対する逆翻訳結果がユーザ提示文表示エリア１１０２に表示される。 Next, as shown in FIG. 13B, the reverse translation result for the translated sentence obtained by translating the original sentence OS1 is displayed in the user-presented sentence display area 1102.

一例としてここでは、３つの逆翻訳文を出力する形態について説明する。逆翻訳結果として、例えば逆翻訳文ＲＳ１、逆翻訳文ＲＳ２、逆翻訳文ＲＳ３が出力され、ユーザ提示文表示エリア１１０２に表示される。このとき、図１３（Ｂ）のユーザ提示文表示エリア１１０２に表示されている逆翻訳文ＲＳ１〜ＲＳ３は、原言語において、それぞれ同様の意味を有する類似文である。逆翻訳処理の特性上、これらは同様の意味を有する類似文であることが期待されるが、それぞれ異なる意味を有する文章を出力するよう実装されてもよい。 As an example, a mode in which three reverse translation sentences are output will be described here. As the reverse translation result, for example, the reverse translation sentence RS1, the reverse translation sentence RS2, and the reverse translation sentence RS3 are output and displayed in the user presentation sentence display area 1102. At this time, the reverse translation sentences RS1 to RS3 displayed in the user presented sentence display area 1102 in FIG. 13B are similar sentences having the same meanings in the source language. Although these are expected to be similar sentences having the same meaning due to the characteristics of the reverse translation processing, they may be implemented so as to output sentences having different meanings.

次に、図１３（Ｃ）に示すように、ユーザは、ユーザ提示文表示エリア１１０２に表示された逆翻訳結果を確認して、自分の意図した入力内容に一番近い逆翻訳文を選択する。ここでは、原文ＯＳ１に対して、例えば、逆翻訳文ＲＳ１が選択される。 Next, as shown in FIG. 13C, the user confirms the reverse translation result displayed in the user-presented sentence display area 1102, and selects the reverse translation sentence closest to the input content intended by the user. . Here, for example, a reverse translation sentence RS1 is selected for the original sentence OS1.

ユーザが逆翻訳文を選択すると、選択された逆翻訳文に対応する翻訳文が、翻訳結果表示エリア１１０３に表示される。ここでは、逆翻訳文ＲＳ１に対応する翻訳文である、翻訳文ＴＳ１が表示される。 When the user selects a reverse translation sentence, a translation sentence corresponding to the selected reverse translation sentence is displayed in the translation result display area 1103. Here, the translated sentence TS1, which is a translated sentence corresponding to the reverse translation sentence RS1, is displayed.

なお、画面表示に関しては、図１３（Ａ）、（Ｂ）、（Ｃ）に示す様なレイアウトに限らない。必要に応じて各種のボタンが配置されてもよく、例えば、翻訳対象の原文を入力した後に、ボタンに対する操作が行なわれると、翻訳処理が実行されるとしてもよい。また、ボタンに対する操作が行なわれることによって、ユーザ提示文表示エリア１１０２に逆翻訳文が表示されるとしてもよい。また、入力文表示エリア１１０１、ユーザ提示文表示エリア１１０２、翻訳結果表示エリア１１０３の配置や表示される内容、向きは上述の内容に限らない。 The screen display is not limited to the layout shown in FIGS. 13A, 13B, and 13C. Various buttons may be arranged as necessary. For example, after an original text to be translated is input, a translation process may be executed when an operation on the button is performed. In addition, a reverse translation sentence may be displayed in the user-sentence sentence display area 1102 by performing an operation on the button. Further, the arrangement, displayed contents, and orientation of the input sentence display area 1101, the user presentation sentence display area 1102, and the translation result display area 1103 are not limited to the above-described contents.

図１４は、本実施の形態における、表示画面の一例を示す図である。 FIG. 14 is a diagram showing an example of a display screen in the present embodiment.

図１３（Ｃ）とは表示が一部異なる。ここでは、翻訳結果表示エリア１２０１に表示されている文章の向きと、入力文表示エリア１２０２およびユーザ提示文表示エリア１２０３に表示されている文章の向きとが異なる。これは、２人のユーザ（原言語話者と目的言語話者）が情報表示端末を挟んで向かい合ってコミュニケーションを行なっている場面を想定している。すなわち、入力文表示エリア１２０２およびユーザ提示文表示エリア１２０３に表示されている文章は、原言語話者に合わせた向きで表示され、翻訳結果表示エリア１２０１に表示されている文章は、目的言語話者に合わせた向きで表示されている。これによって、原言語話者は、入力文に対して出力された翻訳文を目的言語話者に対して読み上げたり、目的言語話者が翻訳文を確認しやすいように情報表示端末の向きを変えたりする必要がないので、情報表示端末などを介して異なる言語を話すユーザ間で円滑なコミュニケーションが可能となる。なお、翻訳結果表示エリア１２０１の向きは、ユーザによる任意の操作で変更することが可能である。また、図１３同様、各エリアの配置や表示される内容、向きはこれらに限らない。 A part of the display is different from FIG. Here, the direction of the text displayed in the translation result display area 1201 is different from the direction of the text displayed in the input text display area 1202 and the user presented text display area 1203. This assumes a scene in which two users (a source language speaker and a target language speaker) communicate with each other across an information display terminal. That is, the sentences displayed in the input sentence display area 1202 and the user-provided sentence display area 1203 are displayed in a direction suitable for the source language speaker, and the sentences displayed in the translation result display area 1201 are the target language spoken words. It is displayed in the direction that suits the person. As a result, the source language speaker reads the translated sentence output for the input sentence to the target language speaker, or changes the direction of the information display terminal so that the target language speaker can easily confirm the translated sentence. Therefore, smooth communication is possible between users who speak different languages via an information display terminal or the like. The orientation of the translation result display area 1201 can be changed by an arbitrary operation by the user. Further, as in FIG. 13, the arrangement of each area, the displayed contents, and the orientation are not limited to these.

以上、本発明の一態様に係る翻訳方法について、実施の形態に基づいて説明したが、本発明はこれらの実施の形態に限定されるものではない。本発明の主旨を逸脱しない限り、当業者が思いつく各種変形を本実施の形態に施したもの、あるいは異なる実施の形態における構成要素を組み合わせて構築される形態も、本発明の範囲内に含まれる。 The translation method according to one aspect of the present invention has been described based on the embodiments. However, the present invention is not limited to these embodiments. Unless it deviates from the main point of this invention, the form which carried out the various deformation | transformation which those skilled in the art can think to this embodiment, or the structure constructed | assembled combining the component in a different embodiment is also contained in the scope of the present invention. .

例えば、上述の説明では、情報表示端末１００に提示される複数の逆翻訳文の中からユーザによって一つの逆翻訳文が選択されるものとしたが、複数の逆翻訳文が選択されてもよい。例えば、一つはユーザの選択として選ばれ、順翻訳文が提示されるが、その他のユーザ非選択文に対しても評価を行い、その結果を学習結果としてシステムに反映するものでもよい。ここでの評価の方法として、例えば、ユーザ非選択文に対して、ユーザが良い順、悪い順に順位付けを行う、ユーザ選択文と同程度に許容できる非選択文をユーザが選択する、明らかに許容できない非選択文をユーザが選択する、などといった方法で評価を行う。これらを行うことで、選択されなかった文に対しても評価を行うことができ、これらをシステムに反映することで、システムの学習に繋がる。 For example, in the above description, one reverse translation sentence is selected by the user from a plurality of reverse translation sentences presented on the information display terminal 100, but a plurality of reverse translation sentences may be selected. . For example, one is selected as a user's selection and a forward translation sentence is presented, but other user non-selected sentences may also be evaluated, and the result may be reflected in the system as a learning result. As an evaluation method here, for example, the user selects a non-selected sentence that is allowed in the same order as the user-selected sentence, in which the user ranks the user-non-selected sentence in the order of good and bad. Evaluation is performed by a method such as the user selecting an unselected sentence that is not acceptable. By doing these, it is possible to evaluate a sentence that was not selected, and reflecting these in the system leads to learning of the system.

また、上記の説明では、図１３のように入力文に対してテキストによりユーザ提示文や結果の翻訳文が出力されるものとしたが、これらに対して、テキストと音声、または音声のみで提示しても構わない。その場合、ユーザはユーザ提示文に対して、マイクロフォンを通じてユーザ提示文から一つを選択する、という方法で選択しても構わない。 Further, in the above description, as shown in FIG. 13, it is assumed that the user-provided sentence and the resulting translated sentence are output as text with respect to the input sentence. It doesn't matter. In that case, the user may select the user presented sentence by a method of selecting one from the user presented sentences through the microphone.

本発明にかかる機械翻訳方法は、言語情報を出力する情報出力装置へ接続し、第１言語と第２言語との間の翻訳処理を行なう機械翻訳システムにおいて有用である。 The machine translation method according to the present invention is useful in a machine translation system that is connected to an information output device that outputs language information and performs a translation process between a first language and a second language.

１００情報表示端末
１０１通信部
１０２入力部
１０３出力部
１０４制御部
１０５選択文検出部
１０６記憶部
２００ネットワーク
２１０通信部
２２０制御部
２３０機械翻訳部
２３１順翻訳部
２３２順翻訳文選択部
２３３逆翻訳部
２３４逆翻訳文選択部
２３５選択文判断部
２３６フレーズ分割部
２３７選択結果評価部
２３８学習部
２４０記憶部
３００翻訳サーバ
４００マイク
５００スピーカー
１０００コンピュータ
１００１入力装置
１００２出力装置
１００３ＣＰＵ
１００４ＲＯＭ
１００５ＲＡＭ
１００６記憶装置
１００７読取装置
１００８送受信装置
１００９バス DESCRIPTION OF SYMBOLS 100 Information display terminal 101 Communication part 102 Input part 103 Output part 104 Control part 105 Selected sentence detection part 106 Storage part 200 Network 210 Communication part 220 Control part 230 Machine translation part 231 Forward translation part 232 Forward translation sentence selection part 233 Reverse translation part 234 Reverse translation selection unit 235 Selection sentence determination unit 236 Phrase division unit 237 Selection result evaluation unit 238 Learning unit 240 Storage unit 300 Translation server 400 Microphone 500 Speaker 1000 Computer 1001 Input device 1002 Output device 1003 CPU
1004 ROM
1005 RAM
1006 Storage device 1007 Reading device 1008 Transmission / reception device 1009 Bus

Claims

A machine translation method in a machine translation system that performs translation between a first language and a second language by connecting to an information output device that outputs language information,
Receiving the sentence to be translated in the first language;
Generating a plurality of different forward-translated sentences obtained by translating the received sentence to be translated into the second language;
Generating a plurality of back-translated sentences back-translated into the first language for each of the plurality of different forward-translated sentences;
When an operation for selecting one reverse translation sentence from the plurality of reverse translation sentences is received when outputting the plurality of reverse translation sentences in the information output device, the one corresponding to the one reverse translation sentence Output forward translation,
Machine translation method.

The machine translation system is further connected to a speech input device that accepts speech input by a user, and a text input device that accepts text input by a user,
The translation target sentence is received in the form of audio information representing the translation sentence, or text information,
Changing the output form of the forward-translated sentence corresponding to the one reverse-translated sentence, depending on whether the sentence to be translated is received in the form of the audio information or the text information;
The machine translation method according to claim 1.

The information output device has an audio output device and a display,
When the translation target sentence is received in the form of voice information, the forward translation sentence corresponding to the one reverse translation sentence is output via the voice output device,
When the translation target sentence is received in the form of text information, the forward translation sentence corresponding to the one reverse translation sentence is output via the display.
The machine translation method according to claim 2.

The machine translation system is further connected to a text input device that accepts text input by a user,
The translation target sentence is received as text information indicating the translation target sentence from the text input device,
Based on the text information, generate a plurality of different forward-translated sentences obtained by translating the sentence to be translated into the second language.
The machine translation method according to claim 1.

The machine translation system is further connected to a voice input device that accepts user voice input,
The translation target sentence is received as voice information representing the translation target sentence from the voice input device;
Generate text information indicating the translation target sentence by performing a speech recognition process on the received speech information,
Based on the text information, generate a plurality of different forward-translated sentences obtained by translating the sentence to be translated into the second language.
The machine translation method according to claim 1.

The information output device has a display;
The plurality of reverse translation sentences are displayed in a first area of the display,
The translation target sentence is displayed in a second area different from the first area of the display;
The machine translation method according to claim 1.

In the third area of the display, the forward translation sentence corresponding to the one reverse translation sentence is displayed.
The machine translation method according to claim 6.

The forward translation sentence corresponding to the one reverse translation sentence is displayed in a different direction from the plurality of reverse translation sentences displayed in the first region.
The machine translation method according to claim 7.

The machine translation system includes:
A set of forward translation sentences obtained by translating the received translation target sentence into the second language, and generating a forward translation sentence group including the plurality of different forward translation sentences;
For each of the forward-translated sentences included in the forward-translated sentence group, determine whether it is classified into a question sentence, an affirmative sentence, a negative sentence, or a command sentence,
The plurality of different forward-translated sentences are determined from the forward-translated sentence group based on the classified form.
The machine translation method according to claim 1.

The plurality of different forward-translated sentences include at least two or more forward-translated sentences each classified into the different forms.
The machine translation method according to claim 9.

The machine translation system includes:
Generating a forward translation sentence group that is a set of the forward translation sentences obtained by translating the received translation target sentence into the second language;
The forward translation sentence group includes the plurality of different forward translation sentences,
Determining the subject or predicate of each of the forward-translated sentences included in the forward-translated sentence group;
The plurality of different forward-translated sentences are determined from the forward-translated sentence group based on the determined subject or previous description word.
The machine translation method according to claim 1.

The machine translation system includes:
A set of at least one reverse translation sentence generated for each of the plurality of different forward translation sentences, and generating a reverse translation sentence group including the plurality of reverse translation sentences;
For each of the reverse translation sentences included in the reverse translation group, calculate an evaluation value that evaluates the similarity with the translation target sentence,
The plurality of reverse translation sentences are selected from the group of reverse translation sentences based on the evaluation value.
The machine translation method according to claim 1.

The machine translation system includes:
A set of at least one reverse translation sentence generated for each of the plurality of different forward translation sentences, and generating a reverse translation sentence group including the plurality of reverse translation sentences;
For each of the back-translated sentences included in the back-translated sentence group, determine whether it is classified into a question sentence, a positive sentence, a negative sentence, or a command sentence,
The plurality of reverse translation sentences are selected from the group of reverse translation sentences based on the classified form.
The machine translation method according to claim 1.

The plurality of reverse translation sentences include at least two or more reverse translation sentences classified into the different forms.
The machine translation method according to claim 13.

The machine translation system includes:
A set of at least one reverse translation sentence generated for each of the plurality of different forward translation sentences, and generating a reverse translation sentence group including the plurality of reverse translation sentences;
Determining the subject or predicate of each of the reverse translation sentences included in the reverse translation sentence group;
The plurality of back-translated sentences are selected from the back-translated sentence group based on the determined subject or predescription word.
The machine translation method according to claim 1.

The plurality of back-translated sentences include at least two or more back-translated sentences that are determined to include different subjects or predicates.
The machine translation method according to claim 15.

The plurality of back-translated sentences are back-translated sentences determined to include the same subject or predicate,
The machine translation method according to claim 15.

The probability model includes a weight value given to each word or phrase used in the translation process,
The machine translation system includes:
A word or phrase included in a selected translated sentence that is a forward translated sentence corresponding to the one reverse translated sentence, and an unselected translated sentence that is a forward translated sentence corresponding to a reverse translated sentence other than the one reverse translated sentence Compare with the word or phrase it contains
A word or phrase included only in the selected translated sentence;
A word or phrase included only in the non-selected order translation, and
The weight value is updated by applying a different updating method of the weight value to words or phrases included in both the selected order translated sentence and the non-selected order translated sentence, and the updated weight value And performing the machine learning using the word or the phrase corresponding to the updated weight value as teacher data,
The machine translation method according to claim 15.

The probability model includes a weight value given to each word or phrase used in the translation process,
The machine translation system includes:
Comparing a word or phrase included in the one reverse translation sentence with a word or phrase included in a non-selected reverse translation sentence that is a reverse translation sentence other than the one reverse translation sentence,
A word or phrase included only in the one reverse translation sentence;
A word or phrase included only in the unselected back-translated sentence;
For each word or phrase included in both the one back-translated sentence and the non-selected back-translated sentence, the weight value is updated by applying different weight value updating methods,
Performing the machine learning using the updated weight value and the word or the phrase corresponding to the updated weight value as teacher data;
The machine translation method according to claim 15.

A machine translation device that performs a translation process between a first language and a second language,
An input unit for receiving input of the translation target sentence in the first language;
A translation unit that generates a forward translation sentence obtained by translating the sentence to be translated into the second language, and a reverse translation sentence obtained by reverse translating the forward translation sentence into the first language;
An output unit that outputs the forward-translated sentence corresponding to the reverse-translated sentence and the one reverse-translated sentence;
A user input unit that accepts a user input, wherein the translation unit generates a plurality of different forward-translated sentences for the translation target sentence, and a plurality of reverse translations corresponding to each of the plurality of different forward-translated sentences Generate statements,
When the output unit receives the input for selecting one reverse translation sentence from the plurality of reverse translation sentences in the user input unit while outputting the plurality of reverse translation sentences, Outputting the forward translation corresponding to the translation,
Machine translation device.

A program for connecting to an information output device and controlling the operation of a machine translation device that performs translation processing between a first language and a second language,
For the computer of the machine translation device,
Receiving a sentence to be translated in the first language;
Generating a plurality of different forward-translated sentences obtained by translating the received sentence to be translated into the second language;
Generating a plurality of back-translated sentences back-translated into the first language for each of the plurality of different forward-translated sentences;
In the information output device, when an operation for selecting one reverse translation sentence from the plurality of reverse translation sentences is received while the plurality of reverse translation sentences are displayed, the one reverse translation sentence is changed to the first reverse translation sentence. To output forward-translated sentences translated into two languages,
program.