A preliminary evaluation of metadata records machine translation
Abstract
Purpose
The purpose of this study is to evaluate freely available machine translation (MT) services' performance in translating metadata records.
Design/methodology/approach
Randomly selected metadata records were translated from English into Chinese using Google, Bing, and SYSTRAN MT systems. These translations were then evaluated using a five point scale for both fluency and adequacy. Missing count (words not translated) and incorrect count (words incorrectly translated) were also recorded.
Findings
Concerning both fluency and adequacy, Google and Bing's translations of more than 70 percent of test data received scores equal to or greater than three, representative of “non‐native Chinese” and “much coverage,” respectively. SYSTRAN scored lowest in both measures. However, these differences were not statistically significant. A Pearson correlation analysis demonstrated a strong relationship (r=0.86) between fluency and adequacy. Missing count and incorrect count strongly correlated with fluency and adequacy.
Originality/value
Most existing digital collections can be accessed in English alone. Few digital collections in the USA support multilingual information access (MLIA) that enables users of differing languages to search, browse, recognize and use information in the collections. Human translation is one solution, but it is neither time nor cost effective for most libraries. This study serves as a first step to understand the performance of current MT systems and to design effective and efficient MLIA services for digital collections.
Keywords
Citation
Chen, J., Ding, R., Jiang, S. and Knudson, R. (2012), "A preliminary evaluation of metadata records machine translation", The Electronic Library, Vol. 30 No. 2, pp. 264-277. https://doi.org/10.1108/02640471211221377
Publisher
:Emerald Group Publishing Limited
Copyright © 2012, Emerald Group Publishing Limited