Abstract
This work presents the evaluation results of a novel technique for word order errors correction, using non native English speakers’ corpus. This technique, which is language independent, repairs word order errors in sentences using the probabilities of most typical trigrams and bigrams extracted from a large text corpus such as the British National Corpus (BNC). A good indicator of whether a person really knows a language is the ability to use the appropriate words in a sentence in correct word order. The “scrambled” words in a sentence produce a meaningless sentence. Most languages have a fairly fixed word order. For non-native speakers and writers, word order errors are more frequent in English as a Second Language. These errors come from the student if he is translating (thinking in his/her native language and trying to translate it into English). For this reason, the experimentation task involves a test set of 50 sentences translated from Greek to English. The purpose of this experiment is to determine how the system performs on real data, produced by non native English speakers.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Michaud, L., McCoy, K.F., Pennington, C.A.: An Intelligent Tutoring System for Deaf Learners of Written English. In: Proc. 4th International ACM SIGCAPH Conference on Assistive Technologies (ASSETS), Arlington, pp. 92–100 (2000)
Bender, E.M., Flickinger, D., Oepen, S., Walsh, A., Baldwin, T.: Arboretum: Using a Precision Grammar for Grammar Checking in CALL. In: Proc. InSTIL/ICALL Symposium on Computer Assisted Learning, Venice, Italy, pp. 83-86 (2004)
Naber, D.: A rule based style and grammar checker. Master Thesis, Bielefeld University (2003)
Fouvry, F.: Constraint Relaxation with Weighted Feature Structures. In: Proc. 8th International Workshop on Parsing Technologies, Nancy, France, pp. 23-25 (2003)
Vogel, C., Cooper, R.: Robust Chart Parsing with Mildly Inconsistent Feature Structures. Nonclassical Feature Systems 10, 127–136 (1995)
Lee, J.: Seneff. S.: Automatic Grammar Error Correction for Second-Language Learners. In: Interspeech, paper 1299-Wed3A3O.1 (2006)
Atwell, E.S.: How to detect grammatical errors in a text without parsing it. In: Proceedings of the 3rd EACL, pp. 38–45 (1987)
Chodorow, M., Leacock, C.: An unsupervised method for detecting grammatical errors. In: Proceedings of NAACL 2000, pp. 140–147 (2000)
Izumi, E., Uchimoto, K., Saiga, T., Supnithi, T., Isahara, H.: Automatic Error Detection in the Japanese Learners’ English Spoken Data. In: Proc. ACL, Sapporo, Japan, pp. 145–148 (2003)
Costa-Jussà, M.R., Fonollosa, J.A.R.: An Ngram-based reordering model. Computer Speech & Language 23(3), 362–375 (2009)
Sun, G., Liu, X., Cong, G., Zhou, M., Xiong, Z., Lee, J., Lin, C.: Detecting Erroneous Sentences using Automatically Mined Sequential Patterns. In: Proceedings of the ACL, Prague, pp. 81–88 (2007)
More, J.: A grammar Checker based on Web searching. Digithum [online article]. Iss. 8. UOC (2006)
Heift, T.: Intelligent Language Tutoring Systems for Grammar Practice. Zeitschrift für Interkulturellen Fremdsprachenunterricht (Online) 6(2), 15 (2001)
Bigert, J., Knutsson, O.: Robust error detection: A hybrid approach combining unsupervised error detection and linguistic knowledge. In: Proceedings of Robust Methods in Analysis of Natural language Data (ROMAND 2002), pp. 10–19 (2002)
Sjöbergh, J., Knutsson, O.: Faking errors to avoid making errors: Very weakly supervised learning for error detection in writing. In: the Proceedings of RANLP 2005, Borovets, Bulgaria (2005)
Young, S.J.: Large Vocabulary Continuous Speech Recognition. IEEE Signal Processing Magazine 13(5), 45–57 (1996)
Good, I.J.: The population frequencies of species and the estimation of population parameters. Biometrika 40(3 and 4), 237–264 (1953)
Katz, S.M.: Estimation of probabilities from sparse data for the language model component of a speech recogniser. IEEE Transactions on Acoustics, Speech and Signal Processing 35(3), 400–401 (1987)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Athanaselis, T., Bakamidis, S., Dologlou, I. (2011). Performance Evaluation of a Novel Technique for Word Order Errors Correction Applied to Non Native English Speakers’ Corpus. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2011. Lecture Notes in Computer Science, vol 6609. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19437-5_33
Download citation
DOI: https://doi.org/10.1007/978-3-642-19437-5_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19436-8
Online ISBN: 978-3-642-19437-5
eBook Packages: Computer ScienceComputer Science (R0)