[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.3115/974358.974372dlproceedingsArticle/Chapter ViewAbstractPublication PagesanlcConference Proceedingsconference-collections
Article
Free access

Improving language models by clustering training sentences

Published: 13 October 1994 Publication History

Abstract

Many of the kinds of language model used in speech understanding suffer from imperfect modeling of intra-sentential contextual influences. I argue that this problem can be addressed by clustering the sentences in a training corpus automatically into subcorpora on the criterion of entropy reduction, and calculating separate language model parameters for each cluster. This kind of clustering offers a way to represent important contextual effects and can therefore significantly improve the performance of a model. It also offers a reasonably automatic means to gather evidence on whether a more complex, context-sensitive model using the same general kind of linguistic information is likely to reward the effort that would be required to develop it: if clustering improves the performance of a model, this proves the existence of further context dependencies, not exploited by the unclustered model. As evidence for these claims, I present results showing that clustering improves some models but not others for the ATIS domain. These results are consistent with other findings for such models, suggesting that the existence or otherwise of an improvement brought about by clustering is indeed a good pointer to whether it is worth developing further the unclustered model.

References

[1]
Agnäs, M-S., et al (1994). Spoken Language Translator First Year Report. SRI International Cambridge Technical Report CRC-043.
[2]
Alshawi, H., and D. M. Carter (1994). "Training and Scaling Preference Functions for Disambiguation". Computational Linguistics (to appear).
[3]
Briscoe, T., and J. Carroll (1993). "Generalized Probabilistic LR Parsing of Natural Language (Corpora) with Unification-Based Grammars", Computational Linguistics, Vol 19:1, 25--60.
[4]
Cover, T. M., and J. A. Thomas (1991). Elements of Information Theory. New York: Wiley.
[5]
Everitt, B. S. (1993). Cluster Analysis, Third Edition. London: Edward Arnold.
[6]
Iyer, R., M. Ostendorf and J. R. Rohlicek (1994). "Language Modeling with Sentence-Level Mixtures". Proceedings of the ARPA Workshop on Human Language Technology.
[7]
Jelinek, F., B. Merialdo, S. Roukos and M. Strauss (1991). "A Dynamic Language Model for Speech Recognition", Proceedings of the Speech and Natural Language DARPA Workshop, Feb. 1991, 293--295.
[8]
Katz, S. M. (1987). "Estimation of Probabilities from Sparse Data for the Language Model Component of a Speech Recognizer", IEEE Transactions on Acoustics, Speech and Signal Processing, Vol ASSP-35:3.
[9]
Lewin, I., D. M. Carter, S. Pulman, S. Browning, K. Ponting and M. Russell (1993). "A Speech-Based Route Enquiry System Built From General-Purpose Components", Proceedings of Eurospeech-93.
[10]
Murveit, H., J. Butzberger, V. Digalakis and M. Weintraub (1993). "Large Vocabulary Dictation using SRI's DECIPHER(TM) Speech Recognition System: Progressive Search Techniques", Proceedings of the International Conference on Acoustics, Speech and Signal Processing, Minneapolis, Minnesota.
[11]
Ney, H., U. Essen and R. Kneser (1994). "On Structuring Probabilistic Dependencies in Stochastic Language Modeling". Computer Speech and Language, vol 8:1, 1--38.
[12]
Pereira, F., N. Tishby and L. Lee (1993). "Distributional Clustering of English Words". Proceedings of ACL-93, 183--190.
[13]
Rayner, M., D. Carter, V. Digalakis and P. Price (1994). "Combining Knowledge Sources to Reorder N-best Speech Hypothesis Lists". Proceedings of the ARPA Workshop on Human Language Technology.
[14]
Rosenfeld, R. (1994). "A Hybrid Approach to Adaptive Statistical Language Modeling". Proceedings of the ARPA Workshop on Human Language Technology.

Cited By

View all
  • (2018)Bilingual Cluster Based Models for Statistical Machine TranslationIEICE - Transactions on Information and Systems10.1093/ietisy/e91-d.3.588E91-D:3(588-597)Online publication date: 16-Dec-2018
  • (2008)Dynamic model interpolation for statistical machine translationProceedings of the Third Workshop on Statistical Machine Translation10.5555/1626394.1626428(208-215)Online publication date: 19-Jun-2008
  • (1998)Text segmentation with multiple surface linguistic cuesProceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics - Volume 210.3115/980691.980714(881-885)Online publication date: 10-Aug-1998

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
ANLC '94: Proceedings of the fourth conference on Applied natural language processing
October 1994
226 pages

Sponsors

  • ACL: Association for Computational Linguistics
  • Gesellschaft ffir Informatik

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 13 October 1994

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)25
  • Downloads (Last 6 weeks)3
Reflects downloads up to 14 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2018)Bilingual Cluster Based Models for Statistical Machine TranslationIEICE - Transactions on Information and Systems10.1093/ietisy/e91-d.3.588E91-D:3(588-597)Online publication date: 16-Dec-2018
  • (2008)Dynamic model interpolation for statistical machine translationProceedings of the Third Workshop on Statistical Machine Translation10.5555/1626394.1626428(208-215)Online publication date: 19-Jun-2008
  • (1998)Text segmentation with multiple surface linguistic cuesProceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics - Volume 210.3115/980691.980714(881-885)Online publication date: 10-Aug-1998

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media