[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.3115/1075812.1075828dlproceedingsArticle/Chapter ViewAbstractPublication PageshltConference Proceedingsconference-collections
Article
Free access

Language modeling with sentence-level mixtures

Published: 08 March 1994 Publication History

Abstract

This paper introduces a simple mixture language model that attempts to capture long distance constraints in a sentence or paragraph. The model is an m-component mixture of trigram models. The models were constructed using a 5K vocabulary and trained using a 76 million word Wall Street Journal text corpus. Using the BU recognition system, experiments show a 7% improvement in recognition accuracy with the mixture trigram models as compared to using a trigram model.

References

[1]
F. Jelinek, B. Merialdo, S. Roukos and M. Strauss, "A Dynamic LM for Speech Recognition," Proc. DARPA Workshop on Speech and Natural Language, pp. 293--295, 1991.
[2]
R. Lau, R. Rosenfeld and S. Roukos, "Trigger-Based Language Models: a Maximum Entropy Approach," Proc. Int'l. Conf. on Acoust., Speech and Signal Proc., Vol. II, pp. 45--48, 1993.
[3]
R. Rosenfeld, "A Hybrid Approach to Adaptive Statistical Language Modeling," this proceedings.
[4]
L. R. Bahl, P. F. Brown, P. V. deSouza and R. L. Mercer, "A Tree-Based Statistical Language Model for Natural Language Speech Recognition," IEEE Trans. on Acoust., Speech, and Signal Proc., Vol. 37, No. 7, pp. 1001--1008, 1989.
[5]
J. H. Wright, G. J. F. Jones and H. Lloyd-Thomas, "A Consolidated Language Model For Speech Recognition," Proc. EuroSpeech, Vol. 2, pp. 977--980, 1993.
[6]
M. Meteer and J. R. Rohlicek, "Statistical Language Modeling Combining n-gram and Context Free Grammars," Proc. Int'l. Conf. on Acoust., Speech and Signal Proc., Vol. 2, pp. 37--40, 1993.
[7]
J. Lafferty, "Integrating Probabilistic Finite-State and Context-Free Models of Language," presentation at the IEEE ASR Workshop, December 1993.
[8]
R. Kneser and V. Steinbiss, "On the Dynamic Adaptation Of Stochastic LM," Proc. Int'l. Conf. on Acoust., Speech and Signal Proc., Vol. 2, pp. 586--589, 1993.
[9]
R. Kuhn and R. de Mori, "A Cache Based Natural Language Model for Speech Recognition." IEEE Trans. PAMI, Vol. 14, pp. 570--583, 1992.
[10]
M. Ostendorf, A. Kannan, S. Austin, O. Kimball, R. Schwartz and J. R. Rohlicek, "Integration of Diverse Recognition Methodologies Through Reevaluation of N-Best Sentence Hypotheses." Proc. DARPA Workshop on Speech and Natural Language, pp. 83--87, February 1991.
[11]
H. Witten and T. C. Bell, "The Zero Frequency Estimation of Probabilities of Novel Events in Adaptive Text Compression," IEEE Trans. Information Theory, Vol. IT-37, No. 4, pp. 1085--1094, 1991.
[12]
P. Placeway and R. Schwartz, "Estimation Of Powerful LM from Small and Large Corpora," Proc. Int'l. Conf. on Acoust., Speech and Signal Proc., Vol. 2, pp. 33--36, 1993.
[13]
A. P. Dempster, N. M. Laird and D. B. Rubin, "Maximum Likelihood Estimation from Incomplete Data," Journal of the Royal Statistical Society (B), Vol. 39, No. 1, pp. 1--38, 1977.
[14]
BBN Byblos November 1993 WSJ Benchmark system.
[15]
R. Schwartz et al., "On Using Written Language Training Data for Spoken Language Modeling," this proceedings.
[16]
M. Elbeze and A.-M. Derouault, "A Morphological Model for Large Vocabulary Speech Recognition," Proc. Int'l. Conf. on Acoust., Speech and Signal Proc., Vol. 1, pp. 577--580, 1990.
[17]
D. Pallett, J. Fiscus, W. Fisher, J. Garofolo, B. Lund and M. Pryzbocki, "1993 Benchmark Tests for the ARPA spoken Language Program," this proceedings.

Cited By

View all
  • (2015)Recurrent neural network language model adaptation with curriculum learningComputer Speech and Language10.1016/j.csl.2014.11.00433:1(136-154)Online publication date: 1-Sep-2015
  • (2013)On the dynamic adaptation of language models based on dialogue informationExpert Systems with Applications: An International Journal10.1016/j.eswa.2012.08.02940:4(1069-1085)Online publication date: 1-Mar-2013
  • (2011)Combining topic specific language modelsProceedings of the 14th international conference on Text, speech and dialogue10.5555/2040037.2040052(99-106)Online publication date: 1-Sep-2011
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
HLT '94: Proceedings of the workshop on Human Language Technology
March 1994
479 pages
ISBN:1558603573

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 08 March 1994

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 240 of 768 submissions, 31%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)42
  • Downloads (Last 6 weeks)10
Reflects downloads up to 13 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2015)Recurrent neural network language model adaptation with curriculum learningComputer Speech and Language10.1016/j.csl.2014.11.00433:1(136-154)Online publication date: 1-Sep-2015
  • (2013)On the dynamic adaptation of language models based on dialogue informationExpert Systems with Applications: An International Journal10.1016/j.eswa.2012.08.02940:4(1069-1085)Online publication date: 1-Mar-2013
  • (2011)Combining topic specific language modelsProceedings of the 14th international conference on Text, speech and dialogue10.5555/2040037.2040052(99-106)Online publication date: 1-Sep-2011
  • (2010)Topic-Dependent Language Model with Voting on Noun HistoryACM Transactions on Asian Language Information Processing10.1145/1781134.17811379:2(1-31)Online publication date: 1-Jun-2010
  • (2009)Combining Topic Information and Structure Information in a Dynamic Language ModelProceedings of the 12th International Conference on Text, Speech and Dialogue10.1007/978-3-642-04208-9_32(218-225)Online publication date: 25-Aug-2009
  • (2002)Language model adaptation with additional text generated by machine translationProceedings of the 19th international conference on Computational linguistics - Volume 110.3115/1072228.1072392(1-7)Online publication date: 24-Aug-2002
  • (2000)Dialogue act modeling for automatic tagging and recognition of conversational speechComputational Linguistics10.1162/08912010056173726:3(339-373)Online publication date: 1-Sep-2000
  • (1999)Dynamic nonlocal language modeling via hierarchical topic-based adaptationProceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics10.3115/1034678.1034711(167-174)Online publication date: 20-Jun-1999
  • (1998)Text segmentation with multiple surface linguistic cuesProceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics - Volume 210.3115/980691.980714(881-885)Online publication date: 10-Aug-1998
  • (1994)Improving language models by clustering training sentencesProceedings of the fourth conference on Applied natural language processing10.3115/974358.974372(59-64)Online publication date: 13-Oct-1994

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media