[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.3115/1073083.1073117dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free access

Entropy rate constancy in text

Published: 06 July 2002 Publication History

Abstract

We present a constancy rate principle governing language generation. We show that this principle implies that local measures of entropy (ignoring context) should increase with the sentence number. We demonstrate that this is indeed the case by measuring entropy in three different ways. We also show that this effect has both lexical (which words are used) and non-lexical (how the words are used) causes.

References

[1]
M. P. Aylett. 1999. Stochastic suprasegmentals: Relationships between redundancy, prosodic structure and syllabic duration. In Proceedings of ICPhS--99, San Francisco.
[2]
E. Charniak. 2001. A maximum-entropy-inspired parser. In Proceedings of ACL--2001, Toulouse.
[3]
J. T. Goodman. 2001. A bit of progress in language modeling. Computer Speech and Language, 15:403--434.
[4]
I. Kontoyiannis, P. H. Algoet, Yu. M. Suhov, and A. J. Wyner. 1998. Nonparametric entropy estimation for stationary processes and random fields, with applications to English text. IEEE Trans. Inform. Theory, 44:1319--1327, May.
[5]
I. Kontoyiannis. 1996. The complexity and entropy of literary styles. NSF Technical Report No. 97, Department of Statistics, Stanford University, June. {unpublished, can be found at the author's web page}.
[6]
R. Kuhn and R. De Mori. 1990. A cache-based natural language model for speech reproduction. IEEE Transactions on Pattern Analysis and Machine Intelligence, 12(6):570--583.
[7]
M. P. Marcus, B. Santorini, and M. A. Marcinkiewicz. 1993. Building a large annotated corpus of English: the Penn treebank. Computational Linguistics, 19:313--330.
[8]
J. B. Plotkin and M. A. Nowak. 2000. Language evolution and information theory. Journal of Theoretical Biology, pages 147--159.
[9]
C. E. Shannon. 1948. A mathematical theory of communication. The Bell System Technical Journal, 27:379--423, 623--656, July, October.

Cited By

View all
  • (2024)ARTiST: Automated Text Simplification for Task Guidance in Augmented RealityProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642772(1-24)Online publication date: 11-May-2024
  • (2023)FACEProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3666867(17038-17056)Online publication date: 10-Dec-2023
  • (2022)Predicting Backchannel Signaling in Child-Caregiver Multimodal ConversationsCompanion Publication of the 2022 International Conference on Multimodal Interaction10.1145/3536220.3563372(196-200)Online publication date: 7-Nov-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
ACL '02: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
July 2002
543 pages

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 06 July 2002

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 85 of 443 submissions, 19%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)57
  • Downloads (Last 6 weeks)9
Reflects downloads up to 11 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)ARTiST: Automated Text Simplification for Task Guidance in Augmented RealityProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642772(1-24)Online publication date: 11-May-2024
  • (2023)FACEProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3666867(17038-17056)Online publication date: 10-Dec-2023
  • (2022)Predicting Backchannel Signaling in Child-Caregiver Multimodal ConversationsCompanion Publication of the 2022 International Conference on Multimodal Interaction10.1145/3536220.3563372(196-200)Online publication date: 7-Nov-2022
  • (2020)Calibration, entropy rates, and memory in language modelsProceedings of the 37th International Conference on Machine Learning10.5555/3524938.3525040(1089-1099)Online publication date: 13-Jul-2020
  • (2017)Automatic evaluation of learning objects based on cross-entropy of eye fixations minimizationProceedings of the XVIII International Conference on Human Computer Interaction10.1145/3123818.3123872(1-4)Online publication date: 25-Sep-2017
  • (2016)Information density and overlap in spoken dialogueComputer Speech and Language10.1016/j.csl.2015.11.00137:C(82-97)Online publication date: 1-May-2016
  • (2014)Representatively memorableProceedings of the SIGCHI Conference on Human Factors in Computing Systems10.1145/2556288.2557024(1709-1712)Online publication date: 26-Apr-2014
  • (2014)Readability Classification of Bangla TextsProceedings of the 15th International Conference on Computational Linguistics and Intelligent Text Processing - Volume 840410.1007/978-3-642-54903-8_42(507-518)Online publication date: 6-Apr-2014
  • (2012)Optimising incremental dialogue decisions using information density for interactive systemsProceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning10.5555/2390948.2390959(82-93)Online publication date: 12-Jul-2012
  • (2010)Why are some word orders more common than others? A uniform information density accountProceedings of the 23rd International Conference on Neural Information Processing Systems - Volume 210.5555/2997046.2997073(1585-1593)Online publication date: 6-Dec-2010
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media