[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.3115/1075096.1075158dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free access

Learning to predict pitch accents and prosodic boundaries in Dutch

Published: 07 July 2003 Publication History

Abstract

We train a decision tree inducer (CART) and a memory-based classifier (MBL) on predicting prosodic pitch accents and breaks in Dutch text, on the basis of shallow, easy-to-compute features. We train the algorithms on both tasks individually and on the two tasks simultaneously. The parameters of both algorithms and the selection of features are optimized per task with iterative deepening, an efficient wrapper procedure that uses progressive sampling of training data. Results show a consistent significant advantage of MBL over CART, and also indicate that task combination can be done at the cost of little generalization score loss. Tests on cross-validated data and on held-out data yield F-scores of MBL on accent placement of 84 and 87, respectively, and on breaks of 88 and 91, respectively. Accent placement is shown to outperform an informed baseline rule; reliably predicting breaks other than those already indicated by intra-sentential punctuation, however, appears to be more challenging.

References

[1]
D. W. Aha, D. Kibler, and M. Albert. 1991. Instance-based learning algorithms. Machine Learning, 6:37--66.
[2]
A. W. Black. 1995. Comparison of algorithms for predicting pitch accent placement in English speech synthesis. In Proceedings of the Spring Meeting of the Acoustical Society of Japan.
[3]
L. Breiman, J. Friedman, R. Ohlsen, and C. Stone. 1984. Classification and regression trees. Wadsworth International Group, Belmont, CA.
[4]
C. J. van Rijsbergen. 1979. Information Retrieval. Buttersworth, London.
[5]
T. M. Cover and P. E. Hart. 1967. Nearest neighbor pattern classification. Institute of Electrical and Electronics Engineers Transactions on Information Theory, 13:21--27.
[6]
A. Cutler, D. Dahan, and W. A. Van Donselaar. 1997. Prosody in the comprehension of spoken language: A literature review. Language and Speech, 40(2):141--202.
[7]
W. Daelemans and V. Hoste. 2002. Evaluation of machine learning methods for natural language processing tasks. In Proceedings of LREC-2002, the third International Conference on Language Resources and Evaluation, pages 755--760.
[8]
W. Daelemans, J. Zavrel, P. Berck, and S. Gillis. 1996. MBT: A memory-based part of speech tagger generator. In E. Ejerhed and I. Dagan, editors, Proc. of Fourth Workshop on Very Large Corpora, pages 14--27. ACL SIGDAT.
[9]
W. Daelemans, A. van den Bosch, and J. Zavrel. 1999. Forgetting exceptions is harmful in language learning. Machine Learning, Special issue on Natural Language Learning, 34:11--41.
[10]
W. Daelemans, J. Zavrel, K. van der Sloot, and A. van den Bosch. 2002. TiMBL: Tilburg Memory Based Learner, version 4.3, reference guide. Technical Report ILK-0210, ILK, Tilburg University.
[11]
E. Fix and J. L. Hodges. 1951. Discriminatory analysis---nonparametric discrimination; consistency properties. Technical Report Project 21-49-004, Report No. 4, USAF School of Aviation Medicine.
[12]
J. Hirschberg. 1993. Pitch accent in context: Predicting intonational prominence from text. Artificial Intelligence, 63:305--340.
[13]
P. Koehn, S. Abney, J. Hirschberg, and M. Collins. 2000. Improving intonational phrasing with syntactic information. In ICASSP, pages 1289--1290.
[14]
R. Kohavi and G. John. 1997. Wrappers for feature subset selection. Artificial Intelligence Journal, 97(1--2):273--324.
[15]
D. R. Ladd. 1996. Intonational phonology. Cambridge University Press.
[16]
E. Marsi, G. J. Busser, W. Daelemans, V. Hoste, M. Reynaert, and A. van den Bosch. 2002. Combining information sources for memory-based pitch accent placement. In Proceedings of the International Conference on Spoken Language Processing, ICSLP-2002, pages 1273--1276.
[17]
S. Pan and J. Hirschberg. 2000. Modeling local context for pitch accent prediction. In Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics, Hong Kong.
[18]
S. Pan and K. McKeown. 1999. Word informativeness and automatic pitch accent modeling. In Proceedings of EMNLP/VLC'99, New Brunswick, NJ, USA. ACL.
[19]
F. Provost, D. Jensen, and T. Oates. 1999. Efficient progressive sampling. In Proceedings of the Fifth International Conference on Knowledge Discovery and Data Mining, pages 23--32.
[20]
G. Salton. 1989. Automatic text processing: The transformation, analysis, and retrieval of information by computer. Addison--Wesley, Reading, MA, USA.
[21]
C. Stanfill and D. Waltz. 1986. Toward memory-based reasoning. Communications of the ACM, 29(12):1213--1228, December.
[22]
P. Taylor and A. Black. 1998. Assigning phrase breaks from part-of-speech sequences. Computer Speech and Language, 12:99--117.
[23]
P. Taylor, R. Caley, A. W. Black, and S. King, 1999. Edinburgh Speech Tools Library, System Documentation Edition 1.2. CSTR, University of Edinburgh.
[24]
O. van Herwijnen and J. Terken. 2001. Evaluation of pros-3 for the assignment of prosodic structure, compared to assignment by human experts. In Proceedings Eurospeech 2001 Scandinavia, Vol.1, pages 529--532.
[25]
M. Q. Wang and J. Hirschberg. 1997. Automatic classification of intonational phrasing boundaries. Computer Speech and Language, 6(2):175--196.

Cited By

View all
  • (2009)Frequency mattersProceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics10.5555/1609067.1609148(728-736)Online publication date: 30-Mar-2009
  • (2004)New statistical methods for phrase break predictionProceedings of the 20th international conference on Computational Linguistics10.3115/1220355.1220450(659-es)Online publication date: 23-Aug-2004
  • (2004)Combining acoustic and pragmatic features to predict recognition performance in spoken dialogue systemsProceedings of the 42nd Annual Meeting on Association for Computational Linguistics10.3115/1218955.1218999(343-es)Online publication date: 21-Jul-2004

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
ACL '03: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
July 2003
571 pages

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 07 July 2003

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 85 of 443 submissions, 19%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)47
  • Downloads (Last 6 weeks)12
Reflects downloads up to 11 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2009)Frequency mattersProceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics10.5555/1609067.1609148(728-736)Online publication date: 30-Mar-2009
  • (2004)New statistical methods for phrase break predictionProceedings of the 20th international conference on Computational Linguistics10.3115/1220355.1220450(659-es)Online publication date: 23-Aug-2004
  • (2004)Combining acoustic and pragmatic features to predict recognition performance in spoken dialogue systemsProceedings of the 42nd Annual Meeting on Association for Computational Linguistics10.3115/1218955.1218999(343-es)Online publication date: 21-Jul-2004

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media