[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/1883891.1883903guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

A machine learning approach to speech act classification using function words

Published: 23 June 2010 Publication History

Abstract

This paper presents a novel technique for the classification of sentences as Dialogue Acts, based on structural information contained in function words. It focuses on classifying questions or non-questions as a generally useful task in agent-based systems. The proposed technique extracts salient features by replacing function words with numeric tokens and replacing each content word with a standard numeric wildcard token. The Decision Tree, which is a well-established classification technique, has been chosen for this work. Experiments provide evidence of potential for highly effective classification, with a significant achievement on a challenging dataset, before any optimisation of feature extraction has taken place.

References

[1]
Keizer, S.: A Bayesian Approach to Dialogue Act Classification. In: BI-DIALOG 2001 the 5th Workshop on Formal Semantics and Pragmatics of Dialogue, pp. 88-94. ZiF, Bielefeld (2001)
[2]
Webb, N., Hepple, M., Wilks, Y.: Dialogue Act Classification Based on Intra-Utterance Features. In: AAAI 2005. AAAI Press, Pittsburgh (2005)
[3]
Verbree, D., Rienks, R., Heylen, D.: Dialogue-Act Tagging Using Smart Feature Selection; Results On Multiple Corpora. In: IEEE Spoken Language Technology Workshop, pp. 70-73. IEEE Press, New York (2006)
[4]
Venkataraman, A., Stolcke, A., Shriberg, E.: Automatic Dialog Act Labeling With Minimal Supervision. In: 9th Australian International Conference on Speech Science and Technology (2002)
[5]
Serafin, R., Di Eugenio, B., Glass, M.: Latent Semantic Analysis for dialogue act classification. In: The 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology (2003)
[6]
Wermter, S., Lochel, M.: Learning Dialog Act Processing. In: COLING 1996, 16th International Conference on Computational Linguistics (1996)
[7]
Li, Y., Bandar, Z., McLean, D., O'Shea, J.: A method for measuring sentence similarity and its application to conversational agents. In: The 17th International Florida Artificial Intelligence Research Society Conference (FLAIRS 2004), pp. 820-825. AAAI Press, Menlo Park (2004)
[8]
Längle, T., Lüth, T., Stopp, E., Herzog, G., Kamstrup, G.: KANTRA - A Natural Language Interface for Intelligent Robots. In: Intelligent Autonomous Systems (IAS 4), pp. 357-364 (1995)
[9]
Bickmore, T., Giorgino, T.: Health dialog systems for patients and consumers. J. Biomed. Inform. 39(5), 556-571 (2006)
[10]
Keizer, S., op den Akker, R., Nijholt, A.: Dialogue Act Recognition with Bayesian Networks for Dutch Dialogues. In: Third SIGdial Workshop on Discourse and Dialogue, pp. 88-94 (2002)
[11]
Crockett, K., Bandar, Z., O'Shea, J., McLean, D.: Bullying and Debt: Developing Novel Applications of Dialogue Systems. In: Knowledge and Reasoning in Practical Dialogue Systems (IJCAI), Pasadena (2009)
[12]
van Rijsbergen, C.: Information Retrieval. Butterworths, Boston (1980)
[13]
Sanderson, M.: http://ftp.dcs.glasgow.ac.uk/idom/ir_resources/ linguistic_utils/stop_words
[14]
Spärck-Jones, K.: A Statistical Interpretation of Term Specificity and its Application in Retrieval. Journal of Documentation 28, 11-21 (1972)
[15]
Salton, G., Wong, A., Yang, C.: A Vector Space Model for Automatic Indexing. Communications of the ACM 18(11), 613-620 (1975)
[16]
Deerwester, S., Dumais, S., Furnas, G., Harshman, R., Landauer, T., Lochbaum, K., Streeter, L.: Computer information retrieval using Latent Semantic Structure. Bell Communications Research Inc. U.S.P. Office (1989)
[17]
Deerwester, S., Dumais, S., Furnas, G., Landauer, T., Harshman, R.: Indexing by Latent Semantic Analysis. Journal of the American Society of Information Science 41(6), 391-407 (1990)
[18]
Bollacker, K., Lawrence, S., Giles, C.: CiteSeer: An Autonomous Web Agent for Automatic Retrieval and Identification of Interesting Publications. In: 2nd International ACM Conference on Autonomous Agents, pp. 116-123. ACM Press, New York (1998)
[19]
Li, Y., Bandar, Z., McLean, D., O'Shea, J.: Sentence Similarity Based on Semantic Nets and Corpus Statistics. IEEE Transactions on Knowledge and Data Engineering 18(8), 1138-1150 (2006)
[20]
Islam, A., Inkpen, D.: Semantic Text Similarity using Corpus-Based Word Similarity and String Similarity. ACM Transactions on Knowledge Discovery from Data 2(2), 1-25 (2008)
[21]
BBC, http://news.bbc.co.uk/1/hi/health/8166444.stm
[22]
Quinlan, J.: C4.5: programs for machine learning. Morgan Kaufmann Publishers, San Mateo (1993)
[23]
Witten, I., Eibe, F.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Elsevier, San Francisco (2005)
[24]
Aleksander, I., Morton, H.: Introduction to Neural Computing. International Thomson Computer Press (1995)
[25]
Quinlan, J.: Induction of Decision Trees. Machine Learning 1(1), 81-106 (1986)
[26]
Fong, T., Grange, S., Thorpe, C., Baur, C.: Multi-robot remote driving with collaborative control. In: IEEE International Workshop on Robot-Human Interactive Communication (2001)

Cited By

View all
  • (2018)Arabic Speech Act Recognition TechniquesACM Transactions on Asian and Low-Resource Language Information Processing10.1145/317057617:3(1-12)Online publication date: 13-Feb-2018
  • (2014)A new benchmark dataset with production methodology for short text semantic similarity algorithmsACM Transactions on Speech and Language Processing (TSLP)10.1145/253704610:4(1-63)Online publication date: 3-Jan-2014
  • (2012)A multi-classifier approach to dialogue act classification using function wordsTransactions on Computational Collective Intelligence VII10.5555/2363384.2363390(119-143)Online publication date: 1-Jan-2012
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
KES-AMSTA'10: Proceedings of the 4th KES international conference on Agent and multi-agent systems: technologies and applications, Part II
June 2010
411 pages
ISBN:3642135404

Sponsors

  • Gdynia Maritime University

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 23 June 2010

Author Tags

  1. classification
  2. decision tree
  3. dialogue act
  4. semantic similarity
  5. speech act

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 23 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2018)Arabic Speech Act Recognition TechniquesACM Transactions on Asian and Low-Resource Language Information Processing10.1145/317057617:3(1-12)Online publication date: 13-Feb-2018
  • (2014)A new benchmark dataset with production methodology for short text semantic similarity algorithmsACM Transactions on Speech and Language Processing (TSLP)10.1145/253704610:4(1-63)Online publication date: 3-Jan-2014
  • (2012)A multi-classifier approach to dialogue act classification using function wordsTransactions on Computational Collective Intelligence VII10.5555/2363384.2363390(119-143)Online publication date: 1-Jan-2012
  • (2011)Using a slim function word classifier to recognise instruction dialogue actsProceedings of the 5th KES international conference on Agent and multi-agent systems: technologies and applications10.5555/2023144.2023150(26-34)Online publication date: 29-Jun-2011

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media