An Input-Parsing Algorithm Supporting Integration of Deictic Gesture in Natural Language Interface

Yong Sun^1,2,
Fang Chen^1,2,
Yu Shi¹ &
…
Vera Chung²

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 4552))

Included in the following conference series:

International Conference on Human-Computer Interaction

4158 Accesses

Abstract

Natural language interface (NLI) enables an efficient and effective interaction by allowing a user to submit a single phrase in natural language to the system. Free hand gestures can be added to an NLI to specify the referents for deictic terms in speech. By combining NLI with other modalities to a multimodal user interface, speech utterance length can be reduced, and users need not clearly specify the referent verbally. Integrating deictic terms with deictic gestures is a critical function in multimodal user interface. This paper presents a novel approach to extend chart parsing used in natural language processing (NLP) to integrate multimodal input based on speech and manual deictic gesture. The effectiveness of the technique has been validated through experiments, using a traffic incident management scenario where an operator interacts with a map on large display at a distance and issues multimodal commands through speech and manual gestures. The preliminary experiment of the proposed algorithm shows encouraging results.

Download to read the full chapter text

Chapter PDF

Compensating for Limitations in Speech-Based Natural Language Processing with Multimodal Interfaces in UAV Operation

Natural Language Understanding for Information Fusion

NLCI: a natural language command interpreter

Article 04 August 2016

Keywords

References

Bird, S., Klein, E., Loper, E.: Parsing (2005), In http://nltk.sourceforge.net
Chen, F., Choi, E., Epps, J., Lichman, S., Ruiz, N., Shi, Y., Taib, R., Wu, M.A.: Study of Manual Gesture-Based Selection for the PEMMI Multimodal Transport Management Interface. In: Proceedings of ICMI’05, October 4–6, Trento, Italy, pp. 274–281 (2005)
Google Scholar
Holzapfel, H., Nickel, K., Stiefelhagen, R.: Implementation and Evaluation of a Constraint-Based Multimodal Fusion System for Speech and 3D Pointing Gestures. In: Proceedings of ICMI 2004, October 13-15, State College Pennsylvania, USA, pp. 175–182 (2004)
Google Scholar
Johnston, M.: Unification-based Multimodal Parsing. In: Proceedings of ACL’1998, Montreal, Quebec, Canada, pp. 624–630. ACM, New York (1998)
Google Scholar
Johnston, M., Bangalore, S.: Finite-state multimodal parsing and understanding. In: Proceedings of COLING 2000, Saarbrücken, Germany, pp. 369–375 (2000)
Google Scholar
Kaiser, E., Demirdjian, D., Gruenstein, A., Li, X., Niekrasz, J., Wesson, M., Kumar, S., Demo.: A Multimodal Learning Interface for Sketch, Speak and Point Creation of a Schedule Chart. In: Proceedings of ICMI’04, October 13-15, State College Pennsylvania, USA, pp. 329-330 (2004)
Google Scholar
Latoschik, M.E.: A User Interface Framework for Multimodal VR Interactions. In: Proc. ICMI 2005 (2005)
Google Scholar
Sun, Y., Chen, F., Shi, Y., Chung, V.: A Novel Method for Multi-sensory Data Fusion in Multimodal Human Computer Interaction. In: Proc. OZCHI 2006 (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

National ICT Australia, Australian Technology Park, Eveleigh NSW 1430, Australia
Yong Sun, Fang Chen & Yu Shi
School of IT, The University of Sydney, NSW 2006, Australia
Yong Sun, Fang Chen & Vera Chung

Authors

Yong Sun
View author publications
You can also search for this author in PubMed Google Scholar
Fang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yu Shi
View author publications
You can also search for this author in PubMed Google Scholar
Vera Chung
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Julie A. Jacko

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sun, Y., Chen, F., Shi, Y., Chung, V. (2007). An Input-Parsing Algorithm Supporting Integration of Deictic Gesture in Natural Language Interface. In: Jacko, J.A. (eds) Human-Computer Interaction. HCI Intelligent Multimodal Interaction Environments. HCI 2007. Lecture Notes in Computer Science, vol 4552. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73110-8_22

Download citation

DOI: https://doi.org/10.1007/978-3-540-73110-8_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73108-5
Online ISBN: 978-3-540-73110-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

An Input-Parsing Algorithm Supporting Integration of Deictic Gesture in Natural Language Interface

Abstract

Chapter PDF

Similar content being viewed by others

Compensating for Limitations in Speech-Based Natural Language Processing with Multimodal Interfaces in UAV Operation

Natural Language Understanding for Information Fusion

NLCI: a natural language command interpreter

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

An Input-Parsing Algorithm Supporting Integration of Deictic Gesture in Natural Language Interface

Abstract

Chapter PDF

Similar content being viewed by others

Compensating for Limitations in Speech-Based Natural Language Processing with Multimodal Interfaces in UAV Operation

Natural Language Understanding for Information Fusion

NLCI: a natural language command interpreter

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation