Voice activated command and control with Java-enabled speech recognition over Wifi
Pages 114 - 119
Abstract
In this paper we present recent research in the development of a voice enabled command and control application, using Java, in which a remote robotic device, with a JVM, is controlled over a Wifi/WLAN/TCPIP network from a mobile handheld PC (iPaq) or a PC workstation. The research considers three different scenario configurations. A recognition grammar for command and control of the robot has been created and implemented in Java, in part in the recognition engine and in part on the robot. The robot is a proxy for one of a number of JVM enabled intelligent devices that can be controlled in this manner. The physical topology involves Java at each node endpoint, that is, at the handheld PC (iPaq), the PC workstation, the Linux server and onboard the robot (including its Java based Lejos OS). Network communications is primarily WLAN with an element of IR and the possibility of BlueTooth (subject to the resolution of certain important Java and Bluetooth issues). For placement of a speech recognition engine on Linux we are initially using Sphinx through the Java Native Interface but plan to develop our own speech recognition engine, in Java, for deployment on Linux. Traditionally speech processing in software has resided in the domain of C software development. Recent advances in Java with the Java Speech API, along with the Java Media Framework, have facilitated the consideration of Java for speech and language engineering activities. Our hypothesis is that Java is now a suitable platform for speech and language engineering in software. The evidence provided by our initial results, for Java speech recognition, achieved using the Cloudgarden implementation of JSAPI, validates this hypothesis. We outline our future plans to move our framework more fully into the ubiquitous computing arena and extend the reach of Java and speech processing in support of command and control a structured way, including, for example, camera feedback to base.
References
[1]
Gartner Group, Research on PDA and mobile phone sales by 2005, {online at} http://www.info-edge.com/samples/EM-2058sam.pdf, (15/1/2004)
[2]
Sun Microsystems Ltd, Java Speech API, {online at} http://java.sun.com/products/java-media/speech/
[3]
Sun Microsystems Ltd, Java Speech Markup Langauge Specification, {online at} http://java.sun.com/products/java-media/speech/forDevelopers/JSML/(21/1/2004)
[4]
Sun Microsystems Ltd, Java Speech Grammar Format Specification, {online at} http://java.sun.com/products/java-media/speech/forDevelopers/JSGF/ (21/1/2004)
[5]
Kinnersley J, Cloudgarden Java Speech Api Implementation, {online at} http://www.cloudgarden.com (20/1/2004)
[6]
Microsoft Corporation, Microsoft Speech and SAPI 5, {online at} http://www.microsoft.com/speech/
[7]
IBM, Via Voice {online at} http://www-306.ibm.com/software/voice/viavoice/ (15/1/2004)
[8]
Phillips, Phillips Speech SDK 2.0, {online at} http://www.speech.philips.com/ (20/1/2004)
[9]
ScanSoft, Dragon Naturally Speaking, {online at} http://www.scansoft.com/naturally speaking/ (16/3/2004)
[10]
Sun Microsystems Ltd, Java 2 Micro Edition: Personal Java {online at} http://java.sun.com/products/cdc/index.jsp
[11]
IEEE, 802.11 Standard, {online at} http://grouper.ieee.org/groups/802/11/main.html (15/1/2004)
[12]
Lenzo K, CMU Sphinx {online at} http://www.speech.cs.cmu.edu/sphinx/index.html (15/1/2004)
[13]
Huang X, Alleva F, Hon H, Hwang M, Rosenfeld R, The Sphinx II Speech Recognition System: An Overview, Computer Speech and Language, 7(2):137--148, 1993.
[14]
Lejos, Lejos, {online at} http:///lejos.sourceforge.net (15/1/2004)
[15]
Technologic Systems, TS-5500, {online at} http://www.embeddedx86.com/epc/ts5500-spec.php
[16]
Acunia, Wonka Embedded JVM, {online at} http://wonka.acunia.com (28/1/2004)
Index Terms
- Voice activated command and control with Java-enabled speech recognition over Wifi
Index terms have been assigned to the content through auto-classification.
Recommendations
Voice activated command and control with speech recognition over WiFi
Special issue: Principles and practices of programming in Java (PPPJ 2004)This paper presents work conducted to date on the development of a voice activated command and control framework specifically for the control of remote devices in a ubiquitous computing environment. The prototype device is a Java controlled Lego ...
Comments
Please enable JavaScript to view thecomments powered by Disqus.Information & Contributors
Information
Published In
Publisher
Trinity College Dublin
Publication History
Published: 16 June 2004
Check for updates
Qualifiers
- Article
Conference
PPPJ04
PPPJ04: International Conference on Principle and Practice of Programming in Java
June 16 - 18, 2004
Nevada, Las Vegas, USA
Acceptance Rates
Overall Acceptance Rate 29 of 58 submissions, 50%
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 1,540Total Downloads
- Downloads (Last 12 months)1
- Downloads (Last 6 weeks)0
Reflects downloads up to 01 Jan 2025
Other Metrics
Citations
View Options
Login options
Check if you have access through your login credentials or your institution to get full access on this article.
Sign in