[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/1071565.1071586acmotherconferencesArticle/Chapter ViewAbstractPublication PagespppjConference Proceedingsconference-collections
Article

Voice activated command and control with Java-enabled speech recognition over Wifi

Published: 16 June 2004 Publication History

Abstract

In this paper we present recent research in the development of a voice enabled command and control application, using Java, in which a remote robotic device, with a JVM, is controlled over a Wifi/WLAN/TCPIP network from a mobile handheld PC (iPaq) or a PC workstation. The research considers three different scenario configurations. A recognition grammar for command and control of the robot has been created and implemented in Java, in part in the recognition engine and in part on the robot. The robot is a proxy for one of a number of JVM enabled intelligent devices that can be controlled in this manner. The physical topology involves Java at each node endpoint, that is, at the handheld PC (iPaq), the PC workstation, the Linux server and onboard the robot (including its Java based Lejos OS). Network communications is primarily WLAN with an element of IR and the possibility of BlueTooth (subject to the resolution of certain important Java and Bluetooth issues). For placement of a speech recognition engine on Linux we are initially using Sphinx through the Java Native Interface but plan to develop our own speech recognition engine, in Java, for deployment on Linux. Traditionally speech processing in software has resided in the domain of C software development. Recent advances in Java with the Java Speech API, along with the Java Media Framework, have facilitated the consideration of Java for speech and language engineering activities. Our hypothesis is that Java is now a suitable platform for speech and language engineering in software. The evidence provided by our initial results, for Java speech recognition, achieved using the Cloudgarden implementation of JSAPI, validates this hypothesis. We outline our future plans to move our framework more fully into the ubiquitous computing arena and extend the reach of Java and speech processing in support of command and control a structured way, including, for example, camera feedback to base.

References

[1]
Gartner Group, Research on PDA and mobile phone sales by 2005, {online at} http://www.info-edge.com/samples/EM-2058sam.pdf, (15/1/2004)
[2]
Sun Microsystems Ltd, Java Speech API, {online at} http://java.sun.com/products/java-media/speech/
[3]
Sun Microsystems Ltd, Java Speech Markup Langauge Specification, {online at} http://java.sun.com/products/java-media/speech/forDevelopers/JSML/(21/1/2004)
[4]
Sun Microsystems Ltd, Java Speech Grammar Format Specification, {online at} http://java.sun.com/products/java-media/speech/forDevelopers/JSGF/ (21/1/2004)
[5]
Kinnersley J, Cloudgarden Java Speech Api Implementation, {online at} http://www.cloudgarden.com (20/1/2004)
[6]
Microsoft Corporation, Microsoft Speech and SAPI 5, {online at} http://www.microsoft.com/speech/
[7]
IBM, Via Voice {online at} http://www-306.ibm.com/software/voice/viavoice/ (15/1/2004)
[8]
Phillips, Phillips Speech SDK 2.0, {online at} http://www.speech.philips.com/ (20/1/2004)
[9]
ScanSoft, Dragon Naturally Speaking, {online at} http://www.scansoft.com/naturally speaking/ (16/3/2004)
[10]
Sun Microsystems Ltd, Java 2 Micro Edition: Personal Java {online at} http://java.sun.com/products/cdc/index.jsp
[11]
IEEE, 802.11 Standard, {online at} http://grouper.ieee.org/groups/802/11/main.html (15/1/2004)
[12]
Lenzo K, CMU Sphinx {online at} http://www.speech.cs.cmu.edu/sphinx/index.html (15/1/2004)
[13]
Huang X, Alleva F, Hon H, Hwang M, Rosenfeld R, The Sphinx II Speech Recognition System: An Overview, Computer Speech and Language, 7(2):137--148, 1993.
[14]
Lejos, Lejos, {online at} http:///lejos.sourceforge.net (15/1/2004)
[15]
Technologic Systems, TS-5500, {online at} http://www.embeddedx86.com/epc/ts5500-spec.php
[16]
Acunia, Wonka Embedded JVM, {online at} http://wonka.acunia.com (28/1/2004)

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
PPPJ '04: Proceedings of the 3rd international symposium on Principles and practice of programming in Java
June 2004
166 pages
ISBN:1595931716

Publisher

Trinity College Dublin

Publication History

Published: 16 June 2004

Check for updates

Qualifiers

  • Article

Conference

PPPJ04

Acceptance Rates

Overall Acceptance Rate 29 of 58 submissions, 50%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 1,540
    Total Downloads
  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 01 Jan 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media