[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1743384.1743453acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
demonstration

Controlling your TV with gestures

Published: 29 March 2010 Publication History

Abstract

Vision-based user interfaces enable natural interaction modalities such as gestures. Such interfaces require computationally intensive video processing at low latency. We demonstrate an application that recognizes gestures to control TV operations. Accurate recognition is achieved by using a new descriptor called MoSIFT, which explicitly encodes optical flow with appearance features. MoSIFT is computationally expensive - a sequential implementation runs 100 times slower than real time. To reduce latency sufficiently for interaction, the application is implemented on a runtime system that exploits the parallelism inherent in video understanding applications.

References

[1]
D. J. Abadi, Y. Ahmad, M. Balazinska, U. Çetintemel, M. Cherniack, J. Hwang, W. Lindner, A. S. Maskey, A. Rasin, E. Ryvkina, N. Tatbul, Y. Xing, and S. Zdonik. The design of the Borealis stream processing engine. In Proc. Innovative Data Systems Research, 2005.
[2]
J. K. Aggarwal and Q. Cai. Human motion analysis: a review. In Proc. Nonrigid and Articulated Motion Workshop, 1997.
[3]
L. Amini, H. Andrade, R. Bhagwan, F. Eskesen, R. King, P. Selo, Y. Park, and C. Venkatramani. SPC: A distributed, scalable platform for data mining. Workshop on Data Mining Standards, Services, and Platforms, 2006.
[4]
J. Brady. A theory of productivity in the creative process. IEEE Computer Graphics and Applications, 6(5):25--34, May 1986.
[5]
S. K. Card, G. G. Robertson, and J. D. Mackinlay. The information visualizer, an information workspace. In CHI '91: Human factors in computing systems, 181--186, 1991.
[6]
M.-Y. Chen and A. Hauptmann. Mosift: Recognizing human actions in surveillance videos. In CMU-CS-09-161, 2009.
[7]
M.-Y. Chen; L. Mummert; P. Pillai; A. Hauptmann; R. Sukthankar, Exploiting Multi-level Parallelism for Low-latency Activity Recognition in Streaming Video; Proc. ACM Multimedia Systems (MMSys) Conference, 2010,
[8]
M. Cherniack, H. Balakrishnan, M. Balazinska, D. Carney, U. Çetintemel, Y. Xing, and S. Zdonik. Scalable distributed stream processing. In Proc. Innovative Data Systems Research, 2003.
[9]
J. Dean and S. Ghemawat. MapReduce: simplified data processing on large clusters. CACM, 51(1), 2008.
[10]
P. Dollar, V. Rabaud, G. Cottrell, and S. Belongie. Behavior recognition via sparse spatio-temporal features. In IEEE Workshop on PETS, 2005.
[11]
M. Isard, M. Budiu, Y. Yu, A. Birrell, and D. Fetterly. Dryad: distributed data-parallel programs from sequential building blocks. Proc. European Conference on Computer Systems, 2007.
[12]
Y. Ke, R. Sukthankar, and M. Hebert. Efficient visual event detection using volumetric features. Proc. Int'l Conference on Computer Vision, 2005.
[13]
I. Laptev and T. Lindeberg. Space-time interest points. In Proc. Int'l Conference on Computer Vision, 2003.
[14]
I. Laptev, M. Marszalek, C. Schmid, and B. Rozenfeld. Learning realistic human actions from movies. In Proc. Computer Vision and Pattern Recognition, 2008.
[15]
D. Lowe. Distinctive image features form scale-invariant keypoints. Int'l Journal on Computer Vision, 60(2), 2004.
[16]
Microsoft, "Project Natal in detail". Microsoft. June 2009. http://www.xbox.com/en-GB/news-features/news/Project-Natal-in-detail-050609.htm. Retrieved Jan 26, 2010.
[17]
R. B. Miller. Response time in man-computer conversational transactions. In AFIPS '68: Proc. of the Dec. 9-11, 1968, joint computer conference (Fall, part I), pages 267--277, 1968.
[18]
J. C. Niebles, H. Wang, and L. Fei-Fei. Unsupervised learning of human action categories using spatial-temporal words. In Proc. British Machine Vision Conference, 2006.
[19]
P. Pillai, L. Mummert, S. Schlosser, R. Sukthankar, and C. Helfrich. SLIPStream: scalable low-latency interactive perception on streaming data. In Proc. NOSSDAV, 2009.
[20]
K. Schindler and L. Van Gool. Action snippets: How many frames does human action recognition require? In Proc. Computer Vision and Pattern Recognition, 2008.
[21]
C. Schuldt, I. Laptev, and B. Caputo. Recognizing human actions: A local SVM approach. In Proc. ICPR, 2004.
[22]
J. Zhang, M. Marszalek, S. Lazebnik, and C. Schmid. Local features and kernels for classification of texture and object categories: A comprehensive study. Int'l Journal on Computer Vision, 73(2), 2007.

Cited By

View all
  • (2023)Dance Gestures Recognition for Wheelchair Control2023 8th International Conference on Control and Robotics Engineering (ICCRE)10.1109/ICCRE57112.2023.10155605(84-90)Online publication date: 21-Apr-2023
  • (2020)Mid-Air Gesture Control of Multiple Home Devices in Spatial Augmented Reality PrototypeMultimodal Technologies and Interaction10.3390/mti40300614:3(61)Online publication date: 31-Aug-2020
  • (2019)The Impact of Comfortable Viewing Positions on Smart TV Gestures2019 International Conference on Information Systems and Computer Science (INCISCOS)10.1109/INCISCOS49368.2019.00054(296-303)Online publication date: Nov-2019
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
MIR '10: Proceedings of the international conference on Multimedia information retrieval
March 2010
600 pages
ISBN:9781605588155
DOI:10.1145/1743384
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 29 March 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. cluster applications
  2. computational perception
  3. multimedia
  4. parallel computing
  5. sensing
  6. stream processing

Qualifiers

  • Demonstration

Conference

MIR '10
Sponsor:
MIR '10: International Conference on Multimedia Information Retrieval
March 29 - 31, 2010
Pennsylvania, Philadelphia, USA

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 11 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Dance Gestures Recognition for Wheelchair Control2023 8th International Conference on Control and Robotics Engineering (ICCRE)10.1109/ICCRE57112.2023.10155605(84-90)Online publication date: 21-Apr-2023
  • (2020)Mid-Air Gesture Control of Multiple Home Devices in Spatial Augmented Reality PrototypeMultimodal Technologies and Interaction10.3390/mti40300614:3(61)Online publication date: 31-Aug-2020
  • (2019)The Impact of Comfortable Viewing Positions on Smart TV Gestures2019 International Conference on Information Systems and Computer Science (INCISCOS)10.1109/INCISCOS49368.2019.00054(296-303)Online publication date: Nov-2019
  • (2019)How to Increase Older Adults’ Accessibility to Mobile Technology? The New ECOMODE CameraAmbient Assisted Living10.1007/978-3-030-04672-9_6(85-98)Online publication date: 31-Jan-2019
  • (2018)Human-Computer Interaction System Using 2D and 3D Hand Gestures2018 International Conference on Emerging Trends and Innovations In Engineering And Technological Research (ICETIETR)10.1109/ICETIETR.2018.8529064(1-4)Online publication date: Jul-2018
  • (2018)SEQUENCEPersonal and Ubiquitous Computing10.1007/s00779-018-1129-222:4(751-770)Online publication date: 1-Aug-2018
  • (2018)Potential of Wearable Technology for Super-Aging SocietiesDistributed, Ambient and Pervasive Interactions: Technologies and Contexts10.1007/978-3-319-91131-1_17(214-226)Online publication date: 30-May-2018
  • (2017)Remote Control by Body Movement in Synchrony with Orbiting WidgetsProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/31309101:3(1-22)Online publication date: 11-Sep-2017
  • (2017)MatchPointProceedings of the 30th Annual ACM Symposium on User Interface Software and Technology10.1145/3126594.3126626(179-192)Online publication date: 20-Oct-2017
  • (2017)Evaluation of in-air hand gestures interaction for older peopleUniversal Access in the Information Society10.1007/s10209-016-0483-y16:3(561-580)Online publication date: 1-Aug-2017
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media