[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

Kubota et al., 2008 - Google Patents

Design and implementation of 3d auditory scene visualizer towards auditory awareness with face tracking

Kubota et al., 2008

View PDF
Document ID
7292380292757120936
Author
Kubota Y
Yoshida M
Komatani K
Ogata T
Okuno H
Publication year
Publication venue
2008 Tenth IEEE International Symposium on Multimedia

External Links

Snippet

If machine audition can recognize an auditory scene containing simultaneous and moving talkers, what kinds of awareness will people gain from an auditory scene visualizer? This paper presents the design and implementation of 3D Auditory Scene Visualizer based on …
Continue reading at citeseerx.ist.psu.edu (PDF) (other versions)

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • H04N7/147Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel

Similar Documents

Publication Publication Date Title
Funt et al. Color constancy computation in near-Mondrian scenes using a finite dimensional linear model
US12089026B2 (en) Processing segments or channels of sound with HRTFs
Yang et al. Telling left from right: Learning spatial correspondence of sight and sound
KR102694487B1 (en) Systems and methods supporting selective listening
Donley et al. Easycom: An augmented reality dataset to support algorithms for easy communication in noisy environments
McCowan et al. Automatic analysis of multimodal group actions in meetings
US7876914B2 (en) Processing audio data
KR20150006799A (en) Audio processing apparatus
GB2342802A (en) Indexing conference content onto a timeline
US20230164509A1 (en) System and method for headphone equalization and room adjustment for binaural playback in augmented reality
US11496830B2 (en) Methods and systems for recording mixed audio signal and reproducing directional audio
Kubota et al. Design and implementation of 3d auditory scene visualizer towards auditory awareness with face tracking
US11513762B2 (en) Controlling sounds of individual objects in a video
JP2004198656A (en) Robot audio-visual system
Berghi et al. Visually supervised speaker detection and localization via microphone array
JP5383056B2 (en) Sound data recording / reproducing apparatus and sound data recording / reproducing method
Nakagawa et al. Using vision to improve sound source separation
Thermos et al. Audio-visual speech activity detection in a two-speaker scenario incorporating depth information from a profile or frontal view
Berghi et al. Leveraging Visual Supervision for Array-Based Active Speaker Detection and Localization
Pingali et al. Audio-visual tracking for natural interactivity
Kubota et al. 3d auditory scene visualizer with face tracking: Design and implementation for auditory awareness compensation
JP3843743B2 (en) Robot audio-visual system
Berghi et al. Audio inputs for active speaker detection and localization via microphone array
Berghi Audio-Visual Detection and Localisation of Speech and Sound Events
WO2021206679A1 (en) Audio-visual multi-speacer speech separation