Kubota et al., 2008 - Google Patents

Design and implementation of 3d auditory scene visualizer towards auditory awareness with face tracking

Kubota et al., 2008

Document ID: 7292380292757120936
Author: Kubota Y; Yoshida M; Komatani K; Ogata T; Okuno H
Publication year: 2008
Publication venue: 2008 Tenth IEEE International Symposium on Multimedia

External Links

Cited by

Snippet

If machine audition can recognize an auditory scene containing simultaneous and moving talkers, what kinds of awareness will people gain from an auditory scene visualizer? This paper presents the design and implementation of 3D Auditory Scene Visualizer based on …

Continue reading at citeseerx.ist.psu.edu (PDF) (other versions)

230000000007 visual effect 0 abstract description 7

Classifications

- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/15—Conference systems
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/141—Systems for two-way working between two video terminals, e.g. videophone
- H04N7/147—Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel

Similar Documents

Publication	Publication Date	Title
Funt et al.	1988	Color constancy computation in near-Mondrian scenes using a finite dimensional linear model
US12089026B2 (en)	2024-09-10	Processing segments or channels of sound with HRTFs
Yang et al.	2020	Telling left from right: Learning spatial correspondence of sight and sound
KR102694487B1 (en)	2024-08-13	Systems and methods supporting selective listening
Donley et al.	2021	Easycom: An augmented reality dataset to support algorithms for easy communication in noisy environments
McCowan et al.	2005	Automatic analysis of multimodal group actions in meetings
US7876914B2 (en)	2011-01-25	Processing audio data
KR20150006799A (en)	2015-01-19	Audio processing apparatus
GB2342802A (en)	2000-04-19	Indexing conference content onto a timeline
US20230164509A1 (en)	2023-05-25	System and method for headphone equalization and room adjustment for binaural playback in augmented reality
US11496830B2 (en)	2022-11-08	Methods and systems for recording mixed audio signal and reproducing directional audio
Kubota et al.	2008	Design and implementation of 3d auditory scene visualizer towards auditory awareness with face tracking
US11513762B2 (en)	2022-11-29	Controlling sounds of individual objects in a video
JP2004198656A (en)	2004-07-15	Robot audio-visual system
Berghi et al.	2021	Visually supervised speaker detection and localization via microphone array
JP5383056B2 (en)	2014-01-08	Sound data recording / reproducing apparatus and sound data recording / reproducing method
Nakagawa et al.	1999	Using vision to improve sound source separation
Thermos et al.	2016	Audio-visual speech activity detection in a two-speaker scenario incorporating depth information from a profile or frontal view
Berghi et al.	2023	Leveraging Visual Supervision for Array-Based Active Speaker Detection and Localization
Pingali et al.	1999	Audio-visual tracking for natural interactivity
Kubota et al.	2008	3d auditory scene visualizer with face tracking: Design and implementation for auditory awareness compensation
JP3843743B2 (en)	2006-11-08	Robot audio-visual system
Berghi et al.	2023	Audio inputs for active speaker detection and localization via microphone array
Berghi	2024	Audio-Visual Detection and Localisation of Speech and Sound Events
WO2021206679A1 (en)	2021-10-14	Audio-visual multi-speacer speech separation