Kubota et al., 2008 - Google Patents
Design and implementation of 3d auditory scene visualizer towards auditory awareness with face trackingKubota et al., 2008
View PDF- Document ID
- 7292380292757120936
- Author
- Kubota Y
- Yoshida M
- Komatani K
- Ogata T
- Okuno H
- Publication year
- Publication venue
- 2008 Tenth IEEE International Symposium on Multimedia
External Links
Snippet
If machine audition can recognize an auditory scene containing simultaneous and moving talkers, what kinds of awareness will people gain from an auditory scene visualizer? This paper presents the design and implementation of 3D Auditory Scene Visualizer based on …
- 230000000007 visual effect 0 abstract description 7
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/15—Conference systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/141—Systems for two-way working between two video terminals, e.g. videophone
- H04N7/147—Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Funt et al. | Color constancy computation in near-Mondrian scenes using a finite dimensional linear model | |
US12089026B2 (en) | Processing segments or channels of sound with HRTFs | |
Yang et al. | Telling left from right: Learning spatial correspondence of sight and sound | |
KR102694487B1 (en) | Systems and methods supporting selective listening | |
Donley et al. | Easycom: An augmented reality dataset to support algorithms for easy communication in noisy environments | |
McCowan et al. | Automatic analysis of multimodal group actions in meetings | |
US7876914B2 (en) | Processing audio data | |
KR20150006799A (en) | Audio processing apparatus | |
GB2342802A (en) | Indexing conference content onto a timeline | |
US20230164509A1 (en) | System and method for headphone equalization and room adjustment for binaural playback in augmented reality | |
US11496830B2 (en) | Methods and systems for recording mixed audio signal and reproducing directional audio | |
Kubota et al. | Design and implementation of 3d auditory scene visualizer towards auditory awareness with face tracking | |
US11513762B2 (en) | Controlling sounds of individual objects in a video | |
JP2004198656A (en) | Robot audio-visual system | |
Berghi et al. | Visually supervised speaker detection and localization via microphone array | |
JP5383056B2 (en) | Sound data recording / reproducing apparatus and sound data recording / reproducing method | |
Nakagawa et al. | Using vision to improve sound source separation | |
Thermos et al. | Audio-visual speech activity detection in a two-speaker scenario incorporating depth information from a profile or frontal view | |
Berghi et al. | Leveraging Visual Supervision for Array-Based Active Speaker Detection and Localization | |
Pingali et al. | Audio-visual tracking for natural interactivity | |
Kubota et al. | 3d auditory scene visualizer with face tracking: Design and implementation for auditory awareness compensation | |
JP3843743B2 (en) | Robot audio-visual system | |
Berghi et al. | Audio inputs for active speaker detection and localization via microphone array | |
Berghi | Audio-Visual Detection and Localisation of Speech and Sound Events | |
WO2021206679A1 (en) | Audio-visual multi-speacer speech separation |