Nakadai et al., 2002 - Google Patents
Real-time speaker localization and speech separation by audio-visual integrationNakadai et al., 2002
View PDF- Document ID
- 10861917188262159240
- Author
- Nakadai K
- Hidai K
- Okuno H
- Kitano H
- Publication year
- Publication venue
- Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No. 02CH37292)
External Links
Snippet
Robot audition in real-world should cope with motor and other noises caused by the robot's own movements in addition to environmental noises and reverberation. This paper reports how auditory processing is improved by audio-visual integration with active movements. The …
- 230000004807 localization 0 title abstract description 24
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/005—Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/24—Speech recognition using non-acoustical features
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Nakadai et al. | Real-time speaker localization and speech separation by audio-visual integration | |
Nakadai et al. | Real-time sound source localization and separation for robot audition. | |
Nakadai et al. | Real-time auditory and visual multiple-object tracking for humanoids | |
Nakadai et al. | Improvement of recognition of simultaneous speech signals using av integration and scattering theory for humanoid robots | |
Takeda et al. | Sound source localization based on deep neural networks with directional activate function exploiting phase information | |
EP1818909B1 (en) | Voice recognition system | |
Okuno et al. | Human-robot interaction through real-time auditory and visual multiple-talker tracking | |
EP1691344B1 (en) | Speech recognition system | |
Aarabi et al. | Robust sound localization using multi-source audiovisual information fusion | |
Nakamura et al. | Intelligent sound source localization and its application to multimodal human tracking | |
US20090030552A1 (en) | Robotics visual and auditory system | |
Nakadai et al. | Epipolar geometry based sound localization and extraction for humanoid audition | |
Okuno et al. | Social interaction of humanoid robot based on audio-visual tracking | |
KR20060029043A (en) | Apparatus and method for object localization, tracking, and separation using audio and video sensors | |
CN110517705A (en) | A kind of binaural sound sources localization method and system based on deep neural network and convolutional neural networks | |
Yamamoto et al. | Improvement of robot audition by interfacing sound source separation and automatic speech recognition with missing feature theory | |
JP3632099B2 (en) | Robot audio-visual system | |
CN103901400B (en) | A kind of based on delay compensation and ears conforming binaural sound source of sound localization method | |
Cho et al. | Sound source localization for robot auditory systems | |
Yamamoto et al. | Assessment of general applicability of robot audition system by recognizing three simultaneous speeches | |
Murase et al. | Multiple moving speaker tracking by microphone array on mobile robot. | |
Okuno et al. | Computational auditory scene analysis and its application to robot audition | |
Wang et al. | A deconvolutive neural network for speech classification with applications to home service robot | |
Okuno et al. | Sound and visual tracking for humanoid robot | |
Nakadai et al. | Exploiting auditory fovea in humanoid-human interaction |