[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

Nakadai et al., 2002 - Google Patents

Real-time speaker localization and speech separation by audio-visual integration

Nakadai et al., 2002

View PDF
Document ID
10861917188262159240
Author
Nakadai K
Hidai K
Okuno H
Kitano H
Publication year
Publication venue
Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No. 02CH37292)

External Links

Snippet

Robot audition in real-world should cope with motor and other noises caused by the robot's own movements in addition to environmental noises and reverberation. This paper reports how auditory processing is improved by audio-visual integration with active movements. The …
Continue reading at www.nue.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02166Microphone arrays; Beamforming
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/005Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/24Speech recognition using non-acoustical features

Similar Documents

Publication Publication Date Title
Nakadai et al. Real-time speaker localization and speech separation by audio-visual integration
Nakadai et al. Real-time sound source localization and separation for robot audition.
Nakadai et al. Real-time auditory and visual multiple-object tracking for humanoids
Nakadai et al. Improvement of recognition of simultaneous speech signals using av integration and scattering theory for humanoid robots
Takeda et al. Sound source localization based on deep neural networks with directional activate function exploiting phase information
EP1818909B1 (en) Voice recognition system
Okuno et al. Human-robot interaction through real-time auditory and visual multiple-talker tracking
EP1691344B1 (en) Speech recognition system
Aarabi et al. Robust sound localization using multi-source audiovisual information fusion
Nakamura et al. Intelligent sound source localization and its application to multimodal human tracking
US20090030552A1 (en) Robotics visual and auditory system
Nakadai et al. Epipolar geometry based sound localization and extraction for humanoid audition
Okuno et al. Social interaction of humanoid robot based on audio-visual tracking
KR20060029043A (en) Apparatus and method for object localization, tracking, and separation using audio and video sensors
CN110517705A (en) A kind of binaural sound sources localization method and system based on deep neural network and convolutional neural networks
Yamamoto et al. Improvement of robot audition by interfacing sound source separation and automatic speech recognition with missing feature theory
JP3632099B2 (en) Robot audio-visual system
CN103901400B (en) A kind of based on delay compensation and ears conforming binaural sound source of sound localization method
Cho et al. Sound source localization for robot auditory systems
Yamamoto et al. Assessment of general applicability of robot audition system by recognizing three simultaneous speeches
Murase et al. Multiple moving speaker tracking by microphone array on mobile robot.
Okuno et al. Computational auditory scene analysis and its application to robot audition
Wang et al. A deconvolutive neural network for speech classification with applications to home service robot
Okuno et al. Sound and visual tracking for humanoid robot
Nakadai et al. Exploiting auditory fovea in humanoid-human interaction