[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

WO2008041878A3 - System and procedure of hands free speech communication using a microphone array - Google Patents

System and procedure of hands free speech communication using a microphone array Download PDF

Info

Publication number
WO2008041878A3
WO2008041878A3 PCT/RS2007/000017 RS2007000017W WO2008041878A3 WO 2008041878 A3 WO2008041878 A3 WO 2008041878A3 RS 2007000017 W RS2007000017 W RS 2007000017W WO 2008041878 A3 WO2008041878 A3 WO 2008041878A3
Authority
WO
WIPO (PCT)
Prior art keywords
speaker
microphone array
procedure
room
signal
Prior art date
Application number
PCT/RS2007/000017
Other languages
French (fr)
Other versions
WO2008041878A2 (en
Inventor
Zoran Saric
Slobodan Jovicic
Vladimir Kovacevic
Nikola Teslic
Dragan Kukolj
Original Assignee
Micronas Nit
Zoran Saric
Slobodan Jovicic
Vladimir Kovacevic
Nikola Teslic
Dragan Kukolj
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Micronas Nit, Zoran Saric, Slobodan Jovicic, Vladimir Kovacevic, Nikola Teslic, Dragan Kukolj filed Critical Micronas Nit
Publication of WO2008041878A2 publication Critical patent/WO2008041878A2/en
Publication of WO2008041878A3 publication Critical patent/WO2008041878A3/en

Links

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01SRADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
    • G01S3/00Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received
    • G01S3/80Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received using ultrasonic, sonic or infrasonic waves
    • G01S3/8006Multi-channel systems specially adapted for direction-finding, i.e. having a single aerial system capable of giving simultaneous indications of the directions of different signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • H04N7/142Constructional details of the terminal equipment, e.g. arrangements of the camera and the display
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02166Microphone arrays; Beamforming
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/44Receiver circuitry for the reception of television signals according to analogue transmission standards
    • H04N5/445Receiver circuitry for the reception of television signals according to analogue transmission standards for displaying additional information
    • H04N5/45Picture in picture, e.g. displaying simultaneously another television channel in a region of the screen

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Remote Sensing (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Telephone Function (AREA)

Abstract

The invention relates to the system and procedure for hand-free voice communication in video-phone or teleconference using a microphone array, whose main purpose is to make a quality recording of speaker in room, in the situation of larger expansion, with presence noise, with acoustic echo, produced by distance speaker and TV program, room reverberation and movement of the speaker in room. System contains: digital TV receiver and digital camera for picture reproduction and shooting, respectively, stereo loudspeakers and microphone array for sound reproduction and recording, respectively, amplifier and acquisition module for audio signals and DSP for acoustic signal processing. The procedure for microphone signal processing is done in frequency domain and it contains: acoustic echo suppression made of two signals: far-end speaker signal and stereo TV signal, acoustic spatial filtering of near-end speaker in accordance with noise sources and room reverberation, based on adaptive characteristic of microphone array directivity, of speaker localization in horizontal plane, of suppression of all residual noises and adaptive gain control of transmitting signal.
PCT/RS2007/000017 2006-10-04 2007-09-19 System and procedure of hands free speech communication using a microphone array WO2008041878A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
RSP-2006/0551A RS49875B (en) 2006-10-04 2006-10-04 System and technique for hands-free voice communication using microphone array
RSP-2006/0551 2006-10-04

Publications (2)

Publication Number Publication Date
WO2008041878A2 WO2008041878A2 (en) 2008-04-10
WO2008041878A3 true WO2008041878A3 (en) 2009-02-19

Family

ID=39268910

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/RS2007/000017 WO2008041878A2 (en) 2006-10-04 2007-09-19 System and procedure of hands free speech communication using a microphone array

Country Status (2)

Country Link
RS (1) RS49875B (en)
WO (1) WO2008041878A2 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8660274B2 (en) 2008-07-16 2014-02-25 Nuance Communications, Inc. Beamforming pre-processing for speaker localization
US8981994B2 (en) 2011-09-30 2015-03-17 Skype Processing signals
US9269367B2 (en) 2011-07-05 2016-02-23 Skype Limited Processing audio signals during a communication event
CN109151370A (en) * 2018-09-21 2019-01-04 上海赛连信息科技有限公司 Intelligent video system and control of intelligent terminal

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5386936B2 (en) * 2008-11-05 2014-01-15 ヤマハ株式会社 Sound emission and collection device
US9215527B1 (en) 2009-12-14 2015-12-15 Cirrus Logic, Inc. Multi-band integrated speech separating microphone array processor with adaptive beamforming
US8861756B2 (en) 2010-09-24 2014-10-14 LI Creative Technologies, Inc. Microphone array system
US8811601B2 (en) 2011-04-04 2014-08-19 Qualcomm Incorporated Integrated echo cancellation and noise suppression
JP6064159B2 (en) 2011-07-11 2017-01-25 パナソニックIpマネジメント株式会社 Echo cancellation apparatus, conference system using the same, and echo cancellation method
GB2495128B (en) 2011-09-30 2018-04-04 Skype Processing signals
GB2495129B (en) 2011-09-30 2017-07-19 Skype Processing signals
GB2495278A (en) 2011-09-30 2013-04-10 Skype Processing received signals from a range of receiving angles to reduce interference
GB2495472B (en) 2011-09-30 2019-07-03 Skype Processing audio signals
GB2495130B (en) 2011-09-30 2018-10-24 Skype Processing audio signals
CN102968999B (en) * 2011-11-18 2015-04-22 斯凯普公司 Audio signal processing
GB2496660B (en) * 2011-11-18 2014-06-04 Skype Processing audio signals
GB201120392D0 (en) 2011-11-25 2012-01-11 Skype Ltd Processing signals
GB2497343B (en) 2011-12-08 2014-11-26 Skype Processing audio signals
TWI466108B (en) * 2012-07-31 2014-12-21 Acer Inc Audio processing method and audio processing device
EP2747451A1 (en) * 2012-12-21 2014-06-25 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Filter and method for informed spatial filtering using multiple instantaneous direction-of-arrivial estimates
WO2016054090A1 (en) * 2014-09-30 2016-04-07 Nunntawi Dynamics Llc Method to determine loudspeaker change of placement
KR20170035504A (en) 2015-09-23 2017-03-31 삼성전자주식회사 Electronic device and method of audio processing thereof
CN110099328B (en) * 2018-01-31 2024-03-29 北京塞宾科技有限公司 Intelligent sound box
CN109147813A (en) * 2018-09-21 2019-01-04 神思电子技术股份有限公司 A kind of service robot noise-reduction method based on audio-visual location technology
CN110366017A (en) * 2019-06-06 2019-10-22 深圳康佳电子科技有限公司 A kind of smart television voice cam device and intelligent TV set
CN110223690A (en) * 2019-06-10 2019-09-10 深圳永顺智信息科技有限公司 The man-machine interaction method and device merged based on image with voice
CN110956969B (en) * 2019-11-28 2022-06-10 北京达佳互联信息技术有限公司 Live broadcast audio processing method and device, electronic equipment and storage medium
CN111161751A (en) * 2019-12-25 2020-05-15 声耕智能科技(西安)研究院有限公司 Distributed microphone pickup system and method under complex scene
CN113470682B (en) * 2021-06-16 2023-11-24 中科上声(苏州)电子有限公司 Method, device and storage medium for estimating speaker azimuth by microphone array
CN118072744B (en) * 2024-04-18 2024-07-23 深圳市万屏时代科技有限公司 Voiceprint-based language identification method and device

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5305307A (en) * 1991-01-04 1994-04-19 Picturetel Corporation Adaptive acoustic echo canceller having means for reducing or eliminating echo in a plurality of signal bandwidths
US5550924A (en) * 1993-07-07 1996-08-27 Picturetel Corporation Reduction of background noise for speech enhancement
EP0762751A2 (en) * 1995-08-24 1997-03-12 Hitachi, Ltd. Television receiver
US5715319A (en) * 1996-05-30 1998-02-03 Picturetel Corporation Method and apparatus for steerable and endfire superdirective microphone arrays with reduced analog-to-digital converter and computational requirements
US6483532B1 (en) * 1998-07-13 2002-11-19 Netergy Microelectronics, Inc. Video-assisted audio signal processing system and method
WO2003043327A1 (en) * 2001-11-13 2003-05-22 Koninklijke Philips Electronics N.V. A system and method for providing an awareness of remote people in the room during a videoconference
US6593956B1 (en) * 1998-05-15 2003-07-15 Polycom, Inc. Locating an audio source
WO2004017303A1 (en) * 2002-08-16 2004-02-26 Dspfactory Ltd. Method and system for processing subband signals using adaptive filters
US20040252850A1 (en) * 2003-04-24 2004-12-16 Lorenzo Turicchia System and method for spectral enhancement employing compression and expansion
WO2006028587A2 (en) * 2004-07-22 2006-03-16 Softmax, Inc. Headset for separation of speech signals in a noisy environment
US20060132595A1 (en) * 2004-10-15 2006-06-22 Kenoyer Michael L Speakerphone supporting video and audio features

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5305307A (en) * 1991-01-04 1994-04-19 Picturetel Corporation Adaptive acoustic echo canceller having means for reducing or eliminating echo in a plurality of signal bandwidths
US5550924A (en) * 1993-07-07 1996-08-27 Picturetel Corporation Reduction of background noise for speech enhancement
EP0762751A2 (en) * 1995-08-24 1997-03-12 Hitachi, Ltd. Television receiver
US5715319A (en) * 1996-05-30 1998-02-03 Picturetel Corporation Method and apparatus for steerable and endfire superdirective microphone arrays with reduced analog-to-digital converter and computational requirements
US6593956B1 (en) * 1998-05-15 2003-07-15 Polycom, Inc. Locating an audio source
US6483532B1 (en) * 1998-07-13 2002-11-19 Netergy Microelectronics, Inc. Video-assisted audio signal processing system and method
WO2003043327A1 (en) * 2001-11-13 2003-05-22 Koninklijke Philips Electronics N.V. A system and method for providing an awareness of remote people in the room during a videoconference
WO2004017303A1 (en) * 2002-08-16 2004-02-26 Dspfactory Ltd. Method and system for processing subband signals using adaptive filters
US20040252850A1 (en) * 2003-04-24 2004-12-16 Lorenzo Turicchia System and method for spectral enhancement employing compression and expansion
WO2006028587A2 (en) * 2004-07-22 2006-03-16 Softmax, Inc. Headset for separation of speech signals in a noisy environment
US20060132595A1 (en) * 2004-10-15 2006-06-22 Kenoyer Michael L Speakerphone supporting video and audio features

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8660274B2 (en) 2008-07-16 2014-02-25 Nuance Communications, Inc. Beamforming pre-processing for speaker localization
US9269367B2 (en) 2011-07-05 2016-02-23 Skype Limited Processing audio signals during a communication event
US8981994B2 (en) 2011-09-30 2015-03-17 Skype Processing signals
CN109151370A (en) * 2018-09-21 2019-01-04 上海赛连信息科技有限公司 Intelligent video system and control of intelligent terminal
CN109151370B (en) * 2018-09-21 2020-10-23 上海赛连信息科技有限公司 Intelligent video system and intelligent control terminal

Also Published As

Publication number Publication date
WO2008041878A2 (en) 2008-04-10
RS20060551A (en) 2007-06-04
RS49875B (en) 2008-08-07

Similar Documents

Publication Publication Date Title
WO2008041878A3 (en) System and procedure of hands free speech communication using a microphone array
US9014387B2 (en) Coordinated control of adaptive noise cancellation (ANC) among earspeaker channels
AU2013239736B2 (en) Pre-shaping series filter for active noise cancellation adaptive filter
JP4202640B2 (en) Short range wireless communication headset, communication system using the same, and acoustic processing method in short range wireless communication
CA2560034C (en) System for selectively extracting components of an audio input signal
US11373665B2 (en) Voice isolation system
US20060147063A1 (en) Echo cancellation in telephones with multiple microphones
DE602005021546D1 (en) Hung
CA2539798A1 (en) A method to reduce training time of an acoustic echo canceller in a full-duplex beamforming-based audio conferencing system
WO2010000878A3 (en) Speech enhancement method and system
KR20100022492A (en) Sound signal processor and delay time setting method
WO2018010375A1 (en) Method and device for realising karaoke function through earphone, and earphone
PT1745637E (en) Conference terminal comprising echo reduction for a voice conferencing system
US20190295525A1 (en) Method and Device for Generating and Providing an Audio Signal for Enhancing a Hearing Impression at Live Events
US20240251038A1 (en) Method for optimizing speech pickup in a communication device
US8923530B2 (en) Speakerphone feedback attenuation
JP5538249B2 (en) Stereo headset
JP2008245250A (en) Audio conferencing equipment
CN113038315A (en) Voice signal processing method and device
JP3314730B2 (en) Audio playback device and communication conference device
JP2003533110A (en) Audio system
US11700485B2 (en) Differential audio data compensation
JP7493158B2 (en) Audio processing device and audio processing method
Goldin Close talking autodirective dual microphone
Goodwin Joe DiBiase, Michael Brandstein (Box D, Brown Univ., Providence, RI 02912), and Harvey F. Silverman (Brown University, Providence, RI 02912) A frequency-domain delay estimator has been used as the basis of a microphone-array talker location and beamforming system [M. S. Brandstein and HF Silverman, Techn. Rep. LEMS-116 (1993)]. While the estimator has advantages over previously employed correlation-based delay estimation methods [HF Silverman and SE Kirtman, Cornput. Speech Lang. 6, 129-152 (1990)], including

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07834923

Country of ref document: EP

Kind code of ref document: A2

122 Ep: pct application non-entry in european phase

Ref document number: 07834923

Country of ref document: EP

Kind code of ref document: A2