WO2008041878A3 - System and procedure of hands free speech communication using a microphone array - Google Patents
System and procedure of hands free speech communication using a microphone array Download PDFInfo
- Publication number
- WO2008041878A3 WO2008041878A3 PCT/RS2007/000017 RS2007000017W WO2008041878A3 WO 2008041878 A3 WO2008041878 A3 WO 2008041878A3 RS 2007000017 W RS2007000017 W RS 2007000017W WO 2008041878 A3 WO2008041878 A3 WO 2008041878A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- speaker
- microphone array
- procedure
- room
- signal
- Prior art date
Links
- 238000000034 method Methods 0.000 title abstract 3
- 230000003044 adaptive effect Effects 0.000 abstract 2
- 230000001629 suppression Effects 0.000 abstract 2
- 238000001914 filtration Methods 0.000 abstract 1
- 230000004807 localization Effects 0.000 abstract 1
- 230000005236 sound signal Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S3/00—Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received
- G01S3/80—Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received using ultrasonic, sonic or infrasonic waves
- G01S3/8006—Multi-channel systems specially adapted for direction-finding, i.e. having a single aerial system capable of giving simultaneous indications of the directions of different signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/141—Systems for two-way working between two video terminals, e.g. videophone
- H04N7/142—Constructional details of the terminal equipment, e.g. arrangements of the camera and the display
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/44—Receiver circuitry for the reception of television signals according to analogue transmission standards
- H04N5/445—Receiver circuitry for the reception of television signals according to analogue transmission standards for displaying additional information
- H04N5/45—Picture in picture, e.g. displaying simultaneously another television channel in a region of the screen
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Radar, Positioning & Navigation (AREA)
- Remote Sensing (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Circuit For Audible Band Transducer (AREA)
- Telephone Function (AREA)
Abstract
The invention relates to the system and procedure for hand-free voice communication in video-phone or teleconference using a microphone array, whose main purpose is to make a quality recording of speaker in room, in the situation of larger expansion, with presence noise, with acoustic echo, produced by distance speaker and TV program, room reverberation and movement of the speaker in room. System contains: digital TV receiver and digital camera for picture reproduction and shooting, respectively, stereo loudspeakers and microphone array for sound reproduction and recording, respectively, amplifier and acquisition module for audio signals and DSP for acoustic signal processing. The procedure for microphone signal processing is done in frequency domain and it contains: acoustic echo suppression made of two signals: far-end speaker signal and stereo TV signal, acoustic spatial filtering of near-end speaker in accordance with noise sources and room reverberation, based on adaptive characteristic of microphone array directivity, of speaker localization in horizontal plane, of suppression of all residual noises and adaptive gain control of transmitting signal.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
RSP-2006/0551A RS49875B (en) | 2006-10-04 | 2006-10-04 | System and technique for hands-free voice communication using microphone array |
RSP-2006/0551 | 2006-10-04 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2008041878A2 WO2008041878A2 (en) | 2008-04-10 |
WO2008041878A3 true WO2008041878A3 (en) | 2009-02-19 |
Family
ID=39268910
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/RS2007/000017 WO2008041878A2 (en) | 2006-10-04 | 2007-09-19 | System and procedure of hands free speech communication using a microphone array |
Country Status (2)
Country | Link |
---|---|
RS (1) | RS49875B (en) |
WO (1) | WO2008041878A2 (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8660274B2 (en) | 2008-07-16 | 2014-02-25 | Nuance Communications, Inc. | Beamforming pre-processing for speaker localization |
US8981994B2 (en) | 2011-09-30 | 2015-03-17 | Skype | Processing signals |
US9269367B2 (en) | 2011-07-05 | 2016-02-23 | Skype Limited | Processing audio signals during a communication event |
CN109151370A (en) * | 2018-09-21 | 2019-01-04 | 上海赛连信息科技有限公司 | Intelligent video system and control of intelligent terminal |
Families Citing this family (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP5386936B2 (en) * | 2008-11-05 | 2014-01-15 | ヤマハ株式会社 | Sound emission and collection device |
US9215527B1 (en) | 2009-12-14 | 2015-12-15 | Cirrus Logic, Inc. | Multi-band integrated speech separating microphone array processor with adaptive beamforming |
US8861756B2 (en) | 2010-09-24 | 2014-10-14 | LI Creative Technologies, Inc. | Microphone array system |
US8811601B2 (en) | 2011-04-04 | 2014-08-19 | Qualcomm Incorporated | Integrated echo cancellation and noise suppression |
JP6064159B2 (en) | 2011-07-11 | 2017-01-25 | パナソニックIpマネジメント株式会社 | Echo cancellation apparatus, conference system using the same, and echo cancellation method |
GB2495128B (en) | 2011-09-30 | 2018-04-04 | Skype | Processing signals |
GB2495129B (en) | 2011-09-30 | 2017-07-19 | Skype | Processing signals |
GB2495278A (en) | 2011-09-30 | 2013-04-10 | Skype | Processing received signals from a range of receiving angles to reduce interference |
GB2495472B (en) | 2011-09-30 | 2019-07-03 | Skype | Processing audio signals |
GB2495130B (en) | 2011-09-30 | 2018-10-24 | Skype | Processing audio signals |
CN102968999B (en) * | 2011-11-18 | 2015-04-22 | 斯凯普公司 | Audio signal processing |
GB2496660B (en) * | 2011-11-18 | 2014-06-04 | Skype | Processing audio signals |
GB201120392D0 (en) | 2011-11-25 | 2012-01-11 | Skype Ltd | Processing signals |
GB2497343B (en) | 2011-12-08 | 2014-11-26 | Skype | Processing audio signals |
TWI466108B (en) * | 2012-07-31 | 2014-12-21 | Acer Inc | Audio processing method and audio processing device |
EP2747451A1 (en) * | 2012-12-21 | 2014-06-25 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Filter and method for informed spatial filtering using multiple instantaneous direction-of-arrivial estimates |
WO2016054090A1 (en) * | 2014-09-30 | 2016-04-07 | Nunntawi Dynamics Llc | Method to determine loudspeaker change of placement |
KR20170035504A (en) | 2015-09-23 | 2017-03-31 | 삼성전자주식회사 | Electronic device and method of audio processing thereof |
CN110099328B (en) * | 2018-01-31 | 2024-03-29 | 北京塞宾科技有限公司 | Intelligent sound box |
CN109147813A (en) * | 2018-09-21 | 2019-01-04 | 神思电子技术股份有限公司 | A kind of service robot noise-reduction method based on audio-visual location technology |
CN110366017A (en) * | 2019-06-06 | 2019-10-22 | 深圳康佳电子科技有限公司 | A kind of smart television voice cam device and intelligent TV set |
CN110223690A (en) * | 2019-06-10 | 2019-09-10 | 深圳永顺智信息科技有限公司 | The man-machine interaction method and device merged based on image with voice |
CN110956969B (en) * | 2019-11-28 | 2022-06-10 | 北京达佳互联信息技术有限公司 | Live broadcast audio processing method and device, electronic equipment and storage medium |
CN111161751A (en) * | 2019-12-25 | 2020-05-15 | 声耕智能科技(西安)研究院有限公司 | Distributed microphone pickup system and method under complex scene |
CN113470682B (en) * | 2021-06-16 | 2023-11-24 | 中科上声(苏州)电子有限公司 | Method, device and storage medium for estimating speaker azimuth by microphone array |
CN118072744B (en) * | 2024-04-18 | 2024-07-23 | 深圳市万屏时代科技有限公司 | Voiceprint-based language identification method and device |
Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5305307A (en) * | 1991-01-04 | 1994-04-19 | Picturetel Corporation | Adaptive acoustic echo canceller having means for reducing or eliminating echo in a plurality of signal bandwidths |
US5550924A (en) * | 1993-07-07 | 1996-08-27 | Picturetel Corporation | Reduction of background noise for speech enhancement |
EP0762751A2 (en) * | 1995-08-24 | 1997-03-12 | Hitachi, Ltd. | Television receiver |
US5715319A (en) * | 1996-05-30 | 1998-02-03 | Picturetel Corporation | Method and apparatus for steerable and endfire superdirective microphone arrays with reduced analog-to-digital converter and computational requirements |
US6483532B1 (en) * | 1998-07-13 | 2002-11-19 | Netergy Microelectronics, Inc. | Video-assisted audio signal processing system and method |
WO2003043327A1 (en) * | 2001-11-13 | 2003-05-22 | Koninklijke Philips Electronics N.V. | A system and method for providing an awareness of remote people in the room during a videoconference |
US6593956B1 (en) * | 1998-05-15 | 2003-07-15 | Polycom, Inc. | Locating an audio source |
WO2004017303A1 (en) * | 2002-08-16 | 2004-02-26 | Dspfactory Ltd. | Method and system for processing subband signals using adaptive filters |
US20040252850A1 (en) * | 2003-04-24 | 2004-12-16 | Lorenzo Turicchia | System and method for spectral enhancement employing compression and expansion |
WO2006028587A2 (en) * | 2004-07-22 | 2006-03-16 | Softmax, Inc. | Headset for separation of speech signals in a noisy environment |
US20060132595A1 (en) * | 2004-10-15 | 2006-06-22 | Kenoyer Michael L | Speakerphone supporting video and audio features |
-
2006
- 2006-10-04 RS RSP-2006/0551A patent/RS49875B/en unknown
-
2007
- 2007-09-19 WO PCT/RS2007/000017 patent/WO2008041878A2/en active Application Filing
Patent Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5305307A (en) * | 1991-01-04 | 1994-04-19 | Picturetel Corporation | Adaptive acoustic echo canceller having means for reducing or eliminating echo in a plurality of signal bandwidths |
US5550924A (en) * | 1993-07-07 | 1996-08-27 | Picturetel Corporation | Reduction of background noise for speech enhancement |
EP0762751A2 (en) * | 1995-08-24 | 1997-03-12 | Hitachi, Ltd. | Television receiver |
US5715319A (en) * | 1996-05-30 | 1998-02-03 | Picturetel Corporation | Method and apparatus for steerable and endfire superdirective microphone arrays with reduced analog-to-digital converter and computational requirements |
US6593956B1 (en) * | 1998-05-15 | 2003-07-15 | Polycom, Inc. | Locating an audio source |
US6483532B1 (en) * | 1998-07-13 | 2002-11-19 | Netergy Microelectronics, Inc. | Video-assisted audio signal processing system and method |
WO2003043327A1 (en) * | 2001-11-13 | 2003-05-22 | Koninklijke Philips Electronics N.V. | A system and method for providing an awareness of remote people in the room during a videoconference |
WO2004017303A1 (en) * | 2002-08-16 | 2004-02-26 | Dspfactory Ltd. | Method and system for processing subband signals using adaptive filters |
US20040252850A1 (en) * | 2003-04-24 | 2004-12-16 | Lorenzo Turicchia | System and method for spectral enhancement employing compression and expansion |
WO2006028587A2 (en) * | 2004-07-22 | 2006-03-16 | Softmax, Inc. | Headset for separation of speech signals in a noisy environment |
US20060132595A1 (en) * | 2004-10-15 | 2006-06-22 | Kenoyer Michael L | Speakerphone supporting video and audio features |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8660274B2 (en) | 2008-07-16 | 2014-02-25 | Nuance Communications, Inc. | Beamforming pre-processing for speaker localization |
US9269367B2 (en) | 2011-07-05 | 2016-02-23 | Skype Limited | Processing audio signals during a communication event |
US8981994B2 (en) | 2011-09-30 | 2015-03-17 | Skype | Processing signals |
CN109151370A (en) * | 2018-09-21 | 2019-01-04 | 上海赛连信息科技有限公司 | Intelligent video system and control of intelligent terminal |
CN109151370B (en) * | 2018-09-21 | 2020-10-23 | 上海赛连信息科技有限公司 | Intelligent video system and intelligent control terminal |
Also Published As
Publication number | Publication date |
---|---|
WO2008041878A2 (en) | 2008-04-10 |
RS20060551A (en) | 2007-06-04 |
RS49875B (en) | 2008-08-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2008041878A3 (en) | System and procedure of hands free speech communication using a microphone array | |
US9014387B2 (en) | Coordinated control of adaptive noise cancellation (ANC) among earspeaker channels | |
AU2013239736B2 (en) | Pre-shaping series filter for active noise cancellation adaptive filter | |
JP4202640B2 (en) | Short range wireless communication headset, communication system using the same, and acoustic processing method in short range wireless communication | |
CA2560034C (en) | System for selectively extracting components of an audio input signal | |
US11373665B2 (en) | Voice isolation system | |
US20060147063A1 (en) | Echo cancellation in telephones with multiple microphones | |
DE602005021546D1 (en) | Hung | |
CA2539798A1 (en) | A method to reduce training time of an acoustic echo canceller in a full-duplex beamforming-based audio conferencing system | |
WO2010000878A3 (en) | Speech enhancement method and system | |
KR20100022492A (en) | Sound signal processor and delay time setting method | |
WO2018010375A1 (en) | Method and device for realising karaoke function through earphone, and earphone | |
PT1745637E (en) | Conference terminal comprising echo reduction for a voice conferencing system | |
US20190295525A1 (en) | Method and Device for Generating and Providing an Audio Signal for Enhancing a Hearing Impression at Live Events | |
US20240251038A1 (en) | Method for optimizing speech pickup in a communication device | |
US8923530B2 (en) | Speakerphone feedback attenuation | |
JP5538249B2 (en) | Stereo headset | |
JP2008245250A (en) | Audio conferencing equipment | |
CN113038315A (en) | Voice signal processing method and device | |
JP3314730B2 (en) | Audio playback device and communication conference device | |
JP2003533110A (en) | Audio system | |
US11700485B2 (en) | Differential audio data compensation | |
JP7493158B2 (en) | Audio processing device and audio processing method | |
Goldin | Close talking autodirective dual microphone | |
Goodwin | Joe DiBiase, Michael Brandstein (Box D, Brown Univ., Providence, RI 02912), and Harvey F. Silverman (Brown University, Providence, RI 02912) A frequency-domain delay estimator has been used as the basis of a microphone-array talker location and beamforming system [M. S. Brandstein and HF Silverman, Techn. Rep. LEMS-116 (1993)]. While the estimator has advantages over previously employed correlation-based delay estimation methods [HF Silverman and SE Kirtman, Cornput. Speech Lang. 6, 129-152 (1990)], including |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 07834923 Country of ref document: EP Kind code of ref document: A2 |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 07834923 Country of ref document: EP Kind code of ref document: A2 |