[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

EP0719439B1 - Voice activity detector - Google Patents

Voice activity detector Download PDF

Info

Publication number
EP0719439B1
EP0719439B1 EP94926317A EP94926317A EP0719439B1 EP 0719439 B1 EP0719439 B1 EP 0719439B1 EP 94926317 A EP94926317 A EP 94926317A EP 94926317 A EP94926317 A EP 94926317A EP 0719439 B1 EP0719439 B1 EP 0719439B1
Authority
EP
European Patent Office
Prior art keywords
gain
voice activity
speech
input signal
updating
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
EP94926317A
Other languages
German (de)
English (en)
French (fr)
Other versions
EP0719439A1 (en
Inventor
Paul Alexander Barrett
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
LG Electronics Inc
Original Assignee
British Telecommunications PLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=27235491&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=EP0719439(B1) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Priority claimed from GB939324967A external-priority patent/GB9324967D0/en
Priority claimed from GB9412451A external-priority patent/GB9412451D0/en
Application filed by British Telecommunications PLC filed Critical British Telecommunications PLC
Priority to EP94926317A priority Critical patent/EP0719439B1/en
Publication of EP0719439A1 publication Critical patent/EP0719439A1/en
Application granted granted Critical
Publication of EP0719439B1 publication Critical patent/EP0719439B1/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/84Detection of presence or absence of voice signals for discriminating voice from noise
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q1/00Details of selecting apparatus or arrangements
    • H04Q1/18Electrical details
    • H04Q1/30Signalling arrangements; Manipulation of signalling currents
    • H04Q1/44Signalling arrangements; Manipulation of signalling currents using alternate current
    • H04Q1/444Signalling arrangements; Manipulation of signalling currents using alternate current with voice-band signalling frequencies
    • H04Q1/46Signalling arrangements; Manipulation of signalling currents using alternate current with voice-band signalling frequencies comprising means for distinguishing between a signalling current of predetermined frequency and a complex current containing that frequency, e.g. speech current
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision
    • G10L2025/786Adaptive threshold
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M19/00Current supply arrangements for telephone systems
    • H04M19/08Current supply arrangements for telephone systems with current supply sources at the substations

Definitions

  • a voice activity detector is a device which is supplied with a signal with the object of detecting periods of speech, or periods containing only noise.
  • the present invention is not limited thereto, one application of particular interest for such detectors is in mobile radio telephone systems where the knowledge as to the presence or otherwise of speech can be exploited to reduce power consumption and interference by turning off a transmitter during periods of silence.
  • the noise level from a vehicle-mounted unit
  • Another possible use in radio systems is to improve the efficient utilisation of radio spectrum.
  • Figure 1 shows a voice activity detector as described in our International Patent Application WO89/08910.
  • noisy speech signals are received at an input 1.
  • a store 2 contains data defining an estimate or model of the frequency spectrum of the noise; a comparison is made (3) between this and the spectrum of the current signal to obtain a measure of similarity which is compared (4) with a threshold value.
  • the noise model is updated from the input only when speech is absent.
  • the threshold can be adapted (adaptor 6).
  • auxiliary detector 7 which comprises an unvoiced speech detector 8 and a voiced speech detector 9: the detector 7 deems speech to be present if either of the detectors recognises speech, and suppresses updating and threshold adaptation of the main detector.
  • the unvoiced speech detector 8 obtains a set of LPC coefficients for the signal and compares the autocorrelation function of these coefficients between successive frame periods, whilst the voiced speech detector 9 examines variations in the autocorrelation of the LPC residual.
  • tone detectors each tuned to the frequency(s) of a particular signalling tone; however, the diversity of different signalling tones throughout the world is considerable, so that a large number of individual detectors would be needed in order, for example, that a mobile telephone user making an international call may be able to hear the 'engaged' tone reliably, irrespective of the country from which it originates.
  • a voice activity detector for detecting the presence of speech in an input signal, comprising
  • a conventional speech coder 100 has a speech input 101, the speech signal being sampled at 8kHz and converted into digital form by an analogue-to-digital converter 102.
  • a windowing unit 103 divides the speech samples into frames of (for example) 160 samples (i.e. a 20ms frame) and multiplies it by a Hamming window or other function which reduces the contribution of samples at the beginning and end of the frame.
  • a correlator 104 receives the digitised speech samples and produces the autocorrelation coefficients R i for each frame.
  • An LPC analysis unit 105 calculates the coefficients a i of a filter (sometimes referred to as a synthesis filter) having a frequency response corresponding to the frequency spectrum of the input speech signal using a known method e. g. a Levinson-Durbin or Schurralgorithm.
  • the digitised input signal is also passed through an inverse filter (or analysis filter) 106 controlled by the coefficients, to produce a residual signal which is further analysed by a long term predictor analysis unit 107 which computes the optimum delay for predicting the LPC residual from its previous values, and a corresponding gain value for the prediction.
  • the analysis unit 106 also forms a second residual (i.e. the difference between the current LPC residual and the LPC residual when delayed and scaled by the parameters obtained).
  • An excitation unit 108 derives excitation parameters for transmission to a decoder, by simply quantisising the LTP residual, or by other conventional means.
  • the LPC coefficients a i , the long term predictor delay d and gain g, and excitation parameters e are transmitted to a decoder.
  • a main voice activity detector in accordance with our earlier patent application averages the autocorrelation coefficients R i by means of an averager 110 which produces a weighted sum R i ' of the current coefficients and those from previous frames stored in a buffer 111.
  • a further autocorrelator 112 forms the autocorrelation coefficients B. of the LPC coefficients a i which are passed to a buffer 113.
  • the contents of the buffer are updated only during periods deemed by an auxiliary detector (to be described below) to contain only noise, so that the contents of the buffer 113 B i ' represent an estimate of the noise spectrum of the input signal.
  • a multiplication/addition unit 114 forms a measure M of the spectral similarity between the input signal and the noise model defined as
  • n is the number of samples in a speech frame.
  • the measure M is compared in a comparator 115 against a threshold level and produces at an output 116 a signal indicating the presence of absence of speech.
  • the threshold may be adaptively adjusted (117) according to the current noise power level.
  • the updating of the noise estimate in the buffer store 113 is not controlled by the output 116 of the detector just described, since failure to recognise speech would result in updating of the buffer with speech information and consequent further recognition failures - a "lock" situation. Therefore updating is controlled by an auxiliary detector 200.
  • this forms (201) a sum of products of the (unaveraged) autocorrelation coefficients Ri of the input and the (unbuffered) autocorrelation coefficients Bi of the LPC coefficients.
  • a subtractor 202 compares this sum with the corresponding sum for a previous speech frame, delayed in a buffer 203. This difference representing the spectral similarity between successive frames of the input signal is thresholded (204) to produce a decision signal.
  • the long term predictor delay d is measured by a pitch analysis unit 205.
  • the outputs of this is combined with that of the thresholding stage 204 in an OR gate 206 - i.e. speech is deemed by the auxiliary detector 200 to be present if either (or both) of the units 204 or 205 products an output indicating that speech is present.
  • speech is deemed by the auxiliary detector 200 to be present if either (or both) of the units 204 or 205 products an output indicating that speech is present.
  • the auxiliary detector just described is not very effective at achieving this. Although it recognises some tones others (generally those with a relatively pure spectral content) are not recognised.
  • the main detector also fails since the noise estimate in the buffer 113 is then "trained" on the signalling tone.
  • a further auxiliary detector is provided for the detection of signalling tones.
  • signalling tones being artificially generated, contain a small number of frequency components (which may be modulated).
  • the performance of an LPC predictor is exceptionally high for such signals, and this is made use of to discriminate between tone-based signals (including multi-tone signals) and background or environmental noise signals.
  • the LPC prediction gain Gp is defined as the ratio of the input signal power to the output signal power for a frame of speech viz is where x i is the filter input and y i is the output of the inverse filter: (where m is the number of filter coefficients, typically 8 or 10).
  • T 63 or 18 dB
  • the further detector 300 may be considered as a detector for certain types of tone; alternatively (in the embodiment of figure 2) it may be viewed as detecting a situation where the residual y i is small, so that operation of the long term predictor 107 (and hence of the pitch analysis 205) is not robust.
  • An alternative option for detecting voiced speech is to replace the pitch detector 205 with items analogous to 301, 302, 303 and 304 to form (and threshold) a prediction gain based on the longterm predictor analysis 107.
  • the prediction gain calculated is that of the LPC analysis of the speech coder 100, which might typically employ an 8th or even 10th order predictor.
  • the basis of this part of the analysis is that information tones result in higher prediction gains than does environmental noise, and that the higher the order of the analysis the higher is the ability of the predictor to model the noise environment, it is found that, by limiting the gain calculation to a fourth order analysis, information signals consisting of one or two tones give a high prediction gain whilst the prediction gain for environmental noise can be reduced.
  • noise in a mobile radio environment contain very strong resonances at low frequencies, and a further test is made to determine whether the "tone" is below a threshold frequency. Selection of a threshold involves a degree of compromise but, since most signalling tones lie above 400Hz, 385 Hz is suggested.
  • This further test operates by determining the frequencies of the poles of the LPC filter.
  • a low order filter is preferred to reduce the complexity of analysis.
  • the pole lies on the real axis and the signal is not a tone. If it is positive, but the real part of the pole position is negative (i.e. a 1 ⁇ 0) then the pole is in the left-hand half of the z-plane. This necessarily implies that the frequency is more than 25% of the sampling rate - i.e. above 2000Hz for a sampling frequency f s of 8kHz, in which case the frequency calculation is unnecessary and a ">385" signal can be generated right away.
  • the pole frequency is given by:
  • Its output is combined in an and-gate 406 with that of the comparator 403 so that a 'tone' decision is produced only when both the prediction gain is high and the pole frequency is greater than 385Hz.
  • pole frequencies above 2000Hz may also be trapped so that high-frequencies above the expected signalling tone range may not be recognised as tones.
  • the initial processing may be by means of a covariance analysis 109, the output of which is supplied to a reflection coefficient calculator 400' and a modified autocorrelation coefficient unit 104'.
  • the LPC analysis unit 105 may be connected as before to the autocorrelation unit 104' or - as shown - directly to the covariance analysis unit 109.
  • filter 450 this is a low pass equiripple FIR filter having zeros on the unit circle having a passband up to 600 (3dB point) and having a stopband attenuation of 20dB at 1200 Hz. It is thought preferable that the stopband attenuation not be too great.
  • the filter output is subsampled at 1200 Hz in subsampling unit 451.
  • the filter 450 is fed directly with the digitised input signal from the analogue-to-digital converter 102, and feeds a reflection coefficient analysis unit 400", or covariance or autocorrelation analysis as discussed earlier.
  • the autocorrelation option will require windowing as explained above.
  • Another embodiment alleviates the "harmonics" problem without unduly limiting the frequency range of prediction gain analysis; this is achieved by using filters to divide the signal into two or more frequency bands each of which is narrow enough that it cannot contain both the fundamental and the third harmonic of a tone. Each channel is then subsampled and subjected to a separate prediction gain analysis.
  • the signal is divided into frequency bands 400-1200 Hz and 1200-2000 Hz by filters 450a, 450b, and subsampled at 1. 6 kHz (451a, 451b).
  • Reflection coefficient computation 400" a,b, prediction error analysis 401a,b and thresholding 403a,b are performed separately for the two bands.
  • the two outputs from comparators 403a, 403b are conducted to separate inputs of the OR gate 206, so that a high prediction gain in either of the channels is considered to indicate the presence of a tone.
  • the other items 100-303 of Figure 7 are not shown in figure 8 as they are unchanged.

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Geophysics And Detection Of Objects (AREA)
  • Measuring Pulse, Heart Rate, Blood Pressure Or Blood Flow (AREA)
  • Electrophonic Musical Instruments (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Telephone Function (AREA)
  • Control Of Amplification And Gain Control (AREA)
  • Investigating Or Analyzing Materials By The Use Of Electric Means (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Cosmetics (AREA)
  • Electromechanical Clocks (AREA)
  • Investigating Or Analysing Materials By The Use Of Chemical Reactions (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Measuring Fluid Pressure (AREA)
  • Burglar Alarm Systems (AREA)
  • Digital Transmission Methods That Use Modulated Carrier Waves (AREA)
  • Radio Relay Systems (AREA)
EP94926317A 1993-09-14 1994-09-14 Voice activity detector Expired - Lifetime EP0719439B1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP94926317A EP0719439B1 (en) 1993-09-14 1994-09-14 Voice activity detector

Applications Claiming Priority (8)

Application Number Priority Date Filing Date Title
EP93307211 1993-09-14
EP93307211 1993-09-14
GB9324967 1993-12-06
GB939324967A GB9324967D0 (en) 1993-12-06 1993-12-06 Voice activity detector
GB9412451 1994-06-21
GB9412451A GB9412451D0 (en) 1994-06-21 1994-06-21 Voice activity detector
EP94926317A EP0719439B1 (en) 1993-09-14 1994-09-14 Voice activity detector
PCT/GB1994/001999 WO1995008170A1 (en) 1993-09-14 1994-09-14 Voice activity detector

Publications (2)

Publication Number Publication Date
EP0719439A1 EP0719439A1 (en) 1996-07-03
EP0719439B1 true EP0719439B1 (en) 1999-07-21

Family

ID=27235491

Family Applications (1)

Application Number Title Priority Date Filing Date
EP94926317A Expired - Lifetime EP0719439B1 (en) 1993-09-14 1994-09-14 Voice activity detector

Country Status (23)

Country Link
US (2) US5749067A (es)
EP (1) EP0719439B1 (es)
JP (1) JP3224132B2 (es)
KR (1) KR100363309B1 (es)
CN (1) CN1064772C (es)
AT (1) ATE182420T1 (es)
BR (1) BR9407535A (es)
CA (1) CA2169745C (es)
CZ (1) CZ286743B6 (es)
DE (1) DE69419615T2 (es)
DK (1) DK0719439T3 (es)
ES (1) ES2136204T3 (es)
FI (1) FI118195B (es)
GR (1) GR3031515T3 (es)
HK (1) HK1014392A1 (es)
HU (1) HU219994B (es)
IN (1) IN184794B (es)
MY (1) MY111134A (es)
NO (1) NO307979B1 (es)
NZ (1) NZ273045A (es)
SG (1) SG48935A1 (es)
SK (1) SK281796B6 (es)
WO (1) WO1995008170A1 (es)

Families Citing this family (96)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
IN184794B (es) * 1993-09-14 2000-09-30 British Telecomm
JP3522012B2 (ja) * 1995-08-23 2004-04-26 沖電気工業株式会社 コード励振線形予測符号化装置
FI100840B (fi) 1995-12-12 1998-02-27 Nokia Mobile Phones Ltd Kohinanvaimennin ja menetelmä taustakohinan vaimentamiseksi kohinaises ta puheesta sekä matkaviestin
DE69716266T2 (de) * 1996-07-03 2003-06-12 British Telecommunications P.L.C., London Sprachaktivitätsdetektor
US6708146B1 (en) * 1997-01-03 2004-03-16 Telecommunications Research Laboratories Voiceband signal classifier
JPH10247098A (ja) * 1997-03-04 1998-09-14 Mitsubishi Electric Corp 可変レート音声符号化方法、可変レート音声復号化方法
US6531982B1 (en) 1997-09-30 2003-03-11 Sirf Technology, Inc. Field unit for use in a GPS system
US5970446A (en) 1997-11-25 1999-10-19 At&T Corp Selective noise/channel/coding models and recognizers for automatic speech recognition
US6385548B2 (en) * 1997-12-12 2002-05-07 Motorola, Inc. Apparatus and method for detecting and characterizing signals in a communication system
US6327471B1 (en) 1998-02-19 2001-12-04 Conexant Systems, Inc. Method and an apparatus for positioning system assisted cellular radiotelephone handoff and dropoff
US5991718A (en) * 1998-02-27 1999-11-23 At&T Corp. System and method for noise threshold adaptation for voice activity detection in nonstationary noise environments
US6182035B1 (en) 1998-03-26 2001-01-30 Telefonaktiebolaget Lm Ericsson (Publ) Method and apparatus for detecting voice activity
US6348744B1 (en) 1998-04-14 2002-02-19 Conexant Systems, Inc. Integrated power management module
US6453289B1 (en) 1998-07-24 2002-09-17 Hughes Electronics Corporation Method of noise reduction for speech codecs
US7711038B1 (en) 1998-09-01 2010-05-04 Sirf Technology, Inc. System and method for despreading in a spread spectrum matched filter
US7545854B1 (en) 1998-09-01 2009-06-09 Sirf Technology, Inc. Doppler corrected spread spectrum matched filter
US6693953B2 (en) 1998-09-30 2004-02-17 Skyworks Solutions, Inc. Adaptive wireless communication receiver
US6606349B1 (en) 1999-02-04 2003-08-12 Sirf Technology, Inc. Spread spectrum receiver performance improvement
US6448925B1 (en) 1999-02-04 2002-09-10 Conexant Systems, Inc. Jamming detection and blanking for GPS receivers
US6556967B1 (en) 1999-03-12 2003-04-29 The United States Of America As Represented By The National Security Agency Voice activity detector
US6304216B1 (en) * 1999-03-30 2001-10-16 Conexant Systems, Inc. Signal detector employing correlation analysis of non-uniform and disjoint sample segments
US6577271B1 (en) 1999-03-30 2003-06-10 Sirf Technology, Inc Signal detector employing coherent integration
US6618701B2 (en) * 1999-04-19 2003-09-09 Motorola, Inc. Method and system for noise suppression using external voice activity detection
US6381568B1 (en) 1999-05-05 2002-04-30 The United States Of America As Represented By The National Security Agency Method of transmitting speech using discontinuous transmission and comfort noise
US6351486B1 (en) 1999-05-25 2002-02-26 Conexant Systems, Inc. Accelerated selection of a base station in a wireless communication system
JP3929686B2 (ja) * 2000-08-14 2007-06-13 松下電器産業株式会社 音声スイッチング装置およびその方法
US6788655B1 (en) 2000-04-18 2004-09-07 Sirf Technology, Inc. Personal communications device with ratio counter
US6931055B1 (en) 2000-04-18 2005-08-16 Sirf Technology, Inc. Signal detector employing a doppler phase correction system
US6952440B1 (en) 2000-04-18 2005-10-04 Sirf Technology, Inc. Signal detector employing a Doppler phase correction system
US6714158B1 (en) * 2000-04-18 2004-03-30 Sirf Technology, Inc. Method and system for data detection in a global positioning system satellite receiver
FR2808391B1 (fr) * 2000-04-28 2002-06-07 France Telecom Systeme de reception pour antenne multicapteur
US7885314B1 (en) 2000-05-02 2011-02-08 Kenneth Scott Walley Cancellation system and method for a wireless positioning system
US6778136B2 (en) * 2001-12-13 2004-08-17 Sirf Technology, Inc. Fast acquisition of GPS signal
JP4201470B2 (ja) * 2000-09-12 2008-12-24 パイオニア株式会社 音声認識システム
JP4201471B2 (ja) * 2000-09-12 2008-12-24 パイオニア株式会社 音声認識システム
US7472059B2 (en) * 2000-12-08 2008-12-30 Qualcomm Incorporated Method and apparatus for robust speech classification
WO2002052546A1 (en) * 2000-12-27 2002-07-04 Intel Corporation Voice barge-in in telephony speech recognition
US6707869B1 (en) * 2000-12-28 2004-03-16 Nortel Networks Limited Signal-processing apparatus with a filter of flexible window design
DE10121532A1 (de) * 2001-05-03 2002-11-07 Siemens Ag Verfahren und Vorrichtung zur automatischen Differenzierung und/oder Detektion akustischer Signale
JP3859462B2 (ja) * 2001-05-18 2006-12-20 株式会社東芝 予測パラメータ分析装置および予測パラメータ分析方法
KR100399057B1 (ko) * 2001-08-07 2003-09-26 한국전자통신연구원 이동통신 시스템의 음성 활성도 측정 장치 및 그 방법
US20030110029A1 (en) * 2001-12-07 2003-06-12 Masoud Ahmadi Noise detection and cancellation in communications systems
ES2272952T3 (es) * 2002-03-08 2007-05-01 Koninklijke Kpn N.V. Procedimiento y sistema para medir la calidad de la transmision de un sistema.
US7454331B2 (en) 2002-08-30 2008-11-18 Dolby Laboratories Licensing Corporation Controlling loudness of speech in signals that contain speech and other types of audio material
US20040064314A1 (en) * 2002-09-27 2004-04-01 Aubert Nicolas De Saint Methods and apparatus for speech end-point detection
US7146316B2 (en) * 2002-10-17 2006-12-05 Clarity Technologies, Inc. Noise reduction in subbanded speech signals
US7230955B1 (en) 2002-12-27 2007-06-12 At & T Corp. System and method for improved use of voice activity detection
US7272552B1 (en) * 2002-12-27 2007-09-18 At&T Corp. Voice activity detection and silence suppression in a packet network
JP2004341339A (ja) * 2003-05-16 2004-12-02 Mitsubishi Electric Corp 雑音抑圧装置
ATE371246T1 (de) * 2003-05-28 2007-09-15 Dolby Lab Licensing Corp Verfahren, vorrichtung und computerprogramm zur berechung und einstellung der wahrgenommenen lautstärke eines audiosignals
EP1661916A4 (en) 2003-07-16 2008-10-01 Daikin Ind Ltd PROCESS FOR PREPARING FLUOROUS POLYMER, AQUEOUS DISPERSION OF FLUOROUS POLYMER, 2-ACYLOXYCARBOXYLENE DERIVATIVE AND TENSID
SG119199A1 (en) * 2003-09-30 2006-02-28 Stmicroelectronics Asia Pacfic Voice activity detector
JP4497911B2 (ja) * 2003-12-16 2010-07-07 キヤノン株式会社 信号検出装置および方法、ならびにプログラム
US20050209762A1 (en) * 2004-03-18 2005-09-22 Ford Global Technologies, Llc Method and apparatus for controlling a vehicle using an object detection system and brake-steer
FI20045315A (fi) * 2004-08-30 2006-03-01 Nokia Corp Ääniaktiivisuuden havaitseminen äänisignaalissa
CA2581810C (en) 2004-10-26 2013-12-17 Dolby Laboratories Licensing Corporation Calculating and adjusting the perceived loudness and/or the perceived spectral balance of an audio signal
US8199933B2 (en) 2004-10-26 2012-06-12 Dolby Laboratories Licensing Corporation Calculating and adjusting the perceived loudness and/or the perceived spectral balance of an audio signal
JP4729927B2 (ja) * 2005-01-11 2011-07-20 ソニー株式会社 音声検出装置、自動撮像装置、および音声検出方法
PL1931197T3 (pl) * 2005-04-18 2015-09-30 Basf Se Preparat zawierający co najmniej jeden fungicyd konazolowy, inny fungicyd i jeden kopolimer stabilizujący
US7826945B2 (en) * 2005-07-01 2010-11-02 You Zhang Automobile speech-recognition interface
DE102006032967B4 (de) * 2005-07-28 2012-04-19 S. Siedle & Söhne Telefon- und Telegrafenwerke OHG Hausanlage und Verfahren zum Betreiben einer Hausanlage
GB2430129B (en) * 2005-09-08 2007-10-31 Motorola Inc Voice activity detector and method of operation therein
ES2347473T3 (es) * 2005-12-05 2010-10-29 Qualcomm Incorporated Procedimiento y aparato de deteccion de componentes tonales de señales de audio.
US8417185B2 (en) * 2005-12-16 2013-04-09 Vocollect, Inc. Wireless headset and method for robust voice data communication
US7885419B2 (en) * 2006-02-06 2011-02-08 Vocollect, Inc. Headset terminal with speech functionality
US7773767B2 (en) 2006-02-06 2010-08-10 Vocollect, Inc. Headset terminal with rear stability strap
US8204754B2 (en) 2006-02-10 2012-06-19 Telefonaktiebolaget L M Ericsson (Publ) System and method for an improved voice detector
US8920343B2 (en) 2006-03-23 2014-12-30 Michael Edward Sabatino Apparatus for acquiring and processing of physiological auditory signals
TWI517562B (zh) 2006-04-04 2016-01-11 杜比實驗室特許公司 用於將多聲道音訊信號之全面感知響度縮放一期望量的方法、裝置及電腦程式
CN101410892B (zh) * 2006-04-04 2012-08-08 杜比实验室特许公司 改进的离散余弦变换域中的音频信号响度测量及修改
BRPI0711063B1 (pt) 2006-04-27 2023-09-26 Dolby Laboratories Licensing Corporation Método e aparelho para modificar um parâmetro de processamento de dinâmicas de áudio
CN101149921B (zh) * 2006-09-21 2011-08-10 展讯通信(上海)有限公司 一种静音检测方法和装置
BRPI0717484B1 (pt) 2006-10-20 2019-05-21 Dolby Laboratories Licensing Corporation Método e aparelho para processar um sinal de áudio
US8521314B2 (en) * 2006-11-01 2013-08-27 Dolby Laboratories Licensing Corporation Hierarchical control path with constraints for audio dynamics processing
US20080147389A1 (en) * 2006-12-15 2008-06-19 Motorola, Inc. Method and Apparatus for Robust Speech Activity Detection
EP2162881B1 (en) * 2007-05-22 2013-01-23 Telefonaktiebolaget LM Ericsson (publ) Voice activity detection with improved music detection
EP2168122B1 (en) * 2007-07-13 2011-11-30 Dolby Laboratories Licensing Corporation Audio processing using auditory scene analysis and spectral skewness
US20090043577A1 (en) * 2007-08-10 2009-02-12 Ditech Networks, Inc. Signal presence detection using bi-directional communication data
US8190440B2 (en) * 2008-02-29 2012-05-29 Broadcom Corporation Sub-band codec with native voice activity detection
EP2107553B1 (en) * 2008-03-31 2011-05-18 Harman Becker Automotive Systems GmbH Method for determining barge-in
US8275136B2 (en) * 2008-04-25 2012-09-25 Nokia Corporation Electronic device speech enhancement
US8244528B2 (en) 2008-04-25 2012-08-14 Nokia Corporation Method and apparatus for voice activity determination
WO2009130388A1 (en) * 2008-04-25 2009-10-29 Nokia Corporation Calibrating multiple microphones
CN101572090B (zh) * 2008-04-30 2013-03-20 向为 一种自适应多速率窄带编码方法及编码器
USD605629S1 (en) 2008-09-29 2009-12-08 Vocollect, Inc. Headset
KR101547344B1 (ko) 2008-10-31 2015-08-27 삼성전자 주식회사 음성복원장치 및 그 방법
TWI384423B (zh) * 2008-11-26 2013-02-01 Ind Tech Res Inst 以聲音事件為基礎之緊急通報方法與系統以及行為軌跡建立方法
CN101609678B (zh) 2008-12-30 2011-07-27 华为技术有限公司 信号压缩方法及其压缩装置
US8160287B2 (en) 2009-05-22 2012-04-17 Vocollect, Inc. Headset with adjustable headband
CN102576528A (zh) 2009-10-19 2012-07-11 瑞典爱立信有限公司 用于语音活动检测的检测器和方法
US8438659B2 (en) 2009-11-05 2013-05-07 Vocollect, Inc. Portable computing device and headset interface
FR2956539B1 (fr) * 2010-02-16 2012-03-16 Dominique Retali Procede de detection du fonctionnement d'un dispositif de transmission sans fil de signaux de voix.
US20120143604A1 (en) * 2010-12-07 2012-06-07 Rita Singh Method for Restoring Spectral Components in Denoised Speech Signals
US8954322B2 (en) * 2011-07-25 2015-02-10 Via Telecom Co., Ltd. Acoustic shock protection device and method thereof
US9363603B1 (en) 2013-02-26 2016-06-07 Xfrm Incorporated Surround audio dialog balance assessment
CN111261197B (zh) * 2020-01-13 2022-11-25 中航华东光电(上海)有限公司 一种复杂噪声场景下的实时语音段落追踪方法

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4358738A (en) * 1976-06-07 1982-11-09 Kahn Leonard R Signal presence determination method for use in a contaminated medium
JPS53105303A (en) * 1977-02-25 1978-09-13 Hitachi Ltd Preprocessing system for audio recognition
JPS5850360B2 (ja) * 1978-05-12 1983-11-10 株式会社日立製作所 音声認識装置における前処理方法
JPS59115625A (ja) * 1982-12-22 1984-07-04 Nec Corp 音声検出器
US4731846A (en) * 1983-04-13 1988-03-15 Texas Instruments Incorporated Voice messaging system with pitch tracking based on adaptively filtered LPC residual signal
EP0127718B1 (fr) * 1983-06-07 1987-03-18 International Business Machines Corporation Procédé de détection d'activité dans un système de transmission de la voix
US4700392A (en) * 1983-08-26 1987-10-13 Nec Corporation Speech signal detector having adaptive threshold values
US4696039A (en) * 1983-10-13 1987-09-22 Texas Instruments Incorporated Speech analysis/synthesis system with silence suppression
JPH0748695B2 (ja) * 1986-05-23 1995-05-24 株式会社日立製作所 音声符号化方式
AU608432B2 (en) * 1988-03-11 1991-03-28 Lg Electronics Inc. Voice activity detection
US5276765A (en) * 1988-03-11 1994-01-04 British Telecommunications Public Limited Company Voice activity detection
JP2573352B2 (ja) * 1989-04-10 1997-01-22 富士通株式会社 音声検出装置
US5680508A (en) * 1991-05-03 1997-10-21 Itt Corporation Enhancement of speech coding in background noise for low-rate speech coder
IN184794B (es) * 1993-09-14 2000-09-30 British Telecomm
US5659622A (en) * 1995-11-13 1997-08-19 Motorola, Inc. Method and apparatus for suppressing noise in a communication system

Also Published As

Publication number Publication date
EP0719439A1 (en) 1996-07-03
IN184794B (es) 2000-09-30
NO961032D0 (no) 1996-03-13
JPH09502814A (ja) 1997-03-18
DE69419615D1 (de) 1999-08-26
NO307979B1 (no) 2000-06-26
CA2169745A1 (en) 1995-03-23
HK1014392A1 (en) 1999-09-24
AU673776B2 (en) 1996-11-21
US5749067A (en) 1998-05-05
CN1130952A (zh) 1996-09-11
NZ273045A (en) 1996-11-26
HUT73986A (en) 1996-10-28
CZ286743B6 (en) 2000-06-14
HU9600641D0 (en) 1996-05-28
CN1064772C (zh) 2001-04-18
ES2136204T3 (es) 1999-11-16
SK31896A3 (en) 1997-03-05
MY111134A (en) 1999-08-30
BR9407535A (pt) 1997-08-26
US6061647A (en) 2000-05-09
CZ67896A3 (en) 1996-07-17
ATE182420T1 (de) 1999-08-15
SK281796B6 (sk) 2001-08-06
DK0719439T3 (da) 2000-02-07
DE69419615T2 (de) 2000-05-25
KR100363309B1 (ko) 2003-02-17
GR3031515T3 (en) 2000-01-31
FI118195B (fi) 2007-08-15
WO1995008170A1 (en) 1995-03-23
FI961158A (fi) 1996-03-13
SG48935A1 (en) 1998-05-18
FI961158A0 (fi) 1996-03-13
HU219994B (hu) 2001-10-28
KR960705303A (ko) 1996-10-09
CA2169745C (en) 2000-05-16
JP3224132B2 (ja) 2001-10-29
NO961032L (no) 1996-03-13
AU7619894A (en) 1995-04-03

Similar Documents

Publication Publication Date Title
EP0719439B1 (en) Voice activity detector
EP0548054B1 (en) Voice activity detector
US5963901A (en) Method and device for voice activity detection and a communication device
EP2242049B1 (en) Noise suppression device
US6591234B1 (en) Method and apparatus for adaptively suppressing noise
US6023674A (en) Non-parametric voice activity detection
US5970441A (en) Detection of periodicity information from an audio signal
US6233549B1 (en) Low frequency spectral enhancement system and method
US20050108004A1 (en) Voice activity detector based on spectral flatness of input signal
EP1093112A2 (en) A method for generating speech feature signals and an apparatus for carrying through this method
US6199036B1 (en) Tone detection using pitch period
AU673776C (en) Voice activity detector
Vahatalo et al. Voice activity detection for GSM adaptive multi-rate codec
US6633847B1 (en) Voice activated circuit and radio using same
EP1748426A2 (en) Method and apparatus for adaptively suppressing noise

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 19960217

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE CH DE DK ES FR GB GR IE IT LI LU NL PT SE

GRAG Despatch of communication of intention to grant

Free format text: ORIGINAL CODE: EPIDOS AGRA

17Q First examination report despatched

Effective date: 19980223

GRAG Despatch of communication of intention to grant

Free format text: ORIGINAL CODE: EPIDOS AGRA

GRAG Despatch of communication of intention to grant

Free format text: ORIGINAL CODE: EPIDOS AGRA

GRAH Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOS IGRA

GRAH Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOS IGRA

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AT BE CH DE DK ES FR GB GR IE IT LI LU NL PT SE

REF Corresponds to:

Ref document number: 182420

Country of ref document: AT

Date of ref document: 19990815

Kind code of ref document: T

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

REF Corresponds to:

Ref document number: 69419615

Country of ref document: DE

Date of ref document: 19990826

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

ITF It: translation for a ep patent filed
ET Fr: translation filed
REG Reference to a national code

Ref country code: CH

Ref legal event code: NV

Representative=s name: JACOBACCI & PERANI S.A.

REG Reference to a national code

Ref country code: ES

Ref legal event code: FG2A

Ref document number: 2136204

Country of ref document: ES

Kind code of ref document: T3

REG Reference to a national code

Ref country code: PT

Ref legal event code: SC4A

Free format text: AVAILABILITY OF NATIONAL TRANSLATION

Effective date: 19990810

REG Reference to a national code

Ref country code: DK

Ref legal event code: T3

PLBQ Unpublished change to opponent data

Free format text: ORIGINAL CODE: EPIDOS OPPO

PLBI Opposition filed

Free format text: ORIGINAL CODE: 0009260

PLBI Opposition filed

Free format text: ORIGINAL CODE: 0009260

PLBF Reply of patent proprietor to notice(s) of opposition

Free format text: ORIGINAL CODE: EPIDOS OBSO

26 Opposition filed

Opponent name: LM ERICSSON

Effective date: 20000420

26 Opposition filed

Opponent name: SIEMENS AG

Effective date: 20000425

Opponent name: LM ERICSSON

Effective date: 20000420

NLR1 Nl: opposition has been filed with the epo

Opponent name: SIEMENS AG

Opponent name: LM ERICSSON

PLBF Reply of patent proprietor to notice(s) of opposition

Free format text: ORIGINAL CODE: EPIDOS OBSO

REG Reference to a national code

Ref country code: GB

Ref legal event code: IF02

PLCK Communication despatched that opposition was rejected

Free format text: ORIGINAL CODE: EPIDOSNREJ1

APAY Date of receipt of notice of appeal deleted

Free format text: ORIGINAL CODE: EPIDOSDNOA2O

APBP Date of receipt of notice of appeal recorded

Free format text: ORIGINAL CODE: EPIDOSNNOA2O

APBQ Date of receipt of statement of grounds of appeal recorded

Free format text: ORIGINAL CODE: EPIDOSNNOA3O

NLS Nl: assignments of ep-patents

Owner name: LG ELECTRONICS INC.

REG Reference to a national code

Ref country code: GB

Ref legal event code: 732E

RAP2 Party data changed (patent owner data changed or rights of a patent transferred)

Owner name: LG ELECTRONICS INC.

REG Reference to a national code

Ref country code: FR

Ref legal event code: TP

NLT2 Nl: modifications (of names), taken from the european patent patent bulletin

Owner name: LG ELECTRONICS INC.

PLAB Opposition data, opponent's data or that of the opponent's representative modified

Free format text: ORIGINAL CODE: 0009299OPPO

REG Reference to a national code

Ref country code: PT

Ref legal event code: PC4A

Free format text: LG ELECTRONICS INC. KR

Effective date: 20040804

R26 Opposition filed (corrected)

Opponent name: SIEMENS AG

Effective date: 20000425

Opponent name: TELEFONAKTIEBOLAGET L M ERICSSON (PUBL)

Effective date: 20000420

REG Reference to a national code

Ref country code: CH

Ref legal event code: PUE

Owner name: LG ELECTRONICS INC.

Free format text: BRITISH TELECOMMUNICATIONS PUBLIC LIMITED COMPANY#81 NEWGATE STREET#LONDON EC1A 7AJ (GB) -TRANSFER TO- LG ELECTRONICS INC.#LG TWIN TOWERS 20, YEOUIDO-DONG YEONGDEUNGPO-GU#SEOUL, 150-721 (KR)

Ref country code: CH

Ref legal event code: NV

Representative=s name: SAEGER & PARTNER

NLR1 Nl: opposition has been filed with the epo

Opponent name: SIEMENS AG

Opponent name: TELEFONAKTIEBOLAGET L M ERICSSON (PUBL)

APAA Appeal reference recorded

Free format text: ORIGINAL CODE: EPIDOS REFN

APAH Appeal reference modified

Free format text: ORIGINAL CODE: EPIDOSCREFNO

APBU Appeal procedure closed

Free format text: ORIGINAL CODE: EPIDOSNNOA9O

PLBN Opposition rejected

Free format text: ORIGINAL CODE: 0009273

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: OPPOSITION REJECTED

27O Opposition rejected

Effective date: 20051019

NLR2 Nl: decision of opposition

Effective date: 20051019

REG Reference to a national code

Ref country code: CH

Ref legal event code: PCAR

Free format text: MANFRED SAEGER;POSTFACH 5;7304 MAIENFELD (CH)

REG Reference to a national code

Ref country code: CH

Ref legal event code: PFA

Owner name: LG ELECTRONICS INC.

Free format text: LG ELECTRONICS INC.#LG TWIN TOWERS 20, YEOUIDO-DONG YEONGDEUNGPO-GU#SEOUL, 150-721 (KR) -TRANSFER TO- LG ELECTRONICS INC.#LG TWIN TOWERS 20, YEOUIDO-DONG YEONGDEUNGPO-GU#SEOUL, 150-721 (KR)

REG Reference to a national code

Ref country code: CH

Ref legal event code: PCAR

Free format text: NEW ADDRESS: FELDGUEETLIWEG 130, 8706 MEILEN (CH)

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: LU

Payment date: 20130816

Year of fee payment: 20

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GR

Payment date: 20130813

Year of fee payment: 20

Ref country code: DE

Payment date: 20130813

Year of fee payment: 20

Ref country code: PT

Payment date: 20130314

Year of fee payment: 20

Ref country code: ES

Payment date: 20130827

Year of fee payment: 20

Ref country code: IE

Payment date: 20130812

Year of fee payment: 20

Ref country code: DK

Payment date: 20130812

Year of fee payment: 20

Ref country code: AT

Payment date: 20130813

Year of fee payment: 20

Ref country code: CH

Payment date: 20130813

Year of fee payment: 20

Ref country code: SE

Payment date: 20130812

Year of fee payment: 20

Ref country code: NL

Payment date: 20130812

Year of fee payment: 20

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20130812

Year of fee payment: 20

Ref country code: FR

Payment date: 20130813

Year of fee payment: 20

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: IT

Payment date: 20130923

Year of fee payment: 20

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: BE

Payment date: 20130820

Year of fee payment: 20

REG Reference to a national code

Ref country code: DE

Ref legal event code: R071

Ref document number: 69419615

Country of ref document: DE

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

Ref country code: DK

Ref legal event code: EUP

Effective date: 20140914

REG Reference to a national code

Ref country code: DE

Ref legal event code: R071

Ref document number: 69419615

Country of ref document: DE

REG Reference to a national code

Ref country code: PT

Ref legal event code: MM4A

Free format text: MAXIMUM VALIDITY LIMIT REACHED

Effective date: 20140914

REG Reference to a national code

Ref country code: NL

Ref legal event code: V4

Effective date: 20140914

REG Reference to a national code

Ref country code: GB

Ref legal event code: PE20

Expiry date: 20140913

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: DE

Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION

Effective date: 20140916

REG Reference to a national code

Ref country code: SE

Ref legal event code: EUG

REG Reference to a national code

Ref country code: IE

Ref legal event code: MK9A

REG Reference to a national code

Ref country code: AT

Ref legal event code: MK07

Ref document number: 182420

Country of ref document: AT

Kind code of ref document: T

Effective date: 20140914

REG Reference to a national code

Ref country code: GR

Ref legal event code: MA

Ref document number: 990402610

Country of ref document: GR

Effective date: 20140915

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION

Effective date: 20140913

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: PT

Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION

Effective date: 20140923

REG Reference to a national code

Ref country code: ES

Ref legal event code: FD2A

Effective date: 20150107

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: ES

Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION

Effective date: 20140915

Ref country code: IE

Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION

Effective date: 20140914