CN108899044B - 语音信号处理方法及装置 - Google Patents
语音信号处理方法及装置 Download PDFInfo
- Publication number
- CN108899044B CN108899044B CN201810845900.4A CN201810845900A CN108899044B CN 108899044 B CN108899044 B CN 108899044B CN 201810845900 A CN201810845900 A CN 201810845900A CN 108899044 B CN108899044 B CN 108899044B
- Authority
- CN
- China
- Prior art keywords
- signal
- voice
- noise
- source
- speech
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000003672 processing method Methods 0.000 title claims abstract description 18
- 238000000034 method Methods 0.000 claims abstract description 42
- 238000012545 processing Methods 0.000 claims abstract description 26
- 238000004364 calculation method Methods 0.000 claims description 24
- 238000000926 separation method Methods 0.000 claims description 15
- 238000004590 computer program Methods 0.000 claims description 7
- 238000000605 extraction Methods 0.000 claims description 3
- 238000004422 calculation algorithm Methods 0.000 description 9
- 238000010586 diagram Methods 0.000 description 5
- 230000003993 interaction Effects 0.000 description 5
- 230000006870 function Effects 0.000 description 4
- 239000011159 matrix material Substances 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 238000010295 mobile communication Methods 0.000 description 3
- 230000007547 defect Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 230000002618 waking effect Effects 0.000 description 2
- 208000001992 Autosomal Dominant Optic Atrophy Diseases 0.000 description 1
- 206010011906 Death Diseases 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 238000012938 design process Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S3/00—Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received
- G01S3/80—Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received using ultrasonic, sonic or infrasonic waves
- G01S3/802—Systems for determining direction or deviation from predetermined direction
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/60—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for measuring the quality of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L25/84—Detection of presence or absence of voice signals for discriminating voice from noise
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
- G10L15/197—Probabilistic grammars, e.g. word n-grams
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Quality & Reliability (AREA)
- General Physics & Mathematics (AREA)
- Radar, Positioning & Navigation (AREA)
- Remote Sensing (AREA)
- Circuit For Audible Band Transducer (AREA)
- Telephone Function (AREA)
Abstract
Description
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810845900.4A CN108899044B (zh) | 2018-07-27 | 2018-07-27 | 语音信号处理方法及装置 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810845900.4A CN108899044B (zh) | 2018-07-27 | 2018-07-27 | 语音信号处理方法及装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108899044A CN108899044A (zh) | 2018-11-27 |
CN108899044B true CN108899044B (zh) | 2020-06-26 |
Family
ID=64352278
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810845900.4A Active CN108899044B (zh) | 2018-07-27 | 2018-07-27 | 语音信号处理方法及装置 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108899044B (zh) |
Families Citing this family (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109545238B (zh) * | 2018-12-11 | 2022-05-10 | 珠海一微半导体股份有限公司 | 一种基于清洁机器人的语音去噪装置 |
CN109410928B (zh) * | 2018-12-11 | 2022-03-04 | 珠海一微半导体股份有限公司 | 一种基于语音识别的去噪方法和芯片 |
CN109360580B (zh) * | 2018-12-11 | 2022-01-04 | 珠海一微半导体股份有限公司 | 一种基于语音识别的迭代去噪装置和清洁机器人 |
CN109584899B (zh) * | 2018-12-11 | 2022-02-08 | 珠海一微半导体股份有限公司 | 一种基于语音识别的去噪装置和清洁机器人 |
CN109841214B (zh) * | 2018-12-25 | 2021-06-01 | 百度在线网络技术(北京)有限公司 | 语音唤醒处理方法、装置和存储介质 |
CN110012331B (zh) * | 2019-04-11 | 2021-05-25 | 杭州微纳科技股份有限公司 | 一种红外触发的远场双麦远场语音识别方法 |
CN110223708B (zh) * | 2019-05-07 | 2023-05-30 | 平安科技(深圳)有限公司 | 基于语音处理的语音增强方法及相关设备 |
CN110459234B (zh) * | 2019-08-15 | 2022-03-22 | 思必驰科技股份有限公司 | 用于车载的语音识别方法及系统 |
CN110673096B (zh) * | 2019-09-30 | 2022-02-01 | 北京地平线机器人技术研发有限公司 | 语音定位方法和装置、计算机可读存储介质、电子设备 |
CN112820310B (zh) * | 2019-11-15 | 2022-09-23 | 北京声智科技有限公司 | 一种来波方向估计方法及装置 |
CN111223497B (zh) * | 2020-01-06 | 2022-04-19 | 思必驰科技股份有限公司 | 一种终端的就近唤醒方法、装置、计算设备及存储介质 |
CN111276143B (zh) * | 2020-01-21 | 2023-04-25 | 北京远特科技股份有限公司 | 声源定位方法、装置、语音识别控制方法和终端设备 |
CN111402883B (zh) * | 2020-03-31 | 2023-05-26 | 云知声智能科技股份有限公司 | 一种复杂环境下分布式语音交互系统中就近响应系统和方法 |
CN112217577B (zh) * | 2020-10-14 | 2021-07-06 | 哈尔滨工程大学 | 一种基于频点存在概率的水下通信节点唤醒信号检测方法 |
CN113496698B (zh) * | 2021-08-12 | 2024-01-23 | 云知声智能科技股份有限公司 | 训练数据的筛选方法、装置、设备和存储介质 |
CN113658593B (zh) * | 2021-08-14 | 2024-03-12 | 普强时代(珠海横琴)信息技术有限公司 | 基于语音识别的唤醒实现方法及装置 |
CN113744752A (zh) * | 2021-08-30 | 2021-12-03 | 西安声必捷信息科技有限公司 | 语音处理方法及装置 |
CN114639398B (zh) * | 2022-03-10 | 2023-05-26 | 电子科技大学 | 一种基于麦克风阵列的宽带doa估计方法 |
CN115346527A (zh) * | 2022-08-08 | 2022-11-15 | 科大讯飞股份有限公司 | 语音控制方法、装置、系统、车辆和存储介质 |
CN118395579B (zh) * | 2024-06-27 | 2024-09-27 | 深圳大学 | 一种隧道中心线选点方法、系统、智能终端及存储介质 |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9704486B2 (en) * | 2012-12-11 | 2017-07-11 | Amazon Technologies, Inc. | Speech recognition power management |
CN105792074B (zh) * | 2016-02-26 | 2019-02-05 | 西北工业大学 | 一种语音信号处理方法和装置 |
US9640197B1 (en) * | 2016-03-22 | 2017-05-02 | International Business Machines Corporation | Extraction of target speeches |
CN106251877B (zh) * | 2016-08-11 | 2019-09-06 | 珠海全志科技股份有限公司 | 语音声源方向估计方法及装置 |
CN108122556B (zh) * | 2017-08-08 | 2021-09-24 | 大众问问(北京)信息科技有限公司 | 减少驾驶人语音唤醒指令词误触发的方法及装置 |
CN108122563B (zh) * | 2017-12-19 | 2021-03-30 | 北京声智科技有限公司 | 提高语音唤醒率及修正doa的方法 |
CN108107403B (zh) * | 2017-12-20 | 2020-07-03 | 北京声智科技有限公司 | 一种波达方向估计方法和装置 |
CN108198568B (zh) * | 2017-12-26 | 2020-10-16 | 太原理工大学 | 一种多声源定位的方法及系统 |
-
2018
- 2018-07-27 CN CN201810845900.4A patent/CN108899044B/zh active Active
Also Published As
Publication number | Publication date |
---|---|
CN108899044A (zh) | 2018-11-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108899044B (zh) | 语音信号处理方法及装置 | |
CN110600017B (zh) | 语音处理模型的训练方法、语音识别方法、系统及装置 | |
CN110164469B (zh) | 一种多人语音的分离方法和装置 | |
CN110473539B (zh) | 提升语音唤醒性能的方法和装置 | |
CN109461449B (zh) | 用于智能设备的语音唤醒方法及系统 | |
CN108352818B (zh) | 用于增强声音信号的声音信号处理装置和方法 | |
CN108417224B (zh) | 双向神经网络模型的训练和识别方法及系统 | |
CN108922553B (zh) | 用于音箱设备的波达方向估计方法及系统 | |
CN108510982B (zh) | 音频事件检测方法、装置及计算机可读存储介质 | |
CN108597505B (zh) | 语音识别方法、装置及终端设备 | |
CN110261816B (zh) | 语音波达方向估计方法及装置 | |
CN112017681B (zh) | 定向语音的增强方法及系统 | |
CN110554357B (zh) | 声源定位方法和装置 | |
US10602270B1 (en) | Similarity measure assisted adaptation control | |
CN110400572B (zh) | 音频增强方法及系统 | |
CN110610718B (zh) | 一种提取期望声源语音信号的方法及装置 | |
CN111868823B (zh) | 一种声源分离方法、装置及设备 | |
CN109270493B (zh) | 声源定位方法和装置 | |
CN111722696B (zh) | 用于低功耗设备的语音数据处理方法和装置 | |
CN111179915A (zh) | 基于语音的年龄识别方法及装置 | |
CN111415653B (zh) | 用于识别语音的方法和装置 | |
CN107274892A (zh) | 说话人识别方法及装置 | |
JP6265903B2 (ja) | 信号雑音減衰 | |
CN115662409B (zh) | 一种语音识别方法、装置、设备及存储介质 | |
CN106340310A (zh) | 语音检测方法及装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CP01 | Change in the name or title of a patent holder |
Address after: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu. Patentee after: Sipic Technology Co.,Ltd. Address before: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu. Patentee before: AI SPEECH Co.,Ltd. |
|
CP01 | Change in the name or title of a patent holder | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: Voice signal processing methods and devices Effective date of registration: 20230726 Granted publication date: 20200626 Pledgee: CITIC Bank Limited by Share Ltd. Suzhou branch Pledgor: Sipic Technology Co.,Ltd. Registration number: Y2023980049433 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right |