CN103035236A - 基于信号时序特征建模的高质量语音转换方法 - Google Patents
基于信号时序特征建模的高质量语音转换方法 Download PDFInfo
- Publication number
- CN103035236A CN103035236A CN2012104904646A CN201210490464A CN103035236A CN 103035236 A CN103035236 A CN 103035236A CN 2012104904646 A CN2012104904646 A CN 2012104904646A CN 201210490464 A CN201210490464 A CN 201210490464A CN 103035236 A CN103035236 A CN 103035236A
- Authority
- CN
- China
- Prior art keywords
- signal
- kalman filter
- parameters
- parameter
- speech
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 59
- 238000006243 chemical reaction Methods 0.000 title claims abstract description 48
- 230000008569 process Effects 0.000 claims abstract description 26
- 238000004422 calculation algorithm Methods 0.000 claims abstract description 16
- 238000013507 mapping Methods 0.000 claims abstract description 13
- 230000000694 effects Effects 0.000 claims abstract description 8
- 230000009466 transformation Effects 0.000 claims abstract description 8
- 238000005457 optimization Methods 0.000 claims abstract description 4
- 238000012549 training Methods 0.000 claims description 16
- 238000004458 analytical method Methods 0.000 claims description 11
- 230000003595 spectral effect Effects 0.000 claims description 11
- 230000015572 biosynthetic process Effects 0.000 claims description 6
- 238000003786 synthesis reaction Methods 0.000 claims description 6
- 238000005311 autocorrelation function Methods 0.000 claims description 3
- 230000001186 cumulative effect Effects 0.000 claims description 3
- 239000011159 matrix material Substances 0.000 claims description 3
- 238000010606 normalization Methods 0.000 claims description 3
- 230000005236 sound signal Effects 0.000 claims description 2
- 238000012546 transfer Methods 0.000 claims description 2
- 108010076504 Protein Sorting Signals Proteins 0.000 claims 6
- 241001499740 Plantago alpina Species 0.000 claims 1
- 238000005352 clarification Methods 0.000 claims 1
- 238000004064 recycling Methods 0.000 claims 1
- 230000002123 temporal effect Effects 0.000 claims 1
- 230000000704 physical effect Effects 0.000 abstract 1
- 238000005516 engineering process Methods 0.000 description 7
- 238000001228 spectrum Methods 0.000 description 7
- 239000000203 mixture Substances 0.000 description 6
- 238000004364 calculation method Methods 0.000 description 5
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 230000008859 change Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000001755 vocal effect Effects 0.000 description 2
- NAWXUBYGYWOOIX-SFHVURJKSA-N (2s)-2-[[4-[2-(2,4-diaminoquinazolin-6-yl)ethyl]benzoyl]amino]-4-methylidenepentanedioic acid Chemical compound C1=CC2=NC(N)=NC(N)=C2C=C1CCC1=CC=C(C(=O)N[C@@H](CC(=C)C(O)=O)C(O)=O)C=C1 NAWXUBYGYWOOIX-SFHVURJKSA-N 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 238000009432 framing Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000005316 response function Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000013179 statistical model Methods 0.000 description 1
- 238000011426 transformation method Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
Images
Landscapes
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Complex Calculations (AREA)
Abstract
Description
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210490464.6A CN103035236B (zh) | 2012-11-27 | 2012-11-27 | 基于信号时序特征建模的高质量语音转换方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210490464.6A CN103035236B (zh) | 2012-11-27 | 2012-11-27 | 基于信号时序特征建模的高质量语音转换方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103035236A true CN103035236A (zh) | 2013-04-10 |
CN103035236B CN103035236B (zh) | 2014-12-17 |
Family
ID=48022068
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201210490464.6A Expired - Fee Related CN103035236B (zh) | 2012-11-27 | 2012-11-27 | 基于信号时序特征建模的高质量语音转换方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103035236B (zh) |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103413548A (zh) * | 2013-08-16 | 2013-11-27 | 中国科学技术大学 | 一种基于受限玻尔兹曼机的联合频谱建模的声音转换方法 |
CN105425319A (zh) * | 2015-09-16 | 2016-03-23 | 河海大学 | 基于地面测量数据校正的降雨卫星暴雨同化方法 |
CN106782599A (zh) * | 2016-12-21 | 2017-05-31 | 河海大学常州校区 | 基于高斯过程输出后滤波的语音转换方法 |
CN107068165A (zh) * | 2016-12-31 | 2017-08-18 | 南京邮电大学 | 一种语音转换方法 |
CN107103914A (zh) * | 2017-03-20 | 2017-08-29 | 南京邮电大学 | 一种高质量的语音转换方法 |
CN108681709A (zh) * | 2018-05-16 | 2018-10-19 | 深圳大学 | 基于骨传导振动与机器学习的智能输入方法及系统 |
CN110097193A (zh) * | 2019-04-28 | 2019-08-06 | 第四范式(北京)技术有限公司 | 训练模型的方法及系统和预测序列数据的方法及系统 |
CN110880315A (zh) * | 2019-10-17 | 2020-03-13 | 深圳市声希科技有限公司 | 一种基于音素后验概率的个性化语音和视频生成系统 |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108613679B (zh) * | 2018-06-14 | 2020-06-16 | 河北工业大学 | 一种移动机器人扩展卡尔曼滤波同步定位与地图构建方法 |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101751921A (zh) * | 2009-12-16 | 2010-06-23 | 南京邮电大学 | 一种在训练数据量极少条件下的实时语音转换方法 |
-
2012
- 2012-11-27 CN CN201210490464.6A patent/CN103035236B/zh not_active Expired - Fee Related
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101751921A (zh) * | 2009-12-16 | 2010-06-23 | 南京邮电大学 | 一种在训练数据量极少条件下的实时语音转换方法 |
Non-Patent Citations (3)
Title |
---|
ARANTZA DEL POZO: "Voice Source and Duration Modelling for Voice Conversion and Speech Repair", 《DISSERTATION SUBMITTED TO THE UNIVERSITY OF CAMBRIDGE》, 30 April 2008 (2008-04-30), pages 32 - 37 * |
NING XU, ZHEN YANG, AND WEI-PING ZHU: "Modeling Articulatory Movements for Voice Conversion Using State Space Model", 《NATURAL COMPUTATION, 2009. ICNC "09. FIFTH INTERNATIONAL CONFERENCE ON》, vol. 5, 16 August 2009 (2009-08-16), pages 236 - 240 * |
NING XU,ZHEN YANG,HAIYAN GUO: "Voice conversion with a strategy for separating speaker individuality using state-space model", 《WIRELESS COMMUNICATIONS, NETWORKING AND INFORMATION SECURITY (WCNIS), 2010 IEEE INTERNATIONAL CONFERENCE ON》, 27 June 2010 (2010-06-27), pages 298 - 301 * |
Cited By (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103413548B (zh) * | 2013-08-16 | 2016-02-03 | 中国科学技术大学 | 一种基于受限玻尔兹曼机的联合频谱建模的声音转换方法 |
CN103413548A (zh) * | 2013-08-16 | 2013-11-27 | 中国科学技术大学 | 一种基于受限玻尔兹曼机的联合频谱建模的声音转换方法 |
CN105425319B (zh) * | 2015-09-16 | 2017-10-13 | 河海大学 | 基于地面测量数据校正的降雨卫星暴雨同化方法 |
CN105425319A (zh) * | 2015-09-16 | 2016-03-23 | 河海大学 | 基于地面测量数据校正的降雨卫星暴雨同化方法 |
CN106782599A (zh) * | 2016-12-21 | 2017-05-31 | 河海大学常州校区 | 基于高斯过程输出后滤波的语音转换方法 |
CN107068165A (zh) * | 2016-12-31 | 2017-08-18 | 南京邮电大学 | 一种语音转换方法 |
CN107068165B (zh) * | 2016-12-31 | 2020-07-24 | 南京邮电大学 | 一种语音转换方法 |
CN107103914A (zh) * | 2017-03-20 | 2017-08-29 | 南京邮电大学 | 一种高质量的语音转换方法 |
CN107103914B (zh) * | 2017-03-20 | 2020-06-16 | 南京邮电大学 | 一种高质量的语音转换方法 |
CN108681709A (zh) * | 2018-05-16 | 2018-10-19 | 深圳大学 | 基于骨传导振动与机器学习的智能输入方法及系统 |
WO2019218725A1 (zh) * | 2018-05-16 | 2019-11-21 | 深圳大学 | 基于骨传导振动与机器学习的智能输入方法及系统 |
CN108681709B (zh) * | 2018-05-16 | 2020-01-17 | 深圳大学 | 基于骨传导振动与机器学习的智能输入方法及系统 |
CN110097193A (zh) * | 2019-04-28 | 2019-08-06 | 第四范式(北京)技术有限公司 | 训练模型的方法及系统和预测序列数据的方法及系统 |
CN110880315A (zh) * | 2019-10-17 | 2020-03-13 | 深圳市声希科技有限公司 | 一种基于音素后验概率的个性化语音和视频生成系统 |
Also Published As
Publication number | Publication date |
---|---|
CN103035236B (zh) | 2014-12-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103035236B (zh) | 基于信号时序特征建模的高质量语音转换方法 | |
CN101751921B (zh) | 一种在训练数据量极少条件下的实时语音转换方法 | |
CN105023580B (zh) | 基于可分离深度自动编码技术的无监督噪声估计和语音增强方法 | |
Weninger et al. | Single-channel speech separation with memory-enhanced recurrent neural networks | |
CN103531205B (zh) | 基于深层神经网络特征映射的非对称语音转换方法 | |
Sun et al. | Unseen noise estimation using separable deep auto encoder for speech enhancement | |
Deng et al. | Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition | |
CN102664003B (zh) | 基于谐波加噪声模型的残差激励信号合成及语音转换方法 | |
Du et al. | Speaker augmentation for low resource speech recognition | |
CN114141238A (zh) | 一种融合Transformer和U-net网络的语音增强方法 | |
Wang et al. | Neural harmonic-plus-noise waveform model with trainable maximum voice frequency for text-to-speech synthesis | |
CN110648684B (zh) | 一种基于WaveNet的骨导语音增强波形生成方法 | |
CN102568476B (zh) | 基于自组织特征映射网络聚类和径向基网络的语音转换法 | |
Juvela et al. | Speaker-independent raw waveform model for glottal excitation | |
Saito et al. | Text-to-speech synthesis using STFT spectra based on low-/multi-resolution generative adversarial networks | |
Fei et al. | Research on speech emotion recognition based on deep auto-encoder | |
Saito et al. | Voice conversion using input-to-output highway networks | |
Parmar et al. | Effectiveness of cross-domain architectures for whisper-to-normal speech conversion | |
CN106782599A (zh) | 基于高斯过程输出后滤波的语音转换方法 | |
CN104392717A (zh) | 一种基于声道谱高斯混合建模的快速语音转换系统及其方法 | |
Tobing et al. | Voice conversion with cyclic recurrent neural network and fine-tuned WaveNet vocoder | |
CN102436815B (zh) | 一种应用于英语口语网络机考系统的语音识别装置 | |
CN103886859B (zh) | 基于一对多码书映射的语音转换方法 | |
Liu et al. | A novel pitch extraction based on jointly trained deep BLSTM recurrent neural networks with bottleneck features | |
Takamichi et al. | Sampling-based speech parameter generation using moment-matching networks |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C41 | Transfer of patent application or patent right or utility model | ||
CB03 | Change of inventor or designer information |
Inventor after: Xu Ningtao Inventor after: Liu Pingsheng Inventor after: Xie Daokuang Inventor before: Xu Ning Inventor before: Bao Jingyi Inventor before: Tang Yibin |
|
COR | Change of bibliographic data | ||
TR01 | Transfer of patent right |
Effective date of registration: 20160504 Address after: 518042 Guangdong city of Shenzhen province Futian District Che Kung Temple Cheonan Digital City Tienhsiang building 7B1 Patentee after: SHENZHEN TENGRUIFENG TECHNOLOGY CO.,LTD. Address before: 213022 Changzhou Jin Ling North Road, Jiangsu, No. 200 Patentee before: CHANGZHOU CAMPUS OF HOHAI University |
|
CB03 | Change of inventor or designer information |
Inventor after: Xu Ningtao Inventor after: Liu Pingsheng Inventor after: Xie Daokuang Inventor before: Xu Ningtao Inventor before: Liu Pingsheng Inventor before: Xie Daokuang |
|
COR | Change of bibliographic data | ||
PP01 | Preservation of patent right | ||
PP01 | Preservation of patent right |
Effective date of registration: 20190814 Granted publication date: 20141217 |
|
PD01 | Discharge of preservation of patent | ||
PD01 | Discharge of preservation of patent |
Date of cancellation: 20210814 Granted publication date: 20141217 |
|
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20141217 Termination date: 20191127 |