CN101710488A - 语音合成方法及装置 - Google Patents
语音合成方法及装置 Download PDFInfo
- Publication number
- CN101710488A CN101710488A CN200910222899A CN200910222899A CN101710488A CN 101710488 A CN101710488 A CN 101710488A CN 200910222899 A CN200910222899 A CN 200910222899A CN 200910222899 A CN200910222899 A CN 200910222899A CN 101710488 A CN101710488 A CN 101710488A
- Authority
- CN
- China
- Prior art keywords
- synthesized
- waveform
- key frame
- frame
- statement
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 37
- 230000015572 biosynthetic process Effects 0.000 title claims abstract description 20
- 238000003786 synthesis reaction Methods 0.000 title claims abstract description 20
- 239000012634 fragment Substances 0.000 claims abstract description 16
- 230000002194 synthesizing effect Effects 0.000 claims abstract description 13
- 238000012545 processing Methods 0.000 claims abstract description 10
- 238000001228 spectrum Methods 0.000 claims description 36
- 238000012549 training Methods 0.000 claims description 14
- 238000011084 recovery Methods 0.000 claims description 11
- 238000013179 statistical model Methods 0.000 claims description 10
- 238000004458 analytical method Methods 0.000 claims description 7
- 230000003595 spectral effect Effects 0.000 claims description 4
- 230000000694 effects Effects 0.000 description 7
- 238000010586 diagram Methods 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 238000009499 grossing Methods 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 238000003066 decision tree Methods 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 230000018109 developmental process Effects 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 230000006872 improvement Effects 0.000 description 3
- 230000001737 promoting effect Effects 0.000 description 3
- 238000003860 storage Methods 0.000 description 3
- 238000010189 synthetic method Methods 0.000 description 3
- 238000007476 Maximum Likelihood Methods 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000033764 rhythmic process Effects 0.000 description 2
- 230000007704 transition Effects 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- 238000005311 autocorrelation function Methods 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 239000000796 flavoring agent Substances 0.000 description 1
- 235000019634 flavors Nutrition 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2009102228990A CN101710488B (zh) | 2009-11-20 | 2009-11-20 | 语音合成方法及装置 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2009102228990A CN101710488B (zh) | 2009-11-20 | 2009-11-20 | 语音合成方法及装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101710488A true CN101710488A (zh) | 2010-05-19 |
CN101710488B CN101710488B (zh) | 2011-08-03 |
Family
ID=42403270
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2009102228990A Active CN101710488B (zh) | 2009-11-20 | 2009-11-20 | 语音合成方法及装置 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101710488B (zh) |
Cited By (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103226946A (zh) * | 2013-03-26 | 2013-07-31 | 中国科学技术大学 | 一种基于受限玻尔兹曼机的语音合成方法 |
CN105654940A (zh) * | 2016-01-26 | 2016-06-08 | 百度在线网络技术(北京)有限公司 | 一种语音合成方法和装置 |
CN107133580A (zh) * | 2017-04-24 | 2017-09-05 | 杭州空灵智能科技有限公司 | 一种3d打印监控视频的合成方法 |
CN107564511A (zh) * | 2017-09-25 | 2018-01-09 | 平安科技(深圳)有限公司 | 电子装置、语音合成方法和计算机可读存储介质 |
CN107924677A (zh) * | 2015-06-11 | 2018-04-17 | 交互智能集团有限公司 | 用于异常值识别以移除语音合成中的不良对准的系统和方法 |
CN108053821A (zh) * | 2017-12-12 | 2018-05-18 | 腾讯科技(深圳)有限公司 | 生成音频数据的方法和装置 |
CN108182936A (zh) * | 2018-03-14 | 2018-06-19 | 百度在线网络技术(北京)有限公司 | 语音信号生成方法和装置 |
CN108648733A (zh) * | 2018-03-15 | 2018-10-12 | 北京雷石天地电子技术有限公司 | 一种迪曲生成方法及系统 |
CN109416911A (zh) * | 2016-06-30 | 2019-03-01 | 雅马哈株式会社 | 声音合成装置及声音合成方法 |
CN109599090A (zh) * | 2018-10-29 | 2019-04-09 | 阿里巴巴集团控股有限公司 | 一种语音合成的方法、装置及设备 |
CN109686358A (zh) * | 2018-12-24 | 2019-04-26 | 广州九四智能科技有限公司 | 高保真的智能客服语音合成方法 |
CN112562637A (zh) * | 2019-09-25 | 2021-03-26 | 北京中关村科金技术有限公司 | 拼接语音音频的方法、装置以及存储介质 |
CN112863530A (zh) * | 2021-01-07 | 2021-05-28 | 广州欢城文化传媒有限公司 | 一种声音作品的生成方法和装置 |
CN113066476A (zh) * | 2019-12-13 | 2021-07-02 | 科大讯飞股份有限公司 | 合成语音处理方法及相关装置 |
CN115440205A (zh) * | 2021-06-04 | 2022-12-06 | 中国移动通信集团浙江有限公司 | 语音处理方法、装置、终端以及程序产品 |
US12046227B2 (en) | 2022-04-19 | 2024-07-23 | Google Llc | Key frame networks |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4989250A (en) * | 1988-02-19 | 1991-01-29 | Sanyo Electric Co., Ltd. | Speech synthesizing apparatus and method |
CN1119793C (zh) * | 1998-08-17 | 2003-08-27 | 英业达股份有限公司 | 声频信号特征波形的合成方法 |
EP1872361A4 (en) * | 2005-03-28 | 2009-07-22 | Lessac Technologies Inc | HYBRID SPEECH SYNTHESIZER, METHOD AND USE |
CN1835075B (zh) * | 2006-04-07 | 2011-06-29 | 安徽中科大讯飞信息科技有限公司 | 一种结合自然样本挑选与声学参数建模的语音合成方法 |
CN101178896B (zh) * | 2007-12-06 | 2012-03-28 | 安徽科大讯飞信息科技股份有限公司 | 基于声学统计模型的单元挑选语音合成方法 |
-
2009
- 2009-11-20 CN CN2009102228990A patent/CN101710488B/zh active Active
Cited By (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103226946A (zh) * | 2013-03-26 | 2013-07-31 | 中国科学技术大学 | 一种基于受限玻尔兹曼机的语音合成方法 |
CN107924677A (zh) * | 2015-06-11 | 2018-04-17 | 交互智能集团有限公司 | 用于异常值识别以移除语音合成中的不良对准的系统和方法 |
CN107924677B (zh) * | 2015-06-11 | 2022-01-25 | 交互智能集团有限公司 | 用于异常值识别以移除语音合成中的不良对准的系统和方法 |
CN105654940A (zh) * | 2016-01-26 | 2016-06-08 | 百度在线网络技术(北京)有限公司 | 一种语音合成方法和装置 |
CN109416911B (zh) * | 2016-06-30 | 2023-07-21 | 雅马哈株式会社 | 声音合成装置及声音合成方法 |
CN109416911A (zh) * | 2016-06-30 | 2019-03-01 | 雅马哈株式会社 | 声音合成装置及声音合成方法 |
CN107133580B (zh) * | 2017-04-24 | 2020-04-10 | 杭州空灵智能科技有限公司 | 一种3d打印监控视频的合成方法 |
CN107133580A (zh) * | 2017-04-24 | 2017-09-05 | 杭州空灵智能科技有限公司 | 一种3d打印监控视频的合成方法 |
CN107564511A (zh) * | 2017-09-25 | 2018-01-09 | 平安科技(深圳)有限公司 | 电子装置、语音合成方法和计算机可读存储介质 |
WO2019056500A1 (zh) * | 2017-09-25 | 2019-03-28 | 平安科技(深圳)有限公司 | 电子装置、语音合成方法和计算机可读存储介质 |
CN108053821A (zh) * | 2017-12-12 | 2018-05-18 | 腾讯科技(深圳)有限公司 | 生成音频数据的方法和装置 |
CN108182936A (zh) * | 2018-03-14 | 2018-06-19 | 百度在线网络技术(北京)有限公司 | 语音信号生成方法和装置 |
CN108648733B (zh) * | 2018-03-15 | 2020-07-03 | 北京雷石天地电子技术有限公司 | 一种迪曲生成方法及系统 |
CN108648733A (zh) * | 2018-03-15 | 2018-10-12 | 北京雷石天地电子技术有限公司 | 一种迪曲生成方法及系统 |
CN109599090A (zh) * | 2018-10-29 | 2019-04-09 | 阿里巴巴集团控股有限公司 | 一种语音合成的方法、装置及设备 |
CN109599090B (zh) * | 2018-10-29 | 2020-10-30 | 创新先进技术有限公司 | 一种语音合成的方法、装置及设备 |
CN109686358B (zh) * | 2018-12-24 | 2021-11-09 | 广州九四智能科技有限公司 | 高保真的智能客服语音合成方法 |
CN109686358A (zh) * | 2018-12-24 | 2019-04-26 | 广州九四智能科技有限公司 | 高保真的智能客服语音合成方法 |
CN112562637A (zh) * | 2019-09-25 | 2021-03-26 | 北京中关村科金技术有限公司 | 拼接语音音频的方法、装置以及存储介质 |
CN112562637B (zh) * | 2019-09-25 | 2024-02-06 | 北京中关村科金技术有限公司 | 拼接语音音频的方法、装置以及存储介质 |
CN113066476A (zh) * | 2019-12-13 | 2021-07-02 | 科大讯飞股份有限公司 | 合成语音处理方法及相关装置 |
CN113066476B (zh) * | 2019-12-13 | 2024-05-31 | 科大讯飞股份有限公司 | 合成语音处理方法及相关装置 |
CN112863530A (zh) * | 2021-01-07 | 2021-05-28 | 广州欢城文化传媒有限公司 | 一种声音作品的生成方法和装置 |
CN115440205A (zh) * | 2021-06-04 | 2022-12-06 | 中国移动通信集团浙江有限公司 | 语音处理方法、装置、终端以及程序产品 |
US12046227B2 (en) | 2022-04-19 | 2024-07-23 | Google Llc | Key frame networks |
Also Published As
Publication number | Publication date |
---|---|
CN101710488B (zh) | 2011-08-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101710488B (zh) | 语音合成方法及装置 | |
CN101178896B (zh) | 基于声学统计模型的单元挑选语音合成方法 | |
US20170162186A1 (en) | Speech synthesizer, and speech synthesis method and computer program product | |
US20120143611A1 (en) | Trajectory Tiling Approach for Text-to-Speech | |
US8494856B2 (en) | Speech synthesizer, speech synthesizing method and program product | |
US20130066631A1 (en) | Parametric speech synthesis method and system | |
CN106649644B (zh) | 一种歌词文件生成方法及装置 | |
Ling et al. | The USTC and iFlytek speech synthesis systems for Blizzard Challenge 2007 | |
CN105654940B (zh) | 一种语音合成方法和装置 | |
Ryant et al. | Highly accurate mandarin tone classification in the absence of pitch information | |
US20160027430A1 (en) | Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system | |
Yin et al. | Modeling F0 trajectories in hierarchically structured deep neural networks | |
CN102982799A (zh) | 一种融合引导概率的语音识别优化解码方法 | |
CN105654942A (zh) | 一种基于统计参数的疑问句、感叹句的语音合成方法 | |
CN111599339B (zh) | 具有高自然度的语音拼接合成方法、系统、设备及介质 | |
AU2015411306A1 (en) | Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system | |
CN103226946B (zh) | 一种基于受限玻尔兹曼机的语音合成方法 | |
JP6142401B2 (ja) | 音声合成モデル学習装置、方法、及びプログラム | |
Yu et al. | Probablistic modelling of F0 in unvoiced regions in HMM based speech synthesis | |
Zhou et al. | Learning and Modeling Unit Embeddings for Improving HMM-based Unit Selection Speech Synthesis. | |
KR102051235B1 (ko) | 스피치 합성에서 푸어 얼라인먼트를 제거하기 위한 아웃라이어 식별 시스템 및 방법 | |
Chandra et al. | Towards the development of accent conversion model for (l1) bengali speaker using cycle consistent adversarial network (cyclegan) | |
Jiao et al. | Improving voice quality of HMM-based speech synthesis using voice conversion method | |
Yu | Review of F0 modelling and generation in HMM based speech synthesis | |
Pour et al. | Persian Automatic Speech Recognition by the use of Whisper Model |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C56 | Change in the name or address of the patentee |
Owner name: IFLYTEK CO., LTD. Free format text: FORMER NAME: ANHUI USTC IFLYTEK CO., LTD. |
|
CP03 | Change of name, title or address |
Address after: Wangjiang Road high tech Development Zone Hefei city Anhui province 230088 No. 666 Patentee after: IFLYTEK Co.,Ltd. Address before: 230088 No. 616, Mount Huangshan Road, hi tech Development Zone, Anhui, Hefei Patentee before: ANHUI USTC IFLYTEK Co.,Ltd. |
|
TR01 | Transfer of patent right |
Effective date of registration: 20190213 Address after: 510335 Guangzhou Haizhu District Yuanjiang West Road 218, 220 Guangzhou International Media Port Office Building West Port 10 Floor Northeast 22-26 Property Patentee after: Ke Da Southern China Co.,Ltd. Address before: 230088 666 Wangjiang West Road, Hefei hi tech Development Zone, Anhui Patentee before: IFLYTEK Co.,Ltd. |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20231212 Address after: 130012 Room 1632, Floor 16, Building B, Liwang Plaza, No. 996, Qianjin Street, Chaoyang District, Changchun, Jilin Patentee after: Jilin Kexun Information Technology Co.,Ltd. Address before: 510335 Guangzhou Haizhu District Yuanjiang West Road 218, 220 Guangzhou International Media Port Office Building West Port 10 Floor Northeast 22-26 Property Patentee before: Ke Da Southern China Co.,Ltd. |
|
TR01 | Transfer of patent right |