IBM Voice Conversion Systems for 2007 TC-STAR Evaluation

来源 :Tsinghua Science and Technology | 被引量 : 0次 | 上传用户:asunsky1
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
This paper proposes a novel voice conversion method by frequency warping. The frequency warp-ing function is generated based on mapping formants of the source speaker and the target speaker. In addi-tion to frequency warping, fundamental frequency adjustment, spectral envelope equalization, breathiness addition, and duration modification are also used to improve the similarity to the target speaker. The pro-posed voice conversion method needs only a very small amount of training data for generating the warping function, thereby greatly facilitating its application. Systems based on the proposed method were used for the 2007 TC-STAR intra-lingual voice conversion evaluation for English and Spanish and a cross-lingual voice conversion evaluation for Spanish. The evaluation results show that the proposed method can achieve a much better quality of converted speech than other methods as well as a good balance between quality and similarity. The IBM1 system was ranked No. 1 for English evaluation and No. 2 for Spanish evaluation. Evaluation results also show that the proposed method is a convenient and competitive method for cross-lingual voice conversion tasks. This paper proposes a novel voice conversion method by frequency warping. The frequency warp-ing function is generated based on mapping formants of the source speaker and the target speaker. In addi-tion to frequency warping, fundamental frequency adjustment, spectral envelope equalization, breathiness addition, and duration modification are also used to improve the similarity to the target speaker. The pro-posed voice conversion method needs only a very small amount of training data for generating the warping function, thereby greatly facilitating its application. method were used for the 2007 TC-STAR intra-lingual voice conversion evaluation for English and Spanish and a cross-lingual voice conversion evaluation for Spanish. The evaluation results show that the proposed method can achieve a much better quality of converted speech than other methods as well as a good balance between quality and similarity. The IBM1 system was ranked No. 1 for English eval uation and No. 2 for Spanish evaluation. Evaluation results also show that the proposed method is a convenient and competitive method for cross-lingual voice conversion tasks.
其他文献
随着社会文化的日益开放,社会生活日益与国际接轨,制服诱惑这类新型“性知识”在高调广泛的“性学文化”节日庆典上大放光辉,显然,中国这个拥有全球人口重要构成数据的国家需
研究了Ti(C,N)基金属陶瓷中η相的组织形态和成份.结果指出,η相中存在微裂纹,对金属陶瓷的力学性能有害,η相中可用分子式(Ni,W,Mo)3(Ti,W,Mo)来表示. The morphology and composition of
氯酸盐经酸化制得稳定性二氧化氯,其具有极强的消毒杀菌作用和广泛的应用范围,而且无毒性,高效安全。 Chlorate by acidification stability of chlorine dioxide, which has a
用盲孔法测量焊接残余应力时,由高应力区的孔边应力集中引起的塑性变形是影响测量精度的主要因素。通过对20g钢进行非常规的超载拉伸标定试验,得到一条误差曲线,进而由该曲线回
几种抗癫痫药物抗癫痫作用耐受性的观察北京医科大学第一医院儿科研究室(北京,100034)王晓军王丽苯巴比妥(PB)、丙戊酸(VPA)、卡马西平(CBZ)及苯二氮艹卓类(BZDs)药物是儿科临床常用抗癫痫药物,但是经过一段
本文在推广LEPS势能面上用准经典轨迹理论方法计算了20000条碰撞轨迹,得到各种散射通道的几率和速度常数,讨论了振动增强、通道竞争、振动绝热性、质量组合及角动量耦合等重
本文对逆变式弧焊电源中感性参数的影响进行了分析和仿真,研究表明:漏感、原过回路电感严重地恶化了逆变式弧焊电源的工作状况,而输出电感则影响着电源的动态特性 This paper a
B值是地震研究中一个重要指标,误用或误解b值通常会导致错误的结论。因此,在计算b值时要特别注意。对影响b值的各种因素(震级大小、长度或宽度模式,震级范围,小、中、大地震
中国艺术品市场经历了30多年的快速发展期后,整个市场的交易规模已近4000亿元的水平,特别是拍卖市场的规模已经达到600亿元至700亿元的平台规模。近几年,艺术品开始超越房市、股市,跻身成为投资领域的新贵,越来越多的资本通过艺术品抵押、艺术品按揭、艺术品信托、艺术品基金等各种金融形式介入艺术领域。  拍卖交易让艺术品金融化初具萌芽  艺术品金融近年来渐渐成为热门话题。自中国出现拍卖公司以来,艺术金
对全层片组织3种尺寸的试样进行了900℃,72h的高温循环氧化增重试验,研究了晶粒尺寸对Ti-33Al-Cr-0.5Mo合金高温抗氧化性能的影响。结果表明:晶粒尺寸居中的试样具有良好的高温抗氧化性能。另外,用SEM和EDAX分析