The text design for continuous speech database of standard Chinese

来源 :Chinese Journal of Acoustics | 被引量 : 0次 | 上传用户:chichilela
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Well developed continuous speech recognition and synthesis systems demand a high quality continuous speech database which is compact and valid, and whose scientific design would benefit from incorporating linguistic and phonetic knowledge. It is argued that at the present stage the database should be limited to read speech. To describe those very complex variabilities in continuous speech, the following speech units are proposed: (1) 401syllables without tone; (2) 415 inter-syllabic diphones, (3) 3035 inter-syllabic triphones, (4) 781 inter-syllabic final-initial structures. The 17 basic sefltence patterns in standard Chinese are summarized to cover the most important prosodic phenomena. By using the automatic method,2393 sentences and 388 phrases are selected by above phonetic rules from a large corpus, which includes People’s Daily in recent years, TV play scripts and dictionary entries, as the reading text of continuous speech recognition database in standard Chinese. This set of sentences and pbrases covers 99.8% syllables without counting tones, 100% inter-syllable diphones, 99.6% inter-syllable triphones and 100% sentence patterns. Well developed continuous speech recognition and synthesis systems demand a high quality continuous speech database which is compact and valid, and whose scientific design would benefit from incorporating linguistic and phonetic knowledge. It is argued that at the present stage the database should be limited to read speech . To describe those very complex variabilities in continuous speech, the following speech units are proposed: (1) 401 syllables without tone; (2) 415 inter-syllabic diphones, (3) 3035 inter-syllabic triphones, The 17 basic sefltence patterns in standard Chinese are summarized to cover the most important prosodic phenomena. By using the automatic method, 2393 sentences and 388 phrases are selected by above phonetic rules from a large corpus, which includes People’s Daily in recent years, TV play scripts and dictionary entries, as the reading text of continuous speech recognition database in standard Chinese. This set of se ntences and pbrases covers 99.8% syllables without counting tones, 100% inter-syllable diphones, 99.6% inter-syllable triphones and 100% sentence patterns.
其他文献
近日,好友李君买了一套三室二厅的房子,一番装修之后,登门要笔者为他配置一套全部用国产器材组建的家庭影院系统。经配置组合后,视听效果不错,李君也很满意。现将这套用国产
一个好的房子,要经得起时间的洗礼和磨砺。现在“做加法”、用心对待每一个产品细节,才能真正打造出精品豪宅。“山明水嫩。潇洒桐庐郡。极目风烟无限景。说也如何得尽。”自
本文通过对济宁二号煤矿基本地质条件进行分析,探讨了影响瓦斯赋存的主控因素,并围绕主控因素总结出该煤矿瓦斯涌出量的地质规律,这对于准确预测瓦斯涌出量、圈定矿井瓦斯涌
一座城堡,一壁江山,万古基业,由此独尊。绿尚·春江城堡,位于中国第三大风景资源带——风光旖(?)的富春江畔,桐庐珍稀福地,背倚连绵青山,面向一江春水,于绝版名胜之地精工豪
摘要:初中阶段学生能否打下良好的英语基础,直接影响到学生以后的英语学习,因此教师应该对初中英语课堂教学中的问题引起重视,并积极采取有效的应对措施,使学生的英语成绩能够稳步提升。  关键词:初中英语;课堂教学;问题 措施  课堂教学在不断发生变化,因此影响课堂教学有效性的因素也在不断地发生变化,传统英语课堂教学中存在的问题得到了解决,新的影响课堂教学有效性的问题又出现,因此,作为一名初中英语教师,应
PCB是重要的电子部件,其中铝基箔材是重要的基板材料,其质量直接决定电子产品的性能和使用寿命。本文使用铝混合废料生产的新型专用合金PCB铝基箔材,采用连铸连轧工艺,在提高
“年年岁岁花相似,岁岁年年人不同.”唐代诗人刘希夷的这两句诗,用来描述我的此时状况最恰当不过了父亲从我手里接过我带来的礼物,习惯性地往旁边看了看,我知道他在找什么.去
期刊
日本东京大学医学博士的研究报告指出:性激素是延缓衰老的物质基础。老年人适当的性生活,可促进人体新陈代谢,延缓早衰,防止脑老化。根据长寿人口的调查,大部分长寿者都有经
色彩是绘画艺术中的最具有生命力的表现因素之一,正确运用色彩直接影响到艺术形式的视觉审美冲击力。受到中国古代社会发展变化与中国传统哲学思想的影响,色彩成为中国画发展
回望走过的路,久久激荡在心怀的,总是那些穿越风雨、激流勇进的岁月。  2020注定载入史册,不仅仅是因为病毒肆虐,而是疫情下勇毅前行、共克时艰的我们。  随着山东各地和全国各大城市2020年经济数据陆续发布,烟台在全省全国的位置逐渐明晰。  看省内,烟台占山东GDP的比重达到10.7%,较“十二五”末提高了0.5个百分点,城市首位度再提升,和青岛、济南携手领跑山东;比全国,烟台继续保持全国大中城市
期刊