Mandarin Chinese Tone Recognition with an Artificial Neural Network

来源 :Journal of Otology | 被引量 : 0次 | 上传用户:BluePrince
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Mandarin Chinese tone patterns vary in one of the four ways, i.e, (1) high level; (2) rising; (3) low falling and rising; and (4) high falling. The present study is to examine the efficacy of an artificial neural network in recognizing these tone patterns. Speech data were recorded from 12 children (3-6 years of age) and 15 adults. All subjects were native Mandarin Chinese speakers. The fundamental frequencies (F0) of each monosyllabic word of the speech data were extracted with an autocorrelation method. The pitch data(i.e., the F0 contours) were the inputs to a feed-forward backpropagation artificial neural network. The number of inputs to the neural network varied from 1 to 16 and the hidden layer of the network contained neurons that varied from 1 to 16 in number. The output of the network consisted of four neurons representing the four tone patterns of Mandarin Chinese. After being trained with the Levenberg-Marquardt optimization, the neural network was able to successfully classify the tone patterns with an accuracy of about 90% correct for speech samples from both adults and children. The artificial neural network may provide an objective and effective way of assessing tone production in prelingually-deafened children who have received cochlear implants. Mandarin Chinese tone patterns vary in one of the four ways, ie, (1) high level; (2) rising; (3) low falling and rising; and (4) high falling. The present study is to examine the efficacy of an All of them were native Mandarin Chinese speakers. The fundamental frequencies (F0) of each monosyllabic word of the speech data were extracted with an autocorrelation method. The pitch of data (ie, the F0 contours) were the inputs to a feed-forward backpropagation artificial neural network. The number of inputs to the neural network varied from 1 to 16 and the hidden layer of the network contained neurons that varied from 1 to 16 in number. The output of the network consisted of four neurons representing the four tone patterns of Mandarin Chinese. After being trained with the Levenberg-Marquardt optimization, the neural network was able to successfully class ify the tone patterns with an accuracy of about 90% correct for speech samples from both adults and children. The artificial neural network may provide objective and effective way of assessing tone production in prelingually-deafened children who have received cochlear implants.
其他文献
一引言奥斯田体等温变态曲线,或简单的常按照其形状称为S曲线或C曲线。但是由于钢成份的不同,等温变态曲线形状变化很大。根据苏联科学家们研究的结果,得出了许多不同成份钢
绿茵世界,风云变幻。没有永远的主宰,也没有永远的 败寇。所有的风光与落寞都像四季轮回中的一道道风景,不 会成为足球王国永远的景致。新人在一夜间横空出世,转瞬 间又可能
我厂在52年中苏友好月里,学习了几种苏联的先进经验,发气压力冒口是其中比较成功的一种。事实证明,苏联的先进经验是科学的,实用的,同时也说明了我们为什么要在技术上一边倒
离子镀涂层刀具是国际上最新高速钢刀具,被称为高速钢刀具的一次革命.中国科学院金属研究所研制的氮化钛涂层刀具于1984年9月通过了技术鉴定. 鉴定认为,空心阴极离子镀刀具
和球队积分榜注重团体效率不同,射手榜更能反映球员的自身价值,更多是 球星个人技艺的展现。不过,看惯了舍甫琴科、范尼、马凯、罗纳尔多等巨星 多年对射手榜的统治,多少会有
大模数齿轮用历来的剃齿刀加工时,因加工精度低,刀具制造困难,故在精加工时很少用剃齿方法;一般用精滚;精滚能提高齿距精度,但齿形精度和齿面光洁度较差。作者研制成功一种
前皇马主教练博斯克一直想将皇马的8号球衣交给马克 莱莱,但马克莱莱三次拒绝,理由是自嘲自己的脚法只适合 20号开外的大号码,因为他最擅长的工作就是防守和大脚解 围。善良
现金流量表是以现金为基础编制的反映企业财务状况变动的报表,它综合反映企业一定会计期间内现金的流入和流出,表明企业获得现金的能力。 The cash flow statement is a sta
跑步?太累了!跳健美操?落伍了吧!练器械?我可不想变成施瓦辛格!但那些脂肪怎么办,难道脱下冬衣的你还能掩饰住腰间的“游泳圈”?想体验最流行的减肥方式,轻松拥有轻盈身材就
日前,福州市出台吸引台湾青年人才来榕创业创新相关政策。今后,符合条件的台湾青年可在福州申请担任聘任制公务员或应聘事业单位特聘岗位的相关工作,还可购买福州市人才公寓,