An Empirical Study of Good-Turing Smoothing for Language Models on Different Size Corpora of Chinese

来源 :电脑和通信(英文) | 被引量 : 0次 | 上传用户：vikdl

【摘要】

：

Data sparseness has been an inherited issue of statistical language models and smoothing method is usually used to resolve the zero count problems. In this pape

【作者】

：

Feng-Long Huang Ming-Shing Yu

【机构】

：

DepartmentofComputerScienceandInformationEngineering,DepartmentofComputerScience

【出处】

：

电脑和通信(英文)

【发表日期】

：

2013年5期

【关键词】

：

Good-Turing Methods SMOOTHING LANGUAGE Models PERPLEXITY Good-Turing Methods Smo

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

Data sparseness has been an inherited issue of statistical language models and smoothing method is usually used to resolve the zero count problems. In this paper, we studied empirically and analyzed the well-known smoothing methods of Good-Turing and adva

其他文献

Development of a General-Purpose E-Voting Server

Voting is a general and indispensable method, and widely used to express a choice or preference, to elect a person, or to choose an opinion by ballot in educati

期刊

COMPONENTE-VOTINGComponentE-Voting

复方血通蒲颗粒的提取纯化工艺研究

目的研究复方血通蒲颗粒制备工艺,优选最佳提取、纯化工艺条件。方法以葛根素含量、含固量为检测指标,用正交试验考察了3种因素（加水量、煎煮时间、煎煮次数）对其水煎煮工艺的

期刊

复方血通蒲颗粒HPLC正交试验醇沉葛根素

基于现代农业理论的现代植保建设的研究

本文重点阐述了推进现代植保建设的工作措施，我们认为，要创新发展理念，进一步探索现代植保建设新路子；要大力推进病虫绿色防控；要创新组织体系，进一步完善科学高效的组织架构；要创新

期刊

现代农业理论现代植保建设

An Algorithm and Data Process Scheme for Indoor Location Based on Mobile Devices

Limited by the sampling capacity of the mobile devices, many real-time indoor location systems have such problems as low accuracy, large variance, and non-smoot

期刊

INDOORPOSITIONINGWIRELESSLANFINGERPRINTMAPReal-timePOSITIONINGIndoor Pos

说秘书

期刊

秘书语言文字功底业务知识领导意识敬业品质

面向业务,打造可持续发展的IP城域承载网

运营环境正经历固移融合、三网融合、全业务运营等变化，随着竞争态势加剧，中国已经形成中国移动、中国电信、中国联通、广电系统的激烈竞争局面。另外，运营商网络服务的用户业务

期刊

IPTV业务IP城域网

钢结构转运站结构设计

以某钢结构转运站结构设计为例，重点介绍设计输入资料，结构布置，荷载统计，荷栽组合等，说明钢结构转运站设计的思路、处理方法及注意事项。

期刊

钢结构转运站框架-支撑荷载无侧移

关于全电气量的保护理论探析

笔者结合当前的发展趋势和保护理论，从保护原理的实际出发，总结了电量保护中所出现的一些具体问题，引出了一种全电气量的保护新的理论．通过对线路模型过程中的探讨和分析，用全新的

期刊

全电气量保护理论单电气量

种子超干贮藏对紫花苜蓿前期生长和生理特性的影响

采用硅胶室温干燥法对陇东紫花苜蓿（Medicago sativa L. cv. Longdong）种子进行超干处理，使其含水量由9.03%（CK）分别降至7.09%～4.59%共八个不同含水量水平，用铝箔纸密封，常温下贮藏1

期刊

紫花苜蓿种子超干处理幼苗生长生理Medicago sativa L. ultra-dried seed seedlings physiology

Multi-Valued Neuron with Sigmoid Activation Function for Pattern Classification

Multi-Valued Neuron (MVN) was proposed for pattern classification. It operates with complex-valued inputs, outputs, and weights, and its learning algorithm is b

期刊

PATTERNClassificationMULTI-VALUEDNEURON(MVN)DIFFERENTIABLEACTIVATIONFunct

An Empirical Study of Good-Turing Smoothing for Language Models on Different Size Corpora of Chinese

其他学术论文