基于随机行走N步的汉语复述短语获取方法

来源 :中国科学:信息科学 | 被引量 : 6次 | 上传用户:sunny_cui
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
在利用大规模双语语料获取复述知识方面,传统的基于"枢轴"方法只能考虑两步以内的复述现象.本文针对已有方法的局限性,对不同语言之间互为翻译的短语对构建翻译关系图,提出基于随机行走N步的复述获取算法,改进已有方法以获取更多潜在的复述知识.本文描述了由汉英短语翻译表构建翻译关系图的方法、基于N步的随机行走算法和基于期望步数的复述短语可信度计算方法.同时,本文提出面向多语言对的翻译关系图扩展方法.在NTCIR汉英和英日双语平行语料上进行了实验与评测,并与传统方法进行了对比.实验结果表明本文所提出的方法能够获
其他文献
  Previous researches on event relation classification primarily rely on lexical and syntactic features.In this paper,we use a Shallow Convolutional Neural Ne
会议
  The dialog manager is the most important component for a dialog system,in which the dialog state tracking is crucial to a real-world system.We claim that th
会议
  The algorithms for discovering global community structure require the knowledge about entire network structures,which are still difficult and unrealistic to
会议
  Finding similarity degree is one of the significant technologies used in the sample-based machine translation.It works in the following principle,first matc
会议
  Previous work has shown that joint modeling of two Natural Language Processing(NLP)tasks are effective for achieving better performances for both tasks.Lots
会议
  The rapid development of new media results in a lot of redundant information,increasing the difficulty of quickly obtaining useful information and browsing
会议
  This paper describes a mixing model of joint POS tagging and chunking for Kazakh where partial optimal solution provide feature information for joint model.
会议
  In this paper,we propose a neural graph-based dependency parsing model which utilizes hierarchical LSTM networks on character level and word level to learn
会议
  Traditional Mongolian Unicode Encoding has serious problems as several pairs of vowels with the same glyphs but different pronunciations are coded different
会议
  This paper describes an approach to identify suspected cybermob on social media.Many researches involve making predictions of group emotion on Internet(such
会议