Arabic Collocation Extraction Based on Hybrid Methods

来源 :第十六届全国计算语言学学术会议暨第五届基于自然标注大数据的自然语言处理国际学术研讨会 | 被引量 : 0次 | 上传用户:jsq
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  Collocation Extraction plays an important role in machine transla-tion,information retrieval,secondary language learning,etc.,and has obtained significant achievements in other languages,e.g.English and Chinese.There are some studies for Arabic collocation extraction using POS annotation to ex-tract Arabic collocation.We used a hybrid method that included POS patterns and syntactic dependency relations as linguistics information and statistical methods for extracting the collocation from Arabic corpus.The experiment re-sults showed that using this hybrid method for extracting Arabic words can guarantee a higher precision rate,which heightens even more after dependency relations are added as linguistic rules for filtering,having achieved 85.11%.This method also achieved a higher precision rate rather than only resorting to syntactic dependency analysis as a collocation extraction method.
其他文献
Mongolian text proofreading is the particularly difficult task because of its unique polyphonic alphabet,morphological ambiguity and agglutinative feature,and coding errors are currently pervasive in
Given a source document with extracted mentions,entity linking callsfor map-ping the mention to an entity in reference knowledge base.Previous en-tity linking approaches mainly focus on generic statis
Neural Machine Translation(NMT)has drawn much attention due to its promising translation performance in recent years.The conventional optimiza-tion algorithm for NMT sets a unified learning rate for e
This paper investigates relations between word semantic den-sity and word frequency.A distributed representations based word av-erage similarity is defined as the measure of word semantic density.We f
This paper presents a character-level encoder-decoder mod-eling method for question answering(QA)from large-scale knowledge bases(KB).This method improves the existing approach [9] from three aspects.
Deep networks have been widely used in many domains in recentyears.However,the pre-training of deep networks is time consuming with greedy layer-wise algorithm,and the scalability of this algorithm is
Current research on emotion detection focuses on the recognizingexplicit emotion expressions in text.In this paper,we propose an approach based on textual inference to detect implicit emotion expressi
We have presented a simple algorithm to noun phrases interpretation based on hand-crafted knowledge-base containing detailed semantic information.The main idea is to define a set of relations that can
Neural attention-based models have been widely used recently in head-line generation by mapping source document to target headline.However,the traditional neural headline generation models utilize the
As a minority language,Tibetan has received relatively little atten-tion in the field of natural language processing(NLP),especially in current var-ious neural network models.In this paper,we investig
会议