Question Answering with Character-Level LSTM Encoders and Model-Based Data Augmentation

来源 :第十六届全国计算语言学学术会议暨第五届基于自然标注大数据的自然语言处理国际学术研讨会 | 被引量 : 0次 | 上传用户:chenenm0702
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  This paper presents a character-level encoder-decoder mod-eling method for question answering(QA)from large-scale knowledge bases(KB).This method improves the existing approach [9] from three aspects.First,long short-term memory(LSTM)structures are adopted to replace the convolutional neural networks(CNN)for encoding the can-didate entities and predicates.Second,a new strategy of generating neg-ative samples for model training is adopted.Third,a data augmentation strategy is applied to increase the size of the training set by generating factoid questions using another trained encoder-decoder model.Experi-mental results on the SimpleQuestions dataset and the Freebase5M KB demonstrates the effectiveness of the proposed method,which improves the state-of-the-art accuracy from 70.3%to 78.8%when augmenting the training set with 70,000 generated triple-question pairs.
其他文献
We consider the task of entity linking over question answering pair(QA-pair).In conventional approaches of entity linking,all the entities whether in one sentence or not are considered the same.We foc
Obtaining bilingual parallel data from the multilingual websites is along-standing research problem,which is very benefit for resource-scarce lan-guages.In this paper,we present an approach for obtain
This paper proposes a neural model for closed-set Chinese word segmentation.The model follows the character-based approach which assigns a class label to each character,indicating its relative po-siti
Event detection suffers from data sparseness and label imbalance prob-lem due to the expensive cost of manual annotations of events.To address this problem,we propose a novel approach that allows for
会议
In this paper,we focus on the problem of answer triggering ad-dressed by Yang et al.(2015),which is a critical component for a real-world question answering system.We employ a hierarchical gated recur
This paper proposes a novel end-to-end neural model to jointly extract entities and relations in a sentence.Unlike most exist-ing approaches,the proposed model uses a hybrid neural network to automati
Mongolian text proofreading is the particularly difficult task because of its unique polyphonic alphabet,morphological ambiguity and agglutinative feature,and coding errors are currently pervasive in
Given a source document with extracted mentions,entity linking callsfor map-ping the mention to an entity in reference knowledge base.Previous en-tity linking approaches mainly focus on generic statis
Neural Machine Translation(NMT)has drawn much attention due to its promising translation performance in recent years.The conventional optimiza-tion algorithm for NMT sets a unified learning rate for e
This paper investigates relations between word semantic den-sity and word frequency.A distributed representations based word av-erage similarity is defined as the measure of word semantic density.We f