Title-Aware Neural News Topic Prediction

来源 :第十八届中国计算语言学大会暨中国中文信息学会2019学术年会 | 被引量 : 0次 | 上传用户:bcrav4
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  Online news platforms have gained huge popularity for online news reading.The topic categories of news are very important for these platforms to target user interests and make personalized recommendations.However,massive news articles are generated everyday,and it too expensive and time-consuming to manually categorize all news.The news bodies usually convey the detailed information of news,and the news titles usually contain summarized and complementary information of news.However,existing news topic prediction methods usually simply aggregate news titles and bodies together and ignore the differences of their characteristics.In this paper,we propose a title-aware neural news topic prediction approach to classify the topic categories of online news articles.In our approach,we propose a multi-view learning framework to incorporate news titles and bodies as different views of news to learn unified news representations.In the title view,we learn title representations from words via a long-short term memory(LSTM)network,and use attention mechanism to select important words according to their contextual representations.In the body view,we propose to use a hierarchical LSTM network to first learn sentence representations from words,and then learn body representations from sentences.In addition,we apply attention networks at both word and sentence levels to recognize important words and sentences.Besides,we use the representation vector of news title to initialize the hidden states of the LSTM networks for news body to capture the summarized news information condensed by news titles.Extensive experiments on a real-world dataset validate that our approach can achieve good performance in news topic prediction and consistently outperform many baseline methods.
其他文献
Depression detection is a significant issue for human well-being.Conventional diagnosis of depression requires a face-to-face con-versation with a doctor,which limits the likelihood of the identificat
学位
学位
Knowledge graph embedding aims at learning low-dimensional representations for entities and relations in knowledge graph.Previous knowledge graph embedding methods use just one score to measure the pl
学位
学位
This paper explores entity embedding effectiveness in ad-hoc entity retrieval,which introduces distributed representation of entities into entity retrieval.The knowledge graph contains lots of knowled
In order to solve the problem of data sparseness caused by less training corpus in Tibetan-Chinese transliteration,this paper ana-lyzes the alignment granularity of Tibetan-Chinese names as the resear
It is widely accepted that part-of-speech(POS)tagging and dependency parsing are highly related.Most state-of-the-art dependency parsing methods still rely on the results of POS tagging,though the tag
Text correction after automatic speech recognition(ASR)is an im-portant method to improve the speech recognition system.We regard the speech error correction as a translation task—from the language of