Name disambiguation in AMiner

来源 :中国科学:信息科学(英文版) | 被引量 : 0次 | 上传用户:saialmaster
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Name disambiguation, aiming at disambiguating who is who, is one of the fundamental problems of the online academic network platforms such as Google scholar, microsoft academic and AMiner.This study takes AMiner1), a free online academic search and mining system [1], as the example to explain how we deal with the name ambiguity problem under three different scenarios.AMiner has already extracted 1.3 × 108 researchers' profiles from the Web and integrated with 2 × 108 papers from heterogeneous publication databases, with a growth rate of over 500000 per month.From the beginning when the system is built to the running and updating phases, we need to pay continuous attention on the problem of name disambiguation.In the following parts, we discuss the problem on three scenarios during the whole life cycle of AMiner, i.e., name disambiguation when the system is built from scratch (full ND), name disambiguation when persons' profiles are continuously updated (continuous ND) and error detection upon existing persons' profiles (error detection).Figure 1(a) illustrates an example of the disambiguating results for the researchers named “Jing Zhang” in AMiner and Figure 1(b)-(d) explains the problem of name disambiguation under three scenarios.
其他文献
老子《道德经》中有日: “九层之台,起于累土”,苏辙《新论》中有云: “欲筑室者,先治其基”,都生动地诠释了筑基的显著作用.rn统计基层基础建设(以下简称“双基”建设)是统
期刊
迁移体(migrasome)是俞立教授于2015年报道的新细胞器.迁移体是细胞迁移过程中尾部产生的收缩丝的尖端或交叉点产生出的膜性细胞器.细胞产生迁移体的过程称为迁移性胞吐(migr
在辽宁省参加第七次人口普查事后质量抽查的这段日子里,每一天我都会被一些瞬间感动,这些瞬间串联起来,成为我生命里难以忘怀的一段经历.rn记得2020年12月15号那天晚上,我正
期刊
力争2030年前实现碳达峰,2060年前实现碳中和,是党中央作出的重大战略决策,对中华民族永续发展,应对全球气候变化意义重大.做好应对气候变化基础统计,服务“碳达峰、碳中和”
期刊
Dear editor,rnNeural networks (NNs) and fuzzy systems are commonly used computational intelligence techniques, each with their own merits in terms of applicatio
期刊
根据第七次全国人口普查结果,现将2020年11月1日零时我国大陆31个省、自治区、直辖市(以下简称省份)和现役军人的人口性别构成情况公布如下:rn一、全国人口性别构成rn全国人
期刊
宫颈癌作为女性第2大恶性肿瘤,仍然是全球范围内的公共卫生问题.外泌体是活细胞主动分泌的一种具有脂质双分子层结构的纳米级囊泡,能够携带蛋白质、脂质、DNA和RNA(包括mRNA
整合因子复合物(integrator complex,INT)的发现极大地拓展了对小核RNA转录成熟和基因转录调控的认知,也重新掀起了相关领域的研究热潮.INT是1个至少由14个亚基组成、分子量
N6-甲基腺嘌呤(N6-methyladenosine,m6A)是发生在腺嘌呤N6位的甲基化修饰,它是真核生物信使RNA(messenger RNA,mRNA)中最丰富的转录后修饰.m6A修饰是由甲基化酶、去甲基化酶
Dear editor,rnOver the past decade, germanium has attracted great interest as a promising channel material for p-channel metal oxide semiconductor field-effect-
期刊