论文部分内容阅读
微博排序是舆情控制的重要问题。文章阐述了PageRank、HITS和SALSA三种排序算法的基本特性,分析了新浪微博的特点及其对舆论传播的影响。基于Hadoop技术中的分布式计算平台,用三种算法分别对热点舆情事件中新浪微博的博文进行排序。根据排序结果,深入分析了三种排序算法对微博信息排序时的优缺点,及各算法对不同种类微博的偏好性,提出了微博排序算法应用的建议。
Weibo sort is an important issue of public opinion control. The article expounds the basic characteristics of PageRank, HITS and SALSA, and analyzes the characteristics of Sina Weibo and their influence on the spread of public opinion. Based on the distributed computing platform in Hadoop technology, three kinds of algorithms are respectively used to sort the blog posts of Sina Weibo in hot public opinion events. According to the sorting result, the advantages and disadvantages of the three sorting algorithms when sorting the weibo information and the preference of each algorithm to the different kinds of weibo are analyzed in depth, and the application of the weibo sorting algorithm is proposed.