HybridTune: Spatio-Temporal Performance Data Correlation for Performance Diagnosis of Big Data Syste

来源 :计算机科学技术学报(英文版) | 被引量 : 0次 | 上传用户:sznc
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
With tremendous growing interests in Big Data, the performance improvement of Big Data systems becomes more and more important. Among many steps, the first one is to analyze and diagnose performance bottlenecks of the Big Data systems. Currently, there are two major solutions. One is the pure data-driven diagnosis approach, which may be very time-consuming; the other is the rule-based analysis method, which usually requires prior knowledge. For Big Data applications like Spark workloads, we observe that the tasks in the same stages normally execute the same or similar codes on each data partition. On basis of the stage similarity and distributed characteristics of Big Data systems, we analyze the behaviors of the Big Data applications in terms of both system and micro-architectural metrics of each stage. Furthermore, for different performance problems, we propose a hybrid approach that combines prior rules and machine leing algorithms to detect performance anomalies, such as straggler tasks, task assignment imbalance, data skew, abnormal nodes and outlier metrics. Following this methodology, we design and implement a lightweight, extensible tool, named HybridTune, and measure the overhead and anomaly detection effectiveness of HybridTune using the BigDataBench benchmarks. Our experiments show that the overhead of HybridTune is only 5%, and the accuracy of outlier detection algorithm reaches up to 93%. Finally, we report several use cases diagnosing Spark and Hadoop workloads using BigDataBench, which demonstrates the potential use of HybridTune.
其他文献
在这个由技术发展塑造的时代,现实与虚拟世界的边界正在逐渐消融,当下我们面临的挑战是如何重新定义虚拟现实与混合现实在景观设计领域应用的潜力.基于对此类技术的主要应用
该文从挂篮荷载计算、施工流程、支座及临时固结施工、挂篮安装及试验、合拢段施工、模板制作安装、钢筋安装、混凝土的浇筑及养生、测量监控等方面人手,介绍了S226海滨大桥
作为当前社会中被广泛应用的工具,人们对电子天平检定及校准结果的精准性提出了较高的要求,需要对使用中存在的检定结果差异原因进行针对性的分析,以保障电子天平检定的科学
期刊
1 政策层面:国家智能机器人重点研发计划启动rn去年年初,根据国家相关文件要求,科技部公示了2017年度国家重点研发计划“智能机器人”重点专项名单,共计44家,整个专项于去年8
期刊
WeRobotics是一家总部位于美国和瑞士的非营利性组织,致力于促进南半球国家广泛参与新兴技术的普及和应用.通过不断扩大其地方知识中心—飞行实验室遍布非洲、拉丁美洲、亚洲
期刊
To improve the mechanical properties of the electrospun nanofibrous membrane,the nonwoven fabrics and spacer fabrics were employed as support substrates to fabr
日前从巩义市住房和城乡建设局了解到,巩义今年正在按照“成熟一个改造一个,改造一个成功一个”的原则,强力推进城中村改造步伐。
广东海事辖区每年处理大大小小的水上事故150多宗,海事调查官是不可或缺的存在,他们的主要职责就是“查明原因,判明责任”,并对事故防范提出有针对性的建议。