MapReduce optimization algorithm based on machine learning in heterogeneous cloud environment

来源 :The Journal of China Universities of Posts and Telecommunica | 被引量 : 0次 | 上传用户:xiaoshuishe
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
We present an approach to optimize the MapReduce architecture,which could make heterogeneous cloud environment more stable and efficient.Fundamentally different from previous methods,our approach introduces the machine learning technique into MapReduce framework,and dynamically improve MapReduce algorithm according to the statistics result of machine learning.There are three main aspects:learning machine performance,reduce task assignment algorithm based on learning result,and speculative execution optimization mechanism.Furthermore,there are two important features in our approach.First,the MapReduce framework can obtain nodes’performance values in the cluster through machine learning module.And machine learning module will daily calibrate nodes’performance values to make an accurate assessment of cluster performance.Second,with the optimization of tasks assignment algorithm,we can maximize the performance of heterogeneous clusters.According to our evaluation result,the cluster performance could have 19%improvement in current heterogeneous cloud environment,and the stability of cluster has greatly enhanced. We present an approach to optimize the MapReduce architecture, which could make heterogeneous cloud environments more stable and efficient. Fundamentally different from previous methods, our approach introduces the machine learning technique into MapReduce framework, and dynamically improve MapReduce algorithm according to the statistics result of machine learning.There are three main aspects: learning machine performance, reduce task assignment algorithm based on learning result, and speculative execution optimization mechanism. Future, there are two important features in our approach. First, the MapReduce framework can obtain nodes’ performance values ​​in the cluster through machine learning module. And machine learning module will daily calibrate nodes’ performance values ​​to make an accurate assessment of cluster performance. Second, with the optimization of tasks assignment algorithm, we can maximize the performance of heterogeneous clusters. According to our evaluation result, the cluster per formance could have 19% improvement in current heterogeneous cloud environment, and the stability of cluster has greatly enhanced.
其他文献
自上世纪80年代起,我国各地晚报、晨报等都市类报纸纷纷走自办发行道路,一张报纸成立一个发行部门,组建一支发行队伍。2000年以后,各地报社成立报业集团。为了整合资源,实行
倏忽之间,《艺术评论》已经发行一百期了!从2003年夏天创刊到现在,《艺术评论》在近十年的路途中沐雨栉风、默默耕耘,始终坚持着本刊的初衷和理想。在一个越来越物质化的社会
历时3年.拙作终于由河南人民出版社出版.这是一部关于滑县农民工在全国各地工作生活的全景观式的影像记录,也是我尝试用影像从一个方面反映改革开放给社会带来的深刻变化,见
本文阐述了邓小平理论的哲学思想是新时期马列主义、毛泽东思想的具体体现,是我们党坚持实事求是思想路线的理论基础,并着重邓小平哲学思想论述了邓小平理论的哲学思想在新时
棉花(Gossypium hirsutum)是世界性的重要经济作物,在我国国民经济中占有重要地位。棉纤维作为重要的天然纤维,是纺织工业的主要原材料,广泛应用于纺织、造纸、生物燃料和化学工业,棉纤维品质决定了其作为商品的价值。然而,棉花经常受到诸如虫害等生物胁迫以及高盐、干旱、低温、营养缺乏等非生物胁迫,造成棉花产量急剧下降。因此,通过基因工程手段将一些具有抗逆功能的基因导入到植物基因组中获得抗逆新
草地是地球上分布最大的陆地生态系统之一,草地生物量和物种多样性是衡量草地生态系统功能和生态服务的重要指标。由于长期过度利用引起我国草地大面积退化,迫切需要科学的草
中国制造业主动将国外的垃圾和废料作为可回收原材料的廉价来源,发达国家也乐得将难以处理的垃圾出口到中国。中国稳坐全球最大废弃物进口国的头把交椅。当然,中国大大小小的
警卫战士警卫首长天天要与武器为伍,整天摆弄,常在河边走哪能免得了湿鞋呢?不说因不小心而使火器走火是经常的吧,也是时不时就容易来一下子的事。 例如,毛泽东那里发生过一
柘城县人民医院现有职工干部630人,其中党员134人。院党总支下设机关、后勤、医技、药械、内科、外科六个党支部,14个党小组。 医疗卫生是社会文明的窗口,医院的精神文明建
粉丝很丰满,效果很骨感。在以互联网和移动终端为核心的新媒体营销蓝海中,确实有不少企业做得风生水起、获益不浅,但是,仍有大批企业摸不着北、找不到头绪。人人都知道新媒体