Reinventing Memory System Design for Many-Accelerator Architecture

来源 :Journal of Computer Science & Technology | 被引量 : 0次 | 上传用户:usrrmhta
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
The many-accelerator architecture, mostly composed of general-purpose cores and accelerator-like function units(FUs), becomes a great alternative to homogeneous chip multiprocessors(CMPs) for its superior power-efficiency.However, the emerging many-accelerator processor shows a much more complicated memory accessing pattern than general purpose processors(GPPs) because the abundant on-chip FUs tend to generate highly-concurrent memory streams with distinct locality and bandwidth demand. The disordered memory streams issued by diverse accelerators exhibit a mutualinterference behavior and cannot be efficiently handled by the orthodox main memory interface that provides an inflexible data fetching mode. Unlike the traditional DRAM memory, our proposed Aggregation Memory System(AMS) can function adaptively to the characterized memory streams from different FUs, because it provides the FUs with different data fetching sizes and protects their locality in memory access by intelligently interleaving their data to memory devices through sub-rank binding. Moreover, AMS can batch the requests without sub-rank conflict into a read burst with our optimized memory scheduling policy. Experimental results from trace-based simulation show both conspicuous performance boost and energy saving brought by AMS. The many-accelerator architecture, mostly composed of general-purpose cores and accelerator-like function units (FUs), becomes a great alternative to homogeneous chip multiprocessors (CMPs) for its superior power-efficiency. a much more complicated memory accessing pattern than general purpose processors (GPPs) because the abundant on-chip FUs tend to generate highly-concurrent memory streams with distinct locality and bandwidth demand. The disordered memory streams issued by diverse accelerators exhibit a mutualinterference behavior and can not Unlike the traditional DRAM memory, our proposed Aggregation Memory System (AMS) can function adaptively to memory flow from different FUs, because it provides the FUs with different data fetching sizes and protects their locality in memory access by intelligently inte rleaving their data to memory devices through sub-rank binding. Moreover, AMS can batch the requests without sub-rank conflict into a read burst with our optimized memory scheduling policy. Experimental results from trace-based simulation show both conspicuous performance boost and energy saving brought by AMS.
其他文献
本文新缀龟腹甲三则,并分别作了说明与考释。内容涉及商代农业、田猎、祭祀、丧葬等诸多方面,其中第二则缀合,更为历组卜辞时代提前说提供了强有力的支持。 This article ne
摘要:小学是培养学生能力的重要阶段,所以学生不能只满足于了解知识,而应积极地突破自己,在了解知识的基础上尽可能地运用知识。本文阐述了培养小学生语文运用能力的方法,为提高小学生的语文综合素养提供了可行性方略。  关键词:小学生 语文运用能力 方略  在小学阶段,虽然学生已经具备了一定能力,但教师仍应因材施教,除了教学课文内容之外,还要注意训练学生的实践能力。以语文学科的教学为例,教师除了教给学生基本
纵观近几年高考阅卷,学生在离子方程式、有机化学方程式书写方面失分较多.因此,有必要对学生进行针对性的指导,是提高化学总分有效的措施之一.下面例举常见易错的离子方程式的类型和有机化学方程式,以达到抛砖引玉的目的.
多媒体技术的出现和普及为教育教学手段的改进提供了新的发展机遇,它集图像、文本、动画、声音等多种媒体信息于一身,具备了直观形象、感染力强、交互方便等特点;改变了以往
小锋人高马大,脾气火爆,表情凶巴巴的,平时即使跟同学发生一些小摩擦,也没人敢跟他理论。他看见老师也不问好,只是面无表情地看两眼就走过去了,给人感觉特别冷漠。  有一次班里同学闹了点矛盾,他也是在场者之一。我找他了解情况。谈了没几分钟,我发觉小锋的神情越来越不对头——他双眼圆睁地瞪着我,脸涨得通红,头侧歪着,还不时地一扭一扭,脖子发出“嘎吱嘎吱”的声音。我心中暗暗叫道:“不好!这家伙可能要发作了!”
一、选择题(6小题,共计48分)  1. 在生态学研究中,下列方法与研究目的不相符的是( )  A. 给海龟安装失踪器调查其洄游路线  B. 给大雁佩戴标志环调查其迁徙路线  C. 用样方法研究固着在岩礁上贝类的种群关系  D. 用标志重捕法调查乌尔黄鼠的丰(富)度  2. 南极冰藻是以硅藻为主的一大类藻类植物,长期生长在南极海冰区-2~4℃的环境中,其最适生长温度为2℃。磷虾主要以南极冰藻为食,
课外阅读既能增加学生的知识底蕴,也能丰富他们的精神世界。所以,我们要打破传统思想,摒弃所谓“闲书”等说法,合理地为学生选择课外读物,让他们在课外阅读的同时得到精神上
夜深了,家家户户的灯陆续熄灭了;夜深了,公路上一个人都没有,一辆车也没有;夜深了,渐渐安静了,只有挂在墙上的钟还在“嘀嗒嘀嗒”地走着;夜深了,大家都沉睡在甜蜜的美梦中,而
市政给排水工程是城市市政设施得以正常使用的重要设施之一,工程质量的优劣不仅影响城市功能的充分发挥,而且对道路完好、城市环保以及城市防洪排涝等都有直接的影响。文章从
本文从专用性投资、博弈扩展的角度对企业网络的形成进行了分析。由于企业网络中的成员不可收回的专用性投资形成的可置信承诺使得每一个成员都形成一个自我实施的单边协议,于是成员企业的收益也随着网络的扩大而递增。在自利原则与理性决策的条件下,企业交易的博弈扩展使得企业网络形成并发展起来。另从可置信承诺、重复博弈的角度对企业信任建立过程中关键的两个环节进行了理论分析,提出了企业信任机制是“内生”的观点。最后,提出了企业网络形成与企业信任机制建立同一性的说法,表明这两个事物的根本作用因素是一致的。