A General-Purpose Many-Accelerator Architecture Based on Dataflow Graph Clustering of Applications

来源 :Journal of Computer Science & Technology | 被引量 : 0次 | 上传用户:suntow
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
The combination of growing transistor counts and limited power budget within a silicon die leads to the utilization wall problem(a.k.a. “Dark Silicon”), that is only a small fraction of chip can run at full speed during a period of time. Designing accelerators for specific applications or algorithms is considered to be one of the most promising approaches to improving energy-efficiency. However, most current design methods for accelerators are dedicated for certain applications or algorithms, which greatly constrains their applicability. In this paper, we propose a novel general-purpose many-accelerator architecture. Our contributions are two-fold. Firstly, we propose to cluster dataflow graphs(DFGs) of hotspot basic blocks(BBs) in applications. The DFG clusters are then used for accelerators design. This is because a DFG is the largest program unit which is not specific to a certain application. We analyze 17 benchmarks in SPEC CPU 2006,acquire over 300 DFGs hotspots by using LLVM compiler tool, and divide them into 15 clusters based on graph similarity.Secondly, we introduce a function instruction set architecture(FISC) and illustrate how DFG accelerators can be integrated with a processor core and how they can be used by applications. Our results show that the proposed DFG clustering and FISC design can speed up SPEC benchmarks 6.2X on average. The combination of growing transistor counts and limited power budget within a silicon die leads to the utilization wall problem (aka “Dark Silicon”), that is only a small fraction of chip can run at full speed during a period of time. Designing accelerators for specific applications or algorithms is considered to be one of the most promising approaches to improving energy-efficiency. However, most current design methods for accelerators-of- Firstly, we propose to cluster dataflow graphs (DFGs) of hotspot basic blocks (BBs) in applications. This DFG clusters are then used for accelerators design. This is because a DFG is the largest program unit which is not specific to a certain application. We analyze 17 benchmarks in SPEC CPU 2006, acquire over 300 DFGs hotspots by using LLVM c ompiler tool, and divide them into 15 clusters based on graph similarity. Secondary, we introduce a function instruction set architecture (FISC) and illustrate how DFG accelerators can be integrated with a processor core and how they can be used by applications. that the proposed DFG clustering and FISC design can speed up SPEC benchmarks 6.2X on average.
其他文献
“我不是大树,但我也有绿荫一片,能让你们栖息纳凉;我不是太阳,但是我希望自己是一缕阳光,能够温暖你们整个世界。”——题记 “I am not a big tree, but I also have a s
项目位于武汉市繁华的旧城区,被京汉大道分割成南北两部分。其中的北地块作为一期开发,承担着带动整个项目商业氛围的任务。因此尽管规模不大,设计上却极为复杂。地面一、二
"金秋时节,桂花飘香,正是板栗成熟的时候。这天,阳光明媚,我们相约去山上摘板栗。板栗树又高又大,宽大的叶子里躲藏着一个个小球,小球有绿色也有棕色。妈妈告诉我板栗就藏在
一、序言公元1276年,当时有世界上最多人口,具备最发达物质文明的南宋王朝向元投降,到1279年抵抗势力的最后堡垒也终于失陷,元朝得以掌握庞大的人物资源,在江南的财源有了新
5月4日,首次陪同李克强总理访问非洲国家的总理夫人程虹,走进了公众视野。程虹出生于河南郑州的一个干部家庭,曾在“上山下乡”中插过队,后来与李克强相识结婚,并育有一女。
从保护饮用水的安全与卫生出发,介绍了替代液氯和其它氯制品的新型消毒剂——二氧化氯所具有的优点,特别是可有效抑制三卤代烷(THMS)的产生;讨论了二氧化氯的最佳投放方式及ClO-2和ClO-3等有害副
德育地位不明确、德育方法太简单、德育效果难保证等,一直是学校德育工作开展要面对的难题。武汉市江汉区迎难而上,提出“本色德育”的概念,并迅速付诸行动。什么是“本色德
沈括(1033—1097),字存中,北宋两浙路杭州钱塘县(今浙江杭州市)人。一提起沈括,自然会联想到他的不朽之作《梦溪笔谈》。正是《梦溪笔谈》,使这位多才多艺的中国古代大科学
冷弯型钢具有力学性能好,易于施工,维护费用低等特点。通过在轻钢结构中的应用证明,其与普通用角钢、H型钢的钢结构相比,显示出工期短、造价低,性能优等特性。 Cold-formed steel
一、在情境中参与识字在课堂识字教学中,要使低年级学生对一个个枯燥的方块字产生兴趣,就得引导学生亲自去感知方块字的魅力。如,在识字教学中,给学生创设一定的故事情境,首