Pragma Directed Shared Memory Centric Optimizations on GPUs

来源 :计算机科学技术学报（英文版） | 被引量 : 0次 | 上传用户：trulyliu

【摘要】

：

GPUs become a ubiquitous choice as coprocessors since they have excellent ability in concurrent processing. In GPU architecture, shared memory plays a very impo

【作者】

：

Jing Li Lei Liu Yuan Wu Xiang-Hua Liu Yi Gao Xiao-Bing Feng Cheng-Yong Wu

【机构】

：

State Key Laboratory of Computer Architecture, Institute of Computing Technology, Chinese Academy of

【出处】

：

计算机科学技术学报（英文版）

【发表日期】

：

2016年2期

【关键词】

：

GPU shared memory pragma directed data centric

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

GPUs become a ubiquitous choice as coprocessors since they have excellent ability in concurrent processing. In GPU architecture, shared memory plays a very important role in system performance as it can largely improve bandwidth utilization and accelerate memory operations. However, even for a?ne GPU applications that contain regular access patts, optimizing for shared memory is not an easy work. It often requires programmer expertise and nontrivial parameter selection. Improper shared memory usage might even underutilize GPU resource. Even using state-of-the-art high level programming models (e.g., OpenACC and OpenHMPP), it is still hard to utilize shared memory since they lack inherent support in describing shared memory optimization and selecting suitable parameters, let alone maintaining high resource utilization. Targeting higher productivity for a?ne applications, we propose a data centric way to shared memory optimization on GPU. We design a pragma extension on OpenACC so as to convey data management hints of programmers to compiler. Meanwhile, we devise a compiler framework to automatically select optimal parameters for shared arrays, using the polyhedral model. We further propose optimization techniques to expose higher memory and instruction level parallelism. The experimental results show that our shared memory centric approaches effectively improve the performance of five typical GPU applications across four widely used platforms by 3.7x on average, and do not burden programmers with lots of pragmas.

其他文献

胆道支架置入术姑息性治疗恶性梗阻性黄疸临床疗效观察

目的　观察经皮肝穿胆道支架置入术 ,在姑息治疗恶性梗阻性黄疸的临床疗效。方法　对恶性梗阻黄疸的 74例胆道支架 (包括 42例金属支架与 3 2例内涵管 )内引流术后临床资料作

期刊

恶性梗阻黄疸支架

瑞舒伐他汀对老年冠心病伴高脂血症患者血脂及高敏C反应蛋白的影响

目的探讨瑞舒伐他汀对老年冠心病伴高脂血症患者血脂及高敏C反应蛋白的影响。方法采用对照研究的方法,将72例患者随机分为治疗组(36例)和对照组(36例),治疗组服用瑞舒伐他汀

期刊

瑞舒伐他汀老年冠心病高脂血症血脂高敏C反应蛋白

插秧机闲置期间的保养/不可用洗衣粉清洗农机具/农机产品质量维权常识

期刊

插秧机保养洗衣粉清洗农机具产品质量维权

借助品牌优势带动农户脱贫

封丘生命果有机食品股份有限公司是亚洲地区树莓种植面积、加工能力、生产工艺领先的现代化龙头企业.该公司主要有16个系列产品,其中4个产品2015年获绿色食品证书,并荣获第十

期刊

肺内淋巴瘤1例报告

1　病例介绍男 ,47岁。 2年前出现低热盗汗 ,颈部、腋下及腹股沟淋巴结肿大。穿刺活检诊为非何杰金氏淋巴瘤ＩＶ期。胸片示右肺中叶肿块。放疗化疗后全身肿大淋巴结消失 ,右肺肿

期刊

肺内何杰金氏淋巴瘤肿大淋巴结淋巴结肿大肺肿块右肺中叶低热穿刺活检病例介绍化疗后腹股沟颈部检查放疗盗汗

阿司匹林联合缓释型双嘧达莫防治急性缺血性卒中后脑血管缺血事件的临床评价

目的评价阿司匹林联合缓释型双嘧达莫防治急性缺血性卒中后脑血管缺血事件的临床作用。方法选择2009年2月-2012年1月清苑县人民医院收治的63例急性缺血性卒中患者作为研究对

期刊

急性缺血性卒中脑血管缺血事件阿司匹林双嘧达莫

延边州农机局开展“践行宗旨、勤政廉政、政策法律”教育活动/敦化开展安全生产月宣传活动/以民为本为民服务提升便民服务水平

期刊

延边州农机宗旨勤政廉政政策法律教育活动敦化安全生产宣传活动以民为本服务提升

三一挖掘机的十五年人机缘

从黑河瑷珲机场沿310省道一路向北,在经过稗子沟后直转向东,再沿嫩黑公路向南行驶约半个小时便抵达了我们此行的目的地——黑河市锦河农场.机主王德修在农场门口早已等候多时

期刊

Wide Operational Range Processor Power Delivery Design for Both Super-Threshold Voltage and Near-Thr

The load power range of mod processors is greatly enlarged because many advanced power management techniques are employed, such as dynamic voltage frequency sca

期刊

voltage regulatorpower deliverynear-threshold computingmulticore processor

肝血管瘤选择性动脉造影加栓塞治疗

目的　探讨肝血管瘤的动脉造影表现 ,寻找理想的栓塞方法 ,提高介入治疗效果。方法　经股动脉插管、选择性肝动脉造影后 ,采用超液化碘油加平阳霉素栓塞治疗肝血管瘤 40例 ,

期刊

肝血管瘤血管造影栓塞治疗性

Pragma Directed Shared Memory Centric Optimizations on GPUs

与本文相关的学术论文