Elastic pointer directory organization for scalable shared memory multiprocessors

来源 :Journal of Systems Engineering and Electronics | 被引量 : 0次 | 上传用户:gaozhanlong
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
In the field of supercomputing, one key issue for scalable sharedmemory multiprocessors is the design of the directory which denotes the sharing state for a cache block. A good directory design intends to achieve three key attributes: reasonable memory overhead, sharer position precision and implementation complexity. However, researchers often face the problem that gaining one attribute may result in losing another. The paper proposes an elastic pointer directory(EPD) structure based on the analysis of sharedmemory applications, taking the fact that the number of sharers for each directory entry is typically small. Analysis results show that for 4 096 nodes, the ratio of memory overhead to the fullmap directory is 2.7%. Theoretical analysis and cycleaccurate executiondriven simulations on a 16 and 64node cache coherence non uniform memory access(CCNUMA) multiprocessor show that the corresponding pointer overflow probability is reduced significantly. The performance is observed to be better than that of a limited pointers directory and almost identical to the fullmap directory, except for the slight implementation complexity. Using the directory cache to explore directory access locality is also studied. The experimental result shows that this is a promising approach to be used in the stateoftheart high performance computing domain. In the field of supercomputing, one key issue for scalable shared memory multiprocessors is the design of the directory which represents the sharing state for a cache block. A good directory design intends to achieve three key attributes: reasonable memory overhead, sharer position precision and implementation complexity . However, researchers often face the problem that gaining one attribute may result in losing another. The paper proposes an elastic pointer directory (EPD) structure based on the analysis of shared memory applications, taking the fact that the number of sharers for each directory entry is typically small. Analysis results show that for 4 096 nodes, the ratio of memory overhead to the fullmap directory is 2.7%. Theoretical analysis and cycle accurate execution sim simulations on a 16 and 64 node cache coherence non uniform memory access (CCNUMA) multiprocessor show that the corresponding pointer overflow probability is reduced significantly. The performance is observed to be b etter than that of a limited pointers directory and almost identical to the fullmap directory, except for the slight implementation complexity. Using the directory cache to explore directory access locality is also studied. The experimental result shows that this is a promising approach to be used in the stateoftheart high performance computing domain.
其他文献
针对一起起重机吊具故障导致坠落伤人事故案例进行分析,找出故障原因及事故根源,并提出相应的预防措施,从而避免类似事故的再次发生,可供起重作业人员进行参考。 Aiming at
区间值信念结构下证据理论在信息融合、决策分析中有着广泛的应用前景.针对区间值冲突证据组合出现反直观结果问题,提出一种新的证据组合方法.从整体角度构建证据间Pignistic概
为了提高终端空域航空器到达时间预测的准确性和排序队列的稳定性,本文从分析影响航空器预计到达时间ETA的不确定因素出发,建立了同一机型航空器过同一边界点到达时间的概率
Frame and frequency synchronization are essential for orthogonal frequency division multiplexing (OFDM) systems. The frame offset owing to incorrect start point
This paper considers a project scheduling problem with the objective of minimizing resource availability costs appealed to finish al activities before the deadl
A combination method of optimization of the back-ground value and optimization of the initial item is proposed. The sequences of the unbiased exponential distri
This paper focuses on fast algorithm for computing the assignment reduct in inconsistent incomplete decision systems. It is quite inconvenient to judge the assi
A novel strategy of probability density function (PDF) shape control is proposed in stochastic systems. The control er is designed whose parameters are optimal
This paper focuses on the performance analysis of flexible reactive systems. The performance analysis consists of two phases: first system modeling, second perf
Mineral sulphide (MS)-lime (CaO) ion exchange reactions (MS+CaO=MO+CaS) and the effect of CaO/C mole ratio during carbothermic reduction (MS+CaO+C=M+CaS+CO(g))