基于多GPU的深层神经网络快速训练方法

来源 :清华大学学报(自然科学版) | 被引量 : 0次 | 上传用户:f54265932
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
近年来,深层神经网络(deep neural network,DNN)被成功应用于语音识别领域,成为一种很具发展潜力的语音识别模型。然而,由于其训练算法复杂度高,随着训练数据和网络规模增大,DNN模型训练将非常耗时。为提高DNN的训练效率,该文研究了基于多图形处理器(graph-ic processing unit,GPU)的DNN快速训练算法。在TIMIT数据集上的音素识别实验显示:在基本保证识别性能的前提下,优化后的DNN快速训练方法在4个GPU下训练速度相比单GPU有约3.3倍的提升。实验结果表明该快速训练方法可以显著提升DNN模型的训练速度。 In recent years, the deep neural network (DNN) has been successfully applied in the field of speech recognition and has become a promising speech recognition model. However, due to the complexity of its training algorithm, DNN model training will be very time-consuming as training data and network size increase. In order to improve the training efficiency of DNN, this paper studies DNN fast training algorithm based on graph-ic processing unit (GPU). The experiments of phoneme identification on TIMIT dataset show that under the premise of ensuring recognition performance, the optimized DNN training method can improve the training speed by about 3.3 times compared with single GPU under 4 GPUs. Experimental results show that the fast training method can significantly improve the training speed of DNN model.
其他文献
党的十七届六中全会决定推动社会主义文化大发展大繁荣,提高国家文化软实力,在日趋激烈的综合国力竞争中赢得主动.中央的这一决定给文化传媒领域重要的有生力量——出版战线
在界定重复发表和一稿多发两个概念的基础上,运用合同法和知识产权法理论分析,认为违法的应是重复发表,而一稿多发则是著作权人的合法权利。报刊社若要对作者的一稿多发的权
期刊
中国证券市场的10年是只生不死的10年,从上海开市时的“老八股”到今天沪深两市的1000家上市公司,就象“南征北战”的我军,仗越战越大,队伍越走越多,只不过,走着走着就开始
In this paper, the Space Weather Modeling Framework (SWMF) is used to simulate the real-time response of the magnetosphere to a solar wind event on June 5, 1998
期刊
期刊
痴迷的电脑神童 盖茨从小就有强烈的进取精神,对电脑有一种执着的迷恋,并能熟练操作电脑,进入别人的电脑加密区,因而被人誉为“电脑神童”。1971年暑假,盖茨和好友艾伦在湖
期刊
Using wave measurements from the EMFISIS instrument onboard Van Allen Probes, we investigate statistically the spatial distributions of the intensity of plasmas