CONVERGENCE OF BACKPROPAGATION WITH MOMENTUM FOR NETWORK ARCHITECTURES WITH SKIP CONNECTIONS

来源 :计算数学(英文版) | 被引量 : 0次 | 上传用户:xsxiaomo
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
We study a class of deep neural networks with architectures that form a directed acyclic graph(DAG).For backpropagation defined by gradient descent with adaptive momentum,we show weights converge for a large class of nonlinear activation functions.The proof generalizes the results of Wu et al.(2008)who showed convergence for a feed-forward network with one hidden layer.For an example of the effectiveness of DAG architectures,we describe an example of compression through an AutoEncoder,and compare against sequential feed-forward networks under several metrics.
其他文献
We demonstrate an all-fiberized narrow-linewidth nanosecond amplifier with high peak power,tunable pulse width,and repetition rate.A fiber-coupled narrow-linewi
动物造型石的立意创作,先要捕捉石象中能够表现某种特定动物的形态细节,对其进行运用;而对于不合这种动物的形态细节予以合理的解释,从而首先对“这一只动物”确认是什么动物
Terahertz polarization devices are an important part of terahertz optical systems.Traditional terahertz polarization devices rely on birefringent crystals,and t
本文采用模板法制备出具有宏孔单向排列结构的多级孔块体碳材料.制备过程分为两个步骤:首先通过可溶性淀粉溶胶的定向凝固制备出具有宏孔单向排列的淀粉块体;然后将制备的多
This article introduces a novel variational model for restoring images degraded by Cauchy noise and/or blurring.The model integrates a nonconvex data-fidelity t
Na0.5Bi0.5TiO3-BiMnO3(NBT-BM)limited solid solution films were fabricated to investigate the lattice modifi-cation on the energy storage performances.The introd