CONVERGENCE OF BACKPROPAGATION WITH MOMENTUM FOR NETWORK ARCHITECTURES WITH SKIP CONNECTIONS

来源 :计算数学（英文版） | 被引量 : 0次 | 上传用户：xsxiaomo

【摘要】

：

We study a class of deep neural networks with architectures that form a directed acyclic graph(DAG).For backpropagation defined by gradient descent with adaptiv

【作者】

：

Chirag Agarwal Joe Klobusicky Dan Schonfeld

【机构】

：

Department of Electrical and Computer Engineering,University of Illinois at Chicago,Chicago,IL 60607

【出处】

：

计算数学（英文版）

【发表日期】

：

2021年1期

【关键词】

：

Backpropagation with momentum Autoencoders Directed acyclic graphs

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

We study a class of deep neural networks with architectures that form a directed acyclic graph(DAG).For backpropagation defined by gradient descent with adaptive momentum,we show weights converge for a large class of nonlinear activation functions.The proof generalizes the results of Wu et al.(2008)who showed convergence for a feed-forward network with one hidden layer.For an example of the effectiveness of DAG architectures,we describe an example of compression through an AutoEncoder,and compare against sequential feed-forward networks under several metrics.

其他文献

All-fiberized very-large-mode-area Yb-doped fiber based high-peak-power narrow-linewidth nanosecond

We demonstrate an all-fiberized narrow-linewidth nanosecond amplifier with high peak power,tunable pulse width,and repetition rate.A fiber-coupled narrow-linewi

期刊

nanosecondall-fiber amplifiervery-large mode areatunable pulse width and repe

鉴评实例四精灵

动物造型石的立意创作,先要捕捉石象中能够表现某种特定动物的形态细节,对其进行运用;而对于不合这种动物的形态细节予以合理的解释,从而首先对“这一只动物”确认是什么动物

期刊

鉴评缠丝玛瑙令人

同轴喷雾技术制备中空分子筛胶囊

会议

同轴喷雾技术制备中空分子筛

Efficient and multifunctional terahertz polarization control device based on metamaterials

Terahertz polarization devices are an important part of terahertz optical systems.Traditional terahertz polarization devices rely on birefringent crystals,and t

期刊

terahertzmetamaterialswaveguide transmission

淀粉模板法制备具有多级孔结构的块体碳

本文采用模板法制备出具有宏孔单向排列结构的多级孔块体碳材料.制备过程分为两个步骤:首先通过可溶性淀粉溶胶的定向凝固制备出具有宏孔单向排列的淀粉块体;然后将制备的多

会议

可溶性淀粉多级孔结构多孔碳单向排列孔道

IMAGE RESTORATION UNDER CAUCHY NOISE WITH SPARSE REPRESENTATION PRIOR AND TOTAL GENERALIZED VARIATIO

This article introduces a novel variational model for restoring images degraded by Cauchy noise and/or blurring.The model integrates a nonconvex data-fidelity t

期刊

Image restorationCauchy noiseSparse representation priorDictionary learn-ing

Energy storage performances regulated by BiMnO3 proportion in limited solid solution films

Na0.5Bi0.5TiO3-BiMnO3(NBT-BM)limited solid solution films were fabricated to investigate the lattice modifi-cation on the energy storage performances.The introd

期刊

limited solid solutionenergy storagerelaxorlattice engineering

介孔/大孔生物活性玻璃的制备及骨修复中的应用研究

会议

介孔大孔生物活性玻璃制备骨修复

BOUNDARY VALUE METHODS FOR CAPUTO FRACTIONAL DIFFERENTIAL EQUATIONS

期刊

Fractional differential equationsCaputo derivativesBoundary value methods Loc

蒸馏方法合成高比表面积介孔MgO及其吸附镍离子性质的研究

会议

蒸馏方法合成高比表面积介孔吸附

CONVERGENCE OF BACKPROPAGATION WITH MOMENTUM FOR NETWORK ARCHITECTURES WITH SKIP CONNECTIONS

与本文相关的学术论文