Progressive framework for deep neural networks: from linear to non-linear

来源 :中国邮电高校学报（英文版） | 被引量 : 0次 | 上传用户：game00vergoo

【摘要】

：

【作者】

：

Shao Jie Cai Anni

【机构】

：

School of Information and Communication Engineering, Beijing University of Posts and Telecommunicati

【出处】

：

中国邮电高校学报（英文版）

【发表日期】

：

2016年6期

【关键词】

：

framework neural network DCCA semantic RankNet

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

We propose a novel progressive framework to optimize deep neural networks.The idea is to try to combine the stability of linear methods and the ability of learning complex and abstract internal representations of deep learning methods.We insert a linear loss layer between the input layer and the first hidden non-linear layer of a traditional deep model.The loss objective for optimization is a weighted sum of linear loss of the added new layer and non-linear loss of the last output layer.We modify the model structure of deep canonical correlation analysis (DCCA),i.e.,adding a third semantic view to regularize text and image pairs and embedding the structure into our framework,for cross-modal retrieval tasks such as text-to-image search and image-to-text search.The experimental results show the performance of the modified model is better than similar state-of-art approaches on a dataset of National University of Singapore (NUS-WIDE).To validate the generalization ability of our framework,we apply our framework to RankNet,a ranking model optimized by stochastic gradient descent.Our method outperforms RankNet and converges more quickly,which indicates our progressive framework could provide a better and faster solution for deep neural networks.

其他文献

High precision approximate analytical solutions to ODE using LS-SVM

The problem of solving differential equations and the properties of solutions have always been an important content of differential equation study.In practical application and scientific research,it is difficult to obtain analytical solutions for most dif

期刊

the kernel functionLS-SVMODEnumerical solutionapproximate analytical solutio

Dynamic model of computer viruses under the effect of removable media and external computers

This paper investigates the propagation of computer viruses and establishes a novel propagation model.In contrast to the existing models,this model can directly indicate the impact of removable media and external computers on the propagation of computer v

期刊

computer virusexternal computerremovable mediumviral equilibrium pointglobal

Improved statistical sparse decomposition principle method for underdetermined blind source signal r

Aiming at the statistical sparse decomposition principle (SSDP) method for underdetermined blind source signal recovery with problem of requiring the number of active signals equal to that of the observed signals,which leading to the application bound of

期刊

underdetermined blind source separationsignal recoveryISSDP

Low complexity detection algorithm based on optimized Neumann series for massive MIMO system

An optimized Neumann series (NS) approximation is described based on Frobenius matrix decomposition,this method aims to reduce the high complexity,which caused by the large matrix inversion of detection algorithm in the massive multiple input multiple out

期刊

massive MIMOjacobi iterationzero forcing precodinglow complexityweighted two

Convergence analysis and performance optimization of parallel concatenated systematic polar code

Similar to the analysis of Turbo codes,the parallel concatenated systematic polar code (PCSPC) can also be analyzed by the extrinsic information transfer (EXIT) chart.The convergence of the iterative decoding based on soft cancellation (SCAN) and belief p

期刊

PCSPCEXIT chartSCAN decoderBP decoderweight coefficient

Joint channel selection and power control for video streaming over D2D communications based cognitiv

A joint channel selection and power control scheme is developed for video streaming in device-to-device (D2D) communications based cognitive radio networks.In particular,physical queue and virtual queue models by applying ‘M/G/1 queue\' and ‘ M/G/1 queu

期刊

channel selectionpower controlcognitive radio networksD2D communicationsvide

Noisy speech emotion recognition using sample reconstruction and multiple-kernel learning

Speech emotion recognition (SER) in noisy environment is a vital issue in artificial intelligence (AI).In this paper,the reconstruction of speech samples removes the added noise.Acoustic features extracted from the reconstructed samples are selected to bu

期刊

speech emotion recognitioncompressed sensingmultiple-kernel learningfeature s

Performance analysis of LTE-U coexistence network with WiFi using queueing model

The unforeseen mobile data explosion as well as the scarce of spectrum resource pose a major challenge to the performance of today\'s cellular networks which are in urgent need of novel solutions to handle such voluminous mobile data.Long term evolution

期刊

LTE-UWiFiqueueing modelmatrix geometric method

Efficient closed-form solution for target localization using TDOA measurements in the presence of se

An efficient solution for locating a target was proposed,which by using time difference of arrival (TDOA) measurements in the presence of random sensor position errors to increase the accuracy of estimation.The cause of position estimation errors in two-s

期刊

target localizationTDOAsensor position errorsCRLB

Measuring web page complexity by analyzing TCP flows and HTTP headers

To understand website complexity deeply,a web page complexity measurement system is developed.The system measures the complexity of a web page at two levels:transport-level and content-level,using a packet trace-based approach rather than server or client

期刊

hyper text transfer protocolconcurrent TCP flowsworld wide webweb page comple

Progressive framework for deep neural networks: from linear to non-linear

与本文相关的学术论文