Progressive framework for deep neural networks: from linear to non-linear

来源 :中国邮电高校学报(英文版) | 被引量 : 0次 | 上传用户:game00vergoo
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
We propose a novel progressive framework to optimize deep neural networks.The idea is to try to combine the stability of linear methods and the ability of learning complex and abstract internal representations of deep learning methods.We insert a linear loss layer between the input layer and the first hidden non-linear layer of a traditional deep model.The loss objective for optimization is a weighted sum of linear loss of the added new layer and non-linear loss of the last output layer.We modify the model structure of deep canonical correlation analysis (DCCA),i.e.,adding a third semantic view to regularize text and image pairs and embedding the structure into our framework,for cross-modal retrieval tasks such as text-to-image search and image-to-text search.The experimental results show the performance of the modified model is better than similar state-of-art approaches on a dataset of National University of Singapore (NUS-WIDE).To validate the generalization ability of our framework,we apply our framework to RankNet,a ranking model optimized by stochastic gradient descent.Our method outperforms RankNet and converges more quickly,which indicates our progressive framework could provide a better and faster solution for deep neural networks.
其他文献
The problem of solving differential equations and the properties of solutions have always been an important content of differential equation study.In practical application and scientific research,it is difficult to obtain analytical solutions for most dif
This paper investigates the propagation of computer viruses and establishes a novel propagation model.In contrast to the existing models,this model can directly indicate the impact of removable media and external computers on the propagation of computer v
Aiming at the statistical sparse decomposition principle (SSDP) method for underdetermined blind source signal recovery with problem of requiring the number of active signals equal to that of the observed signals,which leading to the application bound of
An optimized Neumann series (NS) approximation is described based on Frobenius matrix decomposition,this method aims to reduce the high complexity,which caused by the large matrix inversion of detection algorithm in the massive multiple input multiple out
Similar to the analysis of Turbo codes,the parallel concatenated systematic polar code (PCSPC) can also be analyzed by the extrinsic information transfer (EXIT) chart.The convergence of the iterative decoding based on soft cancellation (SCAN) and belief p
A joint channel selection and power control scheme is developed for video streaming in device-to-device (D2D) communications based cognitive radio networks.In particular,physical queue and virtual queue models by applying ‘M/G/1 queue\' and ‘ M/G/1 queu
Speech emotion recognition (SER) in noisy environment is a vital issue in artificial intelligence (AI).In this paper,the reconstruction of speech samples removes the added noise.Acoustic features extracted from the reconstructed samples are selected to bu
The unforeseen mobile data explosion as well as the scarce of spectrum resource pose a major challenge to the performance of today\'s cellular networks which are in urgent need of novel solutions to handle such voluminous mobile data.Long term evolution
An efficient solution for locating a target was proposed,which by using time difference of arrival (TDOA) measurements in the presence of random sensor position errors to increase the accuracy of estimation.The cause of position estimation errors in two-s
To understand website complexity deeply,a web page complexity measurement system is developed.The system measures the complexity of a web page at two levels:transport-level and content-level,using a packet trace-based approach rather than server or client