Multi-Distributed Speech Emotion Recognition Based on Mel Frequency Cepstogram and Parameter Transfe

来源 :电子学报(英文版) | 被引量 : 0次 | 上传用户:gsbyqjkwkw
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Speech emotion recognition(SER)is the use of speech signals to estimate the state of emotion.At present,machine learning is one of the main research methods of SER,the test and training dataS of tradition-al machine learning all have the same distribution and feature space,but the data ofspeech is accessed from dif-ferent environments and devices,with different distribu-tion characteristics in real life.Thus,the traditional ma-chine learning method is applied to the poor performance of SER.This paper proposes a multi-distributed SER method based on Mel frequency cepstogram(MFCC)and parameter transfer.The method is based on single-layer long short-term memory(LSTM),pre-trained inception-v3 network and multi-distribution corpus.The speech pre-processed MFCC is taken as the input of single-layer LSTM,and input to the pre-trained inception-v3 network.The features are extracted through the pre-trained incep-tion-v3 model.Then the features are sent to the newly defined the fully connected layer and classification layer,let the parameters of the fully connected layer be fine-tuned,finally get the classification result.The experi-ment proves that the method can effectively complete the classification of multi-distribution speech emotions and is more effective than the traditional machine learning framework of SER.
其他文献
The increasing commercialization and massive deployment of radio frequency identification(RFID)systems has raised many security related issues which in return evokes the need of security protocols.Lo-gic of events theory(LoET)is a formal method for con-st
This paper presents a low power con-sumption and low cost electrically erasable program-mable read-only memory(EEPROM)for radio frequency identification(RFID)tag chip.A read-write circuit with parallel input and serial output is proposed.Only one sensitiv
GIFT is a lightweight block cipher with an substitution-permutation-network(SPN)structure proposed in CHES 2017.It has two different versions whose block sizes are 64 and 128 respectively.In RSA 2019,Zhu et al.found some differential characteristics of GI
第五代移动通信技术(5G)已正式投入商用,卫星网络与地面移动通信系统的融合是其中一个重要研究方向.针对5G体制在低轨(Low Earth Orbiting,LEO)卫星场景中应用的适应性问题,总结了不同研究机构、标准化组织以及学术界的研究进展,分析了由于卫星互联网特点带来的5G空口影响与挑战,给出了低轨卫星互联网5G空口的时频同步、随机接入以及小区切换现有解决方案.相关内容可为低轨卫星互联网与5G的融合研究提供参考.
In this paper,a novel maximum corren-tropy high-order extended Kalman filter(H-MCEKF)is proposed for a class of nonlinear non-Gaussian systems presented by polynomial form.All high-order polynomial terms in the state model are defined as implicit variable
The conventional convolutional neural network performs not well enough in the ground objects classification because of its insufficient ability in maintain-ing sensitive spectral information and characterizing the covariance of spatial structure,resulting
In this paper,a hybrid deep learning network-based model is proposed and implemented for maneuver decision-making in an air combat environment.The model consists of stacked sparse auto-encoder net-work for dimensionality reduction of high-dimensional,dyna
Unsupervised person re-identification(Re-ID)aims to improve the model\'s scalability and ob-tain better Re-ID results in the unlabeled data domain.In this paper,we propose an unsupervised person Re-ID method based on multi-granularity feature representa
A new classification model,the fuzzy hy-brid twin support vector machine(TWSVM),namely FHTWSVM,is proposed by combining the fuzzy TWS-VM and the hypersphere support vector machine(SVM).The hypersphere SVM is utilized for generating the hy-perspheres for t
2月16日,据世界经济论坛网站发布的WEF智能工业指数认证白皮书《制造业转型洞察报告》显示,海尔洗衣机旗下中德滚筒互联工厂和海尔合肥洗衣机互联工厂入选该报告,成为全球洗衣机行业唯一入选的优秀案例代表,再次证明了海尔洗衣机的智能制造实力.
期刊