Monaural voiced speech segregation based on elaborate harmonic grouping strategies

来源 :Science China(Information Sciences) | 被引量 : 0次 | 上传用户:icekingfly
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
In this paper, an enhanced algorithm based on several elaborate harmonic grouping strategies for monaural voiced speech segregation is proposed. Main achievements of the proposed algorithm lie in three aspects. Firstly, the algorithm classifies the time-frequency (T-F) units into resolved and unresolved ones by carrier-to-envelope energy ratio, which leads to more accurate classification results than by cross-channel correlation. Secondly, resolved T-F units are grouped together according to minimum amplitude principle, which has been verified to exist in human perception, as well as the harmonic principle. Finally, “enhanced” envelope autocorrelation function is employed to detect amplitude modulation rates, which helps a lot in reducing half-frequency error in grouping of unresolved units. Systematic evaluation and comparison show that performance of separation is greatly improved by the proposed algorithm. Specifically, signal-to-noise ratio (SNR) is improved by 0.96 dB compared with that of previous method. Besides, our algorithm is also effective in improving the PESQ score and subjective perception score. First, the algorithm classifies the time-frequency (TF) units into resolved and unresolved ones by carrier-to-envelope energy ratio, which leads to more accurate classification results than by by-channel correlation. Where, resolved TF units are grouped together according to minimum amplitude principle, which has been verified to exist in human perception, as well as the harmonic principle. Finally, “enhanced ” envelope autocorrelation function is employed to detect amplitude modulation rates, which helps a lot in reducing half-frequency error in grouping of unresolved units. Systematic evaluation and comparison show that performance of separation is greatly improved by the proposed algorithm. Specifically, signal-to-noise ratio (SNR) is improved by 0.96 dB compared to w ith that of previous method. Besides, our algorithm is also effective in improving the PESQ score and subjective perception score.
其他文献
本文对养老金隐性债务和养老保险转轨成本进行了概念辨析,并在此基础上分析了当前化解转轨成本的制度安排对我国养老金收支平衡的不利影响,明确了基本养老保险向部分积累制转
本文针对神经网络因逻辑运算能力差而极大影响其发展和独立应用的问题,结合计算机的快速高效的逻辑运算功能和人工神经网络良好的自学习自组织性等特点,提出并实现了一种无宿
黄港发同志来信,既是对我们的鼓励,也是对我们的鞭策。我们为黄港发同志勤劳致富的喜讯而高兴,也期望更多的民兵、复退军人走上富裕之路。编者 Letters from comrades of Hu
本文针对中央空调模糊控制系统中两输入、单输出的模糊控制器作用于同一空调房间对象进行仿真研究,并且对模糊控制器的3个比例因子分别采用一个改变、其他两个固定的研究方法
会议
目的:探讨血浆微小RNA-21-3p(miR-21-3p)和miR-551-5p表达水平对急性胰腺炎(AP)的诊断及预后预测的价值。方法:采用前瞻性观察性研究,选择2017年1月1日至2019年12月31日海南省第三
本文针对原有断路器过载特性测试装置存在电流调整速度慢、容易产生振荡等问题,基于模糊控制技术和PWM技术,对原有装置进行重新设计.介绍了系统组成和控制器的设计原理.试验
多用户检测技术是最新发展起来的一项用以消除CDMA系统中多址于扰的技术.本文提出了一种模拟退火遗传算法的多用户检测技术,该算法将模拟退火算法(SA)引进到遗传算法(GA)中,
本文针对BP算法训练过程中出现临时极小点的情况,对一个隐含层的前向神经网络,分析了隐含层不同神经元之间权值数值相近但符号相反时会产生临时极小点的问题,提出了增大学习
本文建立了EPS系统的数学模型,利用H∞混合灵敏度的设计方法设计了EPS系统的鲁棒控制器.提出了一种对路感进行分析的方法,路面信息是随着路面的不同而变化的,路感信息和干扰
会议
基于机理建模的原理,建立了间歇精馏过程的动态模型,在此基础上采取了预测函数控制原理与径向基函数神经网络相结合的控制算法,设计了RBF-PFC控制器.该方法利用了神经网络的
会议