Audio-Visual Underdetermined Blind Source Separation Algorithm Based on Gaussian Potential Function

来源 :中国通信 | 被引量 : 0次 | 上传用户:como
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
Most existing algorithms for the underdetermined blind source separation(UBSS) problem are two-stage algorithm, i.e., mixing parameters estimation and sources estimation. In the mixing parameters estimation, the previously proposed traditional clustering algorithms are sensitive to the initializations of the mixing parameters. To reduce the sensitiveness to the initialization, we propose a new algorithm for the UBSS problem based on anechoic speech mixtures by employing the visual information, i.e., the interaural time difference(ITD) and the interaural level difference(ILD), as the initializations of the mixing parameters. In our algorithm, the video signals are utilized to estimate the distances between microphones and sources, and then the estimations of the ITD and ILD can be obtained. With the sparsity assumption in the time-frequency domain, the Gaussian potential function algorithm is utilized to estimate the mixing parameters by using the ITDs and ILDs as the initializations of the mixing parameters. And the time-frequency masking is used to recover the sources by evaluating the various ITDs and ILDs. Experimental results demonstrate the competitive performance of the proposed algorithm compared with the baseline algorithms. Most existing algorithms for the underdetermined blind source separation (UBSS) problem are two-stage algorithms, ie, mixing parameters estimation and sources estimation. The previously proposed traditional clustering algorithms are sensitive to the initializations of the mixing parameters. To reduce the sensitiveness to the initialization, we propose a new algorithm for the UBSS problem based on anechoic speech mixtures by employing the visual information, ie, the interaural time difference (ITD) and the interaural level difference (ILD), as the initializations of The mixing parameters. In our algorithm, the video signals are utilized to estimate the distances between microphones and sources, and then the estimations of the ITD and ILD can be obtained. With the sparsity assumption in the time-frequency domain, the Gaussian potential function algorithm is utilized to estimate the mixing parameters by using the ITDs and ILDs as the initializations of the mixing parameters. And the time-frequency masking is used to recover the sources by evaluating the various ITDs and ILDs. Experimental results demonstrate the competitive performance of the proposed algorithm compared with the baseline algorithms.
如果我们无所作为,宽带(运营商)就会以其无法接受的成本强加给我们。它会扼杀创新,扼杀投资者对于我们今天依赖、明天仍将依赖的自由和开放的互联网的信心!  ——FCC主席格纳考斯基  9月21目,FCC主常格纳考斯基关于“网络中立”立法的一番言论,掀起了轩然大波——这是继2005年“网络中立四大原则”提出后对于美国电信以及有线运营商的又一次挑战。    “4+2”新规要出台    事情还要从9月21日
根据已报道的其他植物H+-PPase基因的保守序列设计一对简并性引物,以马蔺幼根总RNA为模板,采用RT-PCR方法克隆出马蔺H+-PPase基因片段并克隆到p UCm-T载体,命名为Il VP。阳性