Stream Weight Training Based on MCE for Audio-Visual LVCSR

来源 :清华大学学报（英文版） | 被引量 : 0次 | 上传用户：fanjolly

【摘要】

：

In this paper we address the problem of audio-visual speech recognition in the framework of the multi-stream hidden Markov model. Stream weight training based o

【作者】

：

LIU Peng WANG Zuoying

【机构】

：

Department of Electronic Engineering, Tsinghua University, Beijing 100084, China

【出处】

：

清华大学学报（英文版）

【发表日期】

：

2005年2期

【关键词】

：

audio-visual speech recognition (AVSR) large vocabulary continuous speech recogn

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

In this paper we address the problem of audio-visual speech recognition in the framework of the multi-stream hidden Markov model. Stream weight training based on minimum classification error criterion is discussed for use in large vocabulary continuous speech recognition (LVCSR). We present the lattice re-scoring and Viterbi approaches for calculating the loss function of continuous speech. The experimental results show that in the case of clean audio, the system performance can be improved by 36.1% in relative word error rate reduction when using state-based stream weights trained by a Viterbi approach, compared to an audio only speech recognition system. Further experimental results demonstrate that our audio-visual LVCSR system provides significant enhancement of robustness in noisy environments.

其他文献

Actuator Fault Detection for Sampled-Data Systems in H∞ Setting

Actuator fault detection for sampled-data systems was investigated from the viewpoint of jump systems.With the aid of a prior frequency information on fault, su

期刊

sampled-data systemsfault detectionactuatorRiccati equation

Research on the Dyeing Properties of Silk-like Knitted Fabric of Ultrafine Polyester Fiber

In this paper, dyeing processes of silk-like fabric of ultrafine polyester fiber are studied through orthogonal experiment, dyeing properties (K/S value, L* val

期刊

ultra-fine polyesterknitted fabricdyeing

提高二灰碎石早期强度试验研究

通过室内外试验 ,提出了采用 Na2 CO3提高二灰碎石早期强度的方法、措施和使用效果 Through indoor and outdoor experiments, the methods, measures and effects of using

期刊

半刚性基层二灰碎石早期强度收缩裂缝

基于DSP的C0mpactFlash卡接口设计

介绍Compact Flash卡的基本结构和工作原理;结合美国德州仪器(TI)公司的TMS320C54x系列数字信号处理器(DSP),详细地说明了DSP与CompactFlash卡接口设计中的关键软硬件技术;同

期刊

DSP CompactFLash卡 CPLD

On SCP Overload Control in Mobile Intelligent Network Based on Queue Size

For there is no overload control signal between Service Switch Point and Service Control Point at CALEL 2 stage of mobile intelligent network, we provide an ove

期刊

mobile intelligent networkservice control pointleaky bucket algorithmM/M/1/N

Joint Synchronization and Channel Estimation for OFDM Systems

OFDM systems are extremely sensitive to synchronization and channel estimation imperfections. Meanwhile the timing, frequency synchronization and channel estima

期刊

OFDMWLANsynchronizationchannel estimationmaximum likelihood

科技期刊插图中线条图的计算机加工方法

充分利用科技期刊作者提供的电子文档,采用 Photoshop 以及其他相应的绘图软件,运用其强大的处理工具直接修改插图中的线条图,可以获得不失真且清晰的图像。实际应用结果表明

期刊

科技期刊线条图图像处理计算机

Numerical Simulation of Recirculating Flow and Decarburization in RH Vacuum Refining Degasser

Based on the principle of RH process and the mechanism of decarburization,a three-dimensional mathematical model to represent the flow and decarburization of mo

期刊

RH vacuum refiningfluid flowdecarburizationnumerical simulation

Ag、Ta元素对MOS2抗氧化性影响的研究

用离子束辅助沉积方法(IBAD)制备MoS2-Ag和MoS2-Ta复合膜以及MoS2膜.用XPS分别检测在相对湿度100%室温环境下存放45天和室温去离子水浸泡158h以及在430℃加热1h后的三种膜中M

期刊

离子束辅助沉积MoS2-Ag复合膜MoS2-Ta复合膜氧化

Application of Six-Sequence Fault Components in Fault Location for Joint Parallel Transmission Line

A new fault location method based on six-sequence fault components was developed for parallel lines based on the fault analysis of a joint parallel transmission

期刊

fault locationjoint parallel linesix-sequence componentstwo-terminal

Stream Weight Training Based on MCE for Audio-Visual LVCSR

与本文相关的学术论文