Nonlinear Time-Frequency Distributions of Spectrum Energy Operator in Large Vocabulary Mandarin Spe

来源 :Tsinghua Science and Technology | 被引量 : 0次 | 上传用户:wgsgdy
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
This work demonstrates the use of the nonlinear time-frequency distribution (NLTFD) of a discrete time energy operator (DTEO) based on amplitude modulation-frequency modulation demodulation techniques as a feature in speech recognition. The duration distribution based hidden Markov module in a speaker independent large vocabulary mandarin speech recognition system was reconstructed from the feature vectors in the front-end detection stage. The goal was to improve the performance of the existing system by combining new features to the baseline feature vector. This paper also deals with errors associated with using a pre-emphasis filter in the front end processing of the present scheme, which causes an increase in the noise energy at high frequencies above 4 kHz and in some cases degrades the recognition accuracy. The experimental results show that eliminating the pre-emphasis filters from the pre-processing stage and using NLTFD with compensated DTEO combined with Mel frequency cepstrum components give a 21.95% reduction in the relative error rate compared to the conventional technique with 25 candidates used in the test. This work demonstrates the use of the nonlinear time-frequency distribution (NLTFD) of a discrete time energy operator (DTEO) based on amplitude modulation-frequency modulation demodulation techniques as a feature in speech recognition. The duration distribution based hidden Markov module in a speaker independent large vocabulary mandarin speech recognition system was reconstructed from the feature vectors in the front-end detection stage. The goal was to improve the performance of the existing system by combining new features to the baseline feature vector. This paper also deals with errors associated with using a pre-emphasis filter in the front end processing of the present scheme, which causes an increase in the noise energy at high frequencies above 4 kHz and in some cases degrades the recognition accuracy. The experimental results show that eliminating the pre-emphasis filters from the pre-processing stage and using NLTFD with compensated DTEO combined with Mel frequency cepstr um components give a 21.95% reduction in the relative error rate compared to the conventional technique with 25 candidates used in the test.
其他文献
怎样区别苹果轮纹病和炭疽病苹果轮纹病和炭疽病是危害苹果果实的两种主要病害,越接近果实成熟期越严重,二者经常同时出现,病症相近,容易混淆,可从以下几个方面区别:①患轮纹病和炭
中国通信学会定于1982年第四季度召开第二次学术年会。年会内容包括两部分,即邮政网路及自动化学术交流会和通信网路学术交流会。各专业委员会将开展以专题、小型为主的学术
在小型电视机上,如何用单极天线能够接收良好,图象清晰,这是用户所关心的问题。这里所介绍的一种改进的单极天线装置,就能满足这方面的要求。图1是过去采用单极天线的电视机
  目的:本课题通过采用蛋白质组学的方法对粪肠球菌ATCC 19433进行分析,以获得其蛋白质组信息.方法:样品制备:粪肠球菌ATCC19433菌株在PYG培养基中于37℃恒温培养箱中震摇培养
会议
[摘要]文章结合《合唱与指挥》的课程特点,提出了“合唱视唱”这一课题,并阐述了“合唱视唱”在《合唱与指挥》教学中的重要意义和实施的方法。  [关键词]合唱视唱 练声曲 课程设计 开放性教学  一、合唱教学中“合唱视唱”的意义  作曲家甘霖曾说:“如果交响乐是人类智慧文明的高端文化,合唱曲则是集这种高端艺术和天然魅力于一身的产物,用运动项目来比喻,它就是音乐里的高尔夫,是一种修养和技术很高的艺
  研究目的:采用体外培养的牙髓细胞,探讨Rho/ROCK通路对细胞内β-catenin释放、活化及核转位的影响。材料方法:采用LPA、TGF-β 1刺激牙髓细胞,Y-27632阻断剂抑制ROCK,通过
会议
读及一则有趣的寓言.rn蝎子想过河,但它不会游泳,请求青蛙驮它.青蛙起初不答应,因为怕蝎子蜇它.但是,蝎子反问道:“如果我这样做,大家不是会同归于尽吗?”青蛙认为有道理,就
期刊
  目的:选用NF-PLLA/nHA和海藻酸钙水凝胶两种支架材料在体内构建组织工程化的牙本质和牙髓样组织.方法:将支架材料制备成园盘状,由内外两部分组成.外部为NF-PLLA/nHA材料,用于
会议
当代社恐青年的现状:听见邻居门响,会默默等待对方走了再出门; 在走廊遇见同事,会赶紧低头假装刷微信……然而以上这些场面都不算什么.对社恐青年来说,更难熬的是出门消费不
期刊
  目的:评价经使用甲硝唑、米诺环素和环丙沙星三联抗生素消毒后,再血管化法对无髓伴根尖周炎的年轻犬牙的治疗效果及探讨根管内新生组织的组织学特征。方法:杂种犬3只,选取牙
会议