,Topic-aware pivot language approach for statistical machine translation

来源 :浙江大学学报(英文版)(C辑:计算机与电子) | 被引量 : 0次 | 上传用户:ren_sir
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
The pivot language approach for statistical machine translation (SMT) is a good method to break the resource bottleneck for certain language pairs. However, in the implementation of conventional approaches, pivot-side context information is far from fully utilized, resulting in erroneous estimations of translation probabilities. In this study, we propose two topic-aware pivot language approaches to use different levels of pivot-side context. The fi rst method takes advantage of document-level context by assuming that the bridged phrase pairs should be similar in the document-level topic distributions. The second method focuses on the effect of local context. Central to this approach are that the phrase sense can be refl ected by local context in the form of probabilistic topics, and that bridged phrase pairs should be compatible in the latent sense distributions. Then, we build an interpolated model bringing the above methods together to further enhance the system performance. Experimental results on French-Spanish and French-German translations using English as the pivot language demonstrate the effectiveness of topic-based context in pivot-based SMT.
其他文献
Design intelligence is an important branch of artificial intelligence (AI),focusing on the intelligent models and algorithms in creativity and design.In the con
In this paper we address the problem of interpolating a spline developable patch bounded by a given spline curve and the fi rst and the last rulings of the deve
探讨了PCR技术在小麦品种Wx基因鉴定中的应用及导致Waxy蛋白亚基缺失的遗传原因.
本文在系统分析科尔沁沙地生态脆弱性现状和农业可持续发展面临的主要生态问题的基础上,以实地调查研究为主,并结合历史统计数据、前人研究成果,总结提出了科尔沁沙地16种生态治
Barley (Hordeum vulgareL.) is one of the oldest domesticated cereal crops.It isone of the first cultivated grains and is now grown worldwide. It iswell-adapted
英美文学课是高校英语专业高年级学生的专业必修课程,对培养学生的专业知识和人文素养具有不可替代的作用。近年来,该课程在教学实践过程中出现了被压缩与淡化的趋势。随之不
With the increasing energy consumption of computing systems and the growing advocacy for green computing, energy e?ciency has become one of the critical challen
A refractive index sensor based on a multi-core micro/nano fiber is proposed for low refractive index solutions.At first,the mode field distribution of the tape
This paper reviews some main results and progress in distributed multi-agent coordination from a graph Laplacian perspective. Distributed multi-agent coordinati
“水土保持工程学”是水土保持与荒漠化防治专业实践性很强的骨干课程之一,其中的实践环节是整个教学活动的重中之重。加强该课程实践教学有助于激发大学生的学习兴趣,有助于