Automatic Construction of Domain Terminology Knowledge Base for HowNet Based on the Headword

来源 :第十二届全国机器翻译研讨会 | 被引量 : 0次 | 上传用户:ufs2269acjx
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  HowNet is a Chinese-English Bilingual common-sense knowledge base,playing an important role in machine translation tasks.However,when fac-ing domain-specific machine translation tasks,HowNet must be supplemented with domain-specific terminologies.In other words,we need to construct domain terminology semantic knowledge base.In this paper,we propose a method to automatically construct domain terminology knowledge base,based on the head-word of a terminology.Specifically,the semantic meaning(HowNet DEF)of an unseen terminology is defined as one of the semantic meanings of the headword of the terminology.Headword disambiguation is done by considering the context of headwords and adding domain-specific disambiguation rules to the general disambiguation rules.Experiments on aviation domain show that our proposed method on headword disambiguation achieves 9.4%improvement based on the default disambiguation tools in HowNet.We also find that with our automatically constructed domain terminology knowledge base,HowNet machine translation system achieves better translation quality.
其他文献
  基于《傲慢与偏见》一对五平行语料库,以百度在线翻译的《傲慢与偏见》为重点,选取叠词作为研究对象,进行百度在线翻译与人工译本在12 类叠词翻译上异同的对比,以及百度在线
基于平行语料库的机器翻译研究是机器翻译的研究重点。平行语料库为机器学习翻译所需的句法规则提供了保障,也为统计机器翻译提供了资源。本文选择联合国官方资料库
会议
  Machine Translation(MT)evaluation is very important for a MT system.In this paper,we investigate an improved Cilin-based smoothed BLEU(BLEUS-syn).As the pos
会议
  深度神经网络是深度学习技术的代表,是一种模拟人脑的计算模型,已经在人工智能的很多领域取得了突破性的成果。基于神经网络的机器翻译是利用深度神经网络进行源语言到目标
由于汉语药品种类繁多,人工翻译药品说明书对译员专业水平要求较高,翻译成本较高.为了降低翻译成本,提高效率,本文根据药品说明书的特点,对药品用法用量机器翻译方
会议
  Current neural machine translation(NMT)usually extracts a fixed-length semantic representation for source sentence,and then depends on this representation t
会议
该文将深度学习技术应用于汉藏机器翻译任务中,采用了编码器-解码器结构.在编码阶段,首先将汉语句子中的每个词映射为定长的词向量,并通过循环神经网络压缩整个句子
会议
面向专利领域的机器翻译近年来已成为机器翻译的重要应用领域之一.本文提出了一个汉英专利文本机器翻译融合系统,该系统以规则系统为主导搭建,并把规则翻译方法和基
会议
  翻译模板是对自然语言现象高度的总结概括,也是机器翻译重要的资源,翻译模板的质量关系着机器翻译系统的效能,所以对翻译模板的编写一直是机器翻译领域研究的难题。本文采用
  In order to improve the efficiency of human translation,there is an increasing interest in applying machine translation(MT)to computer assisted translation(
会议