Adversarial Domain Adaptation for Chinese Semantic Dependency Graph Parsing

来源 :第十八届中国计算语言学大会暨中国中文信息学会2019学术年会 | 被引量 : 0次 | 上传用户:fangfei123456
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  The Chinese Semantic Dependency Graph(CSDG)Parsing reveals the deep and fine-grained semantic relationship of Chinese sentences,and the parsing results have a great help to the downstream NLP tasks.However,most of the existing work focuses on parsing in a single domain.When transferring to other domains,the performance of the parser tends to drop dramatically.And the target domain often lacks the annotated data,so it is difficult to train the parser directly in the target domain.To solve this problem,we propose a lightweight yet effective domain adaptation component for CSDG parsing that can be easily added to the architecture of existing single domain parser.It contains a data sampling module and an adversarial training module.Furthermore,we present CC SD,the first Chinese Cross-domain Semantic graph Dependency dataset.Experiments show that with the domain adaptation component we proposed,the model can effectively improve the performance in the target domain.On the CCSD dataset,our model achieved state-of-the-art performance with significant improvement compared to the strong baseline model.
其他文献
In order to solve the problem of data sparseness caused by less training corpus in Tibetan-Chinese transliteration,this paper ana-lyzes the alignment granularity of Tibetan-Chinese names as the resear
It is widely accepted that part-of-speech(POS)tagging and dependency parsing are highly related.Most state-of-the-art dependency parsing methods still rely on the results of POS tagging,though the tag
Text correction after automatic speech recognition(ASR)is an im-portant method to improve the speech recognition system.We regard the speech error correction as a translation task—from the language of
Online news platforms have gained huge popularity for online news reading.The topic categories of news are very important for these platforms to target user interests and make personalized recommendat
Sentence selection and summary generation are two main steps to generate informative and readable summaries.However,most previous works treat them as two separated subtasks.In this paper,we propose a
学位
学位
Learning multi-lingual sentence embeddings usually requires large scale of parallel sentences which are difficult to obtain.We propose a novel self-learning approach which is capable of learning multi
学位
Online news platforms have attracted massive users to read digital news online.The demographic information of these users such as gender is critical for these platforms to provide personalized service