Linked Document Classification by Network Representation Learning

来源 :第十七届全国计算语言学学术会议暨第六届基于自然标注大数据的自然语言处理国际学术研讨会(CCL 2018) | 被引量 : 0次 | 上传用户:menglimengwaiszy
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  Network Representation Learning(NRL)can learn a latent space rep-resentation of each vertex in a topology network structure to reflect linked in-formation.Recently,NRL algorithms have been applied to obtain document embedding in linked document network,such as citation websites.However,most existing document representation methods with NRL are unsupervised and they cannot combine NRL with a concrete task-specific NLP tasks.So in this paper,we propose a unified end-to-end hybrid Linked Document Classification(LDC)model which can capture semantic features and topological structure of documents to improve the performance of document classification.In addition,we investigate to use a more flexible strategy to capture structure similarity to improve the traditional rigid extraction of linked document topology structure.The experimental results suggest that our proposed model outperforms other document classification methods especially in the case of having less training sets.
其他文献
Relation extraction is an important part of many information extrac-tion systems that mines structured facts from texts.Recently,deep learning has achieved good results in relation extraction.Attentio
Using sequence-to-sequence models for abstractive text sum-marization is generally plagued by three problems: inability to deal with out-of-vocabulary words,repetition in summaries and time-consuming
As an essential sub-task of frame-semantic parsing,Frame Identifica-tion(FI)is a fundamentally important research topic in shallow semantic pars-ing.However,most existing work is based on sophisticate
The evaluation of word embeddings has received a considerable amount of attention in recent years,but there have been some debates about whether intrinsic measures can predict the performance of downs
Task-oriented dialog systems usually face the challenge of querying knowledge base.However,it usually cannot be explicitly modeled due to the lack of annotation.In this paper,we introduce an explicit
At present,the research on Tibetan machine translation is mainly fo-cused on Tibetan-Chinese machine translation and the research on Chinese-Tibetan machine translation is almost blank.In this paper,t
Extracting term translation pairs is of great help for Chinese histori-cal classics translation since term translation is the most time-consuming and challenging part in the translation of historical
Nowadays,research on stylistic features(SF)mainly focuses on two aspects: lexical elements and syntactic structures.The lexical elements act as the content of a sentence and the syntactic structures c
Dialogue intent detection and semantic slot filling are two critical tasks in nature language understanding(NLU)for task-oriented dialog systems.In this paper,we present an attention-based encoder-dec
In recent years,mining opinions from customer reviews has been widely explored.Aspect-level sentiment analysis is a fine-grained subtask,which aims to detect the sentiment polarity towards a partic-ul