,Knowledge extraction from Chinese wiki encyclopedias

来源 :浙江大学学报(英文版)(C辑:计算机与电子) | 被引量 : 0次 | 上传用户:qq12433184000
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
The vision of the Semantic Web is to build a ‘Web of data’ that enables machines to understand the semantics of information on the Web.The Linked Open Data (LOD) project encourages people and organizations to publish various open data sets as Resource Description Framework (RDF) on the Web,which promotes the development of the Semantic Web.Among various LOD datasets,DBpedia has proved a successful structured knowledge base,and has become the central interlinking-hub of the Web of data in English.However,in the Chinese language,there is little linked data published and linked to DBpedia.This hinders the structured knowledge sharing of both Chinese and cross-lingual resources.This paper deals with an approach for building a large-scale Chinese structured knowledge base from Chinese wiki resources,including Hudong and Baidu Baike.The proposed approach first builds an ontology based on the wiki category system and infoboxes,and then extracts instances from wiki articles.Using Hudong as our source,our approach builds an ontology containing 19 542 concepts and 2381 properties.802 593 instances are extracted and described using the concepts and properties in the extracted ontology and 62 679 of them are linked to equivalent instances in DBpedia.As from Baidu Baike,our approach builds an ontology containing 299 concepts,37 object properties,and 5590 data type properties.1319 703 instances are extracted from Baidu Baike,and 84 343 of them are linked to instances in DBpedia.We provide RDF dumps and SPARQL endpoint to access the established Chinese knowledge bases.The knowledge bases built using our approach can be used not only in Chinese linked data building,but also in many useful applications of large-scale knowledge bases,such as question-answering and semantic search.
其他文献
Methods for pressure sore monitoring remain both a clinical and research challenge.Improved methodologies could assist physicians in developing prompt and effective pressure sore interventions.In this
学位
学位
当前就业工作是高校的一项重要工作.对于成立时间短,就业经验并丰富的独立学院而言,在如今严峻的就业形势下,面临的挑战更多.针对这一问题,独立院校应该提供学生需要的就业指
Contrast evaluation can be used as a criterion to evaluate performance of contrast enhancement algorithms and to compare contrast capability of display systems.
Inverse distance weighting (IDW) interpolation and viewshed are two popular algorithms for geospatial analysis.IDW interpolation assigns geographical values to
三月三十日,我国许多报纸刊登了新华社播发的我国秦始皇陵考古工作有新的突破一稿,所用标题各有千秋。上海《文汇报》的标题是: 我考古工作取得新突破(肩题) 秦皇安眠两千载
学位