,Heuristics based semantic annotation of biodiversity documents in Chinese

来源 :中国文献情报(英文刊) | 被引量 : 0次 | 上传用户:gba2008
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Purpose:To design an efficient high-performance algorithm for semantic annotation of biodiversity documents in Chinese.Design/methodology/approach:Data set consists of 1,000 randomly selected documents from Flora of China.Comparative evaluation of the proposed approach with the Naive Bayes algorithm have been developed before for the same purpose.Findings:Experimental results show that the heuristics based algorithm outperformed the Naive Bayes algorithm.The use of leading words helped improving the annotation performance while prioritizing rule application based on their weights had no significant impact on algorithm performance.Research limitations:The ICTCLAS was used to identify word boundaries off-shelf without optimatization for biodiversity domain.This may have not made the best use of the tool.Practical implications & Originality/value:The performance of heuristics based approach,enhanced by leading words analysis,reached an F value of 0.9216,which is sufficiently accurate for practical use.
其他文献
最近,某报登了一篇短文。内容是,电视新闻报道了某回顾展中的一幅19年前的照片,引起了被摄进此照片的短文作者的回忆。作者介绍了拍摄此照片的过程。其中一段话颇值得玩味:
结合实际,针对智慧城镇建设与地市级高等职业教育互动机制研究进行了论述。 According to the actual situation, this paper discusses the research on the interaction m
Purpose:To meet the changing needs of academic and specialized users,university and research libraries are transforming their collections,staff,and services.At
在生活中,青年朋友们都有自己的业余爱好,有的喜欢体育,有的爱好音乐,而我呢,却爱好新闻写作。屈指算来,从我写的第一篇小稿被采用至今,已整整5年了。这5年里,全国各级报刊
期刊
采集东莞电镀工业区周边农田的表层及不同深度的土壤和农产品样品,分析土壤和农产品可食部分Cu、Zn、Pb、Cd、Hg、As的含量,根据国家环境质量标准和广东省土壤元素背景值,采用综
随着我国城市化速度的加快,蔬菜的种植面积以及产量都急剧增加。蔬菜田土壤质量的可持续发展有着非常重要的意义。   本研究以太湖地区典型的蔬菜基地为背景,经详细调查,分别
Purpose:This study examines Chinese college students’ awareness of ethical issues surrounding the use of information resources and the Intet and their attitude
本文通过盆栽试验,采用平板培养法、氯仿熏蒸浸提法等传统方法以及脂肪酸甲脂(FAME)、末端限制性片段长度多态(T-RFLP)和梯度凝胶电泳(PCR-DGGE)等现代微生物分析技术,研究了
Purpose:The purpose of this research is to investigate Chinese rural women’s information needs and information seeking behavior,with an emphasis on exploration