AUTOMATIC PATENT DOCUMFNT SUMMARIZATION FOR COLLABORATIVE KNOWLEDGE SYSTEMS AND SERVICES

来源 :Journal of Systems Science and Systems Engineering | 被引量 : 0次 | 上传用户:hyzxp01
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Engineering and research teams often develop new products and technologies by referring to inventions described in patent databases.Efficient patent analysis builds R&D knowledge,reduces new product development time,increases market success,and reduces potential patent infringement. Thus,it is beneficial to automatically and systematically extract information from patent documents in order to improve knowledge sharing and collaboration among R&D team members.In this research, patents are summarized using a combined ontology based and TF-IDF concept clustering approach. The ontology captures the general knowledge and core meaning of patents in a given domain.Then, the proposed methodology extracts,clusters,and integrates the content of a patent to derive a summary and a cluster tree diagram of key terms.Patents from the International Patent Classification(IPC) codes B25C,B25D,B25F(categories for power hand tools)and B24B,C09G and H011(categories for chemical mechanical polishing)are used as case studies to evaluate the compression ratio,retention ratio,and classification accuracy of the summarization results.The evaluation uses statistics to represent the summary generation and its compression ratio,the ontology based keyword extraction retention ratio,and the summary classification accuracy.The results show that the ontology based approach yields about the same compression ratio as previous non-ontology based research but yields on average an 11%improvement for the retention ratio and a 14%improvement for classification accuracy. Engineering and research teams often develop new products and technologies by referring to inventions described in patent databases. Efficient patent analysis builds R & D knowledge, reduces new product development time, increases market success, and reduces potential patent infringement. Thus, it is beneficial to automatically and systematically extract information from patent documents in order to improve knowledge sharing and collaboration among R & D team members. In this research, patents are summarized using a combined ontology based and TF-IDF concept clustering approach. The ontology captures the general knowledge and core meaning of patents in a given domain. Chen, the proposed methodology extracts, clusters, and integrates the content of a patent to derive a summary and a cluster tree diagram of key terms. Patents from the International Patent Classification (IPC) codes B25C, B25D, B25F categories for power hand tools) and B24B, C09G and H011 (categories for chemical mechanical polishing) are use d as case studies to evaluate the compression ratio, retention ratio, and classification accuracy of the summarization results. The evaluation uses statistics to represent the summary generation and its compression ratio, the ontology based keyword keyword retention retention, and the summary classification accuracy. results show that the ontology based approach yields about the same compression ratio as previous non-ontology based research but yields on average an 11% improvement for the retention ratio and a 14% improvement for classification accuracy.
其他文献
在课程改革的推动和素质教育的贯彻推行之下,我国高中阶段各学科教研都处于转型关键时期,特别是普高历史课程标准在高中历史教育中的实施,针对于高中历史教师提出了新的要求
目的:探讨中药有效成分穿心莲内酯对白念珠菌生物膜分散细胞凋亡的影响。方法:Hoechst33258染色荧光显微镜检测白念珠菌生物膜细胞凋亡的形态;Rh123染色流式细胞仪检测白念珠
年收入12万元以上者须自行申报个税。两年前,中国首次用立法手段对“为富不税”、“为富不露”行为说“不”。不过,具体执行时,多半还是凭个人自觉。当收入在中国还停留在“
林业统计工作是我国林业建设中的关键内容,其在实际实施的过程中能促进我国林业建设的良好发展.本文将重点研究计算机技术在林业统计中的应用,利用现今的科学技术提高林业统
[目的]了解上海市中心城区有5岁以下儿童家庭吸烟者的吸烟知识、信念、行为,为制定控烟干预措施和相关政策提供依据。[方法]采用结构式问卷进行基线数据的收集,对上海市2个社
5月12日全省地税系统“解放思想、深化改革、扩大开放、科学发展”大讨论活动视频动员大会、专题报告会在省局八楼会议室召开,16个州、市地税局、129个区(县、市)地税局设分
随着社会经济的迅速发展,我国的交通事业有了长足的进步。作为交通建设的重要组成部分,桥梁的建设步伐更是以前所未有的规模在全国各地展开。再加上桥梁技术的突飞猛进,当下
曹雁:生于1960年,现为北京少儿特长生学校校长。幼时因患小儿麻痹症而双腿致残。1993年,她凭借两根拐杖支撑身体,以坚强的毅力战胜种种困难,创办了北京少儿特长生学校。曾荣
Regulatory T cells (Treg) play important roles in immune system homeostasis, and may also be involved in tumorimmunotolerance by suppressing Th1 immune respons
本文通过(+)-樟脑缩呋喃甲亚胺的不对称烷基化反应,合成了(R)-α-烷基糠胺。反应的非对映选择性经~1H NMR测定为5~67%(d.e.).用1,3-二碘丙烷和α,α-二溴邻二甲苯作烷基化试剂,