WordNet-based lexical semantic classification for text corpus analysis

来源 :中南大学学报(英文版) | 被引量 : 0次 | 上传用户:chunlai_zhang
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Many text classifications depend on statistical term measures to implement document representation. Such document representations ignore the lexical semantic contents of terms and the distilled mutual information, leading to text classification errors. This work proposed a document representation method, WordNet-based lexical semantic VSM, to solve the problem. Using WordNet, this method constructed a data structure of semantic-element information to characterize lexical semantic contents, and adjusted EM modeling to disambiguate word stems. Then, in the lexical-semantic space of corpus, lexical-semantic eigenvector of document representation was built by calculating the weight of each synset, and applied to a widely-recognized algorithm NWKNN. On text corpus Reuter-21578 and its adjusted version of lexical replacement, the experimental results show that the lexical-semantic eigenvector performsF1 measure and scales of dimension better than term-statistic eigenvector based on TF-IDF. Formation of document representation eigenvectors ensures the method a wide prospect of classification applications in text corpus analysis.
其他文献
In order to enhance the robustness and contrast in the minimum variance (MV) beamformer, adaptive diagonal loading method was proposed. The conventional diagona
期刊
能源是人类赖以生存、经济发展和社会进步不可缺少的重要物质资源,是社会经济可持续发展的物质基础,能源问题一直是我国经济和社会发展中的热点和难点。中国是能源生产和消费大
学位
1月11日,由信雅达文化艺术与钱江晚报共同主办的“精彩写神——余宏达人物画展”在信雅达·三清上艺术中心拉开帷幕。展览展出了余宏达近几年创作的工笔、写意人物画60余幅及
本文以科技攻关课题《金属支架承载力增强技术研究》为背景,以提高U型钢金属支架承载力为研究中心,目的在于揭示金属支架在不同数量锚杆(索)梁补强情况下的承载力增强的程度,
请下载后查看,本文暂不支持在线获取查看简介。 Please download to view, this article does not support online access to view profile.
期刊
自愿性信息披露的技巧主要体现在对披露内容、披露的详细程度及披露时间的把握上。 The techniques of voluntary disclosure are mainly reflected in the content of disc
在金融市场,如何合理地配置资金,使得投资的风险最小化、收益最大化是一个热门的问题,投资组合的效用理论是优化投资组合的一种有效途径.本文从效用函数出发,在已有效用函数
请下载后查看,本文暂不支持在线获取查看简介。 Please download to view, this article does not support online access to view profile.
期刊