Innovating Web Page Classification Through Reducing Noise

来源 :计算机科学技术学报 | 被引量 : 0次 | 上传用户:yy349764474
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
This paper presents a new method that eliminates noise in Web page classification. It first describes the presentation of a Web page based on HTML tags. Then through a novel distance formula, it eliminates the noise in similarity measure. After carefully analyzing Web pages, we design an algorithm that can distinguish related hyperlinks from noisy ones.We can utilize non-noisy hyperlinks to improve the performance of Web page classification (the CAWN algorithm). For any page, wecan classify it through the text and category of neighbor pages related to the page. The experimental results show that our approach improved classification accuracy.
其他文献
The title copolymer(PDEBO) was synthesized. The thermal characteristics of the polymer were determined by means of DSC and TGA, revealing that the polymer has a
The two-parameter family of Estevez-Mansfield-Clarkson equations with fully nonlinear dispersion (called E(m, n) equations), (uzm)zzr + γ(unzur)z + urr = 0 whi
Carbon-coated oxidized graphite has been prepared by a liquid-state deposition method. Oxidized graphite was prepared by wet chemical oxidation. Oxidation incre
The synthesis and crystal structure of a novel calix[8]arene ester are reported herein. The calix[8]arene ester derivative has been characterized by IR, NMR an
研究飞机外挂物在按给定轨迹下沉旋转的离机过渡过程中,流场非结构网格的调整和再生成方法以及过渡过程欧拉方程有限元数值解法.文中指出外挂物运动时流场网格调整或再生成仅
As a potential application of titanium-oxide nanoparticles, it is extremely importantto investigate a detailed picture of the surface and interior structural pr
为探究吕家坨井田地质构造格局,根据钻孔勘探资料,采用分形理论和趋势面分析方法,研究了井田7
A linear modelling of aeroacoustic waves propagation is discussed. The first point is an existence and uniqueness theorem. But restrictive assumptions are requi
The Se-Se bond in diaryl diselenides was reduced by Zn/ZrCl4 system to produce selenide anions, which react with acyl chlorides or acid anhydrides to afford sel
This paper shows that the -problem for holomorphic (0, 2)-forms on Hilbert spaces is solv able on pseudoconvex open subsets. By using this result, the authors i