Dimensionality Reduction by Mutual Information for Text Classification

来源 :北京理工大学学报(英文版) | 被引量 : 0次 | 上传用户:troy0215
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
The frame of text classification system was presented. The high dimensionality in feature space for text classification was studied. The mutual information is a widely used information theoretic measure, in a descriptive way, to measure the stochastic dependency of discrete random variables. The measure method was used as a criterion to reduce high dimensionality of feature vectors in text classification on Web. Feature selections or conversions were performed by using maximum mutual information including linear and non-linear feature conversions. Entropy was used and extended to find right features commendably in pattern recognition systems. Favorable foundation would be established for text classification mining.
其他文献
In order to increase the specific energy and specific power of a lead-acid battery, lead foam grid was prepared by electrodepositing Pb-Sn alloy on a copper foa
The principle and accuracy of 3-D coordinates acquisition using one single camera and the Aided Measuring Probe(AMP) are discussed in this paper. Using one sing
All-digital carrier synchronization strategies and algorithms for space-time block coding (STBC) orthogonal frequency division multiplexing (OFDM) are proposed
This paper introduced recent development of data acquisition system(DAS) on the HL-2A tokamak. The existing DAS has to be remodeled because of the evident impro
According to the sequential maximum a posteriori probability (SMAP) rule, this paper proposes a novel multi-scale Bayesian texture segmentation algorithm based
三岁男孩童童总是不好好吃饭,一小碗饭得吃一个多小时,几乎每一口都要费一番周折才能吃进去。他妈妈、爸爸和奶奶每天为孩子吃饭用尽了招数,连哄带骗,软硬兼施。
We develop a kind of neutron detector, which consists of a polyethylene thin film and two PIN semiconductors connected face-to-face. The detector is insensitive
Arc-voltage feedback PID ( Proportional plus Integral plus Differential) controller and arc-current feedback PID controller are designed with an algorithm of di
Two large molecular weight fucoidan fractions F-A and F-B were obtained by water extraction and anion-exchange chromatography and then L-A and L-B with low mole
Aiming at the robotic welding positioner with characteristic of parameter change, load change, nonlinearity, and an intelligent control system was researched an