Dimensionality Reduction by Mutual Information for Text Classification

来源 :北京理工大学学报（英文版） | 被引量 : 0次 | 上传用户：troy0215

【摘要】

：

The frame of text classification system was presented. The high dimensionality in feature space for text classification was studied. The mutual information is a

【作者】

：

LIU Li-zhen SONG Han-tao LU Yu

【机构】

：

Information Engineering College,School of Information Science and Technology,Laboratory of Intellige

【出处】

：

北京理工大学学报（英文版）

【发表日期】

：

2004年期

【关键词】

：

text classification mutual information dimensionality reduction

【基金项目】

：

国家重点基础研究发展计划(973计划)

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

The frame of text classification system was presented. The high dimensionality in feature space for text classification was studied. The mutual information is a widely used information theoretic measure, in a descriptive way, to measure the stochastic dependency of discrete random variables. The measure method was used as a criterion to reduce high dimensionality of feature vectors in text classification on Web. Feature selections or conversions were performed by using maximum mutual information including linear and non-linear feature conversions. Entropy was used and extended to find right features commendably in pattern recognition systems. Favorable foundation would be established for text classification mining.

其他文献

Effect of lead foam grid on performance of lead-acid battery

In order to increase the specific energy and specific power of a lead-acid battery, lead foam grid was prepared by electrodepositing Pb-Sn alloy on a copper foa

期刊

lead acid batterynegative active materialgrid materiallead foamspecific capa

Principle of Coordinates Acquisition Based on Single Camera

The principle and accuracy of 3-D coordinates acquisition using one single camera and the Aided Measuring Probe(AMP) are discussed in this paper. Using one sing

期刊

3-D coordinates acquisitionsingle cameracollinear equationspatial resectiona

Carrier synchronization for STBC OFDM systems

All-digital carrier synchronization strategies and algorithms for space-time block coding (STBC) orthogonal frequency division multiplexing (OFDM) are proposed

期刊

OFDMSTBCCarrier synchronization

Recent Development of Data Acquisition System on HL-2A

This paper introduced recent development of data acquisition system(DAS) on the HL-2A tokamak. The existing DAS has to be remodeled because of the evident impro

期刊

tokamakthe data acquisition systemlight-emitting diode

Bayesian texture segmentation based on wavelet domain hidden markov tree and the SMAP rule

According to the sequential maximum a posteriori probability (SMAP) rule, this paper proposes a novel multi-scale Bayesian texture segmentation algorithm based

期刊

wavelet transformhidden markov treeEM algorithm

孩子吃饭无需孙子兵法

三岁男孩童童总是不好好吃饭，一小碗饭得吃一个多小时，几乎每一口都要费一番周折才能吃进去。他妈妈、爸爸和奶奶每天为孩子吃饭用尽了招数，连哄带骗，软硬兼施。

期刊

孙子兵法吃饭孩子

Sensitivity of a new-developed neutron detector

We develop a kind of neutron detector, which consists of a polyethylene thin film and two PIN semiconductors connected face-to-face. The detector is insensitive

期刊

SensitivityPIN detectorNeutronPulse radiationPolyethylene converter

Investigation of optimal control system for arc spraying

Arc-voltage feedback PID ( Proportional plus Integral plus Differential) controller and arc-current feedback PID controller are designed with an algorithm of di

期刊

arc-sprayingmathematical modelPID controllergenetic algorithmadaptive contro

The study of antioxidant activities of fucoidan from Laminaria japonica

Two large molecular weight fucoidan fractions F-A and F-B were obtained by water extraction and anion-exchange chromatography and then L-A and L-B with low mole

期刊

Laminaria japonicafucoidanoxygen radicalslow-density lipoproteinantioxidant

A high-precision control system for robotic welding positioner

Aiming at the robotic welding positioner with characteristic of parameter change, load change, nonlinearity, and an intelligent control system was researched an

期刊

numerically controlled welding positionerintelligent tow-mode controllermulti-

Dimensionality Reduction by Mutual Information for Text Classification

与本文相关的学术论文