论文部分内容阅读
文献标引是指对所收集的文献内容根据其词表或分类表转换为检索标识,给出其标识导引。中文信息自动标引是计算机自动完成用标识符号对文献加标记,标引是建立计算机索引项的过程,索引有分类、主题、作者、出版说明等,从而提供一种或几种检索途径与手段。下面,笔者着重阐述一下词典匹配标引法、切分标记法、词频统计标引法和语义语用分析分词法等中文
Document index refers to the content of the collected documents according to their vocabularies or classification table is converted to retrieve the logo, given its logo guide. Chinese information automatic indexing is the computer automatically completes the marking of the document with the identification symbol. The indexing is the process of establishing the computer indexing item. The indexing includes classification, subject, author, publication specification, etc. to provide one or several retrieval ways and means . Below, the author focuses on the dictionary matching indexing method, segmentation markings method, the word frequency indexing method and the semantic analysis of pragmatics, such as Chinese