论文部分内容阅读
基于规则的信息抽取,设计了信息抽取的规则文档,再利用XML技术对PDF格式的台湾科技文献进行信息抽取,并将所得的结构化数据导入SQLSERVER数据库,最后利用ASP技术构建一个方便、智能的信息检索平台。
Based on rule-based information extraction, a rule-based document for information extraction is designed. Then XML is used to extract information from PDF documents in Taiwan. The structured data is imported into SQLSERVER database. Finally, a convenient and intelligent Information retrieval platform.