论文部分内容阅读
生物多样性研究工作急切需要一个建立在多源数据基础上的数字图书馆。基于虚拟用户社区的生物多样性数字图书馆除了在数据类型、存储需求、共享方式等方面具有一般数字图书馆的特点之外,在数据挖掘和应用方面也有自己的一些特点。本文在对国内外数字图书馆调研和与生物多样性遗产图书馆(Biodi versity Heritage Library)及互联网档案(Internet Archive)项目的合作的基础上,总结了各类数字图书馆中的数据类型,对构建生物多样性数字图书馆相关的数据标准——Dublin Core和TaxonX作了简单介绍。然后设计了具有数据汇总、数据整理、转换和翻译以及数据对外服务三个模块的系统框架,提出了生物多样性数字图书馆的系统架构和功能,展示了已经实现的部分系统运行效果,最后对今后在版权、全文识别、海量和扩展等方面的问题进行了讨论。
An urgent need for a biodiversity research effort is to build a digital library based on multi-source data. Biodiversity Digital Communities Based on Virtual User Communities In addition to having the characteristics of a general digital library in terms of data types, storage requirements and sharing methods, digital libraries have their own characteristics in data mining and applications. Based on the research of digital libraries at home and abroad and the cooperation with the project of Biologic library and Internet Archive, this paper summarizes the types of data in various digital libraries, Building a Digital Library of Biodiversity-Related Data Standards - Dublin Core and TaxonX Make a Brief Introduction. Then it designs a system framework with three modules of data collection, data sorting, translation and translation, and data external service. It proposes the system architecture and functions of the digital biodiversity library, and shows some of the system operating results that have been achieved. Finally, In the future in the copyright, full text recognition, mass and expansion and other aspects of the issue were discussed.