论文部分内容阅读
大数据的产生为电子政务带来了新的机遇与挑战,也为作为电子政务信息资源之一的组织机构代码提供了全新的认知理解角度。目前政府决策时使用的数据信息资源仍未完全统一,存在数据结构和类型差异明显、数据资源不统一等问题。为了使这些孤立的数据能够更好地实现资源共享,把位于不同信息源上的数据融合起来,本文在分析讨论组织机构代码和大数据共同特点的基础上,提出一种基于多源组织机构代码信息的数据融合方法。该方法基于组织机构代码、法人信息、组织机构名称3个方面信息,实现不同来源的信息融合。实验表明,该方法的融合率达到97%,准确率为87.4%。
The emergence of big data has brought new opportunities and challenges for e-government, and also provided a completely new cognitive perspective for the organization code as one of the e-government information resources. At present, the data and information resources used by the government in decision-making have not yet been completely unified. There are such problems as obvious differences in data structure and types and inconsistent data resources. In order to make these isolated data to achieve better resource sharing, the data are located on different sources of information fusion, the paper analyzes and discusses the common characteristics of the organization code and big data, based on the proposed a multi-source organization code Information fusion method. The method based on the organization code, corporate information, organization name three aspects of information, to achieve the integration of information from different sources. Experiments show that the fusion rate of the method reaches 97% and the accuracy rate is 87.4%.