A Clustering Algorithm for Planning the Integration Process of a Large Number of Conceptual Schemas

来源 :计算机科学技术学报(英文版) | 被引量 : 0次 | 上传用户:yht52119
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
When tens and even hundreds of schemas are involved in the integration process, criteria are needed for choosing clusters of schemas to be integrated, so as to deal with the integration problem through an e?cient iterative process. Schemas in clusters should be chosen according to cohesion and coupling criteria that are based on similarities and dissimilarities among schemas. In this paper, we propose an algorithm for a novel variant of the correlation clustering approach that addresses the problem of assisting a designer in integrating a large number of conceptual schemas. The novel variant introduces upper and lower bounds to the number of schemas in each cluster, in order to avoid too complex and too simple integration contexts respectively. We give a heuristic for solving the problem, being an NP hard combinatorial problem. An experimental activity demonstrates an appreciable increment in the effectiveness of the schema integration process when clusters are computed by means of the proposed algorithm w.r.t. the ones manually defined by an expert.
其他文献
Big data processing is becoming a standout part of data center computation. However, latest research has indicated that big data workloads cannot make full use
随着我国经济的不断发展以及人们生活水平的不断提高,人们对电的需求也越来越大,因而对电力行业的要求也越来越高。为了保证电力行业改革的有序推进,必须提高我国电力营销的信息
Phase change memory (PCM) is a promising technology for future memory thanks to its better scalability and lower leakage power than DRAM (dynamic random-access
E?cient resource utilization requires that emerging datacenter interconnects support both high performance communication and e?cient remote resource sharing. Th
As the scaling of applications increases, the demand of main memory capacity increases in order to serve large working set. It is di?cult for DRAM (dynamic rand
Social network analysis (SNA) views social relationships in terms of network theory consisting of nodes and ties. Nodes are the individual actors within the net
Due to advances in semiconductor techniques, many-core processors have been widely used in high performance computing. However, many applications still cannot b
The first issue of Advances in Atmospheric Sciences (AAS) was published in 1984.Originally quarterly,the journal later became bimonthly and will now be publishe
期刊
@@
随着市场经济的发展,电力行业的发展也面临着全新的挑战,因此在新的电力市场环境中,我国的供电企业应该重新审视自己的电力营销服务管理,积极发现其中的问题,并采取相应的措施对电
设计了连铸结晶器弯月面处传热模拟实验,测量了在结晶器振动情况下弯月面处的温度,发现该处的温度随着结晶器的振动而产生周期性的变化.实验结果表明,结晶器振动频率越大,振