数据仓库粒度的估算

来源 :电脑编程技巧与维护 | 被引量 : 2次 | 上传用户:lszh123321
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
近些年由于大数据的出现,传统的数据库技术在某种程度上难以承担对大数据的查询和储存的需求,人们对数据的查询和存储技术提出了更新和更高的标准。由此在数据库技术的基础之上出现了数据仓库,当人们设计数据仓库时,粒度级别的确定是数据查询和储存的重要因素。目前对粒度级别的确定主要采用的是估算算法,对粒度的估算具有一定的模糊性,在查阅了某些粒度的估算算法和文献之后提出了一种以MMTD算法为主算法的粒度级别估算算法。提出的算法思想是:对粒度的最大行数和最小行数进行初步的估算,将实际行数与估算的最大行数与最小行数进行
其他文献
输变电工程研究所是中国电力行业内从事输变电工程机械力学领域科研、技术服务和试验检测的专业研究机构,是全国架空线路标准化技术委员会、中国电机工程学会输电专业委员会、中国钢结构协会塔桅钢结构分会的挂靠单位。输变电工程研究所主要业务范围包括杆塔结构研究与试验检测,导地线与金具研究、产品研发与试验检测,输电线路微风振动、次档距振荡、舞动防治,地基基础研究与试验检测,输变电构筑物及重要电气设施抗震减振研究、
期刊
4月22日,证监会副主席李超和人社部新闻发言人李忠分别就"养老金入市"发声,引起公众高度关注。李超表示,强大的养老金管理体系既是社会发展的稳定器,也是资本市场良性发展的压舱石。目前,包括基本养老金、企业年金、职业年金、全国社保基金在内的各类养老金市场化投资运营都已经不存在政策障碍。李忠则指出,人社部正在会同有关部门抓紧制定《基本养老保险基金投资管理办法》
期刊
Inverted index traversal techniques have been studied in addressing the query processing performance challenges of web search engines, but still leave much room for improvement. In this paper, we focu
期刊
In this Exa byte scale era, data increases at an exponential rate. This is in turn generating a massive amount of metadata in the file system. Hadoop is the most widely used framework to deal with big
期刊
The Maximum Agreement Forest(MAF) problem on two given phylogenetic trees is an important NP-hard problem in the field of computational biology. In this paper, we study the parameterized version of th
期刊
Feature coding is one of the most important procedures in the bag-of-features model for image classification. In this paper, we propose a novel feature coding method called nonnegative correlation cod
期刊
The compressive tracking(CT) method is a simple yet efficient algorithm which compresses the high-dimensional features into a low-dimensional space while preserving most of the salient information. Th
期刊
This study presents a novel approach to unsupervised learning for clustering with missing data.We first extend a finite mixture model to the infinite case by considering Dirichlet process mixtures, wh
期刊
This paper investigates the global practical tracking via adaptive output-feedback for a class of uncertain nonlinear systems with generalized control coefficients. Notably, the system in question has
期刊
The Integrated square error(ISE), as a robust criterion for measuring the difference of densities between two datasets, have been commonly used in pattern recognition. In this paper, two different cri
期刊