Interval Estimation for Aggregate Queries on Incomplete Data

来源 :计算机科学技术学报(英文版) | 被引量 : 0次 | 上传用户:sddhyyj
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Incomplete data has been a longstanding issue in the database community, and the subject is yet poorly handled by both theories and practices. One common way to cope with missing values is to complete their imputation (filling in) as a preprocessing step before analyses. Unfortunately, not a single imputation method could impute all missing values correctly in all cases. Users could hardly trust the query result on such complete data without any confidence guarantee. In this paper, we propose to directly estimate the aggregate query result on incomplete data, rather than to impute the missing values. An interval estimation, composed of the upper and the lower bound of aggregate query results among all possible interpretations of missing values, is presented to the end users. The ground-truth aggregate result is guaranteed to be among the interval. We believe that decision support applications could benefit significantly from the estimation, since they can tolerate inexact answers, as long as there are clearly defined semantics and guarantees associated with the results. Our main techniques are parameter-free and do not assume prior knowledge about the distribution and missingness mechanisms. Experimental results are consistent with the theoretical results and suggest that the estimation is invaluable to better assess the results of aggregate queries on incomplete data.
其他文献
本文对3 4例40个卵巢囊肿进行了CT引导下的穿刺注入硬化剂治疗,并进行了追踪观察随访其结果满意。现报告如下。1 资料与方法1.1 一般资料 本组3 4例,年龄18~5 7岁,平均2 9
With the popularity of storing large data graph in cloud, the emergence of subgraph patt matching on a remote cloud has been inspired. Typically, subgraph patt
想念故乡那小镇,可是,如今却不能回去安居,那儿早已没有我们的老屋,也没有什么直系亲属。但每年的端午,我却想方设法地要回去,因为小镇的青山绿水间,有独特的过节习俗。每到端午,都是我和故乡亲近的日子。  端午,历来为小镇人所看重,这是祖辈传下来的。过端午的所有民俗仪式,都在悄无声息的秩序中进行着。早在端午前几天,人们就计算着日子,联络亲友,安排食品和船只。青绿的菖蒲静悄悄地插于大门上,粽子、包子、皮蛋
期刊
视网膜母细胞瘤是幼儿常见的眼内恶性肿瘤。其诊断除依靠典型的临床病史、体征外,CT是首选的检查方法。对明确疾病的诊断、病变范围及向球外蔓延发展进程有独特优势。现将我
期刊
期刊
期刊
体外试验表明,噬菌蛭弧菌BD57能够裂解K88大肠杆菌、K99大肠杆菌、嗜水气单胞菌.分别用107、105 PFU/mL浓度噬菌蛭弧菌BD57治疗人工感染鼠伤寒沙门氏菌的小鼠,小鼠存活率分别
该文从挂篮荷载计算、施工流程、支座及临时固结施工、挂篮安装及试验、合拢段施工、模板制作安装、钢筋安装、混凝土的浇筑及养生、测量监控等方面人手,介绍了S226海滨大桥
期刊