Fast filtering false active subspaces for efficient high dimensional similarity processing

来源 :Science in China(Series F:Information Sciences) | 被引量 : 0次 | 上传用户:jiugeqingjiao
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
The query space of a similarity query is usually narrowed down by pruning inactive query subspaces which contain no query results and keeping active query subspaces which may contain objects corre-sponding to the request. However,some active query subspaces may contain no query results at all,those are called false active query subspaces. It is obvious that the performance of query processing degrades in the presence of false active query subspaces. Our experiments show that this problem becomes seriously when the data are high dimensional and the number of accesses to false active sub-spaces increases as the dimensionality increases. In order to solve this problem,this paper proposes a space mapping approach to reducing such unnecessary accesses. A given query space can be re-fined by filtering within its mapped space. To do so,a mapping strategy called maxgap is proposed to improve the efficiency of the refinement processing. Based on the mapping strategy,an index structure called MS-tree and algorithms of query processing are presented in this paper. Finally,the performance of MS-tree is compared with that of other competitors in terms of range queries on a real data set. The query space of a similarity query is usually narrowed down by pruning inactive query subspaces which contain no query results and keeping active query subspaces which may contain objects corre-sponding to the request. However, some active query subspaces may contain no query results at all , who are false that the performance of query processing degrades in the presence of false active query subspaces. Our experiments show that this problem becomes seriously when the data are high dimensional and the number of accesses to false active In order to solve this problem, this paper proposes a space mapping approach to reduce such unnecessary accesses. A given query space can be re-fined by filtering within its mapped space. To do so, a mapping strategy called maxgap is proposed to improve the efficiency of the refinement processing. Based on the mapping strategy, an index structure called MS -tree and algorithms of query processing are presented in this paper. Finally, the performance of MS-tree is compared with that of other competitors in terms of range queries on a real data set.
其他文献
今年2月,一位日本公民在北京医科大学第三医院成形外科研究中心通过手术获得了理想的社会性别,这是中国整形外科成功驾驭变性手术技术以来,接受的第一例来自境外的患者。近
对经济的高增长和通货膨胀现象的思考向新民改革以来,我国经济发展中有二个值得注意的现象令人深思;一是国民经济在扩张一紧缩一再扩张一再紧缩的循环交替中运行。二是高赤字、高通货膨胀伴随着经济的高增长。上述二种现象如长期持续下去的话,是否会把我国国民经济的运...
记得十余年前“新时期”刚刚开始时,艺文作品中的“罗曼史”都“一窝蜂”地以图书馆为背景,男女主角大都呆头呆脑傻模傻样酸酸唧唧地在“为四化”苦读中羞羞答答地渐通款曲,
不久前召开的市委六届二次全体(扩大)会议,提出了发奋图强,大胆开拓,实现全市改革开放的大突破,经济建设大发展的宏伟目标。科技工作如何适应新形势的发展,贴紧经济工作这个
3月26日下午,春阳和暖,与本刊记者殷金娣、王辉相约前去拜访夏衍同志。夏公的卧室,逼狭简朴。一张小床、一张茶几和几把椅子之外,便是满架的书籍,床上、几上也散放着杂志与
为做好十八城市试点技术改造工作,10月8至9日,国家经贸委技改司在沈阳市召开了全国十八城市试点技术改造工作座谈会。十八城市经委主任、技改处长及所在省经贸委技改处长共6
1992~1993年度是国家“八五”计划中具有承上启下重要意义的年头,也是我校“八五”科技发展规划实施的关键时期。一年来,全校教师、科技人员、管理人员继续发扬“太行山道路
1944年我由敌后根据地调回延安,12月到中央党校六部学习.入校后才知道校长是毛主席.1945年2月15日下午,我们六部和五部的同学,都到北关外中央党校礼堂听报告.当我们入室坐好
然而,如许众多三资企业的诞生,也由此而衍变出一系列的“病毒现象”,并大有愈演愈烈的趋势,着实令人不敢小觑。今采撷几则,以作前车之鉴,抑或“亡羊补牢”之用。 However,
在5月28日结束于福建省厦门市的中国林学会第八次全国会员代表大会上,徐永椿、毛品一、李文政、陈介、易培同等同志合作完成的《云南树木图志》获第二届梁希奖。中国林学会