论文部分内容阅读
This paper proposes a complementary novel idea, called MiniTasking to further reduce the number of cachemisses by improving the data temporal locality for multiple concurrent queries. Our idea is based on the observation that, inmany workloads such as decision support systems (DSS), there is usually significant amount of data sharing among differentconcurrent queries. MiniTasking exploits such data sharing to improve data temporal locality by scheduling query executionat three levels: query level batching, operator level grouping and mini-task level scheduling. The experimental results withvarious types of concurrent TPC-H query workloads show that, with the traditional N-ary Storage Model (NSM) layout,MiniTasking significantly reduces the L2 cache misses by up to 83%, and thereby achieves 24% reduction in execution time.With the Partition Attributes Across (PAX) layout, MiniTasking further reduces the cache misses by 65% and the executiontime by 9%. For the TPC-H throughput test workload, MiniTasking improves the end performance up to 20%.