Knowledge Discovery using sequential pattern mining

来源 :山东建筑大学 | 被引量 : 0次 | 上传用户:bigfish
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Mining of sequential pattern algorithms is the most important in data mining field and is the key of many knowledge discovery applications.However,running such applications need memory and time,particularly when dealing with vast amounts databases.Choosing the unsuitable support threshold is the main factors to consume additional memory as well as time.On the other hand,it may present huge numerous of frequent patterns and that is hard to obtain the useful patterns,and it is not easy to compare the results.The problem itself will be increased and be more complicated,especially if the sequences are long such as stream sequences.  To solve this problem,we redefine the problem of mining sequence patterns as the problem of mining the Top-K Sequential Patterns,where K is the number of sequential patterns to be set by the user.The current best algorithms for this problem are TSP,TKS.This study introduces the research on the conception of developing an effective pattern sequential model to overcome the aforementioned problem by organizing discovered patterns.There are three aims of this study:1)To reduce melnory consumption:a dynamic technic is proposed,where the minimum support is set dynamically instead of static;and,the algorithln is based on pseudo-projection and BI-Directional Extension collectively;2)To reduce time consumption:we supported the algorithm with three space pruning functions in order to update the minimum support upon the discovered patterns.Finally,a more efficient algorithm than standard algorithms is proposed;3)improve the accuracy of multilinear regression energy system model through cleaning training sets using the proposed efficient algorithm.  The extensive study and experiments were done on various real datasets with different sizes,which demonstrates that the proposed algorithm is more efficient compared with the other related algorithms.
其他文献
实际生活中的大多数控制系统都是非线性控制系统,在非线性控制中,反馈线性化是常用的方法。当系统存在参数不确定性或未建模动态时,采用反馈线性化设计的控制器鲁棒性不能保证。
城市交通系统作为一切城市活动的载体,是整个城市系统赖以生存和发挥效能的物质基础。道路交通运输的好坏对一个地区经济和文化的发展影响甚大。随着汽车行业的发展,交通需求日
最近几年,随着视频编解码技术的发展、网络架构开发部署的加快、存储能力的增强以及计算能力的提高,互联网上的多媒体服务,尤其是视频服务得到了飞速的发展。视频业务涵盖了包括
基于Internet的移动机器人远程控制是当前移动机器人领域的研究热点。传统的移动机器人远程控制系统大多以实际移动机器人为研究对象,当前研究缺少仿真程度较高的移动机器人算
学位
定位技术是无线传感器网络的核心支撑技术,基于RSSI的定位技术以其无需额外的硬件,定位成本低,已被无线传感器网络作为重要选择之一。作为GPS系统的补充,RSSI定位技术可广泛
自平衡两轮小车是一种结构简单、体积小、运动灵活的轮式机器人,类似于一级倒立摆,可以实现零半径转向以及复杂路径的跟踪,鉴于以上自平衡两轮小车的优点,在实际的军用领域与民用
化工过程生产条件较为复杂,其生产装置往往处于高温、高压等极端条件下,因此,即便是一些微小的异常变动也有可能引起整个系统的崩坏,从而导致生产中断甚至装置爆炸、毒气泄漏等一系列后果。化工过程一旦出现故障,不仅会给工厂带来严重经济损失,也会对周围环境造成严重的破坏,更甚者会威胁现场工人的人身安全。倘若能提前检测和诊断出过程故障,不但能有效够缩减停产时间和降低生产成本,也能增强设备运行的安全性,从而保证现
时滞现象广泛存在于实际的系统中,且往往引发系统的不稳定,从而使系统的稳定性研究和控制变得很复杂。因此对时滞系统进行稳定性分析及控制具有重要的理论意义和实用价值。本
溴素在医药、阻燃剂、油田、摄影材料、农药等多个领域均具有十分广泛的应用。本文应用的空气吹出法提溴是比较常用的海水提溴方法,但是在很多情况下,由于环境因素以及介质的腐