Mining Interesting Knowledge from Web-Log

来源 :Wuhan University Journal of Natural Sciences | 被引量 : 0次 | 上传用户:wjw842008
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Web-log contains a lot of information related with user activities on the Internet. How to mine user browsing interest patterns effectively is an important and challengeable research topic. On the analysis of the present algorithm’s advantages and disadvantages, we propose a new concept: support-interest. Its key insight is that visitor will backtrack if they do not find the information where they expect. And the point from where they backtrack is the expected location for the page. We present User Access Matrix and the corresponding algorithm for discovering such expected locations that can handle page caching by the browser. Since the URL-URL matrix is a sparse matrix which can be represented by List of 3-tuples, we can mine user preferred sub-paths from the computation of this matrix. Accordingly, all the sub-paths are merged, and user preferred paths are formed. Experiments showed that it was accurate and scalable. It’s suitable for website based application, such as to optimize website’s topological structure or to design personalized services. How to mine user browsing interest patterns effectively is an important and challengeable research topic. On the analysis of the present algorithm’s advantages and disadvantages, we propose a new concept: support And key point is that the expected location for the page. We present User Access Matrix and the corresponding algorithm for discovering such expected locations that can handle page caching by the browser. Since the URL-URL matrix is ​​a sparse matrix which can be represented by List of 3-tuples, we can mine user preferred sub-paths from the computation of this matrix. sub-paths are merged, and user preferred paths are formed. Experiments showed that it was accurate and scalable. It’s suitable for website based application, such as to optimiz e website’s topological structure or to design personalized services.
其他文献
“四网一会”是全员覆盖、快速反应、整体链接、功能互补的网络化思想政治工作体系,是闭合运转的整体。“四网一会”保证体系是开滦集团根据企业思想政治工作面临的新形势、
革命烈军工属居住农村的占百分之九十,他们的父兄儿女,有的已经投身成仁,为革命而牺牲;有的则仍继续为捍卫祖国而奋斗,一般说来,大多数家里是贫困而又缺乏劳动力,生活是很困
湖南株洲陈斌读者: 弹壳虽然较薄,但由于武器发射时,自动机与枪膛成闭锁状态,弹壳在弹膛内,弹壳壁紧贴枪管弹膛,虽然火药气体压力作用于弹壳,却不会炸烂弹壳。大多数弹壳是
这个星期天正好是母亲节,学校为我们准备了一项特别的体验活动——让我们每人保护一个鸡蛋,做一回鸡蛋的父母。在一天一夜的时间中,我必须时刻陪伴着鸡蛋,保护它让它免受伤害
可可的爸爸是个老烟民,他每天10分钟就要抽一根烟,平均下来,每天抽烟的总数竟然达到了96根,他的身体越来越不好。可可终于看不下去了,她对爸爸说:“爸爸,你再抽这么多烟,身体
期刊
各专署、县(市)人民政府直辖市、县人民政府自治区:目前秋收完后,在今年春季各地,因春荒曾提倡「自由借贷,保证有借有还」借贷政策的口号下,各地曾借贷了很多粮款,大大的活
福州市军事管制委员会已公布「反动党团及特务人员申请登记实施办法」,指定军事管制委员会公安部自即日起办理登记工作。这是进一步肃清反革命残余势力,彻底摧毁反动组织,巩
作为本刊2003年11期发表的题为《购买防火墙和VPN产品需要考虑哪些问题?》的续篇,我们在本期推出旨在进一步帮助企业选购更深层网络安全防护产品指南。希望借此为企业网管、I
主送机关:江西省人民政府交通厅抄送机关:公路局三月十四日你们送来的交人(25)字第二七二号报告收悉。你厅公路局关于道班劳动竞赛运动和发动群众搬运砂石挖掘古坆砖料的检
平时,我们上网的时候,会有一种文件被自动保存在本地计算机的硬盘上,这就是“Internet临时文件”。在“Internet临时文件”项目中,虽然微软公司说将查看过的网页保存起来,以