A Cooperative Schema between Web Sever and Search Engine for Improving Freshness of Web Repository

来源 :Wuhan University Journal of Natural Sciences | 被引量 : 0次 | 上传用户:qutong19921107
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Because the web is huge and web pages are updated frequently, the index maintained by a search engine has to refresh web pages periodically. This is extremely resource consuming because the search engine needs to crawl the web and download web pages to refresh its index. Based on present technologies of web refreshing, we present a cooperative schema between web server and search engine for maintaining freshness of web repository. The web server provides meta-data defined through XML standard to describe web sites. Before updating the web page the crawler visits the meta-data files. If the meta-data indicates that the page is not modified, then the crawler will not update it. So this schema can save bandwidth resource. A primitive model based on the schema is implemented. The cost and efficiency of the schema are analyzed. Because the web is huge and web pages are updated frequently, the index maintained by a search engine has to refresh web pages periodically. This is extremely resource consuming because the search engine needs to crawl the web and download web pages to refresh its index. Based on present technologies of web refreshing, we present a cooperative schema between web server and search engine for maintaining freshness of web repository. The web server provides meta-data defined through XML standard to describe web sites. Before updating the web page the crawler visits the meta-data files. If the meta-data indicates that the page is not modified, then the crawler will not update it. So this schema can save bandwidth resource. A primitive model based on the schema is implemented. The cost and efficiency of the schema are analyzed.
其他文献
<正> 近年来,经济增长理论的研究越来越受到我国经济学界和实际部门同志的重视。由于我国对经济增长理论的研究还刚刚起步,而国外对这一课题的研究已有相当长的历史,因此科学地借鉴国外的研究成果是十分必要的。最近,中国人民大学出版社出版了由胡乃武、金碚主编的《国外经济增长理论比较研究》一书,就是系统地研究国外经济增长理论的一个很有
你在阅读时总是为了文章中的人物、事件、情节而悲伤、欢笑吗?通过阅读你总能够获得无限的乐趣吗?那么,在互联网中,你只需轻点鼠标,你喜欢的各种小说、卡通漫画以及时尚的网
音乐使人快乐,蕴含着丰富的音乐文化,音乐可以调动人的情感、舒缓人的心情。小学音乐学习属于最初级的音乐学习,是进一步学习音乐的基础,在小学音乐学习中同学们的互动意识在
称霸的格局之下,霸主如果蛮横不讲理,小国就会遭受欺侮。这种情况下,小国在与大国的交往过程之中,尤其需要选择好外交官,外交人员的外交智慧也显得尤为重要。  春秋时期,由于周天子地位日渐式微,诸侯称霸遂成常态。霸主更迭,各领风骚,主持大局的不再是周王,而是占据霸主地位的诸侯。就称霸这种模式而言,定期召开盟会非常重要。这种盟会,军事目的和政治意图都非常明确。霸主召集大小诸侯一起开会,缔结条约,谋求互惠共
手机网游企业必须加强新品研发和推广,不能陷入同质化的恶性竞争中,重走SP的老路 Mobile online games companies must step up research and development and promotion of
1997年,全国155家医院普查肛肠疾病的结果显示,肛肠良性疾病占肛肠疾病总数的58.4%,其中痔占87.3%[1].大约有10%的痔患者需外科手术治疗。
经过十年快速的发展,中国互联网已经形成规模,互联网应用走向多元化。加之网络和无线的结合,对于信息时代的人们更能享受到其中带来的便利,此时的个性化需求也日显突出。在
魏征和房玄龄是大唐历史上两颗最耀眼的政治明星。房玄龄辅佐唐太宗治平天下,在宰相位上去世,共32年,是天下人称颂的贤相。魏征犯龙颜,批逆鳞,直言敢谏,为流芳史册的诤臣。但
北魏太武帝神■三年九月在其生母的故乡邺城为杜密太后立别庙,并享有较高的祭祀规格。与汉晋间为女性立庙的事例相较,密太后在太庙和故乡同时享有祭祀的事实显得十分特殊。又