Efficient Model Store and Reuse in an OLML Database System

来源 :计算机科学技术学报(英文版) | 被引量 : 0次 | 上传用户:zoe8480
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Deep learning has shown significant improvements on various machine learning tasks by introducing a wide spectrum of neural network models. Yet, for these neural network models, it is necessary to label a tremendous amount of training data, which is prohibitively expensive in reality. In this paper, we propose OnLine Machine Learning (OLML) database which stores trained models and reuses these models in a new training task to achieve a better training effect with a small amount of training data. An efficient model reuse algorithm AdaReuse is developed in the OLML database. Specifically, AdaReuse firstly estimates the reuse potential of trained models from domain relatedness and model quality, through which a group of trained models with high reuse potential for the training task could be selected efficiently. Then, multi selected models will be trained iteratively to encourage diverse models, with which a better training effect could be achieved by ensemble. We evaluate AdaReuse on two types of natural language processing (NLP) tasks, and the results show AdaReuse could improve the training effect significantly compared with models training from scratch when the training data is limited. Based on AdaReuse, we implement an OLML database prototype system which could accept a training task as an SQL-like query and automatically generate a training plan by selecting and reusing trained models. Usability studies are conducted to illustrate the OLML database could properly store the trained models, and reuse the trained models efficiently in new training tasks.
其他文献
智慧工地给传统项目管理带来了许多变化,为工程建设全面推进质量、安全、进度管控工作提供了强有力的技术辅助决策支撑。结合课题,对智慧工地进行了研究,简介了智慧工地的系统功能,分析了智慧工地给传统项目管理带来的变化,探讨了智慧工地对项目管理的积极作用。对当前智慧工地的发展有一定的实践借鉴作用。
为人民服务是行政部门不变的主题,服务受理窗口直接对接群众,要求工作人员能够提供让人民满意的服务,符合社会主义发展的要求。上海市住房和城乡建设管理委员会行政服务中心基于PDCA循环引入第三方测评,推进“文明窗口”创建,发现目前“文明窗口”创建工作中存在的问题与重点,为中心提供有益的建议与详实的数据支撑,有效增强了工作效率、提高了服务质量。为当前行政体制改革提供了有益的参考。
Application programming interface(API)libraries are extensively used by developers.To correctly program with APIs and avoid bugs,developers shall pay attention to API directives,which illustrate the constraints of APIs.Unfortunately,API directives usually
慢性阻塞性肺疾病是一种比较严重的呼吸系统病变,该病在目前临床研究中的发病率是比较高的,很多患者在发病之后会影响到其自身的健康.因而在患者管理过程中,应该做好患者管理
Configuration tuning is essential to optimize the performance of systems(e.g.,databases,key-value stores).High performance usually indicates high throughput and low latency.At present,most of the tuning tasks of systems are performed artificially(e.g.,by
以上海轨道交通某区间隧道水平冻结法盾构进出洞工程为例,分析了水平冻结法加固施工过程中存在的主要风险,并提出了相应的控制措施,从而保证盾构进出洞能够顺利实施,可为类似工程建设提供一定的参考价值。
The ability to assess the reliability of safety-critical systems is one of the most crucial requirements in the design of modern safety-critical systems where even a minor failure can result in loss of life or irreparable damage to the environment.Model c
心力衰竭(heart failure ,HF)是各种严重心脏疾病的最后阶段,具有高发病率、高死亡率特点.随着人口老龄化的到来和生活压力越来越大,本病的发病率在近年来呈逐年递增趋势 [1,
期刊
通过对大跨度屋盖钢结构预应力张拉拱桁架施工工艺及施工监测等过程介绍,进行此种类型结构研究和探讨。期望为类似工程的应用提供实例参考,取得更好的社会综合效益。
危重病人是医院护理重点人群,对危重病人提供合理、科学、规范的护理是非常有必要的,此外,还需要关注危重病人的护理记录的规范性.护理记录是病案的构成部分,护士根据医嘱和