Stochastic Approximation for Expensive One-Bit Feedback Systems

来源 :Tsinghua Science and Technology | 被引量 : 0次 | 上传用户:rundahe
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
One-bit feedback systems generate binary data as their output and the system performance is usually measured by the success rate with a fixed parameter combination. Traditional methods need many executions for parameter optimization. Hence, it is impractical to utilize these methods in Expensive One-Bit Feedback Systems(EOBFSs), where a single system execution is costly in terms of time or money. In this paper, we propose a novel algorithm, named Iterative Regression and Optimization(IRO), for parameter optimization and its corresponding scheme based on the Maximum Likelihood Estimation(MLE) method and Particle Swarm Optimization(PSO)method, named MLEPSO-IRO, for parameter optimization in EOBFSs. The IRO algorithm is an iterative algorithm,with each iteration comprising two parts: regression and optimization. Considering the structure of IRO and the Bernoulli distribution property of the output of EOBFSs, MLE and a modified PSO are selected to implement the regression and optimization sections, respectively, in MLEPSO-IRO. We also provide a theoretical analysis for the convergence of MLEPSO-IRO and provide numerical experiments on hypothesized EOBFSs and one real EOBFS in comparison to traditional methods. The results indicate that MLEPSO-IRO can provide a much better result with only a small amount of system executions. One-bit feedback systems generate binary data as their output and the system performance is usually measured by the success rate with a fixed parameter combination. Therefore, it is impractical to utilize these methods in Expensive One- Bit Feedback Systems (EOBFSs), where a single system execution is costly in terms of time or money. In this paper, we propose a novel algorithm, named Iterative Regression and Optimization (IRO), for parameter optimization and its corresponding schemes based on the The Maximum Likelihood Estimation (MLE) method and Particle Swarm Optimization (PSO) method, named MLEPSO-IRO, for parameter optimization in EOBFSs. The IRO algorithm is an iterative algorithm, with each iteration comprising two parts: regression and optimization. Considering the structure of IRO and the Bernoulli distribution property of the output of EOBFSs, MLE and a modified PSO are selected to implement the regression and optimization sections, respectively, in MLEPSO-IRO. We also provide a theoretical analysis for the convergence of MLEPSO-IRO and provide numerical experiments on hypothesized EOBFSs and one real EOBFS in comparison to traditional methods. The results indicate that MLEPSO-IRO can provide a much better result with only a small amount of system executions.
其他文献
作物抗冻性的强弱与品质的特性、栽培措施等都有密切关系。秋播作物、强冬性品种应适时早播,利用秋高气爽、强光照晒等有利条件,培育健壮的幼苗,增强抗寒能力,促使其安全越冬
课程设置在整个教学中发挥着至关重要的作用,它影响着一整批甚至数批的专业学习者。然而之前的关于ESL(EnglishasaSecondLanguage)的课程设置却基本上都以主观的教学结果和教
詩、书、画、印
谭恩美于2013年出版了《惊奇山谷》,小说讲述了一个具有极大空间跨度结构的故事。本文以亨利·列斐伏尔的《空间的生产》中所提出的空间叙事理论为依据,从物理空间、社会空间
请下载后查看,本文暂不支持在线获取查看简介。 Please download to view, this article does not support online access to view profile.
期刊
中国古典诗歌历史悠久,因其意美、音美和形美的特点在中国家喻户晓,更成为世界璀璨文化中不可忽视的一部分。古诗翻译难度极高,但中外许多翻译家敢于直面挑战,不遗余力地从事
词汇是构成句子乃至语篇的基本组成单位,也是英语学习的主要因素。在缺乏英语听说环境的中国,阅读是提高词汇水平的有效途径。在阅读的过程中,学习者常常借助字典来查阅陌生
本文通过对荣华二采区10
请下载后查看,本文暂不支持在线获取查看简介。 Please download to view, this article does not support online access to view profile.
期刊
本文通过对荣华二采区10