The skinner automaton: A psychological model formalizing the theory of operant conditioning

来源 :Science China(Technological Sciences) | 被引量 : 0次 | 上传用户:qq540531049
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Operant conditioning is one of the fundamental mechanisms of animal learning,which suggests that the behavior of all animals,from protists to humans,is guided by its consequences.We present a new stochastic learning automaton called a Skinner automaton that is a psychological model for formalizing the theory of operant conditioning.We identify animal operant learning with a thermodynamic process,and derive a so-called Skinner algorithm from Monte Carlo method as well as Metropolis algorithm and simulated annealing.Under certain conditions,we prove that the Skinner automaton is expedient,ε-optimal,optimal,and that the operant probabilities converge to the set of stable roots with probability of 1.The Skinner automaton enables machines to autonomously learn in an animal-like way. Operant conditioning is one of the fundamental mechanisms of animal learning, which suggests that the behavior of all animals, from protists to humans, is guided by its consequences. We present a new stochastic learning automaton called a Skinner automaton that is a psychological model for formalizing the theory of operant conditioning. We identify animal operant learning with a thermodynamic process, and derive a so-called Skinner algorithm from Monte Carlo method as well as Metropolis algorithm and simulated annealing. Undertaken conditions, we prove that the Skinner automaton is expedient, ε-optimal, optimal, and that the operant probabilities converge to the set of stable roots with probability of 1. The Skinner automaton enables machines to autonomously learn in an animal-like way.
其他文献
这不公平    每个赛季,我们都能听到主教练对联赛赛程的抱怨、争吵:“这就不是人做的事”(弗格森),“他们是一群从特殊教育学院里出来的人”(温格)……没有比这更难听的了。  其实,那些负责制定赛程的人也非常苦恼,英超联赛的主席理查兹说:“教练们总是认为赛程的制定有内幕,但现在的赛程都是通过电脑程序随机产生的,当然,抽到好签的确是需要一点运气。”意甲联赛主席贝雷塔也说:“不能让所有人满意,我们只能尽