Optimal Strategy for Concurrent Variable Interval Reinforcement Schedule

摘要：

Herrnstein experimentally studied the choice behavior of pigeons on a special reinforcement schedule, the concurrent variable interval (CVI) schedule, and found a famous matching law. The empirical behavior law is remarkably conserved across many kinds of species, but it has been viewed as an irrational behavior, which means that the matching behavior does not maximize reward. In this paper, we succinctly demonstrate that any strategies leading to matching law can obtain maximal rewards for the CVI reinforcement schedule in discrete time steps. In addition, we put forward a novel strategy algorithm that can earn the maximal reward in the CVI reinforcement schedule. Our results reveal that the matching behavior can be seen as a rational behavior in the reinforcement schedule.

关键词： Reinforcement Schedule Matching Law Optimal Strategy Matching Strategy

作者: Zhenbo Cheng Ming Liang Zhidong Deng

作者单位: Information and Engineering College, Zhejiang University of Technology, Hangzhou, 310014, China Stat State Key Laboratory on Intelligent Technology and Systems, Tsinghua National Laboratory for Informa

会议类型: 国际会议

会议名称: The 22nd China Control and Decision Conference(2010年中国控制与决策会议)

会议地点: 徐州

会议语种:英文

页码: 642-647

在线出版日期: 2010-05-26（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Optimal Strategy for Concurrent Variable Interval Reinforcement Schedule