离散非线性零和博弈的事件驱动最优控制方案

张欣; 薄迎春; 崔黎黎

引用本文:	张欣,薄迎春,崔黎黎.离散非线性零和博弈的事件驱动最优控制方案[J].控制理论与应用,2018,35(5):619~626.[点击复制]
	ZHANG Xin,BO Ying-chun,CUI Lili.Event-triggered optimal control scheme for discrete-time nonlinear zero-sum games[J].Control Theory and Technology,2018,35(5):619~626.[点击复制]

离散非线性零和博弈的事件驱动最优控制方案

Event-triggered optimal control scheme for discrete-time nonlinear zero-sum games

摘要点击 3043 全文点击 1587 投稿时间：2017-11-01 修订日期：2018-03-20

查看全文查看/发表评论下载PDF阅读器

DOI编号 10.7641/CTA.2018.70791

2018,35(5):619-626

中文关键词博弈论事件驱动自适应动态规划最优控制

英文关键词 game theory event-triggered adaptive dynamic programming optimal control

基金项目山东省自然科学基金项目(BS2015DX009), 国家自然科学基金项目(61703289)资助.

作者	单位	E-mail
张欣^*	中国石油大学(华东)	zhangxin@upc.edu.cn
薄迎春	中国石油大学(华东)
崔黎黎	沈阳师范大学

中文摘要

在求解离散非线性零和博弈问题时, 为了在有效降低网络通讯和控制器执行次数的同时保证良好的控制效果, 本文提出了一种基于事件驱动机制的最优控制方案. 首先, 设计了一个采用新型事件驱动阈值的事件驱动条件, 并根据贝尔曼最优性原理获得了最优控制对的表达式. 为了求解该表达式中的最优值函数, 提出了一种单网络值迭代算法. 利用一个神经网络构建评价网. 设计了新的评价网权值更新规则. 通过在评价网、控制策略及扰动策略之间不断迭代, 最终获得零和博弈问题的最优值函数和最优控制对. 然后, 利用Lyapunov稳定性理论证明了闭环系统的稳定性. 最后, 将该事件驱动最优控制方案应用到了两个仿真例子中, 验证了所提方法的有效性.

英文摘要

In order to reduce the network communication and controller execution frequency while guarantee a desired control performance, an event-triggered optimal control scheme is proposed for solving the optimal control pair of discretetime nonlinear zero-sum games in this paper. Firstly, an event-triggered condition with new event-triggered threshold is designed. The expression of the optimal control pair is obtained based on the Bellman optimality principle. Then, a single network value iteration algorithm is proposed to solve the optimal value function in this expression. A neural network is used to construct the critic network. Novel weight update rule of the critic network is derived. Through the iteration between the critic network, the control policy and the disturbance policy, the optimal value function and the optimal control pair can be solved. Further, the Lyapunov theory is used to prove the stability of the event-triggered closed-loop system. Finally, the event-triggered optimal control mechanism is applied to two examples to verify its effectiveness.