引用本文: | 张欣,薄迎春,崔黎黎.离散非线性零和博弈的事件驱动最优控制方案[J].控制理论与应用,2018,35(5):619~626.[点击复制] |
ZHANG Xin,BO Ying-chun,CUI Lili.Event-triggered optimal control scheme for discrete-time nonlinear zero-sum games[J].Control Theory and Technology,2018,35(5):619~626.[点击复制] |
|
离散非线性零和博弈的事件驱动最优控制方案 |
Event-triggered optimal control scheme for discrete-time nonlinear zero-sum games |
摘要点击 2907 全文点击 1569 投稿时间:2017-11-01 修订日期:2018-03-20 |
查看全文 查看/发表评论 下载PDF阅读器 |
DOI编号 10.7641/CTA.2018.70791 |
2018,35(5):619-626 |
中文关键词 博弈论 事件驱动 自适应动态规划 最优控制 |
英文关键词 game theory event-triggered adaptive dynamic programming optimal control |
基金项目 山东省自然科学基金项目(BS2015DX009), 国家自然科学基金项目(61703289)资助. |
|
中文摘要 |
在求解离散非线性零和博弈问题时, 为了在有效降低网络通讯和控制器执行次数的同时保证良好的控制
效果, 本文提出了一种基于事件驱动机制的最优控制方案. 首先, 设计了一个采用新型事件驱动阈值的事件驱动条
件, 并根据贝尔曼最优性原理获得了最优控制对的表达式. 为了求解该表达式中的最优值函数, 提出了一种单网络
值迭代算法. 利用一个神经网络构建评价网. 设计了新的评价网权值更新规则. 通过在评价网、控制策略及扰动策
略之间不断迭代, 最终获得零和博弈问题的最优值函数和最优控制对. 然后, 利用Lyapunov稳定性理论证明了闭环
系统的稳定性. 最后, 将该事件驱动最优控制方案应用到了两个仿真例子中, 验证了所提方法的有效性. |
英文摘要 |
In order to reduce the network communication and controller execution frequency while guarantee a desired
control performance, an event-triggered optimal control scheme is proposed for solving the optimal control pair of discretetime
nonlinear zero-sum games in this paper. Firstly, an event-triggered condition with new event-triggered threshold is
designed. The expression of the optimal control pair is obtained based on the Bellman optimality principle. Then, a single
network value iteration algorithm is proposed to solve the optimal value function in this expression. A neural network
is used to construct the critic network. Novel weight update rule of the critic network is derived. Through the iteration
between the critic network, the control policy and the disturbance policy, the optimal value function and the optimal control
pair can be solved. Further, the Lyapunov theory is used to prove the stability of the event-triggered closed-loop system.
Finally, the event-triggered optimal control mechanism is applied to two examples to verify its effectiveness. |
|
|
|
|
|