quotation:[Copy]
Yongfeng Lv1,Jun Zhao2,Wan Zhang1,Huimin Chang3.[en_title][J].Control Theory and Technology,2025,23(1):118~130.[Copy]
【Print page】 【Online reading】【Download 【PDF Full text】 View/Add CommentDownload reader Close

←Previous page|Page Next →

Back Issue    Advanced search

This Paper:Browse 34   Download 0 本文二维码信息
码上扫一扫!
Data-based neural controls for an unknown continuous-time multi-input system with integral reinforcement
YongfengLv1,JunZhao2,WanZhang1,HuiminChang3
0
(1 College of Electrical and Power Engineering, Taiyuan University of Technology, Taiyuan, Shanxi 030024, China;2 College of Transportation, Shandong University of Science and Technology, Qingdao 266590, Shandong, China;3 School of Mathematical Sciences, Shanxi University, Taiyuan 030006, Shanxi, China)
摘要:
Integral reinforcement learning (IRL) is an effective tool for solving optimal control problems of nonlinear systems, and it has been widely utilized in optimal controller design for solving discrete-time nonlinearity. However, solving the Hamilton– Jacobi–Bellman (HJB) equations for nonlinear systems requires precise and complicated dynamics. Moreover, the research and application of IRL in continuous-time (CT) systems must be further improved. To develop the IRL of a CT nonlinear system, a data-based adaptive neural dynamic programming (ANDP) method is proposed to investigate the optimal control problem of uncertain CT multi-input systems such that the knowledge of the dynamics in the HJB equation is unnecessary. First, the multi-input model is approximated using a neural network (NN), which can be utilized to design an integral reinforcement signal. Subsequently, two criterion networks and one action network are constructed based on the integral reinforcement signal. A nonzero-sum Nash equilibrium can be reached by learning the optimal strategies of the multi-input model. In this scheme, the NN weights are constantly updated using an adaptive algorithm. The weight convergence and the system stability are analyzed in detail. The optimal control problem of a multi-input nonlinear CT system is effectively solved using the ANDP scheme, and the results are verified by a simulation study.
关键词:  Adaptive dynamic programming · Integral reinforcement · Neural networks · Heuristic dynamic programming · Multi-input system
DOI:https://doi.org/10.1007/s11768-024-00238-2
基金项目:
Data-based neural controls for an unknown continuous-time multi-input system with integral reinforcement
Yongfeng Lv1,Jun Zhao2,Wan Zhang1,Huimin Chang3
(1 College of Electrical and Power Engineering, Taiyuan University of Technology, Taiyuan, Shanxi 030024, China;2 College of Transportation, Shandong University of Science and Technology, Qingdao 266590, Shandong, China;3 School of Mathematical Sciences, Shanxi University, Taiyuan 030006, Shanxi, China)
Abstract:
Integral reinforcement learning (IRL) is an effective tool for solving optimal control problems of nonlinear systems, and it has been widely utilized in optimal controller design for solving discrete-time nonlinearity. However, solving the Hamilton– Jacobi–Bellman (HJB) equations for nonlinear systems requires precise and complicated dynamics. Moreover, the research and application of IRL in continuous-time (CT) systems must be further improved. To develop the IRL of a CT nonlinear system, a data-based adaptive neural dynamic programming (ANDP) method is proposed to investigate the optimal control problem of uncertain CT multi-input systems such that the knowledge of the dynamics in the HJB equation is unnecessary. First, the multi-input model is approximated using a neural network (NN), which can be utilized to design an integral reinforcement signal. Subsequently, two criterion networks and one action network are constructed based on the integral reinforcement signal. A nonzero-sum Nash equilibrium can be reached by learning the optimal strategies of the multi-input model. In this scheme, the NN weights are constantly updated using an adaptive algorithm. The weight convergence and the system stability are analyzed in detail. The optimal control problem of a multi-input nonlinear CT system is effectively solved using the ANDP scheme, and the results are verified by a simulation study.
Key words:  Adaptive dynamic programming · Integral reinforcement · Neural networks · Heuristic dynamic programming · Multi-input system