强化学习在导弹制导中的应用

周　锐; 陈宗基

引用本文:	周　锐,陈宗基.强化学习在导弹制导中的应用[J].控制理论与应用,2001,18(5):748~750.[点击复制]
	ZHOU Rui,CHEN Zong-ji.Application of Reinforcement Learning in Missile Guidance[J].Control Theory and Technology,2001,18(5):748~750.[点击复制]

强化学习在导弹制导中的应用

Application of Reinforcement Learning in Missile Guidance

摘要点击 2389 全文点击 1681 投稿时间：2000-01-10 修订日期：2000-11-22

查看全文查看/发表评论下载PDF阅读器

DOI编号 10.7641/j.issn.1000-8152.2001.5.023

2001,18(5):748-750

中文关键词神经网络强化学习微分对策导弹制导

英文关键词 neural networks reinforcement learning differential games missile guidance

基金项目国家自然科学基金(69904002); 国防预研基金; 航天科技创新基金资助项目.

作者	单位
周　锐	北京航空航天大学自动控制系, 北京 100083
陈宗基	北京航空航天大学自动控制系, 北京 100083

中文摘要

简述了强化学习的基本原理和特点, 讨论了强化学习中评价函数的神经网络近似问题, 重点分析了采用多神经网络近似评价函数的学习问题, 实现了状态空间或任务的自动分解, 提高了评价函数的推广能力. 网络的学习是离线进行, 并作为反馈控制器在线应用. 并以A 学习为例, 将强化学习应用于导弹的制导问题, 仿真结果表明了强化学习在导弹制导或控制问题中的应用前景和有效性.

英文摘要

Principle and characteristic of reinforcement learning are outlined. The value function approximation of reinforcement learning with neural networks is studied, and the learning algorithm using modular neural networks to approximate the value function is emphatically analyzed, which decomposes the state space automatically and increases the generalizing ability of the neural networks. The neural networks are trained offline, and is used online as a feedback controller. The A learning algorithm is applied in the missile guidance problem, and the simulation results show the good performance and effectiveness of the application of reinforcement learning in those problems of missile guidance and control.