迭代学习神经网络控制在机器人示教学习中的应用(英文)

蒋　平; 李自育; 陈阳泉

引用本文:	蒋　平，李自育，陈阳泉.迭代学习神经网络控制在机器人示教学习中的应用(英文)[J].控制理论与应用,2004,21(3):447~452.[点击复制]
	JIANG Ping, LI Zi-yu, CHEN Yang-quan.Iterative learning neural network control for robot learning from demonstration[J].Control Theory and Technology,2004,21(3):447~452.[点击复制]

迭代学习神经网络控制在机器人示教学习中的应用(英文)

Iterative learning neural network control for robot learning from demonstration

摘要点击 2525 全文点击 1290 投稿时间：2001-03-30 修订日期：2002-12-31

查看全文查看/发表评论下载PDF阅读器

DOI编号 10.7641/j.issn.1000-8152.2004.3.025

2004,21(3):447-452

中文关键词迭代学习控制神经网络控制视觉伺服模仿学习

英文关键词 iterative learning control neural network control visual servoing imitation learning

基金项目 supportedbytheNationalNaturalScienceFoundationofChina ( 60 175 0 2 8)

作者	单位
蒋　平，李自育，陈阳泉	同济大学信息与控制工程系，上海200092；尤他州立大学电气与计算机工程系自组织与智能系统中心，美国尤他州 84322-4160

中文摘要

示教学习是机器人运动技能获取的一种高效手段.当采用摄像机作为示教轨迹记录部件时,示教学习涉及如何通过反复尝试获得未知机器人摄像机模型问题.本文力图针对非线性系统重复作业中的可重复不确定性学习,提出一个迭代学习神经网络控制方案,该控制器将保证系统最大跟踪误差维持在神经网络有效近似域内.为此提出了一个适合于重复作业应用的分布式神经网络结构.该神经网络由沿期望轨线分布的一系列局部神经网络构成,每一局部神经网络对对应期望轨迹点邻域进行近似并通过重复作业完成网络训练.由于所设计的局部神经网络相互独立,因此一个全程轨迹可以通过分段训练完成,由起始段到结束段,逐段实现期望轨迹的准确跟踪.该方法在具有未知机器人摄像机模型的轨迹示教模仿中得到验证,显示了它是一种高效的训练方法,同时具有一致的误差限界能力.

英文摘要

Learning from demonstration is an efficient way for transferring movement skill from a human teacher to a robot. Using a camera as a recorder of the demonstrated movement, a learning strategy is required to acquire knowledge about the \{nonlinearity\} and uncertainty of a robot-camera system through repetitive practice. The purpose of this paper is to design a neural network controller for vision-based movement imitation by repetitive tracking and to keep the maximum training deviation from a demonstrated trajectory in a permitted region. A distributed neural network structure along a demonstrated trajectory is proposed. The local \{networks\} for a segment of the trajectory are invariant or repetitive over repeated training and are independent of the other segments. As a result, a demonstrated trajectory can be decomposed into short segments and the training of the local neural \{networks\} can be done segment-wise progressively from the starting segment to the ending one. The accurate tracking of the whole demonstrated trajectory is thus accomplished in a step-by-step or segment-by-segment manner. It is used for trajectory imitation by demonstration with an unknown robot-camera model and shows that it is effective in ensuring uniform boundedness and efficient training.