Skill Learning for Robotic Insertion Based on One-shot Demonstration and Reinforcement Learning

来源 :国际自动化与计算杂志(英文版) | 被引量 : 0次 | 上传用户:keenkingzhu
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
In this paper,an efficient skill learning framework is proposed for robotic insertion,based on one-shot demonstration and reinforcement learning.First,the robot action is composed of two parts:expert action ahd refinement action.A force Jacobian matrix is calibrated with only one demonstration,based on which stable and safe expert action can be generated.The deep deterministic policy gradients (DDPG) method is employed to learn the refinement action,which aims to improve the assembly efficiency.Second,an epis-ode-step exploration strategy is developed,which uses the expert action as a benchmark and adjusts the exploration intensity dynamic-ally.A safety-efficiency reward function is designed for the compliant insertion.Third,to improve the adaptability with different com-ponents,a skill saving and selection mechanism is proposed.Several typical components are used to train the skill models.And the trained models and force Jacobian matrices are saved in a skill pool.Given a new component,the most appropriate model is selected from the skill pool according to the force Jacobian matrix and directly used to accomplish insertion tasks.Fourth,a simulation environ-ment is established under the guidance of the force Jacobian matrix,which avoids tedious training process on real robotic systems.Simu-lation and experiments are conducted to validate the effectiveness of the proposed methods.
其他文献
针对柳城特长隧道穿越高地温地区施工存在温度高、湿度大、高温段落长的特点,系统分析了高地温隧道衬砌的受力特征,提出了高地温隧道隔热和降温综合控制措施,解决了隧道内施
设备的合理选择与科学应用关系着整个工程的质量,并且对国家公路工程行业的健康发展有着重要的影响.对公路工程使用机械选择与应用关键性进行阐述,并对压路机合理选择与应用
以河北省衡水市前进街铁路桥梁转体桥作业施工作为项目研究对象,基于项目工程特点,介绍了以主墩、主梁和主塔施工为主的转体桥设计要点,提出转体系统施工和连续梁转体施工的
山西省中部引黄工程深埋长隧洞的涌水具有涌水量大、水头压力高、补给丰富等特点,同时施工中不断会出现断层、涌水涌泥、岩溶、软岩、岩爆和煤层等现象,因此工程采用双护盾TB
杨振宁先生指出,自然科学研究与人文创作一样,也有因人而异的风格,并且这种风格会对研究工作产生重大的影响.2020年诺贝尔物理学奖授予了黑洞的理论和观测研究,其中一半授予
Recently, deep learning has achieved great success in visual tracking tasks, particularly in single-object tracking. This paper provides a comprehensive review of state-of-the-art single-object tracki
Human group activity recognition(GAR) has attracted significant attention from computer vision researchers due to its wide practical applications in security surveillance, social role understanding an
Audio-visual learning, aimed at exploiting the relationship between audio and visual modalities, has drawn considerable attention since deep learning started to be used successfully. Researchers tend
Markov decision process (MDP) offers a general framework for modelling sequential decision making where outcomes are random.In particular,it serves as a mathema
Pointwise convolution is usually utilized to expand or squeeze features in modern lightweight deep models. However, it takes up most of the overall computational cost(usually more than 90%). This pape