An imitation learning-based approach to the motion planning of robots : from discrete to continuum robots

Chen, Jie; 陳杰

File Download

FullText.pdf

Links for fulltext

(May Require Subscription)

DOI: 10.5353/th_991043962676303414

Supplementary

Citations:
Appears in Collections:
- HKU Theses Online
- Industrial & Manufacturing Systems Engineering: Theses

postgraduate thesis: An imitation learning-based approach to the motion planning of robots : from discrete to continuum robots

Title	An imitation learning-based approach to the motion planning of robots : from discrete to continuum robots
Authors	Chen, Jie 陳杰
Advisors	Advisor(s):Lau, HYK Or, KL
Issue Date	2017
Publisher	The University of Hong Kong (Pokfulam, Hong Kong)
Citation	Chen, J. [陳杰]. (2017). An imitation learning-based approach to the motion planning of robots : from discrete to continuum robots. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.
Abstract	Robots have impacted human everyday life deeply over the past decades, from traditional automated manufacturing to the state-of-the-art robot-assisted surgery. With the advancement of robotic technologies, numerous different kinds of robots have been invented to deal with the increasingly complicated requirement. For instance, continuum robots have been developed to enhance the performance of robot-assisted minimally invasive surgery (MIS), which is one of the major focuses of this thesis. However, effective and efficient motion control of robots, including inverse kinematics modelling and motion planning, remains challenging, especially for the continuum robots. In this thesis, we propose an imitation learning-based framework to solve this issue, both discrete and continuum robots have been investigated. For discrete manipulators, conventional Jacobian-based methods are adopted to address the corresponding inverse kinematic problems. While for the continuum robot, we implement three machine learning algorithms, K-Nearest Neighbors Regression (KNNR), Gaussian Mixture Regression (GMR), and Extreme Learning Machine Regression (ELMR), to approximate its inverse kinematics model. We also evaluate the possibility of applying reinforcement learning to further improve the inverse kinematics model of continuum robots. Two different types of imitation learning-based frameworks have been investigated to plan the motion path of robots, namely model-free and model-based methods. In model-free imitation learning, human demonstrations are provided in both actuation space and task space, and Gaussian Mixture Model (GMM) and GMR are used to encode the demonstrations directly and then generalize an executable path for the robot to reproduce the learned task. For complicated tasks, the demonstration has to be segmented into multiple movement primitives to simplify the learning process, and Support Vector Machine (SVM) is used for the segmentation purpose. Dynamical systems have been deployed to facilitate the model-based method, whose effectiveness is ensured by Lyapunov Stability Theorem, and the demonstrations are only available in the task space. A number of robots have been developed in this thesis to evaluate the proposed approaches, including a re-designed 7-DoF Mitsubishi PA-10 robot, a 6-DoF ionic polymer-metal composite (IPMC) flexible manipulator, a 3-DoF tendon-driven continuum manipulator (TCM), and a 2-DoF TCM. Inverse kinematics of the 3-DoF TCM is learned by KNNR, GMR, and ELMR. While reinforcement learning has been applied to the 2-DoF TCM, and in a few iterations the trajectory tracking error reduces significantly from 8.045 mm to 1.101 mm. With model-free imitation learning, the 6-DoF IPMC manipulator acquires the skill of navigation through a narrow hole, and the 3-DoF TCM learns to reproduce two surgical related tasks, namely complaint tube insertion and simplified endoscopic submucosal dissection (ESD). While with model-based imitation learning, the PA-10 robot successfully bats a fast flying ball, a 7-DoF KUKA LBR iiwa robot learns to perform manipulation tasks under both spatial and temporal perturbations, and the 3-DoF TCM obtains the skill of obstacle avoidance. The significance of this thesis is three-fold. Firstly, machine learning and reinforcement learning are investigated to address the inverse kinematics of continuum robots. Secondly, motion planning of both discrete and continuum robots is facilitated via imitation learning. Lastly, both model-free and model-based imitation learning have been developed.
Degree	Doctor of Philosophy
Subject	Robots - Motion
Dept/Program	Industrial and Manufacturing Systems Engineering
Persistent Identifier	http://hdl.handle.net/10722/249221

DC Field	Value	Language
dc.contributor.advisor	Lau, HYK	-
dc.contributor.advisor	Or, KL	-
dc.contributor.author	Chen, Jie	-
dc.contributor.author	陳杰	-
dc.date.accessioned	2017-11-01T09:59:51Z	-
dc.date.available	2017-11-01T09:59:51Z	-
dc.date.issued	2017	-
dc.identifier.citation	Chen, J. [陳杰]. (2017). An imitation learning-based approach to the motion planning of robots : from discrete to continuum robots. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.	-
dc.identifier.uri	http://hdl.handle.net/10722/249221	-
dc.description.abstract	Robots have impacted human everyday life deeply over the past decades, from traditional automated manufacturing to the state-of-the-art robot-assisted surgery. With the advancement of robotic technologies, numerous different kinds of robots have been invented to deal with the increasingly complicated requirement. For instance, continuum robots have been developed to enhance the performance of robot-assisted minimally invasive surgery (MIS), which is one of the major focuses of this thesis. However, effective and efficient motion control of robots, including inverse kinematics modelling and motion planning, remains challenging, especially for the continuum robots. In this thesis, we propose an imitation learning-based framework to solve this issue, both discrete and continuum robots have been investigated. For discrete manipulators, conventional Jacobian-based methods are adopted to address the corresponding inverse kinematic problems. While for the continuum robot, we implement three machine learning algorithms, K-Nearest Neighbors Regression (KNNR), Gaussian Mixture Regression (GMR), and Extreme Learning Machine Regression (ELMR), to approximate its inverse kinematics model. We also evaluate the possibility of applying reinforcement learning to further improve the inverse kinematics model of continuum robots. Two different types of imitation learning-based frameworks have been investigated to plan the motion path of robots, namely model-free and model-based methods. In model-free imitation learning, human demonstrations are provided in both actuation space and task space, and Gaussian Mixture Model (GMM) and GMR are used to encode the demonstrations directly and then generalize an executable path for the robot to reproduce the learned task. For complicated tasks, the demonstration has to be segmented into multiple movement primitives to simplify the learning process, and Support Vector Machine (SVM) is used for the segmentation purpose. Dynamical systems have been deployed to facilitate the model-based method, whose effectiveness is ensured by Lyapunov Stability Theorem, and the demonstrations are only available in the task space. A number of robots have been developed in this thesis to evaluate the proposed approaches, including a re-designed 7-DoF Mitsubishi PA-10 robot, a 6-DoF ionic polymer-metal composite (IPMC) flexible manipulator, a 3-DoF tendon-driven continuum manipulator (TCM), and a 2-DoF TCM. Inverse kinematics of the 3-DoF TCM is learned by KNNR, GMR, and ELMR. While reinforcement learning has been applied to the 2-DoF TCM, and in a few iterations the trajectory tracking error reduces significantly from 8.045 mm to 1.101 mm. With model-free imitation learning, the 6-DoF IPMC manipulator acquires the skill of navigation through a narrow hole, and the 3-DoF TCM learns to reproduce two surgical related tasks, namely complaint tube insertion and simplified endoscopic submucosal dissection (ESD). While with model-based imitation learning, the PA-10 robot successfully bats a fast flying ball, a 7-DoF KUKA LBR iiwa robot learns to perform manipulation tasks under both spatial and temporal perturbations, and the 3-DoF TCM obtains the skill of obstacle avoidance. The significance of this thesis is three-fold. Firstly, machine learning and reinforcement learning are investigated to address the inverse kinematics of continuum robots. Secondly, motion planning of both discrete and continuum robots is facilitated via imitation learning. Lastly, both model-free and model-based imitation learning have been developed.	-
dc.language	eng	-
dc.publisher	The University of Hong Kong (Pokfulam, Hong Kong)	-
dc.relation.ispartof	HKU Theses Online (HKUTO)	-
dc.rights	The author retains all proprietary rights, (such as patent rights) and the right to use in future works.	-
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	-
dc.subject.lcsh	Robots - Motion	-
dc.title	An imitation learning-based approach to the motion planning of robots : from discrete to continuum robots	-
dc.type	PG_Thesis	-
dc.description.thesisname	Doctor of Philosophy	-
dc.description.thesislevel	Doctoral	-
dc.description.thesisdiscipline	Industrial and Manufacturing Systems Engineering	-
dc.description.nature	published_or_final_version	-
dc.identifier.doi	10.5353/th_991043962676303414	-
dc.date.hkucongregation	2017	-
dc.identifier.mmsid	991043962676303414	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

postgraduate thesis: An imitation learning-based approach to the motion planning of robots : from discrete to continuum robots

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats