Deep reinforcement learning for aggressive quadrotor maneuvers

Xie, Yuhan; 谢宇涵

File Download

FullText.pdf

Supplementary

Citations:
Appears in Collections:
- HKU Theses Online
- Mechanical Engineering: Theses

postgraduate thesis: Deep reinforcement learning for aggressive quadrotor maneuvers

Title	Deep reinforcement learning for aggressive quadrotor maneuvers
Authors	Xie, Yuhan 谢宇涵
Issue Date	2023
Publisher	The University of Hong Kong (Pokfulam, Hong Kong)
Citation	Xie, Y. [谢宇涵]. (2023). Deep reinforcement learning for aggressive quadrotor maneuvers. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.
Abstract	Quadrotors are highly agile and versatile aerial vehicles, rendering them ideal for complicated tasks within cluttered environments. The agility of quadrotors is being exploited, posing challenges with escalated performance demands. Meanwhile, reinforcement learning has demonstrated its strong potential of exploiting the robots’ agility. Therefore, this thesis discusses the domain of aggressive quadrotor flights with the deep reinforcement learning method. Optimization-based agile quadrotor flight methods typically decouple trajectory planning and control. Among these, differential-flatness-based planning and control are widely used for their computational convenience. Leveraging the differential flat property, the author extends the minimum-jerk motion primitive algorithm to minimum snap, formulating closed-loop solutions of minimum snap trajectories. The proposed algorithm achieves rapid generation of one million minimum-snap primitives per second on a standard onboard computer. The method is validated through repetitive acrobatics flights. In contrast to traditional optimization-based approaches, learning-based methods address the problem by learning an end-to-end policy that predicts control commands directly from high-dimensional observations. This thesis proposes a learning-based method to tackle the specific problem of traversing through unknown, tilted, and narrow gaps for quadrotors. This problem is challenging for two reasons: a precise and robust whole-body planning and control policy is required to accommodate variable-tilt and narrow gaps, and an effective Sim2Real approach is necessary for successful real-world experiments. To this end, the author propounds a novel learning framework for gap traversal flights. The algorithm effectively trains the quadrotor to negotiate gap centers with an approximate tilt attitude to the gap. The policy trained solely in simulation demonstrates domain transfer-ability without necessitating additional training data, while maintaining the success rate. The author further incorporates onboard sensing, eliminating the reliance on precise environmental knowledge, including gap dimensions and orientation. Our proposed framework, encompassing onboard sensing and a neural network controller, attains a success rate of 87.36% in real-world experiments, with gap orientations up to 60°. To the best of our knowledge, this is the first work that performs the learning-based variable-tilted narrow gap traversal flight in the real world, without prior knowledge of the environment. The inherent under-actuation of conventional quadrotors constrains their agility. To enhance quadrotor agility while preserving conciseness, a novel tilt-rotor quadrotor is introduced lastly. This design incorporates a compact uniaxial rotor tilting mechanism, enabling approximate 6-degree-of-freedom control with only one additional servo motor. The modeling and control framework is derived and then validated through a custom MATLAB simulator. A practical application scenario, horizontal aerial photography around a central objective, is proposed and demonstrated, highlighting the real-world application potential of the proposed tilt-rotor quadrotor. A comparative study over traditional quadrotors is conducted, demonstrating the superiority of the proposed tilt-rotor quadrotor system in aggressive tasks.
Degree	Master of Philosophy
Subject	Quadrotor helicopters - Control Reinforcement learning
Dept/Program	Mechanical Engineering
Persistent Identifier	http://hdl.handle.net/10722/336619

DC Field	Value	Language
dc.contributor.author	Xie, Yuhan	-
dc.contributor.author	谢宇涵	-
dc.date.accessioned	2024-02-26T08:30:45Z	-
dc.date.available	2024-02-26T08:30:45Z	-
dc.date.issued	2023	-
dc.identifier.citation	Xie, Y. [谢宇涵]. (2023). Deep reinforcement learning for aggressive quadrotor maneuvers. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.	-
dc.identifier.uri	http://hdl.handle.net/10722/336619	-
dc.description.abstract	Quadrotors are highly agile and versatile aerial vehicles, rendering them ideal for complicated tasks within cluttered environments. The agility of quadrotors is being exploited, posing challenges with escalated performance demands. Meanwhile, reinforcement learning has demonstrated its strong potential of exploiting the robots’ agility. Therefore, this thesis discusses the domain of aggressive quadrotor flights with the deep reinforcement learning method. Optimization-based agile quadrotor flight methods typically decouple trajectory planning and control. Among these, differential-flatness-based planning and control are widely used for their computational convenience. Leveraging the differential flat property, the author extends the minimum-jerk motion primitive algorithm to minimum snap, formulating closed-loop solutions of minimum snap trajectories. The proposed algorithm achieves rapid generation of one million minimum-snap primitives per second on a standard onboard computer. The method is validated through repetitive acrobatics flights. In contrast to traditional optimization-based approaches, learning-based methods address the problem by learning an end-to-end policy that predicts control commands directly from high-dimensional observations. This thesis proposes a learning-based method to tackle the specific problem of traversing through unknown, tilted, and narrow gaps for quadrotors. This problem is challenging for two reasons: a precise and robust whole-body planning and control policy is required to accommodate variable-tilt and narrow gaps, and an effective Sim2Real approach is necessary for successful real-world experiments. To this end, the author propounds a novel learning framework for gap traversal flights. The algorithm effectively trains the quadrotor to negotiate gap centers with an approximate tilt attitude to the gap. The policy trained solely in simulation demonstrates domain transfer-ability without necessitating additional training data, while maintaining the success rate. The author further incorporates onboard sensing, eliminating the reliance on precise environmental knowledge, including gap dimensions and orientation. Our proposed framework, encompassing onboard sensing and a neural network controller, attains a success rate of 87.36% in real-world experiments, with gap orientations up to 60°. To the best of our knowledge, this is the first work that performs the learning-based variable-tilted narrow gap traversal flight in the real world, without prior knowledge of the environment. The inherent under-actuation of conventional quadrotors constrains their agility. To enhance quadrotor agility while preserving conciseness, a novel tilt-rotor quadrotor is introduced lastly. This design incorporates a compact uniaxial rotor tilting mechanism, enabling approximate 6-degree-of-freedom control with only one additional servo motor. The modeling and control framework is derived and then validated through a custom MATLAB simulator. A practical application scenario, horizontal aerial photography around a central objective, is proposed and demonstrated, highlighting the real-world application potential of the proposed tilt-rotor quadrotor. A comparative study over traditional quadrotors is conducted, demonstrating the superiority of the proposed tilt-rotor quadrotor system in aggressive tasks.	-
dc.language	eng	-
dc.publisher	The University of Hong Kong (Pokfulam, Hong Kong)	-
dc.relation.ispartof	HKU Theses Online (HKUTO)	-
dc.rights	The author retains all proprietary rights, (such as patent rights) and the right to use in future works.	-
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	-
dc.subject.lcsh	Quadrotor helicopters - Control	-
dc.subject.lcsh	Reinforcement learning	-
dc.title	Deep reinforcement learning for aggressive quadrotor maneuvers	-
dc.type	PG_Thesis	-
dc.description.thesisname	Master of Philosophy	-
dc.description.thesislevel	Master	-
dc.description.thesisdiscipline	Mechanical Engineering	-
dc.description.nature	published_or_final_version	-
dc.date.hkucongregation	2024	-
dc.identifier.mmsid	991044770611503414	-

File Download

Supplementary

postgraduate thesis: Deep reinforcement learning for aggressive quadrotor maneuvers

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats