TY - GEN
T1 - Uas conflict resolution integrating a risk-based operational safety bound as airspace reservation with reinforcement learning
AU - Hu, Jueming
AU - Liu, Yongming
N1 - Funding Information:
The research reported in this paper was supported by funds from NASA University Leadership Initiative program (Contract No. NNX17AJ86A, PI: Yongming Liu, Technical Officer: Kai Goebel and Anupa Bajwa). The authors would like to thank Weichang Wang and Dr. Lei Ying for the helpful discussions and valuable suggestions. Yutian Pang also provided much-needed support with LATEX typesetting. The support is gratefully acknowledged.
Publisher Copyright:
© 2020, American Institute of Aeronautics and Astronautics Inc, AIAA. All rights reserved.
PY - 2020
Y1 - 2020
N2 - UAS trajectory prediction is stochastic in nature and randomness exists in almost every aspect of UAS Traffic Management. In order to address this challenge, it is critical to ensure a reasonable separation between UAS and obstacles when doing path planning and conflict resolution. In this paper, a novel method to deconflict for rotary-wing UAS traffic management is proposed. The main idea is to integrate a probabilistic dynamic anisotropic operational safety bound as airspace reservation with reinforcement learning method. The operational safety bound is based on UAS performance, weather condition and uncertainties in UAS operations, such as positioning error. Based on the operational safety bound concept, a new reward function in reinforcement learning is developed. The proposed methodology results in a trajectory prediction model under risk-based dynamic separation criterion. The algorithm of Q learning is adopted to find the optimal path planning. Simulations of avoiding static obstacles and multi-UAS conflict resolution are conducted to show the deconflict capability. Comparisons between results with operational safety bound and without operational safety bound are shown and analyzed.
AB - UAS trajectory prediction is stochastic in nature and randomness exists in almost every aspect of UAS Traffic Management. In order to address this challenge, it is critical to ensure a reasonable separation between UAS and obstacles when doing path planning and conflict resolution. In this paper, a novel method to deconflict for rotary-wing UAS traffic management is proposed. The main idea is to integrate a probabilistic dynamic anisotropic operational safety bound as airspace reservation with reinforcement learning method. The operational safety bound is based on UAS performance, weather condition and uncertainties in UAS operations, such as positioning error. Based on the operational safety bound concept, a new reward function in reinforcement learning is developed. The proposed methodology results in a trajectory prediction model under risk-based dynamic separation criterion. The algorithm of Q learning is adopted to find the optimal path planning. Simulations of avoiding static obstacles and multi-UAS conflict resolution are conducted to show the deconflict capability. Comparisons between results with operational safety bound and without operational safety bound are shown and analyzed.
UR - http://www.scopus.com/inward/record.url?scp=85091926247&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85091926247&partnerID=8YFLogxK
U2 - 10.2514/6.2020-1372
DO - 10.2514/6.2020-1372
M3 - Conference contribution
AN - SCOPUS:85091926247
SN - 9781624105951
T3 - AIAA Scitech 2020 Forum
SP - 1
EP - 10
BT - AIAA Scitech 2020 Forum
PB - American Institute of Aeronautics and Astronautics Inc, AIAA
T2 - AIAA Scitech Forum, 2020
Y2 - 6 January 2020 through 10 January 2020
ER -