English
期刊论文

[2023]Continuous improvement of self-driving cars using dynamic confidence-aware reinforcement learning

Zhong Cao, Kun Jiang, Weitao Zhou, Shaobing Xu, Huei Peng, Diange Yang

Today’s self-driving vehicles have achieved impressive driving capabilities, but still suffer from uncertain performance in long-tail cases. Training a reinforcement-learning-based self-driving algorithm with more data does not always lead to better performance, which is a safety concern. Here we present a dynamic confidence-aware reinforcement learning (DCARL) technology for guaranteed continuous improvement. Continuously improving means that more training always improves or maintains its current performance. Our technique enables performance improvement using the data collected during driving, and does not need a lengthy pre-training phase. We evaluate the proposed technology both using simulations and on an experimental vehicle. The results show that the proposed DCARL method enables continuous improvement in various cases, and, in the meantime, matches or outperforms the default self-driving policy at any stage. This technology was demonstrated and evaluated on the vehicle at the 2022 Beijing Winter Olympic Games.

实验室负责人

杨殿阁 ydg@tsinghua.edu.cn

实验室副主任

江昆 jiangkun@tsinghua.edu.cn