Kochliaridis, V., Kostinoudis, E., & Vlahavas, I. (2024, September). Optimizing Pretrained Transformers for Autonomous Driving. In Proceedings of the 13th Hellenic Conference on Artificial Intelligence (pp. 1-9)
Vasileios Kochliaridis, Evaggelos Kostinoudis, Ioannis Vlahavas
Keywords:
Autonomous Vehicles, Deep Learning, Transformers, Computer Vision
Tags:
Abstract:
Vision-based end-to-end driving systems have demonstrated impressive capabilities through the utilization of large Transformer architectures. More specifically, researchers have combined Transformers with Imitation Learning, in order to construct agents that learn to map navigation states to actions from large datasets created by experts. Although this approach usually works well, it relies on specific datasets and expert actions, and thus achieving limited generalization capability, which can be quite catastrophic in uncertain navigation environments, such as urban areas. To overcome this limitation, we further expand the training process of the agent by applying the Phasic Policy Gradient algorithm, a Deep Reinforcement Learning (DRL) method that improves its generalization capability by enabling it to explore and interact with the environment. We further enhance our approach by integrating a custom reward function that penalizes the weaknesses of the pretrained agent, alongside with additional DRL techniques to enhance its efficiency and accelerate convergence. Our experimental results in the CARLA simulation environment demonstrate that our approach not only achieves robustness in comparison to previous approaches, but also shows potential for wider application in similar navigation scenarios.