Proposal#
Project Name: End-to-end autonomous vehicle driving based on text-based instructions.
Abstract#
JdeRobot is an open-source toolkit for developing Robotics applications. Amongst various projects in Behavior Metrics for evaluating DL models’ performance in autonomous driving tasks. In this project, the aim is to integrate a Language Model (LM) system with an end-to-end autonomous driving model. By combining previous knowledge and successful projects, the goal is to enable users to provide text-based commands directly to the vehicle, similar to interacting with a real-life taxi. The project will commence with a focus on simplicity, utilizing models like BERT, and gradually iterating towards more complex architectures.
Integrating BERT or a similar Language Model (LM) for generating High-Level Commands (HLC), to perform autonomous driving using Carla simulation.#
Integrating BERT or a similar Language Model (LM) for generating High-Level Commands (HLC), to perform autonomous driving using Carla simulation.#
Using Vision Models such a LLaVA for providing feedback to language model.#
Using Vision Models such a LLaVA for providing feedback to language model.#
Timeline#
Time  | 
Tasks  | 
|---|---|
Community Bonding Period  | 
• Thoroughly familiarize myself with the code base.  | 
Week 1, 2 & 3  | 
• Implement a basic NLP-based controller using BERT or a similar LM.  | 
Week 4 & 5  | 
• Understanding and integrating vision encoders (like LMDrive) for closed-loop control. (The above is open-loop setting)  | 
Evaluation Week 6 & 7  | 
• Train the integrated system on the LMDrive dataset for performance evaluation.  | 
Jul 12  | 
Phase 1 Evaluation deadline  | 
Week 8 & 9  | 
• Explore the use of Vision-Language Models (VLMs) like LLaVA to improve the system’s understanding of visual inputs.  | 
Week 10 & 11  | 
• Investigate extending the evaluation metrics using Visual Question Answering (VQA) techniques, such as LingoQA, to enhance system comprehension and response accuracy.  | 
Week 12 & 13  | 
• Finalize project deliverables, including code, documentation, and any additional materials.  | 
Week 14 & 15  | 
• Buffer period for any unexpected delays or additional tasks.  |