Proposal#
Project Name: End-to-end autonomous vehicle driving based on text-based instructions.
Abstract#
JdeRobot is an open-source toolkit for developing Robotics applications. Amongst various projects in Behavior Metrics for evaluating DL models’ performance in autonomous driving tasks. In this project, the aim is to integrate a Language Model (LM) system with an end-to-end autonomous driving model. By combining previous knowledge and successful projects, the goal is to enable users to provide text-based commands directly to the vehicle, similar to interacting with a real-life taxi. The project will commence with a focus on simplicity, utilizing models like BERT, and gradually iterating towards more complex architectures.
Timeline#
Time |
Tasks |
---|---|
Community Bonding Period |
• Thoroughly familiarize myself with the code base. |
Week 1, 2 & 3 |
• Implement a basic NLP-based controller using BERT or a similar LM. |
Week 4 & 5 |
• Understanding and integrating vision encoders (like LMDrive) for closed-loop control. (The above is open-loop setting) |
Evaluation Week 6 & 7 |
• Train the integrated system on the LMDrive dataset for performance evaluation. |
Jul 12 |
Phase 1 Evaluation deadline |
Week 8 & 9 |
• Explore the use of Vision-Language Models (VLMs) like LLaVA to improve the system’s understanding of visual inputs. |
Week 10 & 11 |
• Investigate extending the evaluation metrics using Visual Question Answering (VQA) techniques, such as LingoQA, to enhance system comprehension and response accuracy. |
Week 12 & 13 |
• Finalize project deliverables, including code, documentation, and any additional materials. |
Week 14 & 15 |
• Buffer period for any unexpected delays or additional tasks. |