Tags
3 pages
强化学习
Plan-R1: Safe and Feasible Trajectory Planning as Language Modeling
RAD-2: Scaling Reinforcement Learning in a Generator-Discriminator Framework
DiffusionDriveV2: Truncated Diffusion Model for End-to-End Autonomous Driving