Reinforcement Learning and World Model for Autonomous Driving Intern - 2026

All the best with your application!

Want more jobs like this straight to your inbox?

Summary

Location

Shanghai / Beijing

Work

Internship

About this Job

We are in search of a hardworking intern with expertise in Reinforcement Learning and Multi-Modal World Simulation Model to propel the evolution of ML-centric autonomous driving and Physical AI solutions. The focus of this role lies in model-centric RL, learning about world simulation models, and translating state-of-the-art (SOTA) algorithms into real-world applications, allowing vehicles to interpret, anticipate, and respond astutely in challenging dynamic contexts. This is a rare opportunity to shape the next frontier of intelligent driving, where imagination meets real-world impact. If you’re excited by the idea of building SOTA simulation techs and systems that learn, adapt, and truly “think,” we’d love to have you on board. Join us, join a team where your input plays a crucial role in fast-tracking the growth of autonomous vehicles with the state of art solutions.

What you'll be doing:

  • Develop and refine multi-modal world models and integrate them into our simulation system.

  • Train and evaluate self-supervised latent dynamics and sensor generation models for the joint tasks of trajectory prediction, goal-conditioned ego control, and sensor data synthesis. Explore and prototype hybrid architectures combining world models, generative (e.g., diffusion, flow matching) models, and policy gradients for realistic and robust simulation.

  • Collaborate with End-to-End Driving Model teams to deploy world-model-based policies to simulated RL environments and accelerate the training of the driving systems.

  • Contribute to system development for continuous learning and simulation adaptation (Sim2Real transfer).

What we need to see:

  • Pursuing PhD in Computer Science, Machine Learning, or a related field, with neural rendering, robotics, or simulation background.

  • Strong understanding of reinforcement learning (policy gradients, actor-critic, offline RL).

  • Familiarity with visual representation learning and 4D scene representation (NeRF, Gaussian Splatting, occupancy networks and contrastive, masked modeling, or generative world simulation) for world simulation.

  • Experience building large-scale training pipelines with temporal consistency and simulation data replay.

  • Publications or open-source contributions in RL, model-based control, or autonomous systems.

  • Passion for developing learning systems that can “imagine” and plan in the real world.

About the Company

NVIDIA logo

NVIDIA

Public Company
Transportation & Autonomous VehiclesRobotics Hardware & ComponentsRobotics Software & AI

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

View details
Related Jobs

Get the week's best robotics jobs

We review hundreds of postings weekly and hand-pick the top roles for you. High-salary positions, top companies, remote opportunities.

Please enter a valid email address

Unsubscribe anytime. We respect your privacy.