69 results
based on your profile
ByteDance
San Jose, CA

ByteDance
San Jose, CA

ByteDance
Seattle, WA

ByteDance
San Jose, CA

ByteDance
San Jose, CA

ByteDance
Seattle, WA

ByteDance
San Jose, CA

ByteDance
Seattle, WA

ByteDance
Seattle, WA

ByteDance
Seattle, WA

ByteDance
San Jose, CA

ByteDance
Seattle, WA

ByteDance
Seattle, WA

ByteDance
San Jose, CA

ByteDance
San Jose, CA

ByteDance
Seattle, WA

ByteDance
San Jose, CA

ByteDance
San Jose, CA

ByteDance
Seattle, WA

ByteDance
Seattle, WA

ByteDance
San Jose, CA

ByteDance
Seattle, WA

ByteDance
San Jose, CA

ByteDance
San Jose, CA

ByteDance
San Jose, CA

Student Researcher [Seed – Multimodal Interaction & World Model - RL Focused] – 2026 Start (PhD)
Apply now
About the job
The Seed Multimodal Interaction and World Model team is dedicated to developing models that boast human-level multimodal understanding and interaction capabilities. The team also aspires to advance the exploration and development of multimodal assistant products.
- Design and implement reinforcement learning (RL) training systems for large-scale multimodal foundation models
- Develop unified modeling frameworks that integrate video, audio, and language, with a focus on visual latent reasoning
- Explore RL-based approaches to bridge understanding and generation for multimodal visual reasoning
- Collaborate with researchers to evaluate models on tasks involving world modeling, reasoning, and instruction-conditioned generation
Minimum Qualifications:
- Currently pursuing a PhD in Software Development, Computer Science, Computer Engineering, or a related technical discipline
- Publications in top-tier venues, such as CVPR, ECCV, ICCV, NeurIPS, ICLR, ICML, or other leading conferences in AI and ML
- Strong research background in at least one of the following: reinforcement learning, multimodal learning, video understanding, or vision-language modeling
- Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment
- Experience with reinforcement learning in multimodal or interactive environments
- Familiarity with video generation or diffusion-based generative models
- Experience with large-scale model training (e.g., distributed training, curriculum learning, or memory-augmented transformers)
- Solid programming and engineering skills, with experience building training or evaluation pipelines for ML models
Group Referrals
8 Jobs Suggested for Today
You've selected: 8 Jobs
Senior Actuary: Annuity Profitability - Business Unit Finance
Aegon · United States
Finance Segment Liaison
Huntington National Bank · Chicago, IL
Associate Financial Analyst
Raymond James · St. Petersburg, FL
Associate / Software Engineer
Northrop Grumman · Los Angeles,CA
React Software Developer - PostgreSQL AWS
Peraton · Washington,DC
Staff Software Development Engineer - Java
CVS Health · Northbrook, IL
Software Engineer III – Full Stack Customer Care Technology
Walmart · Bentonville, AR
Senior Software Engineer, Geo AI/ML
Google · Mountain View, CA