Magicβs mission is to build safe AGI that accelerates humanityβs progress on the worldβs most important problems. We believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably than humans can alone. Our approach combines frontier-scale pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal.
As a Research Engineer in post-training, you'll help develop novel techniques and datasets to maximize model performance for real-world applications, leveraging data and compute at scale. Youβll enable our models to complete engineering, code review, and software design tasks in large, real-world codebases while incorporating cutting-edge reinforcement learning (RL) methods.
Research and develop innovative post-training techniques and reinforcement learning strategies to enable models to autonomously generate, debug, and optimize software
Build dynamic reward systems and feedback pipelines to align model outputs with human-like decision-making in software development
Scale up synthetic dataset generation and evaluations to drive iterative improvements in autonomous coding and problem-solving tasks
Improve model capabilities for generating substantial, high-quality, functional code
Design scalable approaches for evaluations and synthetic dataset generation that align with reinforcement learning objectives
Explore and implement novel methods to align AI behavior with human intent, ensuring reliability and performance in high-stakes environments
Strong experience deploying and fine-tuning LLMs for real-world applications.
Strong general software engineering skills
Thorough knowledge of the deep learning literature
Expertise in reinforcement learning techniques such as actor-critic, self-play or self-evaluation, or RLHF
Ability to come up with and evaluate novel research ideasΒ
Obsession with details, reliability, and good testing to ensure data quality and integrity
Willingness to dive deeply into a large ML codebase to debug
Passion for building systems that redefine software engineering through fully autonomous AI
Magic strives to be the place where high-potential individuals can do their best work. We value quick learning and grit just as much as skill and experience.
Integrity. Words and actions should be aligned
Hands-on. At Magic, everyone is buildingΒ
Teamwork. We move as one team, not N individuals
Focus. Safely deploy AGI. Everything else is noise
Quality. Magic should feel like magic
Annual salary range: $100K - $550K
Equity is a significant part of total compensation, in addition to salary
401(k) plan with 6% salary matching
Generous health, dental and vision insurance for you and your dependents
Unlimited paid time off
Visa sponsorship and relocation stipend to bring you to SF, if possible
A small, fast-paced, highly focused team