Member of Technical Staff - Safety
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
Own the red-teaming and adversarial evaluation pipeline for Reflection's models, continuously probing for failure modes across security, misuse, and alignment gaps. Work hand-in-hand with the Alignment team to translate safety findings into concrete guardrails, ensuring models behave reliably under stress and adhere to deployment policies. Validate that every release meets the lab's risk thresholds before it ships, serving as a critical gatekeeper for our open weight releases. Develop scalable, automated safety benchmarks that evolve alongside our model capabilities, moving beyond static datasets to dynamic adversarial testing. Research and implement state-of-the-art jailbreaking techniques and defenses to stay ahead of potential vulnerabilities in the wild. About You Graduate degree (MS or PhD) in Computer Science, Machine Learning, or related discipline, or equivalent practical experience in AI Safety. Deep technical understanding of LLM safety, including adversarial attacks, red-teaming methodologies, and interpretability. Strong software engineering capabilities with experience building automated evaluation pipelines or large-scale ML systems. Experience with Reinforcement Learning (RLHF/RLAIF) and how it impacts model safety and alignment is a strong plus. Thrive in a fast-paced, high-agency startup environment with bias toward action. Willing to make high-stakes decisions regarding model release and safety thresholds. Passionate about advancing the frontier of intelligence.
Benefits
Additional Information
Our Mission Reflection's mission is to build open superintelligence and make it accessible to all . We're developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI researchers and company builders come from DeepMind, OpenAI, Google Brain, Meta, Character.AI, Anthropic and beyond.
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at reflectionai? Share your experience