Skip to content
flint
Back to jobs
far.ai

Senior Research Engineer

$150k – $250k/yr Berkeley, US remote full time senior Nov 26, 2025

About this role

ABOUT US FAR.AI http://FAR.AI is a non-profit AI research institute working to ensure advanced AI is safe and beneficial for everyone. Our mission is to facilitate breakthrough AI safety research, advance global understanding of AI risks and solutions, and foster a coordinated global response. Since our founding in July 2022, we've grown to 40+ staff https://www.far.ai/about/team, published 40+ academic papers https://scholar.google.com/citations?user=FVJ24k8AAAAJ, and convened leading AI safety events https://far.ai/events/. Our work is recognized globally, with publications at premier venues such as NeurIPS, ICML, and ICLR, and features in the Financial Times https://www.ft.com/content/175e5314-a7f7-4741-a786-273219f433a1, Nature News https://www.nature.com/articles/d41586-024-02218-7 and MIT Technology Review https://www.technologyreview.com/2020/02/28/905615/reinforcement-learning-adversarial-attack-gaming-ai-deepmind-alphazero-selfdriving-cars/. We conduct pre-deployment testing on behalf of frontier developers such as OpenAI and independent evaluations for governments including the EU AI Office https://www.far.ai/news/far-ai-selected-to-lead-eu-ai-act-cbrn-risk-consortium. We help steer and grow the AI safety field through developing https://arxiv.org/abs/2405.06624 research https://arxiv.org/abs/2506.20702 roadmaps https://www.researchgate.net/publication/396910034_Open_Technical_Problems_in_Open-Weight_AI_Model_Risk_Management with renowned researchers such as Yoshua Bengio; running FAR.Labs https://www.far.ai/programs/far-labs, an AI safety-focused co-working space in Berkeley housing 40 members; and supporting the community through targeted grants https://www.far.ai/programs/grantmaking to technical researchers. ABOUT FAR.RESEARCH We explore promising research directions in AI safety and scale up only those showing a high potential for impact. Once the core research problems are solved, we work to scale them to a minimum viable prototype, demonstrating their validity to AI companies and governments to drive adoption. We are aiming to rapidly grow our team in the following areas: - Mitigating AI deception: Studying when lie detectors induce honesty or evasion https://www.far.ai/news/avoiding-ai-deception, and developing for deception and sandbagging - Evals and red-teaming: Conducting pre- and post-release adversarial evaluations of frontier models (e.g. Claude 4 Opus https://x.com/ARGleave/status/1926138376509440433, ChatGPT Agent https://cdn.openai.com/pdf/839e66fc-602c-48bf-81d3-b21eacc3459d/chatgpt_agent_system_card.pdf, GPT-5 https://cdn.openai.com/gpt-5-system-card.pdf); developing novel attacks https://www.far.ai/news/defense-in-depth to support this work; and exploring new threat models (e.g. persuasion https://arxiv.org/abs/2506.02873, tampering risks https://arxiv.org/abs/2507.11630). - Infrastructure: Maintaining GPU compute infrastructure to support experiments with open-weight models and developing new tooling to allow our research teams to scale their fine-tuning and post-training workflows to frontier open-weight models. - Adversarial Robustness: Working to rigorously solve these security problems through building a science of security and robustness for AI, from demonstrating superhuman systems can be vulnerable https://far.ai/post/2023-07-superhuman-go-ais/, to scaling laws for robustness https://www.far.ai/news/does-robustness-improve-with-scale and jailbreaking constitutional classifiers https://arxiv.org/abs/2506.24068 - Mechanistic Interpretability: Finding https://arxiv.org/abs/2502.12892 issues https://arxiv.org/abs/2508.16560 with https://arxiv.org/abs/2505.11756 Sparse Autoencoders, probing deception using AmongUs https://arxiv.org/abs/2504.04072, understanding learned planning https://far.ai/post/2024-07-learned-planners/ in SokoBan and interpretable data attribution. FAR.AI http://FAR.AI is one of the largest independent AI safety research institutes, and is rapidly growing with the goal of diversifying and deepening our research portfolio. We would welcome the opportunity to add new research directions if you are a senior researcher with a strong vision and would like to pitch us on it. ABOUT THE ROLE This role would be a good fit for an experienced machine learning engineer, or an experienced software engineer looking to transition to AI safety research. All candidates are expected to: - Have significant software engineering experience. Evidence of this may include prior work experience and open-source contributions. - Be fluent working in Python. - Be results-oriented and motivated by impactful research. - Bring prior experience mentoring other engineers or scientists in engineering skills. Additionally, candidates are expected to bring expertise in one of the following areas corresponding to the core competencies our different research teams most need: - Option 1 – Machine Learning: - Substantial experience training transformers with common ML frameworks like PyTorch or jax. - Good knowledge of basic linear algebra, calculus, vector probability, and statistics. - Option 2 – High-Performance Computing: - Power user of cluster orchestrators such as Kubernetes (preferred) or SLURM - Experience building high-performance distributed-systems (e.g. multi-node training, large-scale numerical computation) - Experience optimizing and profiling code (ideally including on GPU, e.g. CUDA kernels). - Option 3 – Technical Leadership: - Experience designing large-scale software systems, whether as an architect in greenfield software development or leading a major refactor. - Comfortable project managing small teams, such as chairing stand-ups and developing detailed roadmaps to execute on a 3-6 month research vision. ABOUT THE PROJECTS As a Member of Technical Staff (Senior Research Engineer) you would join one of our existing workstreams and lead projects there: - Detecting and preventing deception. Under what conditions can we reliably detect deceptive behaviour from models, and can such behaviour be effectively mitigated at scale? This would focus on large-scale training of transformers. - Preventing catastrophic misuse. Apply our research insights to detect and mitigate vulnerabilities and other risks in frontier AI models. This would focus more on technical leadership - Accelerating our research. Build frameworks and infrastructure that allows us to ask bigger questions and more rapidly run new experiments, to deepen our research. This would focus more on high-performance computing. As we continue to grow our research portfolio, additional workstreams may open up for contribution, for example in mechanistic interpretability. LOGISTICS If based in the USA or Singapore, you will be an employee of FAR.AI http://FAR.AI (501(c)(3) research non-profit / non-profit CLG). Outside the USA or Singapore, you will be employed via an EOR organisation on behalf of FAR.AI http://FAR.AI or as a contractor. - Location: Both remote and in-person (Berkeley, CA or Singapore) are possible. We sponsor visas for in-person employees, and can hire remotely in most countries. - Hours: Full-time (40 hours/week). - Compensation: $150,000-$250,000/year depending on experience and location, with the potential for additional compensation for exceptional candidates. We will also pay for work-related travel and equipment expenses. We offer catered lunch and dinner at our offices in Berkeley. - Application process: A 72-minute programming assessment, two interviews with members of our technical staff, and a 1-2 week paid work trial. If you are not available for a work trial we may be able to find alternative ways of testing your fit. If you have any questions about the role, please do get in touch at talent@far.ai. Otherwise, if you don't have questions, the best way to ensure a proper review of your skills and qualifications is by applying directly via the application form. Please don't email us to share your resume (it won't have any impact on our decision). Thank you!
Sign in Apply