Apollo Research

Apollo Research · 2025-08-05T15:31:51.927Z

We're hiring for an Evals Demonstrator Engineer. With the evals and governance teams, you'd build and perfect demonstrations for AI decision-makers and the general public. If you're a decent engineer and a great communicator, we'd love to work with you. https://lnkd.in/g5n27ncN

Technology, Information and Internet

Technical AI safety organization specializing in auditing high-risk failure modes, particularly deceptive alignment.

See jobs Follow

Discover all 15 employees

About us

Apollo Research is an AI safety organization. We specialize in auditing high-risk failure modes, particularly deceptive alignment, in large AI models. Our primary objective is to minimize catastrophic risks associated with advanced AI systems that may exhibit deceptive behavior, where misaligned models appear aligned in order to pursue their own objectives. Our approach involves conducting fundamental research on interpretability and behavioral model evaluations, which we then use to audit real-world models. Ultimately, our goal is to leverage interpretability tools for model evaluations, as we believe that examining model internals in combination with behavioral evaluations offers stronger safety assurances compared to behavioral evaluations alone.

Website: https://www.apolloresearch.ai/
External link for Apollo Research
Industry: Technology, Information and Internet
Company size: 2-10 employees
Headquarters: London
Type: Privately Held
Founded: 2023
Specialties: Artificial Intelligence, Machine Learning, AI Safety, Interpretability, Model Evaluations, Audits, Research, and Policy Advising

Locations

Primary

1 Fore St Ave

London, EC2Y 9DT, GB

Get directions

Employees at Apollo Research

See all employees

Updates

Apollo Research

3,643 followers
2w
Report this post
Op-Ed in TIME "When it Comes to AI, What We Don’t Know Can Hurt Us" written by Charlotte Stix (Apollo Research) and Yoshua Bengio (Law Zero). https://lnkd.in/eTSkRfW2

When it Comes to AI, What We Don't Know Can Hurt Us time.com

Like Comment Share
Apollo Research

3,643 followers
3w
Report this post
The journal Nature recently covered our work around AI scheming, focusing on our Dec, 2024 paper on In-Context Scheming to our recent research (Sept. 2024) on Anti-Scheming mitigations. https://lnkd.in/grG-unYN

AI models that lie, cheat and plot murder: how dangerous are LLMs really? nature.com

Like Comment Share
Apollo Research

3,643 followers
3w
Report this post
Recent coverage of Apollo Research's work in this Sunday's New York Times, weekend Feature https://lnkd.in/eJPCG9DP

Opinion | The A.I. Prompt That Could End the World https://www.nytimes.com

Like Comment Share
Apollo Research

3,643 followers
1mo
Report this post
Our anti-scheming paper with OpenAI, unpacked by Marius Hobbhahn (Apollo CEO) and Bronson Schoen (lead author). Long-form interview https://lnkd.in/eN2en7H8

Can We Stop AI from Scheming? Lead Researcher Interview

https://www.youtube.com/

1 Comment

Like Comment Share
Apollo Research

3,643 followers
1mo
Report this post
Training AI not to scheme is hard - it may get better at hiding its scheming. Here is a sneak peek of tomorrow’s video with Marius Hobbhahn (Apollo CEO) and Bronson Schoen(lead author):

Like Comment Share
Apollo Research

3,643 followers
1mo
Report this post
How much can today’s AI models scheme? Here is a teaser of a video we’re releasing tomorrow with Marius Hobbhahn(Apollo CEO) and Bronson Schoen(lead author) on our recent paper:

1 Comment

Like Comment Share
Apollo Research

3,643 followers
1mo
Report this post
Marius will be a speaker at the AE Global Summit on Open Problems for AI, hosted by Thinking About Thinking this October in London. Marius' talk takes place on Day 2 (Oct. 29th) under the AI Safety, Enterprise, & Society track. __ The Summit focuses on the pressing question of how to use AI to maximally boost value in enterprise, startups, and society. More than 1,000 researchers, policymakers, engineers, entrepreneurs, and artists will come together to explore breakthroughs in AI research, expected capabilities, effective rollout strategies, and brand new entrepreneurial ideas. More info here: https://lnkd.in/ezBwbdUN The AE Global Summit on Open Problems for AI runs 28–30th October 2025 (Kings Cross / Euston).
1 Comment

Like Comment Share
Apollo Research

3,643 followers
1mo
Report this post
New Research by Apollo Research & OpenAI: We partnered with OpenAI to assess frontier language models for early signs of scheming — covertly pursuing misaligned goals — in controlled stress-tests (non-typical environments), and studied a training method that can significantly reduce (but not eliminate) these behaviors. We tested "deliberative alignment" — training models to cite a safety spec (like: “Do not lie”) and take honest action. In this new research we see a ~30x reduction in covert actions (a proxy for scheming). This work significantly reduce, but not eliminate these behaviors and there is much more work to do around the science of scheming. - OpenAI o3 covert actions went from 13% -> 0.4% - OpenAI o4-mini covert actions went from 8.7% -> 0.3% Rare but serious misbehavior remains, but this is a significant improvement. Our results are complicated by models’ increasing ability to recognize our evaluation environments ('evals awareness) as tests of their alignment. Of note is that our joint research found that models across all major frontier AI providers, including OpenAI, Google, xAI and Anthropic engage in covert behavior. You can read the full paper, transcripts, and blog posts at antischeming.ai
2 Comments

Like Comment Share
Apollo Research

3,643 followers
2mo
Report this post
Apollo's CEO and founder, Marius, is featured in this year's TIME100 of AI.
Marius Hobbhahn

CEO & Founder of Apollo Research
2mo

Honored and humbled to be in TIME's list of the TIME100 AI of 2025! https://lnkd.in/eDAnPWaT
4 Comments

Like Comment Share
Apollo Research

3,643 followers
3mo
Report this post
We're hiring for an Evals Demonstrator Engineer. With the evals and governance teams, you'd build and perfect demonstrations for AI decision-makers and the general public. If you're a decent engineer and a great communicator, we'd love to work with you. https://lnkd.in/g5n27ncN

Apollo Research - Evals Demonstration Engineer (Contract) jobs.lever.co

Like Comment Share

Apollo Research

Technology, Information and Internet

Technical AI safety organization specializing in auditing high-risk failure modes, particularly deceptive alignment.

About us

Locations

Employees at Apollo Research

Christopher Akin

COO | Strategy | Sales & Marketing | Operations | Advisor | New Market Entry

Joping Chai

People & Operations | AI Safety

Alex Lloyd

AI Safety Research at Apollo | Previously: CTO, Google SWE, Cambridge Maths

Jérémy Scheurer

Research Scientist - AI Alignment at Apollo Research

Updates

Can We Stop AI from Scheming? Lead Researcher Interview

https://www.youtube.com/

Join now to see what you are missing

Similar pages

Catapult

Magentic

Applied Data Science Partners

Redwood Research

Novee

BI:PROCSI

Lumilinks Group Ltd

Adclear

Day30

MATS Research

Browse jobs

Legislative Assistant jobs

Digital Director jobs

Cloud Architect jobs

Principal jobs

Head of Information Technology jobs

Information Technology Manager jobs

President jobs

Writer jobs

Sales Account Executive jobs

Demand Planner jobs

Supply Chain Specialist jobs

Account Executive jobs

Project Management Consultant jobs

Director jobs

Mechanical Engineer jobs

Engineer jobs

Engineering Manager jobs

Policy Analyst jobs

Project Manager jobs

Project Support Officer jobs