Existential Risk

OpenAI reveals bid to mitigate "catastrophic" chemical, biological and nuclear risk

AI firm expands Safety Systems team with engineers responsible for "identifying, tracking, and preparing for risks related to frontier AI models."

Jasper Hamill

08 Jul 2025 — 5 min read

A picture of the Castle Bravo thermonuclear weapons test. Let's hope AI never wields a similar weapon (Image: Public Domain)

OpenAI's Safety Systems team is responsible for ensuring ChatGPT and AI models are deployed safely to benefit human society, rather than wiping us all out.

Now Machine has learned that OpenAI is expanding this critical team with research engineers who will focus on tackling "catastrophic risks" around "biology" and Chemical, Biological, Radiological, and Nuclear (CBRN) hazards.

Although a post advertising the new role does not explicitly state that this employee will be responsible for ensuring that ChatGPT cannot be misused to help build nukes, bioweapons or other weapons of mass destruction, there is a clear subtext suggesting that this may indeed be an important part of their job.

OpenAI recently admitted that upcoming models could be misused to help "novice" scientists build bioweapons - a relatively straightforward endeavour compared to spinning up nukes, which requires access to enriched uranium and advanced infrastructure typically only available to nation states.

Now the AI firm has revealed the concrete steps being taken to ensure the nightmare scenario of an AI going full Terminator never comes to pass.

"Frontier AI models have the potential to benefit all of humanity, but also pose increasingly severe risks," OpenAI wrote in a job advert for a role with the title of Research Engineer, Preparedness (Biology/CBRN).

"To ensure that AI promotes positive change, the Preparedness team helps us prepare for the development of increasingly capable frontier AI models. This team is tasked with identifying, tracking, and preparing for catastrophic risks related to frontier AI models."

How is OpenAI addressing catastrophic risks?

*The explosion of Castle Bravo in 1954, the largest nuke ever detonated by the US (Image: Public Domain)*

It is worth noting that the new role is not concerned with existential risk (or x-risk), which describes scenarios which will result in the total destruction of our species.

OpenAI's new employee will "closely monitor and predict the evolving capabilities of frontier AI systems, with an eye towards misuse risks whose impact could be catastrophic (not necessarily existential) to our society".

This involves ensuring "we have concrete procedures, infrastructure and partnerships to mitigate these risks and, more broadly, to safely handle the development of powerful AI systems," OpenAI wrote.

"Our team will tightly connect capability assessment, evaluations, and internal red teaming for frontier models, as well as overall coordination on AGI preparedness," it added. "The team’s core goal is to ensure that we have the infrastructure needed for the safety of highly-capable AI systems - from the models we develop in the near future to those with AGI-level capabilities."

OpenAI vs existential and catastrophic risk

In June, OpenAI revealed that its models were about to cross a new risk threshold which could allow them to help terrorists or enemy states build lethal bioweapons.

It announced that upcoming versions of ChatGPT will reach "High" levels of capability in biology, as measured by its Preparedness Framework⁠.

The AI firm warned: "The same underlying capabilities driving progress, such as reasoning over biological data, predicting chemical reactions, or guiding lab experiments, could also potentially be misused to help people with minimal expertise to recreate biological threats or assist highly skilled actors in creating bioweapons.

"Physical access to labs and sensitive materials remains a barrier—however those barriers are not absolute."

p(doom)-as-a-service?

OpenAI is not the only AI company to worry about existential risk and p(doom) - the danger of humanity being wiped out by its own creations.

Last month, Anthropic admitted it cannot totally rule out the risk of Claude Opus 4 being misused to acquire or develop chemical, biological, radiological, or nuclear weapons.

The AI firm described Claude Opus 4 as "the world’s best coding model", offering "sustained performance on complex, long-running tasks and agent workflows".

But Claude Opus is so powerful that Anthropic activated a security mechanism for the first time to mitigate the risk of it being used to create weapons of mass destruction.

Do you have a story or insights to share? Get in touch and let us know.

OpenAI reveals bid to mitigate "catastrophic" chemical, biological and nuclear risk

Jasper Hamill

How is OpenAI addressing catastrophic risks?

READ MORE: Hugging Face: Autonomous AI agents should NOT be unleashed

READ MORE: Elon Musk makes frightening AI p(doom) apocalypse prediction

OpenAI vs existential and catastrophic risk

READ MORE: IBM "Shepherd Test" assesses risk of superintelligence becoming a digital tyrant

p(doom)-as-a-service?

Follow Machine on X, BlueSky and LinkedIn

Read more

Scattered Spider's latest victim? Qantas confirms encounter with mysterious "cyber criminal"

LLMs can be hypnotized to generate poisoned responses, IBM and MIT researchers warn

Insecure jailbreakers are asking ChatGPT to answer one shocking x-rated question

AntiDarkNet vigilante "annihilated" amid dark web drug user doxxing drama

How is OpenAI addressing catastrophic risks?

READ MORE: Hugging Face: Autonomous AI agents should NOT be unleashed

READ MORE: Elon Musk makes frightening AI p(doom) apocalypse prediction

Sign up for Machine

OpenAI vs existential and catastrophic risk

READ MORE: IBM "Shepherd Test" assesses risk of superintelligence becoming a digital tyrant

p(doom)-as-a-service?

Follow Machine on X, BlueSky and LinkedIn

Read more

Scattered Spider's latest victim? Qantas confirms encounter with mysterious "cyber criminal"

LLMs can be hypnotized to generate poisoned responses, IBM and MIT researchers warn

Insecure jailbreakers are asking ChatGPT to answer one shocking x-rated question

AntiDarkNet vigilante "annihilated" amid dark web drug user doxxing drama