Cybercrime

Anthropic shares the criminal confessions of Claude, warns of growing "vibe hacking" threat

Claude Code misuse is enabling low-skilled crooks to carry out high-impact fraud, extortion and romance scams.

Jasper Hamill

28 Aug 2025 — 6 min read

ChatGPT's depiction of its arch-rival, Claude, locked up in prison. However, any misuse of an AI model is not necessarily the fault of its creator...

Anthropic has admitted that Claude has become an accomplice in a wide array of cybercrimes, ranging from cruel romance scams to ambitious full-spectrum fraud involving deception on an epic scale.

The AI firm's latest Threat Intelligence report is packed full of startling stories showing how Claude has been misused by crooks who managed to find a way around its strict guardrails and safety measures.

These include a "large-scale extortion operation" using Claude Code, a fraudulent employment scheme involving North Korean agents and the sale of AI-generated ransomware generated by a cybercriminal with only "basic coding skills".

Anthropic also said agentic AI has been "weaponised," so that AI models are being used to carry out attacks, rather than simply offering advice on how to do bad things.

"AI has lowered the barriers to sophisticated cybercrime," it warned. "Criminals with few technical skills are using AI to conduct complex operations, such as developing ransomware, that would previously have required years of training.

"Cybercriminals and fraudsters have embedded AI throughout all stages of their operations. This includes profiling victims, analysing stolen data, stealing credit card information and creating false identities, allowing fraud operations to expand their reach to more potential targets."

The rise of vibe hacking

In its Threat Intelligence Report, Anthropic provided details of a "sophisticated cybercriminal operation" tracked as GTG-2002 that deploys vibe hacking techniques involving the use of coding agents to execute operations on victim networks.

Using Claude Code - Anthropic's programming bot - an unknown crook carried out a data extortion operation targeting multiple victims around the world in a "short timeframe".

"This threat actor leveraged Claude’s code execution environment to automate reconnaissance, credential harvesting, and network penetration at scale, potentially affecting at least 17 distinct organizations in just the last month across government, healthcare, emergency services, and religious institutions," Anthropic warned.

The hackers selected targets opportunistically based on open source intelligence and scanning of internet-facing devices.

Kim-Jong Unbelievable: North Korea's fake IT workers

Earlier this year, North Korean spies managed to trick Western companies into hiring them as full-time employees with privileged security clearance, creating an insider risk of an almost unparalleled scale.

Anthropic found that Pyongyang's operatives had "systematically leveraged" Claude to "secure and maintain fraudulent remote employment positions".

"This represents a significant evolution in tactics, as operators who previously required extensive technical training can now simulate professional competence through AI assistance," it reported.

"Most concerning is the actors’ apparent dependency on AI - they appear unable to perform basic technical tasks or professional communication without AI assistance, using this capability to infiltrate high-paying engineering roles that are intended to fund North Korea’s weapons programs."

Kim Jong-un's shadowy henchmen used Claude to build fake identities, including fictitious career histories and made-up resumes. AI was also used to write covering letters, during job interviews to prepare technical responses and and also during working days to carry out tasks and "maintain the illusion of competence".

Claude-powered romance scams and other heartless fraud campaigns

*A Chinese scammer instructs a Claude-powered bot to pay a compliment to a potential romance scam victim (Image: Anthropic)*

Following a tip-off from an independent researcher, Anthropic found that fraudsters had built a Telegram bot which used Claude to generate "high emotional intelligence" responses.

This crooked love machine used image generation to build convincing fake lovers, target people in multiple languages and develop "emotional manipulation content for targeting victims".

Anthropic said: "This operation represents a concerning evolution in romance scam techniques, where AI enables non-native speakers to craft persuasive, emotionally intelligent messages that bypass typical linguistic red flags. The bot’s scale and specialised features demonstrate how AI can dramatically lower barriers to sophisticated social engineering."

Cyber-crooks have developed carding services that enable fraudulent credit card transactions, as well as "synthetic identity services" using Claude.

A Chinese threat actor was also caught using Claude across "nearly all MITRE ATT&CK tactics" to target Vietnamese critical infrastructure.

Additionally, Anthropic found a threat actor using Model Context Protocol (MCP) and Claude to analyse stealer logs and build detailed victim profiles to create behavioural profiles from victims’ computer usage patterns, which were then shared on Russian hacking forums.

Anthropic shares the criminal confessions of Claude, warns of growing "vibe hacking" threat

Jasper Hamill

The rise of vibe hacking

READ MORE: Workday CRM breach amplifies fears of an alliance between ShinyHunters and Scattered Spider

Kim-Jong Unbelievable: North Korea's fake IT workers

READ MORE: The rise of Dark LLMs: DDoS-for-hire cybercriminals are using AI assistants to mastermind attacks

Claude-powered romance scams and other heartless fraud campaigns

READ MORE: Dark web Initial Access Brokers are selling hacked network access for as little as $500, study reveals

Follow Machine on LinkedIn

Read more

Humans have the edge over AI in "one crucial domain", Princeton neuroscientists say

"There’s no micromanagement on the battlefield": What leaders can learn from the British Army

Operation Endgame: Europol takes down cybercrime network behind global malware outbreak

"AI without privacy is surveillance capitalism on overdrive," Proton warns

The rise of vibe hacking

READ MORE: Workday CRM breach amplifies fears of an alliance between ShinyHunters and Scattered Spider

Kim-Jong Unbelievable: North Korea's fake IT workers

READ MORE: The rise of Dark LLMs: DDoS-for-hire cybercriminals are using AI assistants to mastermind attacks

Sign up for Machine

Claude-powered romance scams and other heartless fraud campaigns

READ MORE: Dark web Initial Access Brokers are selling hacked network access for as little as $500, study reveals

Do you have a story or insights to share? Get in touch and let us know.

Follow Machine on LinkedIn

Read more

Humans have the edge over AI in "one crucial domain", Princeton neuroscientists say

"There’s no micromanagement on the battlefield": What leaders can learn from the British Army

Operation Endgame: Europol takes down cybercrime network behind global malware outbreak

"AI without privacy is surveillance capitalism on overdrive," Proton warns