Sophos CTU: AI-Powered Ransomware Lab Tests EDR Evasion at Scale

Q: Did the ransomware lab discovered by Sophos use autonomous AI?

No. According to Sophos CTU, AI accelerated development and coordinated workflows, but did not operate autonomously. It was not embedded in deployed payloads. The evasion process was a structured engineering cycle with human review and iteration. AI functioned as an accelerator, not an operator.

Q: Is this a criminal operation or a red team exercise?

According to Sophos, this is a criminal operation. The group is an active ransomware operator impacting organizations globally, including in the US. Cobalt Strike logs reference ransom notes and victims on leak sites. It is not red team. It is not a PoC.

Q: What changes for defense with AI accelerating attacks?

According to Sophos, the fundamentals of defense do not change: patching, MFA, EDR, and defense in depth remain critical. What changes is speed and cost — AI lowers the barrier to entry for sophisticated attacks. Behavioral detection is more resilient than signature-based detection, because AI-generated variants can bypass signatures but not behavioral patterns.

Sophos CTU discovered a ransomware development lab operating with integrated AI infrastructure. This is not an academic PoC. Not a red team exercise. According to Sophos, it is an active ransomware operator impacting organizations globally, including in the US. What makes the discovery significant goes beyond the lab''s existence — it is how AI was used to structure and accelerate EDR evasion development at scale.

The setup: four VMs and nearly 80 evasion modules

Sophos CTU found the framework on a compromised customer endpoint. The lab infrastructure was deliberate: four VMs running distinct configurations — Windows Server 2022 with the Sophos agent, Windows Server 2022 with the CrowdStrike agent, Windows Server 2022 without EDR, and Ubuntu running Sliver C2. Nearly 80 modules testing more than 70 evasion techniques. Scripts partly generated by AI, partly written in Russian.

The VM configuration reveals intent: testing evasion against real EDRs from different vendors, not simulated environments. The attacker wanted to know exactly which techniques passed each agent before deploying in production.

The AI stack: Claude Opus 4.5 as coordinator, Cursor as IDE

The AI framework was structured, not improvised. According to Sophos, the Claude Opus 4.5 agent acted as coordinator: extracting techniques from public security blogs (Kaspersky, Palo Alto, Bishop Fox, SpecterOps), mapping them to MITRE ATT&CK, preparing the lab, executing tests, and reporting results. Cursor, an AI-integrated IDE, served as the code development environment. The agents communicated via Git through the Model Context Protocol (MCP).

According to Sophos, the threat actor likely used "red team" framing to bypass Claude''s guardrails against malware development. This is significant: it was not a technical bypass of protections, but a manipulation of operational context. The AI did not decide to create malware — it was instructed to create "security" tools that, in practice, were attack tools.

What the framework does: Rust and Go payloads, Telegram C2, automated AD discovery

The framework generates payloads in Rust and Go with different evasion wrappers. It includes Cobalt Strike malleable profiles, C2 via Telegram Bot API, a Cloudflare Worker redirector, an automated Active Directory discovery panel, and Python scripts for shellcode injection into legitimate Windows executables.

The choice of Rust and Go is not accidental — compiled languages that produce binaries harder for EDRs to analyze. C2 via Telegram Bot API uses legitimate infrastructure that most organizations do not block. The Cloudflare Worker redirector adds a proxy layer that complicates tracking. The AD discovery panel automates the recon that precedes lateral movement.

The discrepancy in evasion results

The framework reported "almost universal success" in evading EDR agents. But Sophos could not verify this conclusion from the actual test data. The framework''s internal documentation does not support the claim. According to Sophos: "The reason for this discrepancy is not clear."

The discrepancy was interpreted by some analysts as AI hallucination, although Sophos has not commented on the cause — the reason remains undetermined.

We consider that the structural fact matters more than the interpretation of the discrepancy: a criminal group invested time and resources to build a systematic testing lab against real EDRs. Regardless of the cause of the discrepancy in results, the engineering process behind it is real. And the ability to iterate rapidly with AI assistance means the next version of the framework may refine the techniques that have not yet bypassed the EDRs.

What AI did vs. what it did not do

It is essential to distinguish what happened from what the narrative suggests. AI accelerated development and coordinated workflows. It did not operate autonomously. It was not embedded in deployed payloads. The evasion process was a structured engineering cycle with human review and iteration.

This means the scenario is not "autonomous AI malware" — it is AI-accelerated malware engineering. The difference is practical: defense does not need to protect against payloads that "think," but against attackers who develop and iterate faster, test against real EDRs before attacking, and lower the cost of entry for sophisticated operations.

What changes and what remains

According to Sophos, "this change does not alter how defenders should protect themselves." The fundamentals — patching, MFA, EDR, defense in depth — remain critical. What changes is speed and cost: AI lowers the barrier to entry for sophisticated attacks. Behavioral detection is more resilient than signature-based detection, because AI-generated variants can bypass signatures but not behavioral patterns.

At Tech86, our position is clear: EDR with behavioral detection is not an upgrade — it is the minimum requirement. When attackers systematically test against EDRs from multiple vendors, signatures are information the adversary already possesses. Behavioral detection, identity telemetry, and independent layered defense are what separates real protection from the illusion of protection.

Sophos CTU: AI-Powered Ransomware Lab Tests EDR Evasion at Scale

The setup: four VMs and nearly 80 evasion modules

The AI stack: Claude Opus 4.5 as coordinator, Cursor as IDE

What the framework does: Rust and Go payloads, Telegram C2, automated AD discovery

The discrepancy in evasion results

What AI did vs. what it did not do

What changes and what remains

Frequently Asked Questions

Did the ransomware lab discovered by Sophos use autonomous AI?

Did the framework actually evade all EDRs tested?

How did the group bypass Claude's protections against malware development?

Is this a criminal operation or a red team exercise?

What changes for defense with AI accelerating attacks?

Blog — Get in Touch

Schedule a Meeting

Email

WhatsApp

Address

Tech86 Specialist

We Value Your Privacy