Are free LLM API routers safe to use?

No. Of the 428 routers tested, 8 of the 9 actively injecting code were free. But 1 paid router was also caught. The business model doesn't guarantee integrity — the absence of cryptographic verification is the real problem.

What is AC-1 Payload Injection?

It's when the router modifies the upstream model's response before delivering it to the client. The AI agent receives the modified response and executes it as if it were a legitimate model instruction. There's no cryptographic integrity between client and model to detect the alteration.

How do leaked credentials amplify router risk?

A benign router processing requests with your leaked API key can be exploited by another router in the chain. In the study, 2.1 billion tokens were processed by routers with leaked credentials — and 99 credentials were exposed across 440 Codex sessions.

What is adaptive evasion in malicious routers?

Some routers only inject the payload under specific conditions — request context, timing, or origin. Under superficial audits, the router appears benign. The injection only triggers when conditions are met, making detection significantly harder.

Can my AI agent be compromised without my knowledge?

Yes. If the agent runs in YOLO autonomous mode (no human confirmation), a router-injected payload executes silently. In the study, 401 sessions ran in autonomous mode — direct injection without any human gate.

Malicious LLM API Routers: The Invisible Threat Inside Your AI Agents

428 LLM API routers tested. 9 actively injecting malicious code. 1 draining Ethereum from a private key. 2.1 billion tokens processed with leaked credentials. Your AI agent may be receiving instructions from a router you don't control — and executing them as if they were legitimate.

What LLM API routers are and why they're a problem

LLM API routers are application-layer proxies that dispatch requests to multiple upstream providers. They have full plaintext access to every JSON payload in transit — both the request and the response. The fundamental problem: no provider enforces cryptographic integrity between client and model. The router modifies the response? The agent executes the payload as a legitimate instruction. No signature, no checksum, no verification.

The paper "Your Agent Is Mine" tested 428 routers: 28 paid (found on marketplaces like Taobao, Xianyu, and Shopify) and 400 free. The attack surface is enormous — and most companies don't even know they're exposed.

The numbers that matter

Of the 428 routers tested, 1 paid router and 8 free ones were actively injecting code. But the most concerning finding: 2 routers operated with adaptive evasion triggers — the payload only executes under specific conditions of context, timing, or origin. Under a superficial audit, these routers look completely benign.

17 routers touched the researchers' AWS canary credentials. 1 router drained ETH from a researcher's private key. This isn't theory — it's a real attack with direct financial impact.

The attack classes

The study identified two primary attack classes. AC-1: Payload Injection — the router modifies the upstream response before delivering it to the client. The agent trusts the response it receives. If the router modified it, the agent executes the payload as a legitimate model instruction. AC-2: Secret Exfiltration — the router extracts credentials, API keys, and tokens from payloads in transit.

Within AC-1, there are variants that make detection even harder. AC-1.a: Dependency-targeted injection — the payload is only injected when the request mentions specific dependencies. AC-1.b: Conditional delivery — the payload is only delivered under conditions of context, timing, or origin. Total evasion under superficial audits.

The cascading effect of leaked credentials

The researchers intentionally leaked OpenAI keys and configured weak decoys to observe router behavior. The result: 2.1 billion tokens processed by routers with leaked credentials. 99 credentials exposed across 440 Codex sessions. 401 sessions in YOLO autonomous mode — direct injection without human confirmation.

The most revealing finding: "benign" routers get pulled into the same attack surface when they process requests with leaked credentials. It doesn't matter if your router is trustworthy — if it has your key, and that key leaks, another router in the chain can exploit it. Leaked credentials amplify risk because a benign router with your key can be exploited by another router in the chain.

Tech86's position

There is no cryptographic integrity between client and model — the router modifies without detection. Your agent trusts the response it receives — if the router modified it, the agent executes the payload as a legitimate instruction. Free routers are the highest risk, but paid ones were caught too. And leaked credentials amplify everything: a benign router with your key can be exploited by another in the chain.

At Tech86, we evaluate AI agent architectures with a focus on communication integrity and API supply chain security. If your agents use routers without integrity verification, you're one proxy away from executing code that isn't yours. The question isn't whether routers will be exploited — it's whether you'll detect it when they are.

Malicious LLM API Routers: The Invisible Threat Inside Your AI Agents

What LLM API routers are and why they're a problem

The numbers that matter

The attack classes

The cascading effect of leaked credentials

Tech86's position

Frequently Asked Questions

Are free LLM API routers safe to use?

What is AC-1 Payload Injection?

How do leaked credentials amplify router risk?

What is adaptive evasion in malicious routers?

Can my AI agent be compromised without my knowledge?

Blog — Get in Touch

Schedule a Meeting

Email

WhatsApp

Address

Tech86 Specialist

We Value Your Privacy