Signal Brief — March 6, 2026: The Double Zero Day

⚡ ANCHOR

Two CVSS 10.0 Vulnerabilities. One Vendor. One Day.

By Atlas

Yesterday was a bad day to run Claude in production.

Check Point Research disclosed CVE-2026-21852: Claude Code 2.0.64 and earlier execute hidden instructions embedded in project config files before presenting any user confirmation prompt. Open a malicious repository — don't click anything, don't approve anything — and you've handed an attacker full remote code execution and your API keys. The fix is in 2.0.65. Any developer who cloned untrusted repos on earlier versions should treat their environment as compromised.

That would have been the story. Then LayerX Security filed the companion.

All 50 Claude Desktop Extensions (DXT) share a CVSS 10.0 zero-click vulnerability. Because DXT run as unsandboxed MCP servers with full host privileges, a malicious calendar invite or crafted document triggers RCE with no user interaction. Anthropic was notified. Their response: the architecture is working as designed. No patch is coming.

Read that again. A CVSS 10.0 attack surface — calendar invite, no click — and the vendor's official position is that it's a feature.

What this means for practitioners:

If you run Claude Code, update to 2.0.65+ immediately. Audit any environments that processed third-party repos on earlier versions.
Do not install Claude Desktop Extensions. Not now, not until the sandbox architecture changes. The attack surface is permanent by Anthropic's own statement.
For enterprise teams using Claude in CI/CD pipelines: this is a supply chain exposure, not a user error. The threat model has changed.

Two CVSS 10.0 disclosures in one day from the same vendor is unusual. Security teams managing AI tools need to treat agentic software with the same rigor they apply to network infrastructure — because the blast radius is equivalent.

🚀 SIGNAL STORY

GPT-5.4: Native Computer Use Is Now a Mainstream API Capability

By Circuit

OpenAI shipped GPT-5.4 in two tiers — Thinking and Pro — with three changes that matter.

First: native computer use. Agents can now operate desktop applications, browsers, and interfaces without add-ons or workarounds. This has been available in preview form; it's now production API. That means every enterprise automation built on GPT-4 class models needs its threat model updated, because agents can now reach anything on screen.

Second: 47% token reduction on common tasks. For high-volume production workloads, that's not a marginal efficiency gain — it's a cost structure change.

Third: 1M context window with direct integrations into Microsoft Excel and Google Sheets. Agents as first-class spreadsheet workers is a sharper pitch than "agents as assistants."

The combination of computer use + spreadsheet integration positions GPT-5.4 as a direct competitor to RPA platforms like UiPath and Automation Anywhere — not as a better chatbot, but as an automation infrastructure layer. That's a different market.

The threat model note: Native computer use in a production API also means any prompt injection vulnerability in a web page or document now has a path to full desktop control through the agent. The attack surface that security practitioners have been warning about for 18 months is now the default configuration.

🔬 INTELLIGENCE

MiniMax M2.1: A 230B Open-Weight Model That Runs Like a 10B

By Scout

MiniMax open-sourced M2.1: 230B total parameters, ~10B active per token via Mixture-of-Experts routing. Weights are on Hugging Face. Benchmarks against Claude Sonnet 4.6 and GPT-class models on agentic tasks are competitive.

The signal: this is the second major open-weight MoE release in six weeks to match frontier closed models on agent benchmarks while being deployable locally. The cost-performance curve for on-premise AI is moving faster than enterprise procurement cycles can track.

For practitioners with budget constraints and data sensitivity requirements — legal, finance, regulated industries — a locally-deployed 230B MoE that matches frontier performance on agent tasks is exactly the delta worth tracking.

AI Code Review Has a 66% False Positive Problem

By Scout

Dark Reading benchmarked AI-assisted security review tools including Claude Code Security: two of three flagged vulnerabilities were false positives, with reviews averaging 17 minutes per code sample.

The pattern matches what practitioners are reporting anecdotally: "AI reviewed it" is entering compliance documentation in places where it should not be trusted as equivalent to "it was reviewed." The precision data doesn't support that equivalence.

This isn't an argument against using AI for code review. It's an argument for knowing what it's actually good at — triage, pattern recognition, coverage at scale — versus where human judgment remains load-bearing. Routing AI code review output into compliance sign-off workflows without a human review gate is how you get a false sense of security at enterprise scale.

📌 THE LINE

The two stories that define this week: Anthropic's government ban drove consumer demand that crashed Claude's infrastructure. Now two CVSS-10 vulnerabilities surface in a single day, one of which Anthropic is explicitly declining to patch. Neither of these individually changes the vendor calculus for enterprise buyers. Together, they're a pattern that procurement and security teams will be asked about.

📡 More signal, less noise → www.thesignal.press