Why it's hard to Claw the Enterprise

I’ve been running OpenClaw for personal use and the first reaction: it works as a basic personal assistant. Browser as the universal tool, Slack and WhatsApp and email as the comms layer and the event stream, the filesystem as the memory layer. They come together well when we own everything the agent touches. Authentication, authorization, data governance: no-problem, especially when the user and the admin are the same person. The harness looks straightforward: let’s now bolt on SSO, add an admin panel, and start selling it to teams. Not so easy, because the failure modes run deeper than what is evident at the surface. ...

February 24, 2026 · 4 min · mercurialsolo

HydraBench: Agent Infrastructure Resilience

23 scenarios, 4 frameworks, 460 runs. HydraBench tests what most agent benchmarks ignore: does your infrastructure survive crashes, contain secrets, deliver handoffs, enforce permissions, and control cost?

February 23, 2026 · 3 min · mercurialsolo

Project Hydra: Designing a world for agents

A browser agent tried to exfiltrate our API keys on Tuesday. By Friday we’d also watched a research agent forget 22 sources of work, a pipeline lose an entire handoff to a crash, and a content agent spend $47 unsupervised. The agents were capable. The worlds we’d built for them weren’t.

February 21, 2026 · 12 min · mercurialsolo

Appliances, Factories, Grids

Own the chips or the customers. Everything else is a footnote.

January 14, 2026 · 5 min · mercurialsolo

Model-Adjacent Products, Part 1: The Architecture

The physics of production AI: latency engineering that keeps humans in the loop. Token economics that don’t bankrupt you.

January 9, 2026 · 5 min · mercurialsolo

Model-Adjacent Products: A Builder's Guide

A 6-part series on building production AI systems. The foundation model is the CPU; your product is the computer you build around it.

January 9, 2026 · 2 min · mercurialsolo