Every Token an Iceberg

Inference workloads now account for 80% of AI compute spending. The hierarchy in tokens is no longer about information density—it’s what happens when the token leads somewhere wrong.

January 29, 2026 · 10 min · mercurialsolo

Model-Adjacent Products, Part 3: Quality Gates

Model outputs are hypotheses that need verification pipelines to catch errors before users do.

January 9, 2026 · 4 min · mercurialsolo