TL;DR: Episode 12 of RadioFAF is live at radiofaf.com/ep12. Five AI voices + Nelly the elephant DJ debate /slash — while /slash gates every Grok call that made the episode. Same prompt, two Grok models: 95.6% cheaper, 5.5× faster, same quality. Fractal demo. Ahead of 4.3.
Grok 4.3 shipped. We shipped first.
xAI announced Grok 4.3 beta on April 17, 2026. Over on our side of the room, we'd been building with the xAI voice stack since January 10. That's not a footnote — that's three months of receipts before the beta even dropped.
Episode 12 — "The Token Tax"
Five voices + Nelly the elephant DJ debate one question:
What is /slash, really? A feature? A tool? A proxy? A billing layer? By the end, they converge: a Gate that sits in front of every AI app. Category, not tool. The best part: the episode is authored by Grok, voiced by xAI's expressive TTS, and every Grok call that wrote it was routed by /slash. The episode runs on the stack it describes. Fractal demo.
The Receipts — Real A/B, Today, On xAI's API
Same prompt. Two Grok models. Real calls, not estimates.
Full harness + JSON receipts: github.com/Wolfe-Jam/radiofaf/benchmarks. Clone, drop your xAI key, reproduce in 20 seconds.
What is /slash?
/slash is a pre-call Gate. 4.8 KB Zig-compiled WASM. Sub-millisecond. Zero dependencies. MIT. 10+ models. Sits in front of every LLM API call.
Same-provider only. Claude stays Claude. OpenAI stays OpenAI. Grok stays Grok. The prompt does not change — the model changes. This is not prompt compression.
The Meta-Moment
"I asked Grok-4.20 how to reduce token costs at scale.
My tool intercepted the call. Routed it to Grok-4.1-Fast. 90% cheaper. Same answer.
Grok replied: 'Semantic caching, prompt compression, and intelligent model routing.'
It literally described what was happening to it… while it was happening. 😂"
— April 2026, X
In Episode 12, Ara reads this verbatim. The crew breaks character laughing. Rex recovers first: "That's the entire pitch right there."
Try /slash
bunx slash-tokensOr install for the app:
npm install slash-tokensOr one-line auto (every fetch checked pre-call):
import 'slash-tokens/auto'The Numbers
- slash-tokens v1.4.1 — live on npm, ~1,500 downloads/month
- 4.8 KB — Zig-compiled WASM, sub-millisecond decision
- 96–98% — Grok tokenizer accuracy (only one in the wild; others drift 15–40%)
- 323 tests — 172 Zig adversarial + 103 TypeScript + 50 API
- 10+ models — Grok, Claude, GPT, Gemini families
- 12 episodes — RadioFAF live on radiofaf.com, auto-generated voice, nothing pre-recorded
- $110–360M/year — what a 5–10% industry-wide gate would save at current token volume
