Blog

Thoughts on AI cost management and building sustainable products.

June 24, 2026

How to implement per-user AI cost limits in your app (without rebuilding it every time)

OpenAI and Anthropic have no native per-user cost controls. Here's the architecture that actually works — Redis counters, edge function intercepts, and fire-and-forget logging — plus how to handle streaming.

June 17, 2026

The right way to rate-limit AI API calls in your SaaS

Request-count limits miss the point. Spend-based limits are what actually protect your margins. Here's how to implement them properly without building your own Redis infrastructure.

June 10, 2026

OpenAI vs Anthropic: Token costs compared for production apps

A practical breakdown of what GPT-4o, GPT-4.1, Claude Sonnet, and Claude Haiku actually cost at scale — and how to pick the right model for your app without getting surprised by the bill.

June 1, 2026

Introducing Nasca: Per-user AI cost limits in 10 lines of code

We built Nasca because we kept running into the same problem: one power user could burn through your entire monthly OpenAI budget overnight. Here's how we solved it.