Cloudflare AI Gateway now supports real-time spend limits for AI usage across multiple providers. The feature is meant to prevent runaway token bills before costs spiral out of control. By integrating with Cloudflare Access, companies can apply identity-driven budgets and policies, making AI cost governance more closely tied to users, teams, and access rules.
TechCrunch reports that GitHub Copilot will move to token-based billing on June 1, replacing a more predictable flat or request-based model. Some developers say their expected monthly costs could jump dramatically, citing examples from about $29 to nearly $750 or $50 to around $3,000. Others argue the worst cases may reflect heavy vibe-coding usage, while critics say Microsoft encouraged that behavior before changing the economics.