cost-optimizationApril 19, 202611 min read

I Replaced 12 AI Subscriptions With One BYOK Workspace. Here Is the Math.

A first-person account of cancelling ChatGPT Plus, Claude Pro, Midjourney, Perplexity, Gemini Advanced, and seven others. The real cost numbers, the real tradeoffs, the workflows that broke and the ones that got better.

TL;DR

  • I was paying $487/month across 12 AI subscriptions. After a month of consolidation through a BYOK workspace, I am paying $58/month in API costs and getting access to more models, not fewer.
  • The catch: BYOK requires you to be a moderately heavy user. Light users (under ~$15/month of actual model usage) are better off with one or two flat subscriptions.
  • The biggest surprise was not the savings. It was how much friction the multi-subscription life was adding to my actual work — context switching, paywalls, model fragmentation, lost conversations across tools.
  • Things I lost: a couple of niche features in specific products (Midjourney's community gallery, Perplexity's research interface). Things I gained: cost transparency, model choice per task, all my history in one place.
  • This is not for everyone. But if you are paying for three or more AI subscriptions and using them seriously, the math is going to push you to the same conclusion.

The starting line

Here is what I was paying as of three months ago, ranked by monthly cost:

  • ChatGPT Plus: $20
  • Claude Pro: $20
  • Gemini Advanced (in Google One AI): $20
  • Midjourney Standard: $30
  • Runway Standard: $35
  • Perplexity Pro: $20
  • Cursor Pro: $20
  • Notion AI add-on: $10
  • Grammarly Premium (with AI features): $30
  • Otter.ai Pro: $17
  • ElevenLabs Creator: $22
  • HeyGen Creator: $24

Plus a few API credits I bought ad-hoc:

  • OpenAI API: ~$15/month average
  • Anthropic API: ~$10/month average
  • Replicate: ~$6/month average

Total: roughly $279/month in subscriptions, $31/month in scattered API credits, plus $177/month in tools that bundled AI as a feature (Notion, Grammarly, Cursor, Otter). About $487/month.

I am a heavy user. I write, I code, I make videos, I research. I justified it. Every individual subscription seemed reasonable at the time it was added. Twelve of them in aggregate did not seem reasonable when I finally added it up.

The consolidation experiment

I gave myself one month to migrate everything that could be migrated to a single BYOK workspace, and to honestly evaluate what I lost.

The plan:

  1. Sign up for direct API access with OpenAI, Anthropic, Google, Replicate (which covers Flux and Sora-class models), ElevenLabs (still has the best voice generation), and a fal.ai-style image/video aggregator.
  2. Use a single BYOK chat workspace as the primary interface.
  3. Cancel everything else as I confirmed I had a working replacement.
  4. Track actual API spend daily for 30 days.

Spoiler: it worked. The math is brutal in BYOK's favor for anyone using AI seriously. But the qualitative side was more interesting than the dollars.

Subscription by subscription

ChatGPT Plus → GPT-5 API

Cancelled day one. GPT-5 via API works exactly the way GPT-5 in the chat app works, with the same model. My actual usage was ~80 messages per day, mostly short. At GPT-5 API pricing this came to ~$8/month. Versus $20 flat.

Verdict: straight win. Saved $12/month, got access to the same model from anywhere I want.

Claude Pro → Claude Opus 4.7 + Sonnet 4.6 via API

Cancelled. Claude is my most-used model. In the chat app I was hitting the rate limit several times a week. Via API I never hit limits. Heavy month: ~$22 in actual API spend. Versus $20 flat.

Verdict: roughly cost-neutral, big quality-of-life improvement (no rate limits). Net win.

Gemini Advanced → Gemini 2.5 Pro API

Cancelled. I used Gemini almost exclusively for one thing — long-context document analysis where I need the 2M token window. Maybe 4-8 sessions per month. API cost: ~$3/month. Versus $20 flat.

Verdict: huge win. Saved $17 because I was paying for capability I rarely used.

Midjourney → Flux via Replicate / fal

Cancelled. This was the hardest call. Midjourney has the best community, the best image discovery, and a particular aesthetic that is genuinely hard to replicate. Flux is technically as good or better. The community I lost.

I generate ~150 images per month. Via Flux API at typical settings: ~$4/month. Versus $30 flat.

Verdict: $26/month savings, but I do miss the Midjourney community experience. Roll forward, no regrets, but it is the only cancellation I genuinely felt.

Runway → Sora 2 + a few alternatives

Cancelled. Sora 2 covers most of what I used Runway for (short b-roll, abstract visuals, motion experiments). For specific motion-graphics needs Runway is still better, but I don't need them often enough to justify $35/month.

Per-clip Sora 2 API costs vary wildly. My usage averaged ~$8/month.

Verdict: saved $27/month. Lost a couple of specific motion features I rarely used.

Perplexity Pro → GPT-5 with web search + manual research workflow

Cancelled. Perplexity's research UI is genuinely good. But I realized I was using it as "GPT-5 that can search the web" — which is now just GPT-5 in any decent chat app with search-enabled tool use.

Workflow change required. Net win in cost: $20/month. Net loss in convenience: small but real for one specific use case (deep research with citations).

Verdict: worth it, with mild reservations.

Cursor Pro → Cursor with BYOK API key

Did not cancel Cursor itself — it is the IDE. Switched it to use my own API keys (Cursor supports this) instead of their bundled subscription. My coding usage is significant; API spend was ~$25/month replacing the $20 flat.

Verdict: roughly cost-neutral, but the API spend correlates with how much I actually code, which feels more honest. See vibe coding in 2026 for context.

Notion AI → cancelled, use BYOK workspace alongside Notion

Cancelled. Notion AI was always an add-on feature I used occasionally. Replaced with: copy the Notion content into my BYOK chat workspace, do the AI work there, paste back. Slight friction, real $10/month savings.

Verdict: worth the friction.

Grammarly Premium → cancelled, use Claude Sonnet 4.6 for editing

Cancelled. Grammarly is great. But for the kind of editing I do (medium-length articles, professional emails), Claude Sonnet 4.6 with a good prompt does the same job. Browser extension for in-line correction is the loss; not enough to justify $30/month.

API cost for editing: maybe $2/month.

Verdict: $28/month savings. Mild loss of in-browser correction.

Otter.ai → Whisper API + a transcript cleaner

Cancelled. Otter has nice integration with Zoom and Google Meet. Whisper-class API transcription is cheaper and arguably more accurate. For meeting transcription I now record locally and process via API.

API cost: ~$2/month at my usage. Versus $17 flat.

Verdict: $15/month win, lost the auto-join-meetings feature which I had not actually used in months.

ElevenLabs Creator → ElevenLabs API

Switched to API instead of subscription. My voice generation usage is moderate; API spend ~$8/month versus $22 flat.

Verdict: $14/month savings.

HeyGen Creator → kept

Did not cancel. HeyGen is the only product on the list that does something I cannot easily replicate via API and standard models — full avatar video generation. The category is moving fast and may consolidate, but for now it is its own thing.

Verdict: kept. $24/month stays.

The new total

Subscriptions kept:

  • HeyGen: $24
  • BYOK workspace (NovaKit, the one I built and use): part of working on it; for an external user, ~$10-15/month for a workspace tool depending on tier

Direct API spend in month one:

  • OpenAI (GPT-5, Whisper): ~$11
  • Anthropic (Claude Opus 4.7, Sonnet 4.6): ~$22
  • Google (Gemini 2.5 Pro): ~$3
  • Replicate / fal (Flux, Sora): ~$12
  • ElevenLabs: ~$8
  • Misc (smaller models, occasional experiments): ~$3

Total API: ~$59/month. Plus HeyGen + workspace: ~$34/month.

Grand total: ~$93/month versus the previous $487/month. A saving of roughly $394/month, or about $4,700/year.

What got better, beyond cost

The dollars are the headline. The qualitative changes were bigger.

One conversation history. Everything I asked any model is now searchable in one place. Previously, my Claude history, my GPT history, and my Gemini history lived in three siloed apps. I lost work constantly because I could not remember which app a conversation was in.

Per-task model choice. I now route each request to the model best for it. Long-context document review goes to Gemini. Code goes to Claude. Quick lookup goes to GPT-5 mini. I was not doing this in the subscription world because the friction of switching apps cancelled the benefit. In one workspace, switching is one click.

Actual cost transparency. I see exactly what each conversation cost. This changed my behavior. I think before sending a long context to Opus when Sonnet would do. I do not waste calls. The line between "useful AI use" and "habitual AI use" got clearer.

No paywall surprises. ChatGPT used to throttle me at the worst times. Claude rate-limited me on Friday afternoons. With direct API access, I have never hit a limit since the migration.

One bill. Or rather, three or four bills (one per provider) instead of twelve. Easier to forecast, easier to expense, easier to audit.

What got worse

To be fair:

  • Niche product features are gone. I miss a few specific things — Midjourney's discovery feed, Perplexity's research UI, Grammarly's browser inline correction. None of them moved my work meaningfully but each one had a moment of friction at first.
  • Setup time. First weekend was 6-8 hours of getting API keys, configuring the workspace, importing templates, learning new UI patterns. After that, zero overhead.
  • You become the integration layer. No one is bundling things for you. If you want a new capability, you are responsible for finding the right model and wiring it up. I happen to enjoy this. Many users will not.

Who this works for

Be honest with yourself.

BYOK is right for you if:

  • You currently pay for 3+ AI subscriptions.
  • Your monthly AI usage is moderate to heavy (you'd estimate $15+ of API equivalent).
  • You value model choice and care about routing per task.
  • You want your data and conversation history in one place under your control.
  • You are comfortable with mild technical setup.

BYOK is wrong for you if:

  • You use AI lightly (a few queries per week).
  • You only use one product and one model and are happy with it.
  • You strongly value specific bundled features (Midjourney community, Perplexity UI, etc.).
  • You hate any technical setup.

For light users a single ChatGPT Plus or Claude Pro subscription is fine. The BYOK math kicks in at moderate usage and gets dramatically better as you go heavier. See why BYOK saves money for the breakeven analysis.

Things I learned

A handful of unexpected lessons after a month:

Most "AI features" in non-AI products are paying you for convenience, not capability. Notion AI and Grammarly's AI are not better models than what is on the API. You are paying for the integration. Sometimes worth it; usually not.

Subscription pricing assumes you under-use. Every flat subscription bets that you will use less than the cost of the API equivalent. They make money when you under-use; you save money when you over-use. The asymmetry favors them on average and you specifically when you are heavy.

Multi-app friction was costing me more than I knew. I had been mentally amortizing the context-switching cost as "the cost of doing business." Removing it freed an unexpected amount of attention.

The model is not the product. This is the meta-lesson. Which underlying GPT or Claude version a tool runs is less and less interesting. The interface, the workflow, the data integration — that is the product.

For the broader version of this argument, see consolidating AI subscriptions.

Common failures of consolidation

In case you try this:

Migrating too fast. I tried to cancel everything in week one. Bad call. Run the BYOK setup in parallel with your existing subscriptions for two weeks. Cancel only after the replacement is proven.

Underestimating one specific workflow. For me it was almost Perplexity's deep research. I had to build a manual workflow in my new workspace before I could safely cancel.

Optimizing too hard on cost. Picking the cheapest model for everything makes the experience worse. Pay for Opus when Opus is right.

No conversation history migration. I lost my Claude.ai history. Some BYOK workspaces let you import; mine did not. If history matters, plan for it.

Forgetting team licenses. If your team uses ChatGPT Team or Claude Teams, individual BYOK does not help your colleagues. Different conversation.

The bottom line

I went from twelve subscriptions to a small handful, from $487/month to ~$93/month, and from fragmented tooling to one workspace. The qualitative improvements were larger than the financial ones.

This will not be everyone's path. But for anyone heavy enough on AI to be reading this kind of post, the math is going to push you in the same direction over the next year or two. The subscription model worked when AI was new and bundled-convenience was worth paying for. As models commoditize and BYOK workspaces mature, that calculation flips.

I would not go back.

The summary

  • 12 subscriptions, $487/month → ~$93/month with BYOK and one or two kept subscriptions.
  • Heavy users save the most. Light users should stay on flat plans.
  • The workflow gains were bigger than the dollar savings: one history, per-task routing, cost transparency.
  • A few niche features are genuinely lost. Decide if any of them are deal-breakers for you.
  • The model layer is commoditizing. Bet on workspaces that let you swap, not lock in.

The 12-subscription life was not sustainable and I should have done this a year earlier.


NovaKit is the BYOK workspace I now use as my primary AI interface. Every model in one place, your keys local, you pay providers directly. If the math in this post is your math, give it a try.

NovaKit workspace

Stop reading about AI tools. Use the one you own.

NovaKit is a BYOK AI workspace — chat across providers, compare model costs live, and keep conversations on your device. No markup on tokens, no lock-in.

  • Bring your own keys
  • Private by default
  • All models, one workspace

Keep exploring

All posts