On this page
- TL;DR
- The market shape, briefly
- What is happening to the model layer
- The categories that will eat the most value
- 1. Internal enterprise conversational interfaces
- 2. Vertical agents
- 3. Voice interfaces
- 4. Embedded copilots
- 5. Orchestration and workflow layers
- The categories that are overrated
- Another general-purpose consumer chat app
- Pure prompt engineering tools
- Wrapper apps with no defensibility
- Where founders should build
- Where users should buy
- The five-year picture
- What gets in the way
- The summary
TL;DR
- The global conversational AI market is on a path to roughly $80B by 2030, growing 25-30% per year. The headline number is real but hides a more interesting story underneath.
- The value is not going to general-purpose chatbots — those have collapsed into commodity model APIs. The value is going to vertical agents, voice interfaces, embedded copilots, and orchestration layers.
- The biggest underrated category: internal enterprise conversational interfaces that replace dashboards, forms, and SaaS UIs. Multi-billion dollar opportunity hidden inside boring-looking deployments.
- The biggest overrated category: another consumer chat app. The shelf is full and consumer attention has saturated.
- For builders: pick a wedge (vertical, modality, or distribution channel) and ride the inevitable model commoditization. The model layer is not your moat. Workflow, data, and integration are.
The market shape, briefly
A reasonable consensus from the analyst reports converging in early 2026 puts global conversational AI at $30-35B in 2025, on a path to roughly $75-85B by 2030. Call it $80B. Different reports slice categories differently, but the broad shape:
- Enterprise conversational platforms (contact centers, employee support, knowledge management): largest single slice, ~40% of the market.
- Voice AI (assistants, IVR replacement, in-vehicle, smart speakers): ~20%, fastest-growing modality.
- Consumer chat applications and assistants: ~15%, growth slowing.
- Vertical conversational agents (healthcare triage, legal intake, financial advisory): ~15%, fastest-growing category by value.
- Developer platforms and orchestration: ~10%, the picks-and-shovels layer.
The interesting thing is not the topline. It is the redistribution. Five years ago, "conversational AI" was almost entirely chatbots and IVR. By 2030 it is a much wider sport, and the leaderboard rearranges.
What is happening to the model layer
The foundation models are commoditizing. Not in the sense that they are getting worse — they are getting better. In the sense that the performance gap between top models is shrinking and switching costs are low.
In 2026, Claude Opus 4.7, GPT-5, and Gemini 2.5 Pro are within striking distance on most evals. For 80% of conversational use cases, all three are equivalent. For another 15%, one has a clear edge. For the remaining 5%, only one will do.
This means: the model is not the product. It is an input. The product is everything around it — the data layer, the workflow, the UX, the integrations, the trust model.
The companies that bet their valuation on owning the best model are in a brutal race. The companies that bet on owning the workflow around models are in a much better position. (See why BYOK saves money for the user-side version of this argument.)
The categories that will eat the most value
1. Internal enterprise conversational interfaces
The biggest underrated opportunity. Most large enterprises have dozens of internal SaaS tools — HR, expense, IT support, procurement, knowledge bases, ticketing — each with its own dashboard, login, and learning curve.
In 2026, more and more of those interactions are moving to a single conversational interface. "Approve John's expense report." "Show me the open headcount for Q3." "Find the vendor contract for Acme." The dashboards still exist, but they are no longer the primary interface for most users.
This is unsexy from the outside and enormous in revenue. Mid-market and enterprise deals at $100k-$5M ARR. Long sales cycles. Sticky once deployed.
Builders here win on integration depth (number and quality of system connectors) and trust (audit, security, role-based access). The model is interchangeable.
2. Vertical agents
Healthcare intake. Legal document review. Tax preparation. Real estate offers. Insurance claims. Each of these is a category where a domain-specialized conversational agent can replace 20-50% of human workflow with measurable cost savings.
The wedge here is regulatory and data depth. A general-purpose chatbot cannot handle a tax interview. An agent built specifically for tax interviews, with the right citations, the right workflow, the right liability handling, can.
Most of these will be acquired or partnered with the incumbent of the vertical (Intuit for tax, Epic for healthcare, etc.) but the building period is now and the ones that nail product-market fit get bought for serious multiples.
3. Voice interfaces
The voice gap closed in 2025. Latency is sub-300ms end-to-end. Naturalness is human-indistinguishable for short interactions. Multilingual is solved.
This means voice is now a real interface for things voice was never good at: long-form interaction, complex workflows, agentic tasks. Customer support is the obvious one (incumbents are scrambling), but interesting categories:
- In-car commerce and concierge.
- Field worker hands-free interfaces (construction, manufacturing, logistics).
- Eldercare and accessibility tools.
- Phone-based agentic services (book the appointment, negotiate the bill, follow up on the claim).
Voice has a different distribution model than chat. Less app store, more telephony / embedded / OEM. Different go-to-market.
4. Embedded copilots
Every SaaS product is adding a conversational copilot. Most are bolt-ons. The ones that work share a property: they are deeply integrated with the underlying data and actions of the host app.
Notion's AI is useful because it knows your workspace. Linear's AI is useful because it knows your tickets. Salesforce's Einstein is useful (sometimes) because it knows your pipeline.
The opportunity here is not building yet another copilot. It is building the infrastructure and components that let SaaS companies add good copilots quickly. This is a developer platform play and the early winners are already visible.
5. Orchestration and workflow layers
Connecting models, tools, and data into multi-step agents. This is the picks-and-shovels of the conversational AI economy.
Examples: agent runtimes that handle planning, tool use, error recovery, and human handoff. Memory layers that give agents continuity across sessions. Knowledge graph systems that let agents reason over structured enterprise data. Eval and observability platforms for agent quality.
Smaller market in dollar terms but high margin and central to everything else. The companies that own the orchestration layer for serious deployments will be acquired or IPO at large valuations.
The categories that are overrated
Another general-purpose consumer chat app
The consumer chat shelf is full. ChatGPT, Claude, Gemini, plus the dozen also-rans. Consumer attention has saturated. Switching costs are low. CAC is brutal. Margins thin to nothing because the model is the cost.
A new general-purpose chat app in 2026 is starting from behind on every dimension. Unless you have a clear distribution advantage (a billion-user existing product, an unfair platform position), this is not the play.
The exception: BYOK workspaces that consolidate access to many providers and let users own their data. That is a different product category and a different value proposition — see consolidating AI subscriptions.
Pure prompt engineering tools
The 2023 wave of "prompt management" platforms is largely gone or pivoting. Prompts are short, version-controlled in code, and most teams handle them in their existing dev workflow. There is a tiny niche for the largest enterprises with prompt sprawl. Not a category.
Wrapper apps with no defensibility
"AI for X" where X is generic and the only thing the product does is shape a model output is now table stakes for any incumbent. If a wrapper app's only moat is the prompt, the incumbent will replicate it in a quarter.
The wrappers that survived all moved up the stack into workflow, data, and distribution. The ones that did not are gone.
Where founders should build
If you are a builder thinking about the next 24 months in conversational AI, the durable advantages are:
- Distribution. Existing audience, embedded integration in a high-traffic product, a sales motion into a specific industry.
- Data. Proprietary or hard-to-replicate datasets that make your agent measurably better in your category.
- Workflow depth. Every step of the user's job mapped, integrated, and automated end to end. Not "give the user a chat input."
- Trust and compliance. For regulated industries, the model is 20% of the work. The audit trails, role-based access, data residency, and human-in-the-loop are 80%.
- Vertical specialization. A great healthcare agent beats a generic agent at healthcare every time. Pick a vertical and own it.
The non-durable advantages are:
- "We use the best model." The best model changes quarterly.
- "We have great prompts." Prompts are copy.
- "We are fastest." Speed is a moment, not a moat.
- "We are cheapest." Margin races are a death spiral when the underlying inputs are commoditizing.
Where users should buy
If you are an enterprise or a serious individual user, the meta-strategy is:
- Avoid lock-in. The model layer is moving fast. Bet on platforms that let you swap models, not platforms that lock you in.
- Own your data. Conversational interfaces produce vast logs of context, preferences, and history. That is your asset. Make sure it stays yours.
- Pay providers directly when you can. Subscription markups in this space are 5-30x the underlying API cost. BYOK workflows save real money for any heavy user.
- Think in terms of workflow, not chat. The value is in the multi-step task being completed, not the chat interface itself.
The five-year picture
A reasonable forecast for 2030:
- Foundation models continue to commoditize. The top 5 models will be functionally interchangeable for most uses.
- Voice will be the dominant modality for new conversational deployments outside the desktop.
- "Chat" as a UI primitive will disappear into other products. You will not "use a chatbot" — every app will have conversational interaction where it makes sense.
- Vertical agents will displace 20-40% of work in legal, accounting, healthcare admin, and customer support.
- A few platform-scale companies in orchestration and embedded copilots will emerge as durable winners. Many wrappers will have died.
- Total market: ~$80B. Distribution shifts heavily from horizontal chat to vertical and embedded.
This is not a science fiction forecast. It is straight-line extrapolation of trends already visible in 2026.
What gets in the way
Risks worth watching:
- Regulation. EU AI Act enforcement, US state-level rules, sectoral regulations in healthcare and finance. Could slow deployment in certain verticals significantly.
- Trust collapse. A high-profile agent failure (medical, legal, financial) could chill enterprise deployment for a year. The category recovers but slows.
- Energy and cost ceilings. Inference costs are dropping but datacenter capacity has limits. If the curve flattens, deployment economics in low-margin verticals get harder.
- Model regression. Frontier models hitting plateaus would change the economics. Possible but not the base case.
None of these kill the category. They reshape its growth path.
The summary
- ~$80B market by 2030. Real number. The mix matters more than the headline.
- Value is migrating away from horizontal chat and toward vertical agents, voice, embedded copilots, and orchestration.
- The model is commoditizing. Workflow, data, distribution, and trust are the moats.
- Builders: pick a wedge and own it. Do not bet on having the best model.
- Users: avoid lock-in. Own your data. Pay providers directly when the math works.
The next few years are not about who has the smartest model. They are about who builds the most useful workflow around models that everyone has equal access to.
NovaKit is a BYOK conversational AI workspace built on the thesis that the model is an input, not the product. Use any model, keep your data, pay providers directly.