On this page
- TL;DR
- Why "AI for PDFs" finally works in 2026
- The contenders
- NotebookLM (Google)
- ChatPDF
- Humata
- Claude Projects (Anthropic)
- Gemini (with file upload)
- ChatGPT (with file upload)
- NovaKit (BYOK)
- The matrix
- Picking the right tool
- "I just want to ask a question about this paper."
- "I'm reviewing a confidential contract."
- "I'm doing a literature review across 30 papers."
- "I need to extract structured data from invoices/receipts/forms."
- "I need to summarize a 600-page government document."
- "I want to build a chatbot over my company's PDFs."
- The thing nobody admits
- Privacy: the part most reviewers skip
- What we'd actually pick
TL;DR
- NotebookLM is the easiest "drop a PDF, ask questions" tool. Free, fast, good citations. Weak on bulk and on heavy reasoning.
- Claude Projects with Opus 4.7 is the best at deep understanding of long, dense PDFs. Pay-per-use unless you have Claude Pro.
- ChatPDF and Humata are fine consumer tools but feel dated next to 2026 alternatives. Use them only if you already pay.
- NovaKit (BYOK) wins for people who already have API keys, want every model in one place, and don't want to leak documents to a third party's storage.
- The boring truth: the model matters more than the wrapper. A good PDF tool with a bad model is worse than a basic tool with Claude Opus 4.7.
Why "AI for PDFs" finally works in 2026
A PDF is a hostile format. It was designed in 1993 to look the same on every printer, not to be queried. For thirty years, "extracting meaning from PDFs" was a dirty job done by ETL pipelines and OCR engineers.
Two things changed that:
- Long context windows. A 200-page paper now fits comfortably in Claude Opus 4.7 (1M tokens) or Gemini 2.5 Pro (2M tokens). No chunking required for most documents.
- Vision-capable models. Modern multimodal models can read tables, equations, and figures directly off the page image. No more "the OCR mangled the equation" excuses.
So in 2026, "chat with a PDF" is no longer a parlor trick. It's a real research tool. The question is which tool — and that depends on what you're trying to do.
The contenders
NotebookLM (Google)
What it is: Google's research notebook. Drop in PDFs, web pages, YouTube videos, audio. Ask questions. Get answers with inline citations back to the source.
The good:
- Free. Generous limits. No card required.
- Excellent citation UI — every claim links back to the exact passage.
- Audio Overviews ("podcast mode") are genuinely useful for skimming.
- Handles 50+ sources per notebook without falling over.
The bad:
- Uses Gemini under the hood. Strong, but not as good as Opus 4.7 for hard reasoning.
- You can't pick the model.
- Privacy: your documents go to Google. Fine for public papers, not fine for confidential drafts.
- No API. You can't automate it.
Use it for: literature reviews, course reading, podcast research, anything where you want fast Q&A with citations and the documents aren't sensitive.
ChatPDF
What it is: The original "chat with a PDF" tool. Upload, chat, done.
The good:
- Dead simple UI. Your grandparent could use it.
- Cheap. Free tier handles small documents.
The bad:
- Feels like a 2023 product. Underlying models are not the frontier.
- Citation quality is mediocre.
- Free tier is restrictive (page count, message count).
- You're handing your PDFs to a third party with weaker enterprise posture than the big platforms.
Use it for: quick one-off questions on a non-sensitive document when you don't want to log into anything fancy.
Humata
What it is: A more polished ChatPDF, aimed at researchers and students.
The good:
- Better citation handling than ChatPDF.
- Multi-document workspaces.
- Reasonable pricing for students.
The bad:
- Same problem as ChatPDF: model choice is hidden, and the underlying intelligence is behind the frontier.
- Yet another subscription.
Use it for: student research workflows where NotebookLM doesn't fit, and you want something more structured than a generic chat app.
Claude Projects (Anthropic)
What it is: Anthropic's "give Claude a project with files attached" feature inside Claude.ai.
The good:
- The model. Claude Opus 4.7 is the best at long, dense, technical PDFs. Period.
- 1M token context. A 400-page document is no problem.
- Genuine reasoning over the document, not just retrieval.
- Custom instructions per project — set the persona, the tone, the citation style.
The bad:
- Requires Claude Pro or Max. Not free.
- The Projects UI is fine but not citation-first like NotebookLM.
- Anthropic stores the documents. (Better posture than most, but it's still on their server.)
Use it for: anything where the quality of understanding matters more than free or fast — contracts, dense technical specs, long research papers, legal review.
Gemini (with file upload)
What it is: Drop PDFs into a Gemini conversation. With Gemini 2.5 Pro you get 2M tokens of context.
The good:
- Free tier is generous.
- Massive context. Drop your entire textbook in.
- Strong multimodal — handles diagrams and tables well.
- Native Google Drive integration.
The bad:
- Still occasionally weaker than Claude on multi-step reasoning.
- Same Google privacy posture.
- The chat UI doesn't have great document-management features for serious research.
Use it for: very large documents (>500 pages) where context size matters most, or when you live in Google Workspace already.
ChatGPT (with file upload)
What it is: Drop PDFs into ChatGPT. GPT-5 reads them.
The good:
- GPT-5 is excellent at structured extraction (tables, fields, schema-shaped output).
- Code interpreter can compute over the data.
- Memory across chats means your PDF context can carry forward.
The bad:
- Free tier limits PDF use sharply.
- Privacy posture for non-Enterprise tiers is weaker than Claude.
- See the ChatGPT enterprise privacy problem for the full picture.
Use it for: PDFs where you need structured extraction (invoices, statements, forms) and want to do math on the result.
NovaKit (BYOK)
What it is: A bring-your-own-key chat app where you can attach PDFs, build knowledge bases from them, and pick any model per question.
The good:
- You pick the model. Use Opus 4.7 for the hard question, Haiku 4.5 for the cheap follow-ups, Gemini 2.5 Pro for the giant document.
- Documents and embeddings live locally (or in your own backend if you self-host).
- Pay providers directly — no markup, no subscription.
- Build a persistent knowledge base across many PDFs. See how to build an AI knowledge base from PDFs.
The bad:
- BYOK means you need API keys from at least one provider.
- More setup than "drag and drop into NotebookLM."
- No magic Audio Overviews mode (yet).
Use it for: people who do this often, want full model choice, care about privacy, and want one workspace instead of five subscriptions.
The matrix
| Tool | Best for | Cost | Privacy | Model choice |
|---|---|---|---|---|
| NotebookLM | Quick Q&A with citations | Free | None | |
| ChatPDF | One-off questions | Freemium | Third-party | None |
| Humata | Student workflows | $$ | Third-party | None |
| Claude Projects | Deep reasoning | $$ | Anthropic | Anthropic only |
| Gemini | Huge documents | Free/$$ | Gemini family | |
| ChatGPT | Structured extraction | Freemium | OpenAI | OpenAI family |
| NovaKit | Power users, privacy | BYOK keys | Local-first | All |
Picking the right tool
"I just want to ask a question about this paper."
NotebookLM. Free, fast, cited. Done.
"I'm reviewing a confidential contract."
Claude Projects (if you trust Anthropic) or NovaKit with Claude Opus 4.7 via API (if you don't want it on Claude.ai's storage). Do not put a confidential contract into ChatPDF.
"I'm doing a literature review across 30 papers."
NotebookLM for the first pass — it's faster than anything else for citation-heavy multi-source Q&A. NovaKit if you want a permanent knowledge base you can keep extending.
"I need to extract structured data from invoices/receipts/forms."
ChatGPT with GPT-5 or NovaKit pointed at GPT-5. Both handle structured output cleanly.
"I need to summarize a 600-page government document."
Gemini 2.5 Pro (2M token context) or Claude Opus 4.7 in NovaKit. The document fits in one shot, no chunking required.
"I want to build a chatbot over my company's PDFs."
That's a RAG problem, not a chat-with-PDF problem. Different post, different tooling.
The thing nobody admits
Most of these tools share the same dirty secret: they wrap a foundation model and add a thin layer of UI, retrieval, and prompt engineering. The "intelligence" is the model.
That means the gap between a $20/month wrapper and a free chat with the right model is much smaller than the wrappers want you to believe. What you're paying for is:
- A nicer UI
- Persistent storage
- Citation styling
- Document management
All of which are real value — just be honest about what the value is.
If you do this every day, it's worth paying for the polish. If you do this occasionally, the free tools are fine.
Privacy: the part most reviewers skip
Every tool here, except NovaKit running fully local, sends your PDFs to a third party. The questions are:
- Who? Google, OpenAI, Anthropic, or a smaller startup with weaker posture?
- For how long? Some tools delete on demand; some retain "to improve the service."
- Do they train on it? Most paid tiers say no. Most free tiers say "maybe, see policy."
For confidential documents — board materials, legal drafts, customer data — assume anything uploaded to a free consumer tool is fair game for that vendor. If you can't accept that, you're choosing between Claude/OpenAI Enterprise (with proper DPAs), or a BYOK setup like NovaKit where the documents never leave your control.
See privacy-first AI workspace for the longer take.
What we'd actually pick
If we could only have one tool for the rest of 2026:
- For most people: NotebookLM. Free, excellent at the common case.
- For researchers and analysts: Claude Projects or NovaKit + Opus 4.7. Quality matters more than convenience.
- For privacy-sensitive work: NovaKit + your provider of choice. Don't ship sensitive PDFs to a wrapper.
- For very large documents: Gemini 2.5 Pro for the context, NovaKit if you want flexibility.
The honest summary: you don't need a dedicated PDF app in 2026. You need a good chat app, a good model, and the discipline to pick the right one for the job.
NovaKit lets you chat with PDFs using any model — Claude Opus 4.7, GPT-5, Gemini 2.5 Pro, DeepSeek V3 — with your own API keys. Documents stay in your browser. No subscription, no markup.