The Rise of Legal AI: Comparing Claude, ChatGPT, and Gemini (Formerly Bard) for Legal Applications

Published: July 16, 2023 • Substantially updated: July 2026 • By Sergei Tokmakov, Esq., CA Bar #279869
Substantially updated July 2026. The original version of this article compared Claude, ChatGPT, and Google Bard as they existed in July 2023. Bard no longer exists under that name (Google renamed it Gemini in February 2024), the context-window figures I quoted are off by orders of magnitude today, and one claim about Claude's training data did not hold up and has been retracted below. Rather than patch a fossil, I rewrote the comparison from scratch. The URL and publication date stay for the record; the analysis is current as of mid-2026 and, unlike the 2023 version, it is grounded in three years of running these models in my own working law practice.

In July 2023, comparing AI chatbots for legal work was an exercise in educated speculation: the tools were months old and almost nobody had built a practice around them. Three years later I can write the comparison I wish I had been able to write then, because I now use these model families daily, in production, with real clients and real consequences. This is that rewrite: what each model family is actually good at for legal work in 2026, where the 2023 version of this article went wrong, and how a lawyer or business should choose.

What the 2023 Version Got Wrong

I keep old posts up because pretending past analysis never existed is worse than owning it. Three corrections matter:

Retraction. The original article stated that "Claude's training data was carefully curated by Anthropic to include over 1 billion legal and business documents." Anthropic never made that specific claim, and I could not source it then or now. It was repeated from secondhand commentary and should not have run. It is retracted.

Bard is gone as a brand. The original treated Bard as a permanent fixture of the landscape. Google renamed Bard to Gemini in February 2024, and the Gemini models of 2026 are a fundamentally different, far more capable family than the product I reviewed. Every 2023 sentence about "Bard's 8,000 token ceiling" or "Bard only supports English" describes a product that no longer exists.

The numbers rotted fastest. In 2023 I compared context windows of roughly 100,000 tokens (Claude), 4,000 (ChatGPT), and 8,000 (Bard) as if those were durable product traits. Within about a year, every major vendor was shipping windows large enough to hold entire document sets, and the gaps I treated as decisive had closed or inverted. The lesson I now apply everywhere, including this rewrite: specific capability numbers belong in release notes, not in analysis meant to last. Where the 2023 article leaned on figures, this version leans on the structural differences that have stayed true.

The 2026 Model Landscape

The three vendors from 2023 are still the three that matter for legal work, but each now ships a family of models rather than a single chatbot:

  • Anthropic's Claude is now the Claude 5 family, with Opus 4.8 as the flagship reasoning model. Claude's distinctive strengths for legal users remain long-document discipline, careful instruction-following, and its coding agents (Claude Code), which matter more to law practice than most lawyers expect.
  • OpenAI's GPT-5.x generation powers ChatGPT and the API most legal tech is built on. Its reasoning modes, where the model spends deliberate compute on hard problems before answering, made a real difference for multi-step legal analysis compared with the 2023-era models.
  • Google's Gemini is the successor to Bard, tightly integrated with Google Workspace and search grounding. For firms living in Google Docs and Gmail, that integration is its practical pitch.

The other structural change: the interesting question in 2023 was "which chatbot answers legal questions best?" The interesting question in 2026 is "which model family do you build your workflow on?", because serious legal use now happens through documents, tools, and agents, not through a chat window alone.

Comparing the Model Families for Legal Work

Specific benchmark rankings shuffle with every release, so this table compares the durable characteristics I have observed across three years of daily use:

DimensionClaude (Claude 5 / Opus 4.8)ChatGPT (GPT-5.x)Gemini
Long-document legal analysisConsistently strong at holding a full contract stack in view and staying precise about which document says what.Strong, especially in reasoning modes; benefits from explicit structure in the prompt.Capable, with the advantage of native access to documents already living in Google Workspace.
Drafting disciplineFollows drafting instructions and house style closely; my pick for template and clause work.Excellent fluency; the strongest at persuasive and client-facing prose.Serviceable; shines when the draft starts from existing Workspace content.
Agentic and coding workClaude Code is the standout: it is how I maintain this site and build legal tools.Mature agent tooling and the largest third-party ecosystem.Improving quickly, strongest inside Google's own stack.
Citation behaviorConverged: all three families still occasionally produce plausible-but-wrong citations under pressure. No model has earned an exemption from primary-source verification, and none will. Treat this as a workflow requirement, not a model-selection criterion.
Confidentiality postureDepends on the tier, not the vendor: enterprise and API tiers with zero-retention terms are defensible for client data; consumer tiers that may train on inputs are not. Read the data terms for the specific product you buy.

The honest summary is that for a lawyer's core reading-and-writing work, all three families crossed the "genuinely useful" threshold long ago, and the differences that remain are workflow differences more than intelligence differences.

What I Actually Run, and Why

The 2023 article was written from vendor documentation. This section is written from my own infrastructure.

My production AI Legal Analyst, the attorney-supervised assistant that runs on every page of Terms.Law and triages visitor legal questions, runs on OpenAI's GPT-5.x reasoning models. That choice came from head-to-head testing on real legal fact patterns: for the specific job of careful, multi-turn legal triage with document uploads, the GPT-5.x reasoning tier has been the most dependable engine for my use case, with cheaper utility models handling routing and summaries around it.

My engineering and document tooling runs on Claude. Claude Code, powered by the Opus-class models, is how a solo attorney maintains a site of more than ten thousand pages, builds template generators and client workrooms, and ships fixes the same day problems surface. This division of labor, one model family as the client-facing legal engine, another as the build-and-maintain engine, was not something I planned in 2023; it emerged from testing and it is the single most useful practical finding I can report.

I document the full toolset, including what each tool is allowed to touch and the ethics rails around it, on my AI stack page, and the story of how the whole system came together is in my attorney-led AI practice case study. If you want the same evaluation done for your firm's stack, that is a defined flat-fee engagement on my AI implementation services page.

How to Choose for Your Firm or Business

Model selection questions from lawyers and founders usually resolve into four criteria, none of which is "which model is smartest this month":

  • Data terms first. The tier you buy determines whether client confidences can lawfully and ethically touch the system. An enterprise agreement with zero-retention terms changes the analysis completely compared with a consumer subscription. This is a contracts question before it is a technology question.
  • Where your documents live. Deep Workspace shops lean Gemini; Microsoft-centric firms often land on GPT-5.x via Copilot-style integrations; document-heavy custom workflows are where Claude tends to excel.
  • Chat versus infrastructure. If AI will only ever be a research-and-drafting sidekick, any frontier subscription works. If you want intake, matter workrooms, or document generation built around your practice, you are choosing an API platform and a build partner, not a chatbot.
  • Verification workflow. Whatever you pick, the citation-checking and review discipline is yours, not the vendor's. Budget for the workflow, not just the license.

Ethics, UPL, and Confidentiality

The regulatory questions the 2023 article posed as hypotheticals now have partial answers. The State Bar of California issued practical guidance on generative AI in late 2023; the ABA followed with Formal Opinion 512 in 2024. The framework that emerged maps the existing rules onto AI rather than inventing new ones: competence (Rule 1.1) requires understanding the tool's failure modes, confidentiality (Rule 1.6) governs what client information reaches which systems, and the supervision duties treat model output like nonlawyer work product that must be reviewed before it carries an attorney's name.

Unauthorized practice of law remains the live boundary for consumer-facing tools. A model explaining the law is information; a system applying the law to a specific person's facts and telling them what to do drifts toward advice, which is why every AI surface on this site is branded and built as an informational AI Legal Analyst with a licensed attorney behind it, never an "AI lawyer." Courts have also made the liability side concrete: sanctions decisions against lawyers who filed unverified AI-drafted material established early that delegation is no defense. I cover the compliance side of AI deployment, for law firms and for businesses shipping AI features, in my AI implementation attorney practice area.

FAQ

Is Bard still available for legal research?

No. Bard was renamed Gemini in February 2024, and the current Gemini models bear little resemblance to the 2023 product this article originally reviewed. Anything you read comparing "Bard" to current models is describing a discontinued brand.

Which AI model is best for legal work in 2026?

For core legal reading, analysis, and drafting, Claude 5, GPT-5.x, and Gemini are all past the threshold of genuine usefulness, and the ranking among them shifts release to release. The durable differentiators are data terms, workflow fit, and ecosystem, which is why my own practice runs two families side by side for different jobs.

Can I rely on any of these models for case citations?

You can rely on them to find and summarize authority faster than manual research; you cannot rely on any of them to be right without checking. Verification against the primary source before filing or sending is a professional obligation, and it is the single practice that separates safe AI-assisted lawyering from the sanctions cases.

Should a small firm build on one model or several?

Start with one subscription and a written confidentiality policy, then expand by job function rather than by hype cycle. In my experience the multi-model setup earns its complexity only once you are building infrastructure (intake, workrooms, document generation) rather than just chatting.

Need Legal Judgment, Not Just Model Output?

AI tools inform; they do not take responsibility. I offer a $240 Written Attorney Consultation (send your question and key documents, receive a written analysis of the issues, risks, and next steps) and a $400 1-Hour Zoom Strategy Session with screen sharing and preliminary document review. Real attorney, real accountability.

Request this package: $240 Written Consultation $400 Zoom Strategy Session Email me

Sergei Tokmakov, Esq. - CA Bar #279869

More from Terms.Law

This article is for informational purposes only and does not constitute legal advice. Reading it does not create an attorney-client relationship. Model capabilities and vendor terms change quickly; verify current product terms before relying on them. Sergei Tokmakov, Esq., California Bar #279869.
Live interactive demo

Try a live AI governance workroom

If you are choosing between Claude, ChatGPT, or Bard for legal work, this is how I govern that AI use in a contract. Flip a tool's account type or training posture in the AI vendor matrix and watch its rating react in real time, work the confidentiality and citation-verification checklists, sort the risk register, and click any clause of the locked AI Use Policy to comment or suggest an edit in track changes.

Open the live demo workroom How I build these for firms
Fictional demo data. Built by Sergei Tokmakov, Esq., California attorney and AI engineer.