Google Just Skipped the Chatbot Era. Your 2026 AI Plan Is Already Outdated.
The model isn’t the product anymore. The agent is. And Google now owns more of the surfaces it runs on than anyone else.
Google’s I/O 2026: From “Gemini Everywhere” to Agents That Actually Do the Work
TL;DR
A year ago, I told you Google had quietly become the company to beat in AI. After watching this week’s I/O, that argument feels almost too soft. Google didn’t show up with a faster model. They showed up making the case that the chatbot phase is over, and the next phase is agents doing your work for you, inside the apps you already use. Gemini 3.5 Flash. A 24/7 personal agent called Spark. Antigravity 2.0. A Search box rebuilt for the first time in a quarter century. Workspace going voice-first. Glasses this fall. And pricing finally pulled into line with Claude and ChatGPT at $100 and $200. From my seat, Google is turning its full stack into an agent runtime for consumers and the enterprise at the same time, and pricing it to win on volume. If you’re still planning your 2026 around chat interfaces, you’re planning for the wrong year.
The moment the keynote actually landed
I sat through most of the keynote waiting to feel it. New models, new benchmarks, new demos. We’ve all seen this movie. Then they brought Spark on stage, and the whole event clicked into focus.
Here’s what I mean. For most of 2024 and 2025, every AI keynote followed the same playbook. New model, benchmark chart, a demo where the thing writes a poem about somebody’s dog. Everybody clapped, closed their laptops, and went back to typing into chat windows like nothing had really changed.
This one was different. Google didn’t lead with intelligence. They led with action, which is honestly what most of us in this work have been waiting for. Spark working on its own VM with your laptop closed. Antigravity running parallel agents in the background. A Search box that takes a Chrome tab as input and books your dog groomer on the way out. Workspace where you just talk to your inbox.
The chatbot era didn’t end with a bang. Google decided to skip it.
What they’re really saying
Let me simplify this the way an operator needs it.
Gemini isn’t the product anymore. What Gemini does for you while you’re doing something else is the product. Every announcement on stage was a version of that one idea. The model is just the engine. The actual work, the part that matters for your firm, is everything you wire it into.
The model: frontier intelligence at a Flash price
Start here, because the economics drive everything downstream.
Gemini 3.5 Flash launched today, generally available everywhere. Google’s pitch is “frontier intelligence at Flash latency and Flash cost.” Their internal benchmarks have it beating Gemini 3.1 Pro (the flagship from three months ago) on most coding and agentic tasks, while running roughly four times faster than the comparable frontier models from other labs. Pricing is $1.50 per million input tokens and $9.00 per million output. Roughly half of Claude Sonnet. A fraction of GPT-5.5.
The number that should make you look up is GDPval-AA. It’s an independent benchmark maintained by the evaluation firm Artificial Analysis, built to measure “economically valuable knowledge work,” which is the closest thing the industry has to “AI doing real professional work.” Flash now sits at 1,656 Elo. Their last model was at 1,314. That’s a 342-point jump in three months on a benchmark Google doesn’t own. Which puts a Flash-tier model within throwing distance of the best models on the market at the price of a cheap one.
Think about what that does inside a firm. Cost stops being the thing that gates rollout. When the cheap fast model can hold its own on real work, you stop rationing AI to a handful of power users and start putting it into every workflow. For law firms specifically, that means associate-level research, intake summaries, document review, conflict checks, and a hundred other things that have been stuck in “we should pilot that” for two years become “let’s just turn it on this quarter.”
Pichai said the quiet part out loud during the keynote. If the biggest companies shifted 80% of their workloads to 3.5 Flash, they’d save over a billion dollars a year. That’s a Google scenario, not an audited number, but the direction is right. Google is forcing the market price of frontier-grade intelligence down. They can afford to because they own the chips.
Gemini 3.5 Pro is in internal testing and ships next month. Gemini Omni Flash, a new multimodal family that generates and edits video conversationally, started rolling out today across the Gemini app, Google Flow, and YouTube Shorts.
Spark: this is what I was waiting for
Of everything they announced, Spark is the one to actually study.
Spark is a personal AI agent that runs on its own Google Cloud VM, twenty-four hours a day, with native access to Gmail, Calendar, Drive, Docs, Sheets, Slides, YouTube, and Maps. Third-party integrations through MCP start with Canva, OpenTable, and Instacart. Beta opens next week for US Ultra subscribers. A Workspace version is “soon in preview” for business customers, whatever that ends up meaning in practice.
Here’s what makes it different. Spark doesn’t need your laptop open. You give it a task and it goes off and works on it, in the background, on Google’s hardware. It pings you for confirmation before doing anything high-stakes, like sending an email or paying for something or posting publicly. And it surfaces through a new Android status-bar element called Halo so you can see what your agent is doing without opening anything.
I’ve been using Claude as my primary tool for two years, and Cowork has been the most credible agentic experience I’ve had inside that stack. Spark is now the second product I’d put in that category. Both are real. Both are good. And the choice between them, for most firms, is going to come down to where you already live. If you’re a Workspace shop, Spark is going to feel like it grew out of your tenancy. If you’ve built around Anthropic or OpenAI, you have a better model quality argument and a more mature governance story, but you’ll have to wire up the integrations yourself.
The bigger point. Google isn’t asking you to pick a new chat app. They’re asking you to let an agent watch your inbox at night. That’s a much bigger ask, and it’s the right one for them to make if they want to win the next phase.
Antigravity is the developer side of the same idea
Antigravity 2.0 is Google’s agentic development platform. Desktop app, CLI, SDK, full integration with the Gemini Enterprise Agent Platform. You can run multiple agents in parallel, schedule background tasks, and pick up isolated sessions where you left off.
One thing worth flagging. Google is officially transitioning Gemini CLI users to Antigravity CLI. On June 18, 2026, Gemini CLI and the Gemini Code Assist IDE extensions stop serving requests for AI Pro, Ultra, and free individual users. Enterprise license holders on Gemini Code Assist Standard or Enterprise keep their access unchanged. If your firm has technical teams on individual or AI Pro seats running Gemini CLI in production, that migration is a calendar item. If you’re on the Enterprise license, you’re fine, but Antigravity is where Google is putting all the new development effort going forward.
For firms with real data residency rules, here’s the part that matters. Antigravity inherits Google Cloud’s privacy controls and runs inside your secure cloud boundary by default. That used to be a months-long conversation with IT and security. It just got a lot shorter.
Search and Workspace stop being apps and start being interfaces
Google rebuilt the Search box for the first time in a quarter century. The new one takes text, images, files, videos, and Chrome tabs as input. AI Mode runs on 3.5 Flash globally. They added information agents that watch the web for you 24/7, agentic booking for local services in the US this summer, and “Mini Apps,” which are custom dashboards Search generates on the fly using Antigravity.
The scale numbers matter here. AI Mode is now over a billion monthly users. AI Overviews is at 2.5 billion. The Gemini app is at 900 million MAU, up from 400 million a year ago. Across all surfaces, Google is processing 3.2 quadrillion tokens a month, seven times last year.
Workspace got the equally important update. Gmail Live lets you ask your inbox questions out loud. Docs Live turns a spoken brain dump into a structured draft, pulling from Drive and Chat with permission. Keep does the same for notes. AI Inbox just expanded from Ultra-only to Plus and Pro. Spark for Workspace is coming.
If you run a professional services firm, here’s the thing about voice-first work. The friction of writing prompts is what kept most partners and senior associates from actually using these tools day to day. Talking to your inbox is different. That’s a behavior change a 55-year-old equity partner will actually adopt, and the moment that happens, AI inside your firm stops being a thing the associates do and starts being a thing the firm does.
What sets Google apart now, and it’s not the model
A year ago, I argued the moat was vertical integration. Chips, cloud, models, distribution, all under one roof. That argument got stronger this week, but it also changed shape.
The new moat is that Google owns more of the surfaces where agents need to actually do things than anybody else on the planet. Search. Gmail. Docs. Calendar. Drive. Maps. Chrome. Android. YouTube. Workspace identity. Google Pay. And now glasses, with Samsung and Warby Parker and Gentle Monster, shipping this fall.
Agents don’t need apps. They need surfaces. An agent that can read your inbox, check your calendar, search the web, pay for the thing, and route the result to your phone is worth more than ten specialized agents that each do one of those badly. Google can hand all of that to its agents under a single identity layer. That’s a structural advantage nobody else can catch up to in eighteen months.
The competition has pieces of this, not the whole thing. OpenAI has the better consumer brand and a credible argument for the best frontier model. Anthropic, in my view, still has the better workhorse for serious reasoning and coding, and the most mature governance posture. Meta owns social. Microsoft has Windows and the enterprise install base.
None of them has the full surface area Google is now running agents on top of. From where I sit, that’s the real change since I wrote about Google’s stack in December.
The other thing that changed is pricing. Gemini Ultra now has a $100 tier and a $200 tier. So does Claude Max. So does ChatGPT Pro. Google had been the outlier at $250 with nothing below it, and they just stopped being the outlier. The strategic read is that Google wants the same seat-level conversation inside firms that Claude and ChatGPT have been having all year, and pricing won’t be the thing that makes a CFO blink anymore.
Risks I’d actually pay attention to
I’d be lying if I said this is all upside.
Quality drift matters more once the model is acting. A wrong answer in a chat is embarrassing. A wrong action in your inbox is something somebody has to clean up, and depending on what got sent or paid or posted, the cleanup may not be free. Google’s Spark gates high-stakes actions behind explicit confirmation, which is the right default. It’s also the kind of friction users start clicking through after the third day.
Data governance gets harder, not easier. When an agent is moving data across Gmail, Calendar, Drive, and third-party apps under one identity, your DLP and access controls have to catch up. Most firms’ AI policies are written for humans clicking buttons, not for agents executing on behalf of those humans. If you run a law firm, this is the conversation to have with your CIO and your general counsel this month, not in Q4.
Lock-in is real. The same thing that makes Spark feel effortless inside Workspace is what will make it painful to leave. Go all-in on Spark for personal productivity and Antigravity for enterprise agents and you’re committing to Google’s identity and data layer in a deeper way than you ever did with Search.
And the gated rollouts are still gated. Spark is US-only, Ultra-only, beta, English. Workspace Spark has no date. Glasses ship this fall with no price. The story is strong. The Monday-morning availability is uneven.
What to do this week
Add Gemini 3.5 Flash to your model evaluation portfolio. Run it against whatever you’re using today, on your actual work, not on a benchmark. The cost story only matters if the quality holds for your tasks.
Re-read your AI governance policy with one question: does it cover agents that act on a user’s behalf, not just users who chat with a model? If not, that’s the next revision.
If you sell to consumers, audit how your business shows up in the new Search box and the Universal Cart. If your product can’t be added to a Google-managed cart from a Gemini conversation, your distribution math just got worse.
The bottom line
Google didn’t win 2025 by shipping a better demo. They won by shipping defaults.
In 2026, they’re trying to pull the same trick at a higher level. The default for asking a question is becoming an agent. So is the default for booking a thing, and for getting through your inbox, and probably for half a dozen other things I haven’t thought of yet. Google is making Gemini the engine those defaults run on.
I said in December that demos don’t compound and defaults do. Agents are the next default. And heading into the back half of 2026, I’m planning like Google is the one most likely to own them.
Google didn’t win 2025 with a better demo. They won by shipping defaults, and the default they’re going after now is the agent. If you’re a firm leader trying to think through what this means for your operating model, your governance posture, or your 2026 AI budget, I’d be glad to talk it through. Reach me at steve@intelligencebyintent.com. The firms that get out ahead of this won’t be the ones with the smartest model. They’ll be the ones who rewired the work first.


