Skip to content

Start typing to find articles and guides.

Your cart is empty

AI

Google Retires Vertex AI, Bets Enterprise Future on Gemini Agent Platform

Google is collapsing its five-year-old Vertex AI brand into Gemini Enterprise Agent Platform, signaling that the enterprise AI market has moved from model-building to agent-orchestration

TL;DR

  • Vertex AI is dead. Google is retiring the five-year-old brand and merging all its capabilities into the new Gemini Enterprise Agent Platform
  • Agent-first architecture. The platform is organized around four verbs: Build, Scale, Govern, Optimize—with most new investment in the lower layers (governance, security, observability)
  • Cryptographic agent identity. Every agent gets a unique SPIFFE-based ID, enabling audit trails and least-privilege access policies
  • TPU 8t and 8i. New hardware generation: 8t for training (9,600-chip superpods), 8i for inference (80% better price-performance)
  • Migration required. All Vertex AI services and roadmap items will be delivered exclusively through the Agent Platform

What Actually Happened

At Google Cloud Next '26 on April 28, Google announced the Gemini Enterprise Agent Platform as the evolution of Vertex AI. The company is retiring the Vertex AI brand after five years and collapsing its agent tooling, model registry, and development kits into a unified surface for building, securing, running, governing, and observing AI agents at enterprise scale.

The rebrand is substantive, not cosmetic. According to Google Cloud's public sector blog: "All Vertex AI services and roadmap evolutions will be delivered exclusively through the Agent Platform, rather than as a standalone service." The platform combines model selection, model building, and agent building with new features like Agent Identity, which assigns every agent a unique cryptographic ID and well-defined authorization policies.

The architecture is organized around four layers:

  1. Build: Agent Development Kit (ADK) 2.0 with graph-based workflows, collaborative agents, and dynamic workflows; Agent Studio for low-code development
  2. Scale: Re-engineered Agent Runtime (GA) with sub-second cold starts, support for long-running agents up to seven days, and 3,000 agents per project
  3. Govern: Agent Identity (GA) based on CNCF SPIFFE standard, Agent Registry (Preview), Agent Gateway (Private Preview) for policy enforcement
  4. Optimize: Agent Simulation, evaluation tools, and performance monitoring

The hardware stack is also new. TPU 8t for training scales to 9,600 chips with 2 PB of shared memory. TPU 8i for inference delivers 80% better price-performance than the prior generation, enabling millions of concurrent agents.

What It Actually Means

This is Google acknowledging that the enterprise AI market has moved from "Can we build an agent?" to "How do we manage thousands of them?" The platform's heavy investment in governance, security, and observability layers—traditionally unglamorous but essential—suggests Google has been paying attention to what's actually hard about deploying agents at scale.

The Vertex AI retirement resolves brand confusion but creates API friction. As TWIML's analysis notes: "The distinct Gemini (google.genai) and Vertex AI (vertexai) APIs will continue to coexist, despite the developer confusion this has caused since the launch of the Gemini API." This means developers will still need to navigate two separate APIs even after the brand unification.

Agent Identity is the most significant technical innovation. By giving every agent a cryptographic identity, Google enables:

  • Least-privilege access policies at the agent level
  • Non-repudiable audit trails of every agent action
  • Integration with existing IAM infrastructure
  • Traceability across multi-agent workflows

This addresses the core enterprise concern: "How do we secure and govern autonomous systems that act on our behalf?"

The $750 million partner fund signals platform ambition. Google is betting that agents will be a third-party ecosystem play, not just a first-party service. The Agent Marketplace will offer partner-built agents from Adobe, Atlassian, ServiceNow, Workday, and others.

Hype Deconstruction

"Vertex AI is dead" is accurate but incomplete. The brand is retired, but the APIs persist. This is a marketing and product unification, not a technical discontinuation. Enterprises won't need to rewrite code immediately, but they will need to update documentation, training, and procurement processes.

"Agents replace apps" is aspirational, not current. Google's vision of a post-app world where conversational AI handles business tasks is compelling, but early reference customers (Comcast, L'Oreal, Macquarie Bank, PayPal) are still in the experimentation phase. Most enterprises remain cautious about giving AI systems access to sensitive data or authority to act inside business workflows.

The 75% adoption claim needs context. Google says nearly 75% of Google Cloud customers use its AI products, but this likely includes any usage of Gemini, Vertex AI, or pre-built APIs—not production agent deployments. The meaningful metric is how many are running agents with real business authority, and that number is much smaller.

TPU performance claims are for scale buyers. The 80% price-performance improvement on TPU 8i matters if you're running millions of concurrent agents. If you're running dozens, the economics are less compelling. Most enterprises will stick with GPU-based inference for the foreseeable future.

Stakeholder Landscape

Who benefits:

  • Google Cloud: Resolves brand confusion, positions for the agent era, creates new revenue streams through Agent Platform plans
  • Enterprise architects: Single platform for model and agent development with built-in governance
  • Security teams: Agent Identity provides the auditability and control mechanisms they've been demanding
  • Systems integrators: $750M partner fund and Agent Marketplace create implementation opportunities

Who faces disruption:

  • Vertex AI specialists: Need to retool around Gemini Enterprise concepts and Agent Platform APIs
  • Competing platforms: Microsoft Copilot Studio, AWS Bedrock Agents, Salesforce Agentforce now face a more cohesive Google offering
  • Traditional SaaS vendors: If agents truly replace apps, every business software category is vulnerable

Who is unaffected despite the noise:

  • Small businesses: The platform is enterprise-priced and enterprise-complex
  • Non-Google clouds: This is a Google Cloud story, not a multi-cloud story (though Agent Gateway supports some cross-cloud scenarios)
  • Academic researchers: The platform is optimized for production deployment, not research flexibility

Cross-Layer Implications

Commercial: The agent platform market is consolidating around three stacks: Google (Gemini Enterprise), Microsoft (Copilot + Azure AI), and AWS (Bedrock). The winner won't be determined by model quality alone but by governance, security, and ecosystem integration.

Security: Agent Identity and Agent Gateway establish a pattern that other platforms will need to match. Expect CNCF SPIFFE/SPIRE adoption to accelerate as enterprises demand cryptographic agent identities across vendors.

Talent: The GEAR (Gemini Enterprise Agent Ready) upskilling program indicates Google recognizes the talent bottleneck. Agent development requires different skills than traditional ML engineering—more systems thinking, more security awareness, more workflow design.

Regulatory: The platform's audit capabilities (Agent Identity, non-repudiable logs) position Google ahead of potential AI governance regulations. If the EU AI Act or similar frameworks require agent audit trails, Google customers are already equipped.

Recommendations

For Enterprise IT Leaders (CIO/CTO)

Immediate (this week):

  • Audit your current Vertex AI usage. Identify which services and APIs you're dependent on
  • Review the Gemini Enterprise Agent Platform pricing and plan tiers
  • Map your agent development roadmap against the Build-Scale-Govern-Optimize framework

Near-term (this month):

  • Pilot Agent Identity for any agents with production data access
  • Evaluate Agent Runtime for long-running agent use cases (reconciliation, monitoring, sequencing)
  • Begin migrating documentation and training from Vertex AI to Gemini Enterprise concepts

Strategic (this quarter):

  • Decide whether to commit to the Gemini Enterprise stack or maintain multi-platform flexibility
  • Assess partner agents in the Agent Marketplace for commodity tasks (HR, finance, IT ops)
  • Build agent governance policies using Google's framework as a baseline

For Developers and ML Engineers

Immediate:

  • Learn the ADK 2.0 primitives: graph-based workflows, collaborative agents, dynamic workflows
  • Understand the difference between the google.genai and vertexai APIs—both still active
  • Experiment with Agent Studio for rapid prototyping

Near-term:

  • Master the Agent Runtime deployment model and session management
  • Implement Agent Identity in any agent that touches production systems
  • Use the Agents CLI to integrate agent management into your existing dev workflow

Strategic:

  • Shift from model-centric thinking to agent-system thinking
  • Develop skills in agent orchestration, security, and observability
  • Consider the GEAR certification program for career development

For Security and Compliance Teams

Immediate:

  • Review Agent Identity documentation and understand SPIFFE integration
  • Map your current IAM policies to agent-specific authorization requirements
  • Assess whether Agent Gateway's preview status meets your production security bar

Near-term:

  • Implement agent activity logging and anomaly detection
  • Create agent-specific incident response playbooks
  • Evaluate Model Armor integration with Agent Gateway for prompt injection defense

Strategic:

  • Develop enterprise-wide agent governance policies
  • Establish agent risk assessment frameworks
  • Plan for regulatory compliance using the platform's audit capabilities

Uncertainty Ledger

Unresolved questions:

  • API unification timeline: Will Google eventually merge google.genai and vertexai APIs, or will developers need to manage both indefinitely?
  • Pricing evolution: How will agent-based pricing compare to traditional API call pricing as agents become more autonomous and long-running?
  • Multi-cloud reality: How well will Agent Gateway's cross-cloud promises work in practice for AWS and Azure data sources?
  • Breakage risk: Early agent platforms have struggled with multi-step workflows and edge cases. How robust is Google's implementation for real business complexity?

What would change the analysis:

  • If Microsoft or AWS match Google's governance capabilities within 90 days, the competitive advantage narrows
  • If agent adoption rates remain below 20% of enterprise use cases through 2026, the platform shift matters less
  • If regulatory requirements emerge that contradict Google's agent architecture, migration costs increase

Bottom Line

Google is placing a $750 million bet that the enterprise AI market is ready to move from experimental models to production agent fleets. The Gemini Enterprise Agent Platform is the most comprehensive agent infrastructure stack shipped to date, with genuine innovations in agent identity, governance, and runtime management. However, the persistent API bifurcation and the aspirational nature of "agents replace apps" means enterprises should adopt incrementally: pilot the governance layer now, scale the runtime layer next quarter, and maintain healthy skepticism about the full vision until real customer success stories emerge.

The Vertex AI retirement is the right strategic move but imposes a relearning tax. Budget 20-40 hours of team training time and expect 2-3 months of documentation confusion during the transition. The platform's value proposition is strongest for enterprises already committed to Google Cloud—multi-cloud organizations should evaluate agent governance requirements before committing.

Signal remains high because this is the first platform to treat agents as first-class security principals, not just API consumers. That architectural decision will influence the entire industry.


Sources:

  • Google Cloud Blog, "Welcome to the agentic era" (Tier 1)
  • Pure AI, "Google Packages Enterprise AI Agents into New Gemini Platform" (Tier 2)
  • TWIML, "Google Cloud Next '26: Delivering the Agentic Control Plane" (Tier 2)
  • Forrester, "Google Cloud Next 2026: The End Of The AI Pilot Era" (Tier 1)
  • Strategic Focus, "What Google Cloud announced in AI this month" (Tier 2)
Back to blog

Read Next

AI

Canada Unveils National AI Strategy

Canada has just made the boldest sovereign AI play of any G7 middle power — explicitly framing AI as critical...
I F ·7 MIN READ
AI

BEYOND Expo Macao: "AI Digital to Physical" Signals APAC's Embodied-AI Centre of Gravity

The story of AI in 2026 is no longer about which lab builds the smartest model — it is about...
I F ·4 MIN READ
AI

Siemens Intelligence Center X: industrial agentic AI gets a reference architecture

Siemens just made the agentic-AI-in-the-factory pitch concrete — and pinned its credibility to two customer metrics it cannot walk back.
I F ·8 MIN READ
FROM THE LIBRARY

Guides for getting better at the things that matter.

A growing collection of playbooks, frameworks, and deep dives.