The Anthropic Claude models on Vertex AI offer fully managed and
serverless models as APIs. To use a Claude model on Vertex AI, send
a request directly to the Vertex AI API endpoint. Because the Anthropic
Claude models use a managed API, there's no need to provision or manage
infrastructure. You can stream your Claude responses to reduce the end-user latency perception.
A streamed response uses server-sent events (SSE) to incrementally stream the
response. You pay for Claude models as you use them (pay as you go), or you pay a fixed
fee when using provisioned
throughput. For
pay-as-you-go pricing, see Anthropic Claude models on the Vertex AI
pricing page. The following models are available from Anthropic to use in Vertex AI. To
access a Claude model, go to its Model Garden model card. Anthropic's Claude models support Vertex AI request-response
logging. Enable 30-day request-response logging of your prompt and completion
activity to track any model misuse by your users. For more information, see Log
requests and responses. Claude Opus 4.1 is Anthropic's most intelligent model and an industry leader for
coding and agent capabilities, especially agentic search. It excels for
customers needing frontier intelligence: Claude Opus 4.1 is Anthropic's most intelligent model and an industry leader for
coding and agent capabilities, especially agentic search. It excels for
customers needing frontier intelligence: Go to the Claude Opus 4.1 model card Claude Opus 4 is a state-of-the-art model for coding and agent
capabilities, especially agentic search. It excels for customers needing
frontier intelligence: Go to the Claude Opus 4 model card Claude Sonnet 4 balances impressive performance for coding with the
right speed and cost for high-volume use cases: Go to the Claude Sonnet 4 model card Claude 3.7 Sonnet is Anthropic's most intelligent model to date
and the first Claude model to offer extended thinking—the ability to solve
complex problems with careful, step-by-step reasoning.
Claude 3.7 Sonnet is a single model where you can balance speed
and quality by choosing between standard thinking for near-instant responses or
extended thinking for advanced reasoning. For more information about extended thinking, see Anthropic's
documentation. Claude 3.7 Sonnet is optimized for the following use cases: Go to the Claude 3.7 Sonnet model card Claude 3.5 Sonnet v2 is a state-of-the-art model for real-world
software engineering tasks and agentic capabilities.
Claude 3.5 Sonnet v2 delivers these advancements at the same
price and speed as Claude 3.5 Sonnet. The upgraded Claude 3.5 Sonnet model is capable of interacting
with tools that can manipulate a computer desktop environment. For more
information, see the Anthropic documentation. Claude 3.5 Sonnet is optimized for the following use cases: Go to the Claude 3.5 Sonnet v2 model card Claude 3.5 Haiku, the next generation of Anthropic's fastest and
most cost-effective model, is optimal for use cases where speed and
affordability matter. It improves on its predecessor across every skill set.
Claude 3.5 Haiku is optimized for the following use cases: Go to the Claude 3.5 Haiku model card Anthropic's Claude 3 Haiku is Anthropic's fastest vision and text model for
near-instant responses to basic queries, meant for seamless AI experiences
mimicking human interactions. Live customer interactions and translations. Content moderation to catch suspicious behavior or customer requests. Cost-saving tasks, such as inventory management and knowledge extraction
from unstructured data. Vision tasks, such as processing images to return text output, analysis of
charts, graphs, technical diagrams, reports, and other visual content. Go to the Claude 3 Haiku model card Anthropic's Claude 3.5 Sonnet outperforms Claude 3 Opus on a wide
range of Anthropic's evaluations, with the speed and cost of Anthropic's
mid-tier Claude 3 Sonnet. Claude 3.5 Sonnet is
optimized for the following use cases: Coding, such as writing, editing, and running code with sophisticated
reasoning and troubleshooting capabilities. Handle complex queries from customer support by understanding user context
and orchestrating multi-step workflows. Data science and analysis by navigating unstructured data and leveraging
multiple tools to generate insights. Visual processing, such as interpreting charts and graphs that require
visual understanding. Writing content with a more natural, human-like tone. Go to the Claude 3.5 Sonnet model cardAvailable Claude models
Claude Opus 4.1
Claude Opus 4
Claude Sonnet 4
Claude 3.7 Sonnet
Claude 3.5 Sonnet v2
Claude 3.5 Haiku
Claude 3 Haiku
Claude 3.5 Sonnet
What's next
Anthropic's Claude models
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2025-08-21 UTC.