Google Vertex AI Provider
Google Vertex AI is a platform for accessing Google's Gemini models via Vertex AI.
Available Models
Veo 3.1
google
veo-3.1-generate-previewGoogle Vertex AI
Context: 32.8k
Per Second Pricing
Video / Audio$0.2 – $0.4/sec
Veo 3.1 Fast
google
veo-3.1-fast-generate-previewGoogle Vertex AI
Context: 32.8k
Per Second Pricing
Video / Audio$0.1 – $0.15/sec
Gemini 3.1 Flash Lite (Preview)
google
gemini-3.1-flash-lite-previewStreaming
Vision
Tools
JSON Output
JSON Schema
Google Vertex AI
Context: 1.0M
Input
$0.25
/M tokens
Cached
$0.025
/M tokens
Output
$1.5
/M tokens
Gemini 3.1 Flash Image (Preview)
google
gemini-3.1-flash-image-previewStreaming
Vision
JSON Output
JSON Schema
Image Generation
Google Vertex AI
Context: 65.5k
Input
$0.25
/M tokens
Cached
—
/M tokens
Output
$1.5
/M tokens
Image Pricing (est. per image)
Input
any size~$0.0001
Output
0.5K~$0.0448
1K~$0.0672
2K~$0.1008
4K~$0.1512
Gemini 3.1 Pro (Preview)
google
gemini-3.1-pro-previewStreaming
Vision
Tools
Reasoning
JSON Output
JSON Schema
Google Vertex AI
Context: 1.0M
Input
$2
/M tokens
Cached
$0.2
/M tokens
Output
$12
/M tokens
+ $0.014 per search
Claude Sonnet 4.6
anthropic
claude-sonnet-4-6Streaming
Vision
Tools
Reasoning
JSON Schema
Google Vertex AI
Context: 200k
Input
$3
/M tokens
Cached
$0.3
/M tokens
Output
$15
/M tokens
Gemini 3 Flash (Preview)
google
gemini-3-flash-previewStreaming
Vision
Tools
Reasoning
JSON Output
JSON Schema
Google Vertex AI
Context: 1.0M
Input
$0.5
/M tokens
Cached
$0.05
/M tokens
Output
$3
/M tokens
+ $0.014 per search
Claude Opus 4.5
anthropic
claude-opus-4-5-20251101Streaming
Vision
Tools
Reasoning
JSON Schema
Google Vertex AI
Context: 200k
Input
$5
/M tokens
Cached
$0.5
/M tokens
Output
$25
/M tokens
Gemini 3 Pro Image (Preview)
google
gemini-3-pro-image-previewStreaming
Vision
JSON Output
JSON Schema
Image Generation
Google Vertex AI
Context: 65.5k
Input
$2
/M tokens
Cached
$0.2
/M tokens
Output
$12
/M tokens
Image Pricing (est. per image)
Input
any size~$0.0011
Output
1K~$0.1344
2K~$0.1344
4K~$0.2400
Gemini 3 Pro (Preview)
googleModel Deactivated
gemini-3-pro-previewStreaming
Vision
Tools
Reasoning
JSON Output
JSON Schema
Google Vertex AI
Context: 1.0M
Deprecated since Feb 27, 2026Deactivated since Mar 26, 2026
Input
$2
/M tokens
Cached
$0.2
/M tokens
Output
$12
/M tokens
+ $0.014 per search
Gemini 2.5 Flash Image (Preview)
google
gemini-2.5-flash-image-previewStreaming
Vision
JSON Output
JSON Schema
Image Generation
Google Vertex AI
Context: 32.8k
Input
$0.3
/M tokens
Cached
—
/M tokens
Output
$2.5
/M tokens
Image Pricing (est. per image)
Output
1K~$0.0336
2K~$0.0336
4K~$0.0600
Gemini 2.5 Flash Image
google
gemini-2.5-flash-imageStreaming
Vision
JSON Output
JSON Schema
Image Generation
Google Vertex AI
Context: 32.8k
Input
$0.3
/M tokens
Cached
$0.03
/M tokens
Output
$30
/M tokens
Gemini 2.5 Flash Preview (09-2025)
googleModel Deactivated
gemini-2.5-flash-preview-09-2025Streaming
Vision
Tools
Reasoning
JSON Output
JSON Schema
Google Vertex AI
Context: 1.0M
Deactivated since Jan 27, 2026
Input
$0.3
/M tokens
Cached
$0.03
/M tokens
Output
$2.5
/M tokens
Gemini 2.5 Flash Lite Preview (09-2025)
google
gemini-2.5-flash-lite-preview-09-2025Streaming
Vision
Tools
JSON Output
JSON Schema
Google Vertex AI
Context: 1.0M
Input
$0.1
/M tokens
Cached
$0.01
/M tokens
Output
$0.4
/M tokens
Gemini 2.5 Flash
google
gemini-2.5-flashStreaming
Vision
Tools
Reasoning
JSON Output
JSON Schema
Google Vertex AI
Context: 1.0M
Input
$0.3
/M tokens
Cached
$0.03
/M tokens
Output
$2.5
/M tokens
+ $0.035 per search
Gemini 2.5 Flash Lite
google
gemini-2.5-flash-liteStreaming
Vision
Tools
JSON Output
JSON Schema
Google Vertex AI
Context: 1.0M
Input
$0.1
/M tokens
Cached
$0.01
/M tokens
Output
$0.4
/M tokens
Gemini 2.5 Pro Preview (06-05)
googleModel Deactivated
gemini-2.5-pro-preview-06-05Streaming
Vision
Tools
Reasoning
JSON Output
JSON Schema
Google Vertex AI
Context: 1M
Deactivated since Jul 15, 2025
Input
$1.25
/M tokens
Cached
—
/M tokens
Output
$10
/M tokens
Gemini 2.5 Flash Preview (05-20)
googleModel Deactivated
gemini-2.5-flash-preview-05-20Streaming
Vision
Tools
Reasoning
JSON Output
JSON Schema
Google Vertex AI
Context: 1M
Deactivated since Jul 15, 2025
Input
$0.15
/M tokens
Cached
—
/M tokens
Output
$0.6
/M tokens
Gemini 2.5 Pro Preview (05-06)
googleModel Deactivated
gemini-2.5-pro-preview-05-06Streaming
Vision
Tools
Reasoning
JSON Output
JSON Schema
Google Vertex AI
Context: 1M
Deactivated since Jul 15, 2025
Input
$1.25
/M tokens
Cached
—
/M tokens
Output
$10
/M tokens
Gemini 2.5 Flash Preview (04-17)
googleModel Deactivated
gemini-2.5-flash-preview-04-17Streaming
Vision
Tools
Reasoning
JSON Output
JSON Schema
Google Vertex AI
Context: 1M
Deactivated since Jul 15, 2025
Input
$0.15
/M tokens
Cached
—
/M tokens
Output
$0.6
/M tokens
Gemini 2.5 Flash Preview Thinking (04-17)
googleModel Deactivated
gemini-2.5-flash-preview-04-17-thinkingStreaming
Vision
Tools
Reasoning
JSON Output
JSON Schema
Google Vertex AI
Context: 1M
Deactivated since Jul 22, 2025
Input
$0.15
/M tokens
Cached
—
/M tokens
Output
$0.6
/M tokens
Gemini 2.5 Pro
google
gemini-2.5-proStreaming
Vision
Tools
Reasoning
JSON Output
JSON Schema
Google Vertex AI
Context: 1.0M
Input
$1.25
/M tokens
Cached
$0.125
/M tokens
Output
$10
/M tokens
+ $0.035 per search
Gemini 2.0 Flash Lite
googleScheduled for Deactivation
gemini-2.0-flash-liteStreaming
Tools
JSON Output
JSON Schema
Google Vertex AI
Context: 1.0M
Deprecated since Feb 18, 2026Deactivating on May 25, 2026
Input
$0.075
/M tokens
Cached
—
/M tokens
Output
$0.3
/M tokens
Gemini 2.0 Flash
googleScheduled for Deactivation
gemini-2.0-flashStreaming
Tools
JSON Output
JSON Schema
Google Vertex AI
Context: 1.0M
Deprecated since Feb 18, 2026Deactivating on May 25, 2026
Input
$0.1
/M tokens
Cached
$0.025
/M tokens
Output
$0.4
/M tokens
Gemini 1.5 Flash 8B
googleModel Deactivated
gemini-1.5-flash-8bStreaming
Tools
Reasoning
JSON Output
JSON Schema
Google Vertex AI
Context: 1M
Deactivated since Sep 20, 2025
Input
$0.0375
/M tokens
Cached
—
/M tokens
Output
$0.15
/M tokens
Gemini 1.5 Pro
googleModel Deactivated
gemini-1.5-proStreaming
Vision
Tools
Reasoning
JSON Output
JSON Schema
Google Vertex AI
Context: 1M
Deactivated since Sep 20, 2025
Input
$2.5
/M tokens
Cached
—
/M tokens
Output
$10
/M tokens
Gemini 1.5 Flash
googleModel Deactivated
gemini-1.5-flashStreaming
Vision
Tools
Reasoning
JSON Output
JSON Schema
Google Vertex AI
Context: 1M
Deactivated since Sep 20, 2025
Input
$0.0375
/M tokens
Cached
—
/M tokens
Output
$0.15
/M tokens