Gemini 1.5 Flash 8B

Compact 8B Gemini Flash for lightweight inference.

gemini-1.5-flash-8b
STABLEModel DeactivatedGet Started
1,000,000 context
Starting at $0.04/M input tokens
Starting at $0.15/M output tokens
Streaming
Tools
Reasoning
JSON Output

All Providers for Gemini 1.5 Flash 8B

LLM Gateway routes requests to the best providers that are able to handle your prompt size and parameters.

Google AI Studio
Context: 1M
Deactivated since Sep 20, 2025
Input
$0.0375
/M tokens
Cached
/M tokens
Output
$0.15
/M tokens
Get Started
Google Vertex AI
Context: 1M
Deactivated since Sep 20, 2025
Input
$0.0375
/M tokens
Cached
/M tokens
Output
$0.15
/M tokens
Get Started