Back to all models

Gemini 1.5 Flash 8B

Compact 8B Gemini Flash for lightweight inference.

gemini-1.5-flash-8b

STABLEModel DeactivatedGet Started

1,000,000 context

Starting at $0.04/M input tokens

Starting at $0.15/M output tokens

Streaming

Tools

Reasoning

JSON Output

Select Provider

All Providers for Gemini 1.5 Flash 8B

LLM Gateway routes requests to the best providers that are able to handle your prompt size and parameters.

Google AI Studio

Context: 1M

Deactivated since Sep 20, 2025

Input

$0.0375

/M tokens

Cached

—

/M tokens

Output

$0.15

/M tokens

Google Vertex AI

Context: 1M

Deactivated since Sep 20, 2025

Input

$0.0375

/M tokens

Cached

—

/M tokens

Output

$0.15

/M tokens