Gemma 4 26B A4B
Chatgemma-4-26B-A4B-it
Gemma 4 26B A4B on FlexAI: Google LLM (Multimodal), Gemma Terms license, served via the OpenAI-compatible Token Factory at the live market-tracked rate.
Context
256K tokens
API endpoint
/v1/chat/completions
Compatibility
OpenAI
Parameters
26B MoE (4B active)
License
Gemma Terms
Hardware
H100
Quantization
BF16
Estimate your monthly cost
M tokens
M tokens
10M × $0.054/M input$0.54
2M × $0.27/M output$0.54
Estimated monthly cost$1.08
Estimate only, at the current market-tracked rate. Usage-based; no minimums.
Get an API keyQuick Start
gemma-4-26B-A4B-it
from openai import OpenAI
client = OpenAI(base_url="https://tokens.flex.ai/v1", api_key="your-api-key")
response = client.chat.completions.create(
model="gemma-4-26B-A4B-it",
messages=[{
"role": "user",
"content": [
{"type": "text", "text": "What's in this image?"},
{"type": "image_url", "image_url": {"url": "https://example.com/photo.jpg"}},
],
}],
)
print(response.choices[0].message.content)