Back to all models
Gemini

Gemini 2.0 Flash Image Generation

Google

experimental
new

Overview

An experimental Gemini model specialized in text-to-image generation capabilities with a 32,000-token context window. It converts detailed text prompts into high-quality, creative images with strong understanding of styles, concepts, and composition. This model bridges text understanding and visual creation, making it ideal for creative professionals, marketing content generation, and visual prototyping.

Key Strengths

Text-to-image generation
Style understanding
Visual creativity
Compositional awareness

Capabilities

Text Generation
Image Understanding
Reasoning

Categories

Multimodal
Image Generation

Specifications

Context Size

32,000 tokens

Pricing

Input$0.35 / 1M tokens
Output$1.05 / 1M tokens

Documentation

View Documentation