Back to all models
Gemini

Gemma 3 4B

Google

efficient
multimodal
open weights

Overview

A compact multimodal model from Google's Gemma 3 family with a 128,000-token context window, capable of processing both text and images. It provides efficient performance for everyday tasks with minimal computational requirements, making it ideal for lightweight applications, personal projects, and educational tools where deployment efficiency is prioritized over maximum capability.

Key Strengths

Deployment efficiency
Basic multimodal capabilities
Long context handling
Balanced performance

Capabilities

Text Generation
Image Understanding
Code Generation
Reasoning

Categories

General Purpose
Multimodal

Specifications

Context Size

128,000 tokens

Pricing

Input$0.1 / 1M tokens
Output$0.3 / 1M tokens

Documentation

View Documentation