Back to Models

Llama 3.2 Vision

by Meta
API Available

Multimodal model with vision capabilities available in 11B and 90B parameter sizes. Supports image understanding and reasoning.

Specifications

Context Window
128,000 tokens
Released
September 2024

Pricing

Open source - hosting costs vary

Capabilities

VisionOpen sourceImage understandingMultimodal

Best For