← ← Back to Rankings

Meta: Llama 3.2 11B Vision Instruct

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...

β€”
Rank
128K
Context Window
$0.24
Min Input /1M
$0.24
Min Output /1M

Model Info

Model ID meta-llama-llama-3-2-11b-vision-instruct
Modality text+image->text
Context Window 128K tokens
Platforms OR

Usage & Rankings

Rank Unranked
Weekly Usage β€”

Cross-Platform Pricing

Platform Input /1M Output /1M Context Model ID
OR OpenRouter $0.245 $0.245 128K meta-llama/llama-3.2-11b-vision-instruct
Input
$0.245
/ 1M tokens
Output
$0.245
/ 1M tokens
Round Trip
$0.49
/ 1M tokens
Value Rating
🟒
🟒 Economy β€” great for high volume

About Meta: Llama 3.2 11B Vision Instruct

Meta: Llama 3.2 11B Vision Instruct is a large language model with a 128K token context window.

The best available input price is $0.245/1M tokens and output price is $0.245/1M tokens, with a round-trip cost of $0.49/1M tokens.