Meta: Llama 3.2 11B Vision Instruct

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...

—

Rank

128K

Context Window

$0.24

Min Input /1M

$0.24

Min Output /1M

Model Info

Model ID	meta-llama-llama-3-2-11b-vision-instruct
Modality	text+image->text
Context Window	128K tokens
Platforms	OR

Usage & Rankings

Rank	Unranked
Weekly Usage	—

Cross-Platform Pricing

Platform	Input /1M	Output /1M	Context	Model ID
OR OpenRouter	$0.245	$0.245	128K	meta-llama/llama-3.2-11b-vision-instruct

Input

$0.245

/ 1M tokens

Output

$0.245

/ 1M tokens

Round Trip

$0.49

/ 1M tokens

Value Rating

🟢

🟢 Economy — great for high volume

About Meta: Llama 3.2 11B Vision Instruct

Meta: Llama 3.2 11B Vision Instruct is a large language model with a 128K token context window.

The best available input price is $0.245/1M tokens and output price is $0.245/1M tokens, with a round-trip cost of $0.49/1M tokens.