β β Back to Rankings
Meta: Llama 3.2 11B Vision Instruct
Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...
β
Rank
128K
Context Window
$0.24
Min Input /1M
$0.24
Min Output /1M
Model Info
| Model ID | meta-llama-llama-3-2-11b-vision-instruct |
| Modality | text+image->text |
| Context Window | 128K tokens |
| Platforms | OR |
Usage & Rankings
| Rank | Unranked |
| Weekly Usage | β |
Input
$0.245
/ 1M tokens
Output
$0.245
/ 1M tokens
Round Trip
$0.49
/ 1M tokens
Value Rating
π’
π’ Economy β great for high volume
About Meta: Llama 3.2 11B Vision Instruct
Meta: Llama 3.2 11B Vision Instruct is a large language model with a 128K token context window.
The best available input price is $0.245/1M tokens and output price is $0.245/1M tokens, with a round-trip cost of $0.49/1M tokens.