Pricing
per 1M tokens
| Input / 1M | $0.11 |
| Output / 1M | $0.34 |
Specifications
| API Model ID | meta-llama/llama-4-scout-17b-16e-instruct |
| Context Window | 128K tokens |
| Max Output | 8K tokens |
Modalities
textimage
Capabilities
tool-usestreamingjson-modevisioncode
Other Groq Text / Chat Models
| Model | Input / 1M | Output / 1M | Cache Read / 1M | Cache Write / 1M |
|---|---|---|---|---|
| Llama 3.1 8B Instant | $0.05 | $0.08 | — | — |
| GPT OSS 20B | $0.07 | $0.30 | — | — |
| GPT OSS 120B | $0.15 | $0.60 | — | — |
| Llama 4 Maverick | $0.20 | $0.60 | — | — |
| Qwen3 32B | $0.29 | $0.59 | — | — |
| Llama 3.3 70B Versatile | $0.59 | $0.79 | — | — |
| Kimi K2 | $1.00 | $3.00 | — | — |