Pricing
per 1M tokens
| Input / 1M | $0.10 |
| Output / 1M | $0.10 |
Specifications
| API Model ID | llama3.1-8b |
| Context Window | 128K tokens |
| Max Output | 8K tokens |
Modalities
text
Capabilities
tool-usestreamingjson-modecode
Other Cerebras Text / Chat Models
| Model | Input / 1M | Output / 1M | Cache Read / 1M | Cache Write / 1M |
|---|---|---|---|---|
| GPT OSS 120B | $0.35 | $0.75 | — | — |