GPT OSS 120B

Cerebras

Large open-source model running on Cerebras wafer-scale hardware for high-performance inference at scale

Pricing

per 1M tokens

Input / 1M$0.35
Output / 1M$0.75

Specifications

API Model IDgpt-oss-120b
Context Window128K tokens
Max Output8K tokens

Modalities

text

Capabilities

tool-usestreamingjson-modecode

Other Cerebras Text / Chat Models

ModelInput / 1MOutput / 1MCache Read / 1MCache Write / 1M
Llama 3.1 8B$0.10$0.10