GPT OSS 120B

Groq

Large-scale open-source model running on Groq's LPU silicon for high-performance inference at scale

Pricing

per 1M tokens

Input / 1M$0.15
Output / 1M$0.60

Specifications

API Model IDllama-3.1-405b-instruct
Context Window128K tokens
Max Output8K tokens

Modalities

text

Capabilities

tool-usestreamingjson-modecode

Other Groq Text / Chat Models

ModelInput / 1MOutput / 1MCache Read / 1MCache Write / 1M
Llama 3.1 8B Instant$0.05$0.08
GPT OSS 20B$0.07$0.30
Llama 4 Scout$0.11$0.34
Llama 4 Maverick$0.20$0.60
Qwen3 32B$0.29$0.59
Llama 3.3 70B Versatile$0.59$0.79
Kimi K2$1.00$3.00