
GPT-OSS-120B
OpenAI
GPT-OSS-120B is OpenAI's larger open-weight language model with 117 billion parameters (5.1B active per token), optimized for deployment on a single 80GB GPU. Released under Apache 2.0 license, it achieves near-parity with o4-mini on reasoning benchmarks with exceptional MMLU (90%) and AIME performance.
Model Information
Detailed specifications and technical details
Release Details
Model Architecture
Context Window
Features & Capabilities
Core functionality and supported features
Features
Tools
Tools supported when using the Responses API
Modalities
Text
Image
Audio
Performance Benchmarks
Focus on quantitative capabilities of the model across reasoning, math, coding, etc.
MathLiveBench
Mathematical ability
CodeLiveBench
Coding ability
Jailbreaking & Red Teaming Analysis
Comprehensive safety evaluation and red teaming analysis
Overall Safety Analysis
97%
(292 out of 300)
3%
(8 out of 300)
Jailbreaking Resistance
92%
(92 out of 100 attempts)
These Red Teaming audits were conducted using standardized testing protocols and adversarial prompts to assess model safety and robustness.
Cost Calculator
Interactive cost calculator and token pricing
No Pricing Information Available
Pricing data is not available for this model.
Providers
Compare pricing and features across different AI providers
Provider | Input $/1M | Output $/1M | Latency | Throughput |
---|---|---|---|---|
![]() AWS Bedrock | $0.15 | $0.60 | 1.2 ms | 3000 tokens/s |
C Chutes | $73.00 | $290.00 | 2.28 ms | 237 tokens/s |
![]() DeepInfra | $90.00 | $450.00 | 0.56 ms | 117.5 tokens/s |
![]() nCompass | $100.00 | $450.00 | 1.59 ms | 39.3 tokens/s |
![]() Baseten | $100.00 | $500.00 | 0.27 ms | 305.1 tokens/s |
![]() NovitaAI | $100.00 | $500.00 | 0.45 ms | 121.4 tokens/s |
![]() AtlasCloud | $100.00 | $500.00 | 0.44 ms | 139.8 tokens/s |
![]() Crusoe | $150.00 | $500.00 | 0.56 ms | 151 tokens/s |
![]() Fireworks | $150.00 | $600.00 | 0.82 ms | 221 tokens/s |
![]() Together | $150.00 | $600.00 | 0.37 ms | 81.95 tokens/s |
![]() Parasail | $150.00 | $600.00 | 0.73 ms | 68.02 tokens/s |
![]() Nebius AI Studio | $150.00 | $600.00 | 0.72 ms | 55.65 tokens/s |
![]() Groq | $150.00 | $750.00 | 0.23 ms | 1000 tokens/s |
![]() Cerebras | $250.00 | $690.00 | 0.44 ms | 4041 tokens/s |
Business Decision Guide
Key factors to consider when adopting this model for enterprise use
Safety Profile
Strong safety measures with good compliance rates. Suitable for enterprise use.
Safety Rank: #5Performance Metrics
Solid performance across key metrics. Good for general business applications.
Cost Efficiency
Highly cost-effective with excellent context handling.
$0.00/mo (avg. use)Business Use Cases
Optimize your workflows with tailored AI solutions
Chatbot
Create conversational AI assistants
- High resilience against manipulation
- Cost-effective for high volume
Best for:
Customer engagement, website assistants
Customer Service
Automate support and improve response times
- Competent customer support
- Scalable solution
Best for:
Support teams, customer success departments
Code Generation
Create and debug programming code
- Strong coding capabilities
Best for:
Development teams, engineering departments
Research Assistant
Analyze information and support research
- Strong analytical capabilities
Best for:
R&D departments, data analysis teams
Content Creation
Generate articles, blogs, and marketing copy
- Consistent brand voice alignment
Best for:
Marketing teams, publishers, content agencies
Creative Projects
Generate ideas, stories, and creative content
- Logical creativity
Best for:
Design teams, storytellers, game developers
This data is generated based on the model benchmarks available in public documentation.
OpenAI Models Comparison
Compare metrics across different OpenAI models