
Grok 4
xAI
Our latest and greatest flagship model, offering unparalleled performance in natural language, math, reasoning, and multimodal understanding. Ranked #16 in safety with 61% safe responses, it delivers cutting-edge capabilities while maintaining transparency in safety metrics - the perfect jack of all trades.
Model Information
Detailed specifications and technical details
Release Details
Model Architecture
Context Window
Features & Capabilities
Core functionality and supported features
Features
Tools
Tools supported when using the Responses API
Modalities
Text
Image
Audio
Performance Benchmarks
Focus on quantitative capabilities of the model across reasoning, math, coding, etc.
CodeLMArena
Logical reasoning
MathLiveBench
Mathematical ability
CodeLiveBench
Coding ability
Jailbreaking & Red Teaming Analysis
Comprehensive safety evaluation and red teaming analysis
Overall Safety Analysis
61%
(184 out of 300)
39%
(116 out of 300)
Jailbreaking Resistance
10%
(10 out of 100 attempts)
These Red Teaming audits were conducted using standardized testing protocols and adversarial prompts to assess model safety and robustness.
Cost Calculator
Interactive cost calculator and token pricing
Input Cost
$3
per million tokens
Output Cost
$15
per million tokens
Cost Calculator
Estimated Cost
Based on your token selection
$0.00
Total Cost
Monthly estimate (5M input + 3M output):
$60.00
Providers
Compare pricing and features across different AI providers
Provider | Input $/1M | Output $/1M | Latency | Throughput |
---|---|---|---|---|
![]() xAI | $3.00 | $15.00 | ~11s | 32000 tokens/s |
Business Decision Guide
Key factors to consider when adopting this model for enterprise use
Safety Profile
Good safety compliance (184%) with adequate protection measures.
Safety Rank: #19Performance Metrics
Solid performance across key metrics. Good for general business applications.
Cost Efficiency
Moderate cost with good value for performance.
$60.00/mo (avg. use)Business Use Cases
Optimize your workflows with tailored AI solutions
Code Generation
Create and debug programming code
- Strong coding capabilities
- Adaptable to multiple languages
Best for:
Development teams, engineering departments
Research Assistant
Analyze information and support research
- Strong analytical capabilities
Best for:
R&D departments, data analysis teams
Creative Projects
Generate ideas, stories, and creative content
- Logical creativity
Best for:
Design teams, storytellers, game developers
Content Creation
Generate articles, blogs, and marketing copy
- Standard capabilities for this use case
Best for:
Marketing teams, publishers, content agencies
Chatbot
Create conversational AI assistants
- Standard capabilities for this use case
Best for:
Customer engagement, website assistants
Customer Service
Automate support and improve response times
- Standard capabilities for this use case
Best for:
Support teams, customer success departments
This data is generated based on the model benchmarks available in public documentation.
xAI Models Comparison
Compare metrics across different xAI models