โ Back to all models
Reasoning-specialized model trained with reinforcement learning. Shows chain-of-thought reasoning transparently.
#reasoning#rl-trained#open-weight#chain-of-thought
๐งฎ
671B
Parameters
๐
128K tokens
Context Window
๐
Open Source
License
๐
Jan 20, 2025
Released
๐ฐ Pricing
Input
$0.55
per 1M tokens
Output
$2.19
per 1M tokens
โก Strengths
โTransparent reasoning chains
โStrong at math/logic
โOpen weights
โRL-trained
๐ฏ Use Cases
Math competitionsLogic puzzlesScientific reasoningResearch
๐
API Available
This model is accessible via API for integration into your applications.
โญ Related Models
๐ง
Claude 4 Opus
Anthropic
๐ฅ 98
Anthropic's most powerful reasoning model with extended thinking. Excels at complex analysis, multi-step math, advanced coding, and nuanced writing.
๐ proprietary๐ 200K ctx๐ฐ $15
Sep 1, 2025View details โ
๐ง
Claude 4 Sonnet
Anthropic
๐ฅ 94
Balanced intelligence and speed. Strong reasoning with faster response times and lower cost than Opus.
๐ proprietary๐ 200K ctx๐ฐ $3
Jun 1, 2025View details โ
๐ง
o3
OpenAI
๐ฅ 91
OpenAI's most powerful reasoning model. Uses chain-of-thought to solve complex math, science, and coding problems.
๐ proprietary๐ 200K ctx๐ฐ $10
Apr 16, 2025View details โ