← Back to all models
456B parameter MoE model with 4M token context — one of the longest context windows available. Strong at long-doc tasks.
#moe#ultra-long-context#open-weight#4M-context
🧮
456B
Parameters
📏
4.1M tokens
Context Window
🔓
Open Source
License
📅
Jan 15, 2025
Released
💰 Pricing
Input
$0.20
per 1M tokens
Output
$1.10
per 1M tokens
⚡ Strengths
✓4M token context
✓Open weights
✓Lightning attention
✓Good performance
🎯 Use Cases
Extremely long docsBook analysisCode reposResearch
🔌
API Available
This model is accessible via API for integration into your applications.
⭐ Related Models
💬
Claude 3.5 Sonnet
Anthropic
🔥 85
The model that defined a generation. Fast, smart, and incredibly capable across coding, analysis, and creative tasks.
🔒 proprietary📏 200K ctx💰 $3
Oct 22, 2024View details →
💬
Claude 3.5 Haiku
Anthropic
🔥 72
Ultra-fast and cost-effective. Best for high-volume tasks where speed matters more than peak intelligence.
🔒 proprietary📏 200K ctx💰 $0.80
Nov 4, 2024View details →
💬
GPT-4o Mini
OpenAI
🔥 70
Compact and affordable. Surprisingly capable for its price point, ideal for high-volume applications.
🔒 proprietary📏 128K ctx💰 $0.15
Jul 18, 2024View details →