TNG: DeepSeek R1T2 Chimera
DeepSeek-TNG-R1T2-Chimera is the second-generation Chimera model from TNG Tech. It is a 671 B-parameter mixture-of-experts text-generation model assembled from DeepSeek-AI’s R1-0528, R1, and V3-0324 checkpoints with an Assembly-of-Experts merge. The tri-parent design yields strong reasoning performance while running roughly 20 % faster than the original R1 and more than 2× faster than R1-0528 under vLLM, giving a favorable cost-to-intelligence trade-off. The checkpoint supports contexts up to 60 k tokens in standard use (tested to ~130 k) and maintains consistent <think> token behaviour, making it suitable for long-context analysis, dialogue and other open-ended generation tasks.
Undisclosed
Parameters
164K tokens
Context Window
Proprietary
License
Jul 8, 2025
Released
💰 Pricing
Input
$0.30
per 1M tokens
Output
$1.10
per 1M tokens
API Available
This model is accessible via API for integration into your applications.
⭐ Related Models
Claude 4 Opus
Anthropic
Anthropic's most powerful reasoning model with extended thinking. Excels at complex analysis, multi-step math, advanced coding, and nuanced writing.
Claude 4 Sonnet
Anthropic
Balanced intelligence and speed. Strong reasoning with faster response times and lower cost than Opus.
o3
OpenAI
OpenAI's most powerful reasoning model. Uses chain-of-thought to solve complex math, science, and coding problems.