Qwen: Qwen3 235B A22B
Qwen3-235B-A22B is a 235B parameter mixture-of-experts (MoE) model developed by Qwen, activating 22B parameters per forward pass. It supports seamless switching between a "thinking" mode for complex reasoning, math, and code tasks, and a "non-thinking" mode for general conversational efficiency. The model demonstrates strong reasoning ability, multilingual support (100+ languages and dialects), advanced instruction-following, and agent tool-calling capabilities. It natively handles a 32K token context window and extends up to 131K tokens using YaRN-based scaling.
Undisclosed
Parameters
131K tokens
Context Window
Proprietary
License
Apr 28, 2025
Released
๐ฐ Pricing
Input
$0.45
per 1M tokens
Output
$1.82
per 1M tokens
API Available
This model is accessible via API for integration into your applications.
โญ Related Models
Claude 3.5 Sonnet
Anthropic
The model that defined a generation. Fast, smart, and incredibly capable across coding, analysis, and creative tasks.
Claude 3.5 Haiku
Anthropic
Ultra-fast and cost-effective. Best for high-volume tasks where speed matters more than peak intelligence.
GPT-4o Mini
OpenAI
Compact and affordable. Surprisingly capable for its price point, ideal for high-volume applications.