โ† Back to all models
๐Ÿ’ฌ

Qwen: Qwen3 235B A22B Instruct 2507

AlibabaยทText Generation
๐Ÿ”ฅ 70trending

Qwen3-235B-A22B-Instruct-2507 is a multilingual, instruction-tuned mixture-of-experts language model based on the Qwen3-235B architecture, with 22B active parameters per forward pass. It is optimized for general-purpose text generation, including instruction following, logical reasoning, math, code, and tool usage. The model supports a native 262K context length and does not implement "thinking mode" (<think> blocks). Compared to its base variant, this version delivers significant gains in knowledge coverage, long-context reasoning, coding benchmarks, and alignment with open-ended tasks. It is particularly strong on multilingual understanding, math reasoning (e.g., AIME, HMMT), and alignment evaluations like Arena-Hard and WritingBench.

#text->text#top-provider
๐Ÿงฎ

Undisclosed

Parameters

๐Ÿ“

262K tokens

Context Window

๐Ÿ”’

Proprietary

License

๐Ÿ“…

Jul 21, 2025

Released

๐Ÿ’ฐ Pricing

Input

$0.07

per 1M tokens

Output

$0.10

per 1M tokens

๐Ÿ”Œ

API Available

This model is accessible via API for integration into your applications.