โ† Back to all models
๐Ÿ’ฌ

Qwen: Qwen3.5-Flash

AlibabaยทText Generation
๐Ÿ”ฅ 72trending

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the 3 series, these models deliver a leap forward in performance for both pure text and multimodal tasks, offering fast response times while balancing inference speed and overall performance.

#text+image+video->text#top-provider
๐Ÿงฎ

Undisclosed

Parameters

๐Ÿ“

1.0M tokens

Context Window

๐Ÿ”’

Proprietary

License

๐Ÿ“…

Feb 25, 2026

Released

๐Ÿ’ฐ Pricing

Input

$0.07

per 1M tokens

Output

$0.26

per 1M tokens

๐Ÿ”Œ

API Available

This model is accessible via API for integration into your applications.