← Back to all models

💬

Qwen: Qwen3.5-Flash

Alibaba·Text Generation

🔥 72trending

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the 3 series, these models deliver a leap forward in performance for both pure text and multimodal tasks, offering fast response times while balancing inference speed and overall performance.

#text+image+video->text#top-provider

🧮

Undisclosed

Parameters

📏

1.0M tokens

Context Window

🔒

Proprietary

License

📅

Feb 25, 2026

Released

💰 Pricing

Input

$0.07

per 1M tokens

Output

$0.26

per 1M tokens

🔌

API Available

This model is accessible via API for integration into your applications.

⭐ Related Models

Claude 3.5 Sonnet

Anthropic

The model that defined a generation. Fast, smart, and incredibly capable across coding, analysis, and creative tasks.

🔒 proprietary📏 200K ctx💰 $3

Oct 22, 2024View details →

Claude 3.5 Haiku

Anthropic

Ultra-fast and cost-effective. Best for high-volume tasks where speed matters more than peak intelligence.

🔒 proprietary📏 200K ctx💰 $0.80

Nov 4, 2024View details →

GPT-4o Mini

OpenAI

Compact and affordable. Surprisingly capable for its price point, ideal for high-volume applications.

🔒 proprietary📏 128K ctx💰 $0.15

Jul 18, 2024View details →