Best Free AI Models &amp; APIs

Qwen: Qwen Plus 0728

Alibaba

🔒 proprietary📏 1.0M ctx💰 $0.26

Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoning model with a balanced performance, speed, and cost combination.

Sep 8, 2025View details →

💻

Qwen: Qwen2.5 Coder 7B Instruct

Alibaba

🔒 proprietary📏 33K ctx💰 $0.03

Qwen2.5-Coder-7B-Instruct is a 7B parameter instruction-tuned language model optimized for code-related tasks such as code generation, reasoning, and bug fixing. Based on the Qwen2.5 architecture, it incorporates enhancements like RoPE, SwiGLU, RMSNorm, and GQA attention with support for up to 128K tokens using YaRN-based extrapolation. It is trained on a large corpus of source code, synthetic data, and text-code grounding, providing robust performance across programming languages and agentic coding workflows. This model is part of the Qwen2.5-Coder family and offers strong compatibility with tools like vLLM for efficient deployment. Released under the Apache 2.0 license.

Apr 15, 2025View details →

Qwen: Qwen2.5 VL 32B Instruct

Alibaba

🔒 proprietary📏 128K ctx💰 $0.20

Qwen2.5-VL-32B is a multimodal vision-language model fine-tuned through reinforcement learning for enhanced mathematical reasoning, structured outputs, and visual problem-solving capabilities. It excels at visual analysis tasks, including object recognition, textual interpretation within images, and precise event localization in extended videos. Qwen2.5-VL-32B demonstrates state-of-the-art performance across multimodal benchmarks such as MMMU, MathVista, and VideoMME, while maintaining strong reasoning and clarity in text-based tasks like MMLU, mathematical problem-solving, and code generation.

Mar 24, 2025View details →

Reka Flash 3

rekaai

🔒 proprietary📏 66K ctx💰 $0.10

Reka Flash 3 is a general-purpose, instruction-tuned large language model with 21 billion parameters, developed by Reka. It excels at general chat, coding tasks, instruction-following, and function calling. Featuring a 32K context length and optimized through reinforcement learning (RLOO), it provides competitive performance comparable to proprietary models within a smaller parameter footprint. Ideal for low-latency, local, or on-device deployments, Reka Flash 3 is compact, supports efficient quantization (down to 11GB at 4-bit precision), and employs explicit reasoning tags ("<reasoning>") to indicate its internal thought process. Reka Flash 3 is primarily an English model with limited multilingual understanding capabilities. The model weights are released under the Apache 2.0 license.

Mar 12, 2025View details →

Anthropic: Claude 3.5 Haiku

Anthropic

🔒 proprietary📏 200K ctx💰 $0.80

Claude 3.5 Haiku features offers enhanced capabilities in speed, coding accuracy, and tool use. Engineered to excel in real-time applications, it delivers quick response times that are essential for dynamic tasks such as chat interactions and immediate coding suggestions. This makes it highly suitable for environments that demand both speed and precision, such as software development, customer service bots, and data management systems. This model is currently pointing to [Claude 3.5 Haiku (2024-10-22)](/anthropic/claude-3-5-haiku-20241022).

Nov 4, 2024View details →

Meta: Llama 3 8B Instruct

Mistral Large 3

Mistral

📊 123B🔓 open📏 128K ctx💰 $2

Mistral's flagship with strong multilingual support, function calling, and code generation. Open weights.

Jul 1, 2025View details →

💻

Kwaipilot: KAT-Coder-Pro V2

kwaipilot

🔒 proprietary📏 256K ctx💰 $0.30

KAT-Coder-Pro V2 is the latest high-performance model in KwaiKAT’s KAT-Coder series, designed for complex enterprise-grade software engineering and SaaS integration. It builds on the agentic coding strengths of earlier versions, with a focus on large-scale production environments, multi-system coordination, and seamless integration across modern software stacks, while also supporting web aesthetics generation to produce production-grade landing pages and presentation decks.

Mar 27, 2026View details →

Upstage: Solar Pro 3

upstage

🔒 proprietary📏 128K ctx💰 $0.15

Solar Pro 3 is Upstage's powerful Mixture-of-Experts (MoE) language model. With 102B total parameters and 12B active parameters per forward pass, it delivers exceptional performance while maintaining computational efficiency. Optimized for Korean with English and Japanese support.

Jan 27, 2026View details →

Qwen: Qwen3 VL 32B Instruct

Alibaba