Best AI Reasoning Models

Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoning model with a balanced performance, speed, and cost combination.

🔒 proprietary📏 1.0M ctx💰 $0.26

Sep 8, 2025View details →

🧠

Grok 3

xAI

🔥 77

xAI's conversational AI with real-time X/Twitter integration, strong reasoning, and a personality.

🔒 proprietary📏 128K ctx💰 $3

Feb 17, 2025View details →

🧠

Qwen: Qwen3 VL 235B A22B Thinking

Alibaba

🔥 76

Qwen3-VL-235B-A22B Thinking is a multimodal model that unifies strong text generation with visual understanding across images and video. The Thinking model is optimized for multimodal reasoning in STEM and math. The series emphasizes robust perception (recognition of diverse real-world and synthetic categories), spatial understanding (2D/3D grounding), and long-form visual comprehension, with competitive results on public multimodal benchmarks for both perception and reasoning. Beyond analysis, Qwen3-VL supports agentic interaction and tool use: it can follow complex instructions over multi-image, multi-turn dialogues; align text to video timelines for precise temporal queries; and operate GUI elements for automation tasks. The models also enable visual coding workflows, turning sketches or mockups into code and assisting with UI debugging, while maintaining strong text-only performance comparable to the flagship Qwen3 language models. This makes Qwen3-VL suitable for production scenarios spanning document AI, multilingual OCR, software/UI assistance, spatial/embodied tasks, and research on vision-language agents.

🔒 proprietary📏 131K ctx💰 $0.26

Sep 23, 2025View details →

🧠

Sonar Pro

Perplexity

🔥 74

Perplexity's search-augmented model. Answers questions with real-time web data and source citations.

🔒 proprietary📏 200K ctx💰 $3

Feb 1, 2025View details →

🧠

Qwen: QwQ 32B

Alibaba

🔥 74

QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems. QwQ-32B is the medium-sized reasoning model, which is capable of achieving competitive performance against state-of-the-art reasoning models, e.g., DeepSeek-R1, o1-mini.

🔒 proprietary📏 131K ctx💰 $0.15

Mar 5, 2025View details →

🧠

Anthropic: Claude 3.7 Sonnet (thinking)

Anthropic

🔥 73

Claude 3.7 Sonnet is an advanced large language model with improved reasoning, coding, and problem-solving capabilities. It introduces a hybrid reasoning approach, allowing users to choose between rapid responses and extended, step-by-step processing for complex tasks. The model demonstrates notable improvements in coding, particularly in front-end development and full-stack updates, and excels in agentic workflows, where it can autonomously navigate multi-step processes. Claude 3.7 Sonnet maintains performance parity with its predecessor in standard mode while offering an extended reasoning mode for enhanced accuracy in math, coding, and instruction-following tasks. Read more at the [blog post here](https://www.anthropic.com/news/claude-3-7-sonnet)

🔒 proprietary📏 200K ctx💰 $3.00

Feb 24, 2025View details →

🧠

DeepSeek: R1 0528

DeepSeek

🔥 72

May 28th update to the [original DeepSeek R1](/deepseek/deepseek-r1) Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active in an inference pass. Fully open-source model.

🔒 proprietary📏 164K ctx💰 $0.45

May 28, 2025View details →

🧠

AllenAI: Olmo 3.1 32B Think

allenai

🔥 71

Olmo 3.1 32B Think is a large-scale, 32-billion-parameter model designed for deep reasoning, complex multi-step logic, and advanced instruction following. Building on the Olmo 3 series, version 3.1 delivers refined reasoning behavior and stronger performance across demanding evaluations and nuanced conversational tasks. Developed by Ai2 under the Apache 2.0 license, Olmo 3.1 32B Think continues the Olmo initiative’s commitment to openness, providing full transparency across model weights, code, and training methodology.

🔒 proprietary📏 66K ctx💰 $0.15

Dec 16, 2025View details →

🧠

02

2026

🔥 68

![Qwen-Image-2.0: Professional infographics, exquisite photorealism](https://img.alicdn.com/imgextra/i1/O1CN01IKBGxr1EqAL8Px7tj_!!6000000000402-2-tps-1590-954.png)

🔓 open

View details →

🧠

03

2026

🔥 66

![Qwen3.5-Max-Preview Now Available on Arena](https://img.alicdn.com/imgextra/i1/O1CN01oNNHZp1u2RCpQ3Kyj_!!6000000005979-2-tps-1590-954.png)

🔓 open

View details →

🧠

Arcee AI: Trinity Large Thinking

arcee-ai

🔥 63

Trinity Large Thinking is a powerful open source reasoning model from the team at Arcee AI. It shows strong performance in PinchBench, agentic workloads, and reasoning tasks. It is free in open claw for the first five days. Launch video: https://youtu.be/Gc82AXLa0Rg?si=4RLn6WBz33qT--B7

🔒 proprietary📏 262K ctx💰 $0.25

Apr 1, 2026View details →

🧠

Perplexity: Sonar Reasoning Pro

Perplexity

🔥 61

Note: Sonar Pro pricing includes Perplexity search pricing. See [details here](https://docs.perplexity.ai/guides/pricing#detailed-pricing-breakdown-for-sonar-reasoning-pro-and-sonar-pro) Sonar Reasoning Pro is a premier reasoning model powered by DeepSeek R1 with Chain of Thought (CoT). Designed for advanced use cases, it supports in-depth, multi-step queries with a larger context window and can surface more citations per search, enabling more comprehensive and extensible responses.

🔒 proprietary📏 128K ctx💰 $2.00

Mar 7, 2025View details →

Browse More

Text Generation Models Multimodal AI Models AI Models for Coding AI Image Generation Models AI Video Generation Models AI Audio & Speech Models Embedding Models for RAG Open Source AI Models Free AI Models & APIs Cheapest AI Model APIs Long Context AI Models