Best Long Context AI Models

Exclusively available on the OpenRouter API, Sonar Pro's new Pro Search mode is Perplexity's most advanced agentic search system. It is designed for deeper reasoning and analysis. Pricing is based on tokens plus $18 per thousand requests. This model powers the Pro Search mode on the Perplexity platform. Sonar Pro Search adds autonomous, multi-step reasoning to Sonar Pro. So, instead of just one query + synthesis, it plans and executes entire research workflows using tools.

🔒 proprietary📏 200K ctx💰 $3.00

Oct 30, 2025View details →

💬

Qwen: Qwen3 VL 8B Instruct

Alibaba

🔥 79

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interleaved-MRoPE for long-horizon temporal reasoning, DeepStack for fine-grained visual-text alignment, and text-timestamp alignment for precise event localization. The model supports a native 256K-token context window, extensible to 1M tokens, and handles both static and dynamic media inputs for tasks like document parsing, visual question answering, spatial reasoning, and GUI control. It achieves text understanding comparable to leading LLMs while expanding OCR coverage to 32 languages and enhancing robustness under varied visual conditions.

🔒 proprietary📏 131K ctx💰 $0.08

Oct 14, 2025View details →

💬

Qwen: Qwen Plus 0728

Alibaba

🔥 79

Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoning model with a balanced performance, speed, and cost combination.

🔒 proprietary📏 1.0M ctx💰 $0.26

Sep 8, 2025View details →

💬

Qwen: Qwen2.5 VL 32B Instruct

Alibaba

🔥 79

Qwen2.5-VL-32B is a multimodal vision-language model fine-tuned through reinforcement learning for enhanced mathematical reasoning, structured outputs, and visual problem-solving capabilities. It excels at visual analysis tasks, including object recognition, textual interpretation within images, and precise event localization in extended videos. Qwen2.5-VL-32B demonstrates state-of-the-art performance across multimodal benchmarks such as MMMU, MathVista, and VideoMME, while maintaining strong reasoning and clarity in text-based tasks like MMLU, mathematical problem-solving, and code generation.

🔒 proprietary📏 128K ctx💰 $0.20

Mar 24, 2025View details →

Browse More

AI Reasoning Models Text Generation Models Multimodal AI Models AI Models for Coding AI Image Generation Models AI Video Generation Models AI Audio & Speech Models Embedding Models for RAG Open Source AI Models Free AI Models & APIs Cheapest AI Model APIs