โ Back to all models
Efficient MoE model with 109B total parameters. Fits on a single H100 GPU while delivering strong performance.
#moe#open-weight#efficient
๐งฎ
109B
Parameters
๐
512K tokens
Context Window
๐
Open Source
License
๐
Apr 5, 2025
Released
โก Strengths
โFits on single GPU
โOpen weights
โ512K context
โCost-effective deployment
๐ฏ Use Cases
Self-hostingCost-sensitive appsFine-tuningEdge deployment
๐
API Available
This model is accessible via API for integration into your applications.
๐ ๏ธ Tools Built on Llama 4 Scout
โญ Related Models
๐ฌ
Claude 3.5 Sonnet
Anthropic
๐ฅ 85
The model that defined a generation. Fast, smart, and incredibly capable across coding, analysis, and creative tasks.
๐ proprietary๐ 200K ctx๐ฐ $3
Oct 22, 2024View details โ
๐ฌ
Claude 3.5 Haiku
Anthropic
๐ฅ 72
Ultra-fast and cost-effective. Best for high-volume tasks where speed matters more than peak intelligence.
๐ proprietary๐ 200K ctx๐ฐ $0.80
Nov 4, 2024View details โ
๐ฌ
GPT-4o Mini
OpenAI
๐ฅ 70
Compact and affordable. Surprisingly capable for its price point, ideal for high-volume applications.
๐ proprietary๐ 128K ctx๐ฐ $0.15
Jul 18, 2024View details โ