โ Back to all models
Nvidia's optimized model for enterprise deployment. Tuned for NVIDIA hardware with TensorRT-LLM integration.
#nvidia-optimized#enterprise#tensorrt
๐งฎ
70B
Parameters
๐
128K tokens
Context Window
๐
Open Source
License
๐
Oct 1, 2024
Released
โก Strengths
โOptimized for NVIDIA GPUs
โTensorRT integration
โEnterprise-ready
โOpen weights
๐ฏ Use Cases
Enterprise AINVIDIA deploymentsInference optimizationFine-tuning
๐
API Available
This model is accessible via API for integration into your applications.
โญ Related Models
๐ฌ
Claude 3.5 Sonnet
Anthropic
๐ฅ 85
The model that defined a generation. Fast, smart, and incredibly capable across coding, analysis, and creative tasks.
๐ proprietary๐ 200K ctx๐ฐ $3
Oct 22, 2024View details โ
๐ฌ
Claude 3.5 Haiku
Anthropic
๐ฅ 72
Ultra-fast and cost-effective. Best for high-volume tasks where speed matters more than peak intelligence.
๐ proprietary๐ 200K ctx๐ฐ $0.80
Nov 4, 2024View details โ
๐ฌ
GPT-4o Mini
OpenAI
๐ฅ 70
Compact and affordable. Surprisingly capable for its price point, ideal for high-volume applications.
๐ proprietary๐ 128K ctx๐ฐ $0.15
Jul 18, 2024View details โ