Foundation vision model for detection, segmentation, captioning, and OCR in a single unified model.
770M
Parameters
N/A
Context Window
Open Source
License
Jun 1, 2024
Released
โก Strengths
๐ฏ Use Cases
โญ Related Models
Phi-4
Microsoft
Small but mighty. Microsoft's compact model punches well above its weight class in reasoning and coding.
Microsoft: Phi 4
Microsoft
[Microsoft Research](/microsoft) Phi-4 is designed to perform well in complex reasoning tasks and can operate efficiently in situations with limited memory or where quick responses are needed. At 14 billion parameters, it was trained on a mix of high-quality synthetic datasets, data from curated websites, and academic materials. It has undergone careful improvement to follow instructions accurately and maintain strong safety standards. It works best with English language inputs. For more information, please see [Phi-4 Technical Report](https://arxiv.org/pdf/2412.08905)
Mistral: Pixtral Large 2411
Mistral
Pixtral Large is a 124B parameter, open-weight, multimodal model built on top of [Mistral Large 2](/mistralai/mistral-large-2411). The model is able to understand documents, charts and natural images. The model is available under the Mistral Research License (MRL) for research and educational use, and the Mistral Commercial License for experimentation, testing, and production for commercial purposes.