👁️

Florence 2

Name: Florence 2
Author: Microsoft

Microsoft·Vision

🔥 58trending

Visit ↗

Foundation vision model for detection, segmentation, captioning, and OCR in a single unified model.

#detection#segmentation#ocr#open-weight

🧮

770M

Parameters

📏

N/A

Context Window

🔓

Open Source

License

📅

Jun 1, 2024

Released

⚡ Strengths

✓Unified vision tasks

✓Open weights

✓Small and fast

✓Good OCR

🎯 Use Cases

Object detectionImage captioningOCRVisual search

⭐ Related Models

💬

Phi-4

Microsoft

🔥 69

Small but mighty. Microsoft's compact model punches well above its weight class in reasoning and coding.

📊 14B🔓 open📏 16K ctx

Dec 12, 2024View details →

💬

Microsoft: Phi 4

Microsoft

🔥 63

[Microsoft Research](/microsoft) Phi-4 is designed to perform well in complex reasoning tasks and can operate efficiently in situations with limited memory or where quick responses are needed. At 14 billion parameters, it was trained on a mix of high-quality synthetic datasets, data from curated websites, and academic materials. It has undergone careful improvement to follow instructions accurately and maintain strong safety standards. It works best with English language inputs. For more information, please see [Phi-4 Technical Report](https://arxiv.org/pdf/2412.08905)

🔒 proprietary📏 16K ctx💰 $0.07

Jan 10, 2025View details →

👁️

Mistral: Pixtral Large 2411

Mistral

🔥 71

Pixtral Large is a 124B parameter, open-weight, multimodal model built on top of [Mistral Large 2](/mistralai/mistral-large-2411). The model is able to understand documents, charts and natural images. The model is available under the Mistral Research License (MRL) for research and educational use, and the Mistral Commercial License for experimentation, testing, and production for commercial purposes.

🔒 proprietary📏 131K ctx💰 $2.00

Nov 19, 2024View details →