Florence-2-base-ft
Florence-2 is a powerful visual model for image description, detection, and segmentation, excelling in zero-shot and fine-tuned tasks.
What is Florence-2-base-ft
Florence-2-base-ft is an advanced visual foundation model developed by Microsoft. It excels in various vision and vision-language tasks using a sequence-to-sequence architecture. This model is adept at handling tasks like image description, object detection, and segmentation with high performance in zero-shot and fine-tuned settings. It leverages the FLD-5B dataset containing 54 billion annotations across 126 million images, making it a powerful tool for researchers and developers.