florence-2

Here are 35 public repositories matching this topic...

roboflow / maestro

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

transformers vqa objectdetection captioning fine-tuning multimodal vision-and-language phi-3-vision paligemma florence-2 qwen2-vl

Updated Jul 23, 2025
Python

jhc13 / taggui

Star

Tag manager and captioner for image datasets

image-captioning image-tagging tag-manager pyside6 stable-diffusion llava cogvlm florence-2

Updated May 21, 2025
Python

AI-Powered Watermark Remover using Florence-2 and LaMA Models: A Python application leveraging state-of-the-art deep learning models to effectively remove watermarks from images with a user-friendly PyQt6 interface.

dataset-creation inpainting watermark-remover lama-cleaner florence-2

Updated May 17, 2025
Python

autodistill / autodistill-grounded-sam-2

Star

Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.

grounded-sam autodistill florence-2 segment-anything-2

Updated Aug 7, 2024
Python

Ravi-Teja-konda / Surveillance_Video_Summarizer

Star

VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vision-Language Model. Includes a Gradio-based interface for querying and analyzing video footage.

video ai summarization gradio vlm vision-and-language huggingface surviellance gpt-4 chatgpt gradio-python-llm florence-2

Updated Jun 6, 2025
Python

anyantudre / Florence-2-Vision-Language-Model

Star

Florence-2 is a novel vision foundation model with a unified, prompt-based representation for a variety of computer vision and vision-language tasks.

computer-vision deep-learning huggingface vision-language vision-transformer vision-transformer-models vision-language-model florence-2

Updated Jul 3, 2024
Jupyter Notebook

Damarcreative / rem-wm

Sponsor

Star

Watermark remover tool that leverages the capabilities of Microsoft Florence and Lama Cleaner models.

watermark lama-cleaner florence-2

Updated Jan 28, 2025
Python

retkowsky / florence-2

Star

Florence-2

azure florence-2

Updated Feb 13, 2025
Jupyter Notebook

autodistill / autodistill-florence-2

Star

Use Florence 2 to auto-label data for use in training fine-tuned object detection models.

object-detection zero-shot-object-detection autodistill florence-2

Updated Aug 15, 2024
Python

sayedmohamedscu / Vision-language-models-VLM

Star

vision language models finetuning notebooks & use cases (Medgemma - paligemma - florence .....)

computer-vision medical-imaging lora vlm florence finetuning multimodal colab-notebook qlora finetune-llms paligemma florence-2 visionlanguage florence-finetuning medgemma

Updated Jul 5, 2025
Jupyter Notebook

fireicewolf / wd-llm-caption-cli

Star

A Python base cli tool for caption images with WD series, Joy-caption-pre-alpha,meta Llama 3.2 Vision Instruct and Qwen2 VL Instruct models.

image-caption wd14 llama3-vision florence-2 qwen2-vl joy-caption

Updated Mar 18, 2025
Python

Iteranya / AktivaAI

Sponsor

Star

Local LLM Discord Bot

ai chatbot discord-bot roleplay llama florence multimodal koboldcpp florence-2

Updated Jun 20, 2025
Python

jacobmarks / fiftyone_florence2_plugin

Star

Run SOTA Vision-Language Model Florence-2 on your data!

computer-vision ml transformer datacentric fiftyone-datasets vision-language-model florence-2

Updated Mar 27, 2025
Jupyter Notebook

mithunparab / text2segment_video

Star

Simple Video Summarization using Text-to-Segment Anything (Florence2 + SAM2) This project provides a video processing tool that utilizes advanced AI models, specifically Florence2 and SAM2, to detect and segment specific objects or activities in a video based on textual descriptions.

raft video-summarization optical-flow segment-anything florence-2 sam2

Updated Feb 20, 2025
Python

regiellis / ecko-cli

Star

ecko-cli is a simple CLI tool that streamlines the process of processing images in a directory, generating captions, and saving them as text files. Additionally, it provides functionalities to create a JSONL file from images in the directory you specify. Images will be captioned using the Microsoft Florence-2-large model and ONNX

cli ai image-processing image-classification onnxruntime huggingface-transformers generative-ai ecko florence-2 ecko-cli

Updated Jul 21, 2025
Python

nguyennpa412 / simple-multimodal-ai

Star

Simple Gradio application integrated with Hugging Face Multimodals to support visual question answering chatbot and more features

docker text-to-speech computer-vision gradio vlm visual-question-answering llm mllm vision-foundation-model image-text-to-text florence-2 xtts-v2 mini-internvl

Updated Aug 16, 2024
Python

Rm1n90 / Florence2Onnx

Star

ONNX deploys for Florence 2 visual multimodal

inference onnx onnxruntime onnxruntime-gpu florence-2

Updated Feb 11, 2025
Python

sitamgithub-MSIT / TextSnap

Star

TextSnap: Demo for Florence 2 model used in OCR tasks to extract and visualize text from images.

python artificial-intelligence optical-character-recognition gradio ocr-text-reader huggingface-transformers gradio-interface huggingface-spaces vision-language-model florence-2

Updated Apr 13, 2025
Python

PRITHIVSAKTHIUR / Florence-2-Image-Caption

Star

This application utilizes the powerful Florence-2 vision-language model from Microsoft to generate comprehensive captions for images. The model is capable of understanding visual content and expressing it in natural language.

image-processing transformers pillow torch image-captioning gradio florence huggingface timm vision-language-model florence-2

Updated Jul 11, 2025
Python

Ambruk-chan / DiscordBot

Star

The Ultimate Local LLM Discord Bot!!!

ai discord-bot roleplay llm koboldcpp gbnf florence-2

Updated Dec 6, 2024
Python

Improve this page

Add a description, image, and links to the florence-2 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the florence-2 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

florence-2

Here are 35 public repositories matching this topic...

roboflow / maestro

jhc13 / taggui

D-Ogi / WatermarkRemover-AI

autodistill / autodistill-grounded-sam-2

Ravi-Teja-konda / Surveillance_Video_Summarizer

anyantudre / Florence-2-Vision-Language-Model

Damarcreative / rem-wm

retkowsky / florence-2

autodistill / autodistill-florence-2

sayedmohamedscu / Vision-language-models-VLM

fireicewolf / wd-llm-caption-cli

Iteranya / AktivaAI

jacobmarks / fiftyone_florence2_plugin

mithunparab / text2segment_video

regiellis / ecko-cli

nguyennpa412 / simple-multimodal-ai

Rm1n90 / Florence2Onnx

sitamgithub-MSIT / TextSnap

PRITHIVSAKTHIUR / Florence-2-Image-Caption

Ambruk-chan / DiscordBot

Improve this page

Add this topic to your repo