Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools,…

Python 51,494 8,415 Updated Jul 11, 2025

NVlabs / describe-anything

[ICCV 2025] Implementation for Describe Anything: Detailed Localized Image and Video Captioning

Python 1,248 71 Updated Jun 26, 2025

DepthAnything / Depth-Anything-V2

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 6,004 554 Updated Jan 22, 2025

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,544 659 Updated May 29, 2025

mlfoundations / open_clip

An open source implementation of CLIP.

Python 12,159 1,126 Updated Jun 10, 2025

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 29,792 3,666 Updated Jul 23, 2024

AssafSinger94 / dino-tracker

Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video” (ECCV 2024)

Python 495 46 Updated Nov 23, 2024

George-Zhuang / NetTrack

Official code for NetTrack [CVPR 2024]

Python 93 9 Updated Mar 17, 2024

NVIDIA / flownet2-pytorch

Pytorch implementation of FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks

Python 3,239 747 Updated May 28, 2023

nwojke / deep_sort

Simple Online Realtime Tracking with a Deep Association Metric

Python 5,801 1,543 Updated Mar 2, 2025

abewley / sort

Simple, online, and realtime tracking of multiple objects in a video sequence.

Python 4,206 1,127 Updated Nov 28, 2023

IDEA-Research / DINO-X-API

DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding

Python 1,121 46 Updated Jun 20, 2025

freeCodeCamp / freeCodeCamp

freeCodeCamp.org's open-source codebase and curriculum. Learn math, programming, and computer science for free.

TypeScript 422,787 40,732 Updated Jul 12, 2025

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 58,832 8,199 Updated Jul 10, 2025

yformer / EfficientSAM

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Jupyter Notebook 2,372 152 Updated Dec 24, 2024

yformer / EfficientTAM

Efficient Track Anything

Python 586 20 Updated Jan 6, 2025

vivekhsridhar / tracktor

Python and OpenCV based object tracking software

Jupyter Notebook 123 43 Updated Sep 15, 2022

HumanSignal / awesome-data-labeling

A curated list of awesome data labeling tools

4,091 453 Updated Jun 17, 2024

cvat-ai / cvat

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Python 14,043 3,282 Updated Jul 11, 2025

JonathonLuiten / TrackEval

HOTA (and other) evaluation metrics for Multi-Object Tracking (MOT).

Python 1,112 277 Updated Jul 3, 2024

obss / sahi

Framework agnostic sliced/tiled inference + interactive ui + error analysis plots

Python 4,662 666 Updated Jul 13, 2025

facebookresearch / dino

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Python 6,975 962 Updated Jul 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ye Bi yebigithub

Highlights

Block or report yebigithub

Starred repositories

OpenGVLab / DCNv4

mattjj / pyhsmm

Jiahao-Ma / MultiviewC

Anil-Bhujel / Public-Computer-Vision-Dataset-A-Systematic-Survey

SysCV / sam-hq

facebookresearch / vggt

yangchris11 / samurai

JaidedAI / EasyOCR

PaddlePaddle / PaddleOCR