Stars
Migrate a project from Poetry/Pipenv/pip-tools/pip to uv package manager
👤🔄 | Face re-identification using FAISS, ArcFace & SCRFD | ONNX Runtime Inference
Automated CI toolchain to produce precompiled opencv-python, opencv-python-headless, opencv-contrib-python and opencv-contrib-python-headless packages.
Advanced Quantization Algorithm for LLMs and VLMs, with support for CPU, Intel GPU, CUDA and HPU. Seamlessly integrated with Torchao, Transformers, and vLLM.
Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs
A framework for few-shot evaluation of language models.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Official inference framework for 1-bit LLMs
Production-ready Inference, Ingestion and Indexing built in Rust 🦀
A game theoretic approach to explain the output of any machine learning model.
A guidance language for controlling large language models.
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
Generalist YOLO: Towards Real-Time End-to-End Multi-Task Visual Language Models
Official Implementation of DINO-Foresight: Looking into the Future with DINO
ripgrep recursively searches directories for a regex pattern while respecting your gitignore
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …
Open Source Data Annotation & Labeling Tools
A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.
🐍 Geometric Computer Vision Library for Spatial AI
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
The code for the paper "Efficient Self-Supervised Video Hashing with Selective State Spaces" (AAAI'25).