Stars
Stay on top of trending topics on social media and the web with AI
Demonstration of truss external packages feature
Get started with building Fullstack Agents using Gemini 2.5 and LangGraph
Build cross-browser extensions with native HMR and zero-config setup
Real-time webcam demo with SmolVLM and llama.cpp server
Files storage explorer and photo gallery for Azure Storage Account
[NIPS2025] VideoChat-R1 & R1.5: Enhancing Spatio-Temporal Perception and Reasoning via Reinforcement Fine-Tuning
Official Repository of OmniCaptioner
Official PyTorch implementation of One-Minute Video Generation with Test-Time Training
Learn how to use Ocular Foundry to fine-tune or train powerful state-of-the-art models like YOLOv11 for real-time object detection, SAM 2 for image segmentation, Florence-2 for visual reasoning tas…
[CVPR 2025 Highlight] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"
Open-Sora: Democratizing Efficient Video Production for All
zero-shot voice conversion & singing voice conversion, with real-time support
An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.
3D object detection using YOLO and depth estimation
[SIGGRAPH2025] Official repo for paper "Any-length Video Inpainting and Editing with Plug-and-Play Context Control"
📄 Configuration files that enhance Cursor AI editor experience with custom rules and behaviors
Empower the Web community and invite more to build across platforms.
real time face swap and one-click video deepfake with only a single image
[CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
deepbeepmeep / Wan2GP
Forked from Wan-Video/Wan2.1A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, Qwen Image, Hunyuan Video, LTX Video and Flux.