Stars
Open-source framework for conversational voice AI agents
💾 Create minimalist, Ubuntu based images for the Nvidia jetson boards
FreeRTOS kernel files only, submoduled into https://github.com/FreeRTOS/FreeRTOS and various other repos.
A simulation platform for versatile Embodied AI research and developments.
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Designed to quickly operations like paste among mac/win/Linux.
Pure Javascript OCR for more than 100 Languages 📖🎉🖥
🔊 Text-Prompted Generative Audio Model
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
A tool for generating fake code signing certificates or signing real ones
System Clipboard interfacing library in Rust
Vue drag-and-drop component based on Sortable.js
A lightweight, customizable Vue UI library for mobile web apps.
For the one who is finding a customizable chatbot UI.
The Postgres development platform. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.
OpenVideo specializes in the domain of text-to-video generation, with the goal of providing high-quality and diverse video datasets to AI researchers globally.
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
Parse sb3 blocks, and generate scratchblocks formatted code.
Make pictures of Scratch blocks from text.
HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding