Stars
Model swapping for llama.cpp (or any local OpenAI API compatible server)
llama.cpp fork with additional SOTA quants and improved performance
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
Get your documents ready for gen AI
Context7 MCP Server -- Up-to-date code documentation for LLMs and AI code editors
🚀 The fast, Pythonic way to build MCP servers and clients
🤗 smolagents: a barebones library for agents that think in code.
Visual testing tool for MCP servers
⌛ easy to use progress-bar for command-line/terminal applications
Display images in the terminal
⚙️ Convert HTML to Markdown. Even works with entire websites and can be extended through rules.
Concatenate a directory full of files into a single prompt for use with LLMs
A python program that turns an LLM, running on Ollama, into an automated researcher, which will with a single query determine focus areas to investigate, do websearches and scrape content from vari…
Port of OpenAI's Whisper model in C/C++
Use your locally running AI models to assist you in your web browsing
Rembg is a tool to remove images background
A collection of common interactive command line user interfaces.
node.js command-line interfaces made easy
klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs
Parse files (e.g. code repos) and websites to clipboard or a file for ingestions by AI / LLMs
Review/Check GGUF files and estimate the memory usage and maximum tokens per second.
An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.
WebAssembly binding for llama.cpp - Enabling on-browser LLM inference
Context management for long-context LLMs, agents, and vibe coding. Instantly build context for an entire repo, selected files, folders, and GitHub issues to generate structured AI-XML context with …
JS tokenizer for LLaMA 3 and LLaMA 3.1