+
Skip to content
View wuxmax's full-sized avatar
💭
boing boing
💭
boing boing
  • Berlin

Block or report wuxmax

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Get your documents ready for gen AI

Python 41,866 2,989 Updated Oct 17, 2025

A fast, local neural text to speech system

C++ 10,151 840 Updated Aug 26, 2025
Python 2,502 317 Updated Oct 17, 2025

The repo is a fork of microsoft/markitdown, I removed magika, so onnxruntime will not be included.

Python 6 Updated Sep 12, 2025

Python tool for converting files and office documents to Markdown.

Python 81,831 4,565 Updated Sep 8, 2025

Universal memory layer for AI Agents; Announcing OpenMemory MCP - local and secure memory management.

Python 41,441 4,430 Updated Oct 18, 2025

Typed argument parser for Python

Python 595 45 Updated Oct 18, 2025

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website …

HTML 12,974 1,064 Updated Oct 15, 2025

PyMuPDF4LLM

Python 1,080 156 Updated Oct 1, 2025

A programming framework for agentic AI

Python 50,889 7,767 Updated Oct 8, 2025

Python disk-backed cache (Django-compatible). Faster than Redis and Memcached. Pure-Python.

Python 2,637 155 Updated Aug 10, 2024

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 44,790 6,457 Updated Oct 18, 2025

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Python 8,284 651 Updated Oct 16, 2025

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …

Go 14,813 1,110 Updated Oct 19, 2025

Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust

Rust 13,866 816 Updated Oct 17, 2025

Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy

Python 1,358 80 Updated Sep 8, 2025

A Collection of BM25 Algorithms in Python

Python 1,251 99 Updated Oct 8, 2024

pgvector support for Python

Python 1,347 87 Updated Oct 10, 2025

Open-source vector similarity search for Postgres

C 17,997 921 Updated Sep 27, 2025

SQL databases in Python, designed for simplicity, compatibility, and robustness.

Python 16,968 773 Updated Oct 16, 2025

Tesseract Open Source OCR Engine (main repository)

C++ 70,349 10,301 Updated Oct 13, 2025

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Python 31,498 2,192 Updated Oct 16, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 19,064 3,090 Updated Oct 19, 2025

Structured Outputs

Python 12,718 642 Updated Oct 15, 2025

The #1 open-source voice interface for desktop, mobile, and ESP32 chips.

Python 5,092 540 Updated Nov 1, 2024

A natural language interface for computers

Python 60,661 5,196 Updated Aug 6, 2025

An extremely fast Python package and project manager, written in Rust.

Rust 70,324 2,131 Updated Oct 19, 2025

RayLLM - LLMs on Ray (Archived). Read README for more info.

1,263 93 Updated Mar 13, 2025

A proxy server for multiple ollama instances with Key security

Python 506 82 Updated Oct 15, 2025
Next
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载