Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 80+ languages.

Python 56,940 8,836 Updated Oct 6, 2025

madmaze / pytesseract

A Python wrapper for Google Tesseract

Python 6,236 744 Updated Oct 6, 2025

heplex / douyin-rtmp

抖音直播rtmp推流地址获取工具界面版

Python 227 50 Updated Aug 20, 2025

putyy / res-downloader

视频号、小程序、抖音、快手、小红书、直播流、m3u8、酷狗、QQ音乐等常见网络资源下载!

Go 10,644 1,276 Updated Sep 23, 2025

cv-cat / DouYin_Spider

抖音逆向，抖音爬虫，抖音全部api、直播间监听

JavaScript 696 168 Updated Jun 7, 2025

ape-byte / DouyinBarrageGrab

基于系统代理的抖音弹幕wss抓取程序，能够获取所有数据来源，包括chrome，抖音直播伴侣等，可进行进程过滤

C# 1,371 211 Updated Apr 30, 2025

facefusion / facefusion

Industry leading face manipulation platform

Python 25,423 4,023 Updated Oct 10, 2025

bytedance / LatentSync

Taming Stable Diffusion for Lip Sync!

Python 4,992 813 Updated Jun 20, 2025

TMElyralab / MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Python 4,821 655 Updated Sep 26, 2025

modelscope / ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 3,472 288 Updated Aug 14, 2025

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Python 10,538 1,563 Updated Oct 9, 2025

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 8,459 950 Updated Oct 10, 2025

FFmpeg / FFmpeg

Mirror of https://git.ffmpeg.org/ffmpeg.git

C 53,796 13,109 Updated Oct 11, 2025

ShujiaHuang / Cpp-Primer-Plus-6th

《C++ Primer Plus 第6版（中文版）》原书代码、习题答案和个人笔记，仅供学习和交流。

C++ 2,951 599 Updated Mar 10, 2025

voicepaw / so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

Python 9,144 1,220 Updated Oct 11, 2025

jianchang512 / gptsovits-api

适用于 GPT-SoVITS 的api调用接口

Python 314 39 Updated Mar 7, 2024

murang / potato

tiny game server framework writen in golang

C# 50 9 Updated Oct 11, 2025

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 51,462 5,660 Updated Sep 10, 2025

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,390 1,959 Updated Oct 9, 2025

Elektordi / obs-websocket-py

Python library to communicate with an obs-websocket server (for OBS Studio)

Python 263 62 Updated Aug 9, 2024

obsproject / obs-websocket

Remote-control of OBS Studio through WebSocket

C++ 4,166 733 Updated Sep 17, 2025

Breakthrough / PySceneDetect

🎥 Python and OpenCV-based scene cut/transition detection program & library.

Python 4,230 462 Updated Oct 5, 2025

0x90d / videoduplicatefinder

Video Duplicate Finder - Crossplatform

C# 2,622 237 Updated Sep 30, 2025

fawdlstty / FawCourse_FFmpeg

ffmpeg教程，非命令行模式

C++ 260 68 Updated Dec 5, 2022

heifengli001 / video-de-clip-moviepy

一个使用moviepy对视频进行去重剪辑的程序

Python 65 7 Updated May 18, 2024

index-tts / index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 13,199 1,419 Updated Oct 10, 2025

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 37,946 4,108 Updated Jul 6, 2025

thingsboard / thingsboard-gateway

Open-source IoT Gateway - integrates devices connected to legacy and third-party systems with ThingsBoard IoT Platform using Modbus, CAN bus, BACnet, BLE, OPC-UA, MQTT, ODBC and REST protocols

Python 1,986 941 Updated Oct 9, 2025

leoluz / nvim-dap-go

An extension for nvim-dap providing configurations for launching go debugger (delve) and debugging individual tests

Lua 578 92 Updated Jul 11, 2025

quii / learn-go-with-tests

Learn Go with test-driven development

Go 23,213 2,904 Updated Aug 21, 2025

x14n XianCH

Lists (9)

ai model

arch

css libraries

front end

go

java project

learn other

net

nvim plugins

Starred repositories

Spring Boot

Java