Stars
Adult live stream downloader for advanced people. I could have chosen a better name.
Trigger Circle to Search on any Android 9–15 device
DigitalPlat FreeDomain: Free Domain For Everyone
A yt-dlp plugin that attempts to generate POT with the phantomjs Javascript Interpreter.
An open-source AI agent that brings the power of Gemini directly into your terminal.
A TTS model capable of generating ultra-realistic dialogue in one pass.
Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.
Automagically synchronize subtitles with video.
这个实时字幕翻译插件将OpenAI的ChatGPT API(或任何具有相同API调用方法的模型)集成到PotPlayer中。它使你在观看视频时能够实时翻译字幕,从而打破语言障碍,提升你的观看体验。 This real-time subtitle translation plugin integrates OpenAI's ChatGPT API (or any model with the …
A stream-translator fork with VAD based audio slicing & GPT / Gemini translation.
Fast and accurate automatic speech recognition (ASR) for edge devices
(Experimental) Instagram live stream downloader
samuelchristlie / PyInstaLive
Forked from dvingerh/PyInstaLivePython script to download Instagram livestreams.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Multilingual Voice Understanding Model
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
Groq-Powered Real-Time Voice Assistant
AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more
一款专注于Ai翻译的工具,一键自动翻译RPG SLG游戏,Epub TXT小说,Srt Vtt Lrc字幕,Word MD文档等等复杂长文本。
Automatically translates the text of a video based on a subtitle file, and then uses AI voice services to create a new dubbed & translated audio track where the speech is synced using the subtitle'…
LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key 管理与二次分发。单可执行文件,提供 Docker 镜像,一键部署,开箱即用。LLM API management & k…
CF-workers/pages代理脚本【Vless与Trojan】:支持nat64自动生成proxyip,一键自建proxyip与CF反代IP,CF优选官方IP三地区应用脚本,自动输出美、亚、欧最佳优选IP
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine