Stars
tfw when you when your lid when uhh angle your lid sensor
Transparent Image Layer Diffusion using Latent Transparency
Arbitrary-steps Image Super-resolution via Diffusion Inversion (CVPR 2025)
https://textbehindimage.rexanwong.xyz - create text behind image designs easily
[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).
A program synthesis agent that autonomously fixes its output by running tests!
Discomfort: Control ComfyUI with Python
This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headles…
Retentioneering: product analytics, data-driven CJM optimization, marketing analytics, web analytics, transaction analytics, graph visualization, process mining, and behavioral segmentation in Pyth…
A minimal library that helps filter out NSFW images.
The ultimate training toolkit for finetuning diffusion models
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) …
Python script to download models from CIVITAI using an API key
The prototype of an online midi keyboard with various functionalities
Software that extracts line chart data from images.
A simple react app with a canvas to draw an image. Real time image-to-image inference using Stable Diffusion XL Turbo and Modal
A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
OneTrainer docker images for use in GPU cloud and local environments. Includes AI-Dock KDE Plasma desktop with GPU acceleration and audio for authentication and improved user experience.
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis
[ECCV-2024] This is the official implementation of ZeST.