Stars
- All languages
- Assembly
- Astro
- C
- C#
- C++
- CSS
- ChucK
- Clojure
- Cuda
- Cython
- Dart
- Emacs Lisp
- Fortran
- GLSL
- Go
- HTML
- Java
- JavaScript
- Julia
- Jupyter Notebook
- Kotlin
- LLVM
- Lua
- MATLAB
- MDX
- MLIR
- Macaulay2
- Makefile
- Markdown
- Mathematica
- Objective-C
- PHP
- Perl
- PostScript
- Python
- Ruby
- Rust
- Scala
- Shell
- Swift
- TeX
- TypeScript
- Vim Script
- Visual Basic .NET
- Vue
- XSLT
🔥[IJCAI 2022, Official Code] for paper "Rethinking Image Aesthetics Assessment: Models, Datasets and Benchmarks". Official Weights and Demos provided. 首个面向多主题场景的美学评估数据集、算法和benchmark.
AI-Powered Watermark Remover using Florence-2 and LaMA Models: A Python application leveraging state-of-the-art deep learning models to effectively remove watermarks from images with a user-friendl…
Watermark remover tool that leverages the capabilities of Microsoft Florence and Lama Cleaner models.
🔥 CNN for Watermark Removal using Deep Image Prior with Pytorch 🔥.
Dr.V: A Hierarchical Perception-Temporal-Cognition Framework to Diagnose Video Hallucination by Fine-grained Spatial-Temporal Grounding
Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activelo…
Interleaving Reasoning: Next-Generation Reasoning Systems for AGI
[NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT
This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA benchmark performance. It also significantly improves the quality, fine-grain…
The most advanced Nano Banana image generator and editor application. Your central hub for AI image generation and revisions. Intuitive UI features reference images, editing with image masks, versi…
A curated collection of fun and creative examples generated with Nano Banana🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the community's development…
Awesome curated collection of images and prompts generated by GPT-4o and gpt-image-1. Explore AI generated visuals created with ChatGPT and Sora, showcasing OpenAI’s advanced image generation capab…
Awesome curated collection of images and prompts generated by gemini-2.5-flash-image (aka Nano Banana) state-of-the-art image generation and editing model. Explore AI generated visuals created with…
🔥🔥 Official Repo of UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward
Video Chain of Thought, Codes for ICML 2024 paper: "Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition"
✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning
Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types
The official implementation of the paper "Draw-In-Mind: Rebalancing Designer-Painter Roles in Unified Multimodal Models Benefits Image Editing"
Official Repository of Lumen: Consistent Video Relighting and Harmonious Background Replacement
[NOTE] I do not have enough ressources to maintain VMS, please use Ostris's AI-Tookit instead
Clapper.app, a video synthesizer and sequencer designed for the age of AI cinema