+
Skip to content
View songkq's full-sized avatar

Block or report songkq

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🔥[IJCAI 2022, Official Code] for paper "Rethinking Image Aesthetics Assessment: Models, Datasets and Benchmarks". Official Weights and Demos provided. 首个面向多主题场景的美学评估数据集、算法和benchmark.

Python 356 20 Updated Sep 25, 2025

实验室【外部】美学课题组入门学习材料,加入课题组后,会有更详细的内部学习资料。

63 3 Updated Sep 10, 2025

AI-Powered Watermark Remover using Florence-2 and LaMA Models: A Python application leveraging state-of-the-art deep learning models to effectively remove watermarks from images with a user-friendl…

Python 720 131 Updated Aug 1, 2025

Watermark remover tool that leverages the capabilities of Microsoft Florence and Lama Cleaner models.

Python 90 14 Updated Jan 28, 2025

🔥 CNN for Watermark Removal using Deep Image Prior with Pytorch 🔥.

Jupyter Notebook 1,093 167 Updated Oct 15, 2024

Dr.V: A Hierarchical Perception-Temporal-Cognition Framework to Diagnose Video Hallucination by Fine-grained Spatial-Temporal Grounding

2 Updated Aug 14, 2025
Python 274 13 Updated Sep 15, 2025

Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference

Python 1,114 35 Updated Oct 11, 2025

fast random access on videos over http

C++ 5 Updated Sep 3, 2025

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activelo…

Python 8,864 684 Updated Oct 10, 2025

Interleaving Reasoning: Next-Generation Reasoning Systems for AGI

182 8 Updated Sep 10, 2025

[NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT

Python 406 24 Updated Sep 18, 2025

This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA benchmark performance. It also significantly improves the quality, fine-grain…

Python 62 Updated Sep 14, 2025

The most advanced Nano Banana image generator and editor application. Your central hub for AI image generation and revisions. Intuitive UI features reference images, editing with image masks, versi…

TypeScript 243 76 Updated Sep 17, 2025

A curated collection of fun and creative examples generated with Nano Banana🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the community's development…

14,483 1,509 Updated Sep 24, 2025

Awesome curated collection of images and prompts generated by GPT-4o and gpt-image-1. Explore AI generated visuals created with ChatGPT and Sora, showcasing OpenAI’s advanced image generation capab…

JavaScript 7,548 1,419 Updated May 26, 2025

Awesome curated collection of images and prompts generated by gemini-2.5-flash-image (aka Nano Banana) state-of-the-art image generation and editing model. Explore AI generated visuals created with…

JavaScript 6,871 680 Updated Sep 8, 2025
Python 236 16 Updated Apr 10, 2024

🔥🔥 Official Repo of UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward

Python 147 2 Updated Sep 15, 2025

[CVPR' 25] Interleaved-Modal Chain-of-Thought

Python 89 4 Updated Oct 7, 2025

Video Chain of Thought, Codes for ICML 2024 paper: "Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition"

Python 167 7 Updated Feb 25, 2025

✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Python 263 21 Updated May 9, 2025

Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types

Python 31 Updated Jul 16, 2025

The official implementation of the paper "Draw-In-Mind: Rebalancing Designer-Painter Roles in Unified Multimodal Models Benefits Image Editing"

Python 16 Updated Oct 8, 2025
Python 676 12 Updated Sep 24, 2025

免费AI去水印在线工具汇总:一键去除图片和视频水印

253 15 Updated Apr 15, 2025

Official Repository of Lumen: Consistent Video Relighting and Harmonious Background Replacement

Python 47 3 Updated Aug 20, 2025

[NOTE] I do not have enough ressources to maintain VMS, please use Ostris's AI-Tookit instead

Python 39 2 Updated Oct 3, 2025

Clapper.app, a video synthesizer and sequencer designed for the age of AI cinema

TypeScript 2,281 224 Updated Aug 1, 2025
Next
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载