Highlights
- Pro
Stars
ShellCheck, a static analysis tool for shell scripts
Fully open reproduction of DeepSeek-R1
Python tool for converting files and office documents to Markdown.
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Official inference repo for FLUX.1 models
[ICML 2024] See More Details: Efficient Image Super-Resolution by Experts Mining
You don't need an animation library to add a simple effect to your SwiftUI app. Create it yourself with SwiftUI. This repo inspires you to add helpful and expressive SwiftUI animations like loading…
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
This is the repo for our new project Highly Accurate Dichotomous Image Segmentation
💡 PseudoDiffusers: paper/code review and experimental findings related to computer vision generation and diffusion-based models
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models (CVPR 2024 Highlight)
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
Generate text line images for training deep learning OCR models
A command-line utility for taking automated screenshots of websites
Modeling, training, eval, and inference code for OLMo
Master the command line, in one page
A single-file library for working with Apple Live Photos
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
High-Resolution Image Synthesis with Latent Diffusion Models
A curated list of Best Artificial Intelligence Resources
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
A latent text-to-image diffusion model
A holistic way of understanding how WebRTC and its protocols run in practice, with code and detailed documentation.