+
Skip to content

🌍 Join the Pruna AI community!

Twitter GitHub LinkedIn Discord Reddit


💜 Simply make AI models faster, cheaper, smaller, greener!

Pruna AI makes AI models faster, cheaper, smaller, greener with the pruna package.

  • It supports various models including CV, NLP, audio, graphs for predictive and generative AI.
  • It supports various hardware including GPU, CPU, Edge.
  • It supports various compression algortihms including quantization, pruning, distillation, caching, recovery, compilation that can be combined together.
  • You can either play on your own with smash/compression configurations or let the smashing/compressing agent find the optimal configuration [Pro].
  • You can evaluate reliable quality and efficiency metrics of your base vs smashed/compressed models. You can set it up in minutes and compress your first models in few lines of code!

⏩ How to get started?

You can smash your own models by installing pruna with:

pip install pruna

You can start with simple notebooks to experience efficiency gains with:

Use Case Free Notebooks
3x Faster Stable Diffusion Models Smash for free
Making your LLMs 4x smaller Smash for free
Smash your model with a CPU only Smash for free
Transcribe 2 hours of audio in less than 2 minutes with Whisper Smash for free
100% faster Whisper Transcription Smash for free
Run your Flux model without an A100 Smash for free
x2 smaller Sana in action Smash for free

For more details about installation and tutorials, you can check the Pruna AI documentation.


Pinned Loading

  1. pruna pruna Public

    Pruna is a model optimization framework built for developers, enabling you to deliver faster, more efficient models with minimal overhead.

    Python 912 68

Repositories

Showing 10 of 18 repositories
  • pruna Public

    Pruna is a model optimization framework built for developers, enabling you to deliver faster, more efficient models with minimal overhead.

    PrunaAI/pruna’s past year of commit activity
    Python 912 Apache-2.0 68 44 (23 issues need help) 21 Updated Oct 17, 2025
  • VBench Public Forked from Vchitect/VBench

    Benchmarking Suite for Video Generation Evaluation

    PrunaAI/VBench’s past year of commit activity
    Python 0 Apache-2.0 77 0 0 Updated Oct 14, 2025
  • ai-efficiency-courses Public

    Courses on building, compressing, evaluating, and deploying efficient AI models.

    PrunaAI/ai-efficiency-courses’s past year of commit activity
    Jupyter Notebook 43 Apache-2.0 1 0 1 Updated Oct 6, 2025
  • awesome-ai-efficiency Public

    A curated list of materials on AI efficiency

    PrunaAI/awesome-ai-efficiency’s past year of commit activity
    173 MIT 15 0 1 Updated Oct 4, 2025
  • ComfyUI_pruna Public

    This is a ComfyUI node that integrates pruna

    PrunaAI/ComfyUI_pruna’s past year of commit activity
    Python 64 MIT 2 7 0 Updated Sep 8, 2025
  • runpod-worker-FLUX.1-dev Public Forked from runpod-workers/worker-sdxl

    RunPod worker for FLUX.1-dev

    PrunaAI/runpod-worker-FLUX.1-dev’s past year of commit activity
    Python 5 MIT 75 0 0 Updated Aug 19, 2025
  • HPSv2 Public Forked from tgxs002/HPSv2

    Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis

    PrunaAI/HPSv2’s past year of commit activity
    Jupyter Notebook 0 Apache-2.0 22 0 0 Updated Aug 14, 2025
  • modal-example Public

    This repository shows how to smash Pruna models using Modal.

    PrunaAI/modal-example’s past year of commit activity
    Jupyter Notebook 2 Apache-2.0 1 0 0 Updated Aug 1, 2025
  • aws-example Public
    PrunaAI/aws-example’s past year of commit activity
    Jupyter Notebook 1 Apache-2.0 0 0 0 Updated Jun 23, 2025
  • diffusers Public Forked from huggingface/diffusers

    🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

    PrunaAI/diffusers’s past year of commit activity
    Python 0 Apache-2.0 6,490 0 0 Updated Jun 12, 2025

Most used topics

Loading…

点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载