Change the repository type filter
All
Repositories list
42 repositories
LLMs-from-scratch
PublicGOT-OCR2.0
Publicllm-awq
PublicAwesome-LLM-Strawberry
PublicCAG
Publicmlc-llm
Publicawesome-llm-apps
PublicSPaR
Publickvpress
Publicnano-sparse-attention
PublicLLMSuperWeight
Publicspiritlm
PublicMMLU-Pro
PublicAria
Publictensorrtllm_backend
PublicTensorRT-LLM
PublicTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.Awesome-LLM
Publicllm.c
PublicOpen-O1
Publicembeddedllm
PublicOpenStrawberry
PublicAwesome-LLM-Compression
PublicMRAG
PublicCheckEmbed
PublicLongWriter
Publicgraph-of-thoughts
PublicAwesome-Efficient-LLM
Publicquivr
PublicOpen-source RAG Framework for building GenAI Second Brains 🧠 Build productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Efficient retrieval augmented generation frameworkOpenLLM
PublicAwesome-Code-LLM
Public