Stars
The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.
An open source DevOps tool from the CNCF for packaging and versioning AI/ML models, datasets, code, and configuration into an OCI Artifact.
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch
Modeling, training, eval, and inference code for OLMo
21 Lessons, Get Started Building with Generative AI
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
📄🧠 PageIndex: Document Index for Reasoning-based RAG
A calculator to estimate the memory footprint, capacity, and latency on VMware Private AI with NVIDIA.
A lightweight data processing framework built on DuckDB and 3FS.
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
Documentation and best practices for using Cline
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Large Concept Models: Language modeling in a sentence representation space
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
Synthetic data curation for post-training and structured data extraction
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
[Nature Reviews Bioengineering🔥] Application of Large Language Models in Medicine. A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website …
⏰ AI conference deadline countdowns
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
LeonVouk / lighteval
Forked from huggingface/lightevalLightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.
Composio equips your AI agents & LLMs with 100+ high-quality integrations via function calling
DSPy: The framework for programming—not prompting—language models
Speech to Text Alignment tool implemented with Python and Kaldi
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models