Stars
csabakecskemeti / llm_qlora
Forked from georgesung/llm_qloraFine-tuning LLMs using QLoRA
A new way to communicate with LLM by sharing a portion of your screen instead of typing.
Infrastructure for Machine Learning Guided Optimization (MLGO) in LLVM.
This open source benchmarking framework allows you to build your own P2P learning algorithm and evaluate it in a simulated but realistic -- where you can model message delay, drop or churn -- netwo…
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
A library of reinforcement learning components and agents
Reverb is an efficient and easy-to-use data storage and transport system designed for machine learning research