- Shanghai
-
18:31
(UTC +08:00) - https://cr7258.github.io
Lists (5)
Sort Name ascending (A-Z)
Stars
Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling
This NVIDIA RAG blueprint serves as a reference solution for a foundational Retrieval Augmented Generation (RAG) pipeline.
derive(Error) for struct and enum error types
GPU Programming with C++ and CUDA, published by Packt
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
Free and Open Source, Distributed, RESTful Search Engine
Ultrafast serverless GPU inference, sandboxes, and background jobs
The glamourous AI coding agent for your favourite terminal 💘
Local development against a remote Kubernetes or OpenShift cluster
Professionally crafted React & Figma components for building beautiful products or starting your own design system
The official Go SDK for Model Context Protocol servers and clients. Maintained in collaboration with Google.
Supercharge Your LLM with the Fastest KV Cache Layer
Run cloud native workloads on NVIDIA GPUs
Offline optimization of your disaggregated Dynamo graph
A cd command that learns - easily navigate directories from the command line
List of projects that provide terminal user interfaces
A modular graph-based Retrieval-Augmented Generation (RAG) system
Python composable command line interface toolkit
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.
Expose your FastAPI endpoints as Model Context Protocol (MCP) tools, with Auth!
The pytest framework makes it easy to write small tests, yet scales to support complex functional testing
A Datacenter Scale Distributed Inference Serving Framework
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…
This repository contains examples for Elastic Observability