vllm

Here are 5 public repositories matching this topic...

jasonacox / TinyLLM

Setup and run a local LLM and Chatbot using consumer grade hardware.

chatbot artificial-intelligence openai rag large-language-models llm vllm retrieval-augmented-generation llama-cpp-python

Updated Jun 25, 2025
JavaScript

jparkerweb / down-craft

Star

📑 npm pacakge to Craft files into Markdown with ease

nodejs markdown pdf npm converter ocr xlsx docx pptx vllm

Updated Jan 3, 2025
JavaScript

wizenheimer / periscope

Sponsor

Star

LLM Performance Testing | K6 + Grafana + InfluxDB | A tiny toolkit for load testing and benchmarking OpenAI-like inference endpoints using K6 + Grafana + InfluxDB

influxdb grafana performance-testing k6 vllm llm-testing vllm-production-stack

Updated Apr 11, 2025
JavaScript

itsvaibhav01 / immune-web

Star

Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment

jailbreak alignment defence multi-modal llm mllm vllm

Updated Jun 5, 2025
JavaScript

ganochenkodg / vllm-token-stats

Star

Proxy for vLLM to expose token usage metrics.

prometheus fastify vllm

Updated May 22, 2025
JavaScript

Improve this page

Add a description, image, and links to the vllm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vllm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vllm

Here are 5 public repositories matching this topic...

jasonacox / TinyLLM

jparkerweb / down-craft

wizenheimer / periscope

itsvaibhav01 / immune-web

ganochenkodg / vllm-token-stats

Improve this page

Add this topic to your repo