rag-agent

This service enables Large Language Models (LLMs) to "study" web pages.

By routing your OpenAI API chat completion requests through this service, you can enable the following workflow:

From your chat interface, the #study site.com command allows you to crawl and process web pages.
In subsequent chat conversations, relevant context from the studied pages will be automatically incorporated into your prompts before they are sent to the OpenAI API.

This service intercepts OpenAI API calls specifically for chat completion requests, and only when the model name in the request matches the model specified by the BIG_MODEL variable.

How to Install

This project utilizes Docker Compose, allowing for easy local deployment.

To get started:

Clone the repository:

git clone https://github.com/raul3820/rag-agent.git

Create an .env file in the repository root directory and define the following environment variables:

POSTGRES_PASSWORD = ...
API_URL_OPENAI = http://ollama:11434 or [https://api.openai.com](https://api.openai.com) or any other service compatible with the OpenAI API
API_KEY_OPENAI = ...
BIG_MODEL=deepseek-r1:14b  # Model for general chat interactions
SMALL_MODEL=llama3.2:3b   # Model for smart parsing, query generation, and tool utilization

Run the services:

You can choose to run only the essential services:

docker compose up agent postgres infinity ...

Alternatively, to run all services locally, use:

docker compose up

Installation on Ubuntu with Docker and NVIDIA GPU Support

For users with Ubuntu, Docker, and NVIDIA GPUs, follow these steps:

Install Docker:

sudo apt-get update
sudo apt-get install ca-certificates curl
sudo install -m 0755 -d /etc/apt/keyrings
sudo curl -fsSL https://download.docker.com/linux/ubuntu/gpg -o /etc/apt/keyrings/docker.asc
sudo chmod a+r /etc/apt/keyrings/docker.asc
echo \
  "deb [arch=$(dpkg --print-architecture) signed-by=/etc/apt/keyrings/docker.asc] https://download.docker.com/linux/ubuntu \
  $(. /etc/os-release && echo "$VERSION_CODENAME") stable" | \
  sudo tee /etc/apt/sources.list.d/docker.list > /dev/null

Install Docker Engine:

sudo apt-get update
sudo apt-get install docker-ce docker-ce-cli containerd.io docker-compose-plugin

Start and enable Docker:

sudo systemctl start docker
sudo systemctl enable docker

Install NVIDIA Container Toolkit:

Refer to the official NVIDIA documentation for detailed information: NVIDIA Container Toolkit Installation Guide

Execute the following commands for installation:

curl -fsSL https://nvidia.github.io/libnvidia-container/gpgkey | sudo gpg --dearmor -o /usr/share/keyrings/nvidia-container-toolkit-keyring.gpg \
  && curl -s -L https://nvidia.github.io/libnvidia-container/stable/deb/nvidia-container-toolkit.list | \
    sed 's#deb https://#deb [signed-by=/usr/share/keyrings/nvidia-container-toolkit-keyring.gpg] https://#g' | \
    sudo tee /etc/apt/sources.list.d/nvidia-container-toolkit.list

sudo apt-get update
sudo apt-get install -y nvidia-container-toolkit

Configure Docker to use NVIDIA runtime:

sudo nvidia-ctk runtime configure --runtime=docker
sudo systemctl restart docker

Verify NVIDIA Docker setup:

sudo docker run --rm --runtime=nvidia --gpus all ubuntu nvidia-smi

Contributions

This is an open work in progress, feel free to fork or propose changes.

This project was inspired by https://github.com/coleam00

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
agents		agents
db		db
proxy		proxy
.env.example		.env.example
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
docker-compose.yaml		docker-compose.yaml
dockerfile.agent		dockerfile.agent
dockerfile.ollama		dockerfile.ollama
entrypoint-agent.sh		entrypoint-agent.sh
entrypoint-ollama.sh		entrypoint-ollama.sh
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

rag-agent

How to Install

Installation on Ubuntu with Docker and NVIDIA GPU Support

Contributions

About

Uh oh!

Releases 2

Packages

Languages

License

raul3820/rag-agent

Folders and files

Latest commit

History

Repository files navigation

rag-agent

How to Install

Installation on Ubuntu with Docker and NVIDIA GPU Support

Contributions

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages