GitHub - ConductionNL/docurag: Python RAG implementatie die documenten zoek, op basis daarvan LLM-gebaseerde antwoorden genereert.

DocuSearch

This repo provides Retrieval-Augmented Generation (RAG) over WOO documents using Apache Solr for vector search. It retrieves the most relevant chunks from Solr and lets an LLM generate the final answer.

See the Solr demo with a small testset in notebooks/docs_to_solr.ipynb.

Setup

Python 3.10+
Install dependencies (with uv):

# install uv if needed
curl -LsSf https://astral.sh/uv/install.sh | sh

# create venv and install project deps
uv sync

If you run in Jupyter, also install ipywidgets for nicer progress bars:

uv add ipywidgets

Configuration (no secrets in env)

Credentials are passed per request to the API or function calls. Do not store Solr or Fireworks secrets in .env.
Optional environment variables for non-secrets:

LOG_LEVEL=INFO
TOP_K=5
MODEL_NAME=intfloat/multilingual-e5-small
LLM_NAME=llama-v3p3-70b-instruct

Prompts are loaded from src/prompts/prompts.yaml. The default chat model is llama-v3p3-70b-instruct if LLM_NAME is not set.

1) RAG over Solr from Python

From Python (run from repo root):

from src.rag import rag

result = rag(
    "Wat is er besproken over afval?",
    solr_url="http://<solr-host>/solr/<chunks_collection>",
    solr_username="<username>",
    solr_password="<password>",
    fireworks_api_key="<fireworks_api_key>",
)
print(result["answer"])        # Final answer
print(result["sources"])       # Attributed sources

Notes:

Retrieval queries Solr (KNN over the emb field) and then calls the LLM using prompts from src/prompts/prompts.yaml.

2) REST API

Run the FastAPI server (from repo root):

uv run -- uvicorn app:app --reload --app-dir .

Call the endpoint (credentials in the request body):

curl -X POST http://localhost:8000/process \
  -H "Content-Type: application/json" \
  -d '{
    "text": "Wat is er besproken over afval?",
    "fireworks_api_key": "<fireworks_api_key>",
    "solr_url": "http://<solr-host>/solr/<chunks_collection>",
    "solr_username": "<username>",
    "solr_password": "<password>"
  }'

Streaming API (NDJSON)

For incremental tokens, use the streaming endpoint which emits NDJSON lines:

curl -N -X POST http://localhost:8000/process_stream \
  -H "Content-Type: application/json" \
  -d '{
    "text": "Wat is er besproken over afval?",
    "fireworks_api_key": "<fireworks_api_key>",
    "solr_url": "http://<solr-host>/solr/<chunks_collection>",
    "solr_username": "<username>",
    "solr_password": "<password>"
  }'

Behavior:

Each line is a JSON object ending with a newline
Lines look like {\"delta\":\"...\"} for partial content
The last line is {\"event\":\"done\"}

3) Docker

Build and run the API server with Docker:

docker build -t docusearch-api .
docker run --rm -p 8000:8000 docusearch-api

Or with Docker Compose (also mounts a Hugging Face cache volume):

docker compose up --build

Then call the API as shown above. Credentials are always provided in the request payload; they are not read from container environment variables.

Notebooks

notebooks/docs_to_solr.ipynb: step-by-step Solr ingestion and KNN search

Repository layout

app.py                 # FastAPI server
src/
  rag.py               # RAG entrypoints (sync + streaming, session-aware variants)
  llm.py               # Fireworks LLM helpers and JSON parsing
  embeddings.py        # Solr KNN search utilities
  prompts/prompts.yaml # Prompt templates
notebooks/             # Demos and ingestion
Dockerfile
docker-compose.yml
pyproject.toml

Troubleshooting

Ensure Solr is reachable and the emb vector field is configured for KNN
If LLM calls fail, verify the Fireworks API key used in your request payload
If the embedding model fails to load, verify MODEL_NAME and network access for model weights

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.github/workflows		.github/workflows
charts/docusearch		charts/docusearch
doc		doc
notebooks		notebooks
src		src
.dockerignore		.dockerignore
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DocuSearch

Setup

Configuration (no secrets in env)

1) RAG over Solr from Python

2) REST API

Streaming API (NDJSON)

3) Docker

Notebooks

Repository layout

Troubleshooting

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

ConductionNL/docurag

Folders and files

Latest commit

History

Repository files navigation

DocuSearch

Setup

Configuration (no secrets in env)

1) RAG over Solr from Python

2) REST API

Streaming API (NDJSON)

3) Docker

Notebooks

Repository layout

Troubleshooting

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages