GitHub - al-how/minima: On-premises conversational RAG with configurable containers

Minima is an open source RAG on-premises containers, with ability to integrate with ChatGPT and MCP. Minima can also be used as a fully local RAG.

Minima currently supports three modes:

Isolated installation – Operate fully on-premises with containers, free from external dependencies such as ChatGPT or Claude. All neural networks (LLM, reranker, embedding) run on your cloud or PC, ensuring your data remains secure.
Custom GPT – Query your local documents using ChatGPT app or web with custom GPTs. The indexer running on your cloud or local PC, while the primary LLM remains ChatGPT.
Anthropic Claude – Use Anthropic Claude app to query your local documents. The indexer operates on your local PC, while Anthropic Claude serves as the primary LLM.

Running as containers

Create a .env file in the project’s root directory (where you’ll find env.sample). Place .env in the same folder and copy all environment variables from env.sample to .env.
Ensure your .env file includes the following variables:

LOCAL_FILES_PATH
EMBEDDING_MODEL_ID
EMBEDDING_SIZE
OLLAMA_MODEL
RERANKER_MODEL
USER_ID
PASSWORD

For fully local installation use: docker compose -f docker-compose-ollama.yml --env-file .env up --build.
For ChatGPT enabled installation use: docker compose -f docker-compose-chatgpt.yml --env-file .env up --build.
For MCP integration (Anthropic Desktop app usage): docker compose -f docker-compose-mcp.yml --env-file .env up --build.
In case of ChatGPT enabled installation copy OTP from terminal where you launched docker and use Minima GPT
If you use Anthropic Claude, just add folliwing to /Library/Application\ Support/Claude/claude_desktop_config.json

{
    "mcpServers": {
      "minima": {
        "command": "uv",
        "args": [
          "--directory",
          "/path_to_cloned_minima_project/mcp-server",
          "run",
          "minima"
        ]
      }
    }
  }

To use fully local installation go to cd electron, then run npm install and npm start which will launch Minima electron app.
Ask anything, and you'll get answers based on local files in {LOCAL_FILES_PATH} folder.

Explanation of Variables:

LOCAL_FILES_PATH: Specify the root folder for indexing (on your cloud or local pc). Indexing is a recursive process, meaning all documents within subfolders of this root folder will also be indexed. Supported file types: .pdf, .xls, .docx, .txt, .md, .csv.

EMBEDDING_MODEL_ID: Specify the embedding model to use. Currently, only Sentence Transformer models are supported. Testing has been done with sentence-transformers/all-mpnet-base-v2, but other Sentence Transformer models can be used.

EMBEDDING_SIZE: Define the embedding dimension provided by the model, which is needed to configure Qdrant vector storage. Ensure this value matches the actual embedding size of the specified EMBEDDING_MODEL_ID.

OLLAMA_MODEL: Set up the Ollama model, use an ID available on the Ollama site. Please, use LLM model here, not an embedding.

RERANKER_MODEL: Specify the reranker model. Currently, we have tested with BAAI rerankers. You can explore all available rerankers using this link.

USER_ID: Just use your email here, this is needed to authenticate custom GPT to search in your data.

PASSWORD: Put any password here, this is used to create a firebase account for the email specified above.

Text Chunking Configuration

Minima provides several options to customize how documents are split into chunks for indexing:

CHUNK_SIZE: Default chunk size (in characters) for text segmentation. Default: 500

CHUNK_OVERLAP: Number of characters that should overlap between chunks. Default: 200

AUTO_CHUNKING: When set to "true" (default), the system automatically uses optimized chunking strategies for different file types. Set to "false" to use a single strategy for all files.

DEFAULT_CHUNK_STRATEGY: Default strategy to use when AUTO_CHUNKING is disabled. Options:

default: Standard chunking based on paragraphs and sentences
markdown_aware: Optimized for Markdown documents, respects headers and structure
sentence_aware: Prioritizes keeping sentences intact

File-specific chunk sizes:

MARKDOWN_CHUNK_SIZE: Specific chunk size for markdown (.md) files
PDF_CHUNK_SIZE: Specific chunk size for PDF files
DOC_CHUNK_SIZE: Specific chunk size for Word documents (.doc, .docx)

When AUTO_CHUNKING is enabled, the system applies these optimizations:

Markdown files (.md): Uses header-aware chunking that respects document structure
PDF files (.pdf): Uses paragraph-focused chunking with custom chunk size
Word documents (.doc, .docx): Uses paragraph-focused chunking with custom chunk size
Text files (.txt): Uses sentence-focused chunking to preserve meaning
Data files (.csv, .xls, .xlsx): Uses row-based chunking to keep data rows together

Example of .env file for on-premises/local usage:

LOCAL_FILES_PATH=/Users/davidmayboroda/Downloads/PDFs/
EMBEDDING_MODEL_ID=sentence-transformers/all-mpnet-base-v2
EMBEDDING_SIZE=768
OLLAMA_MODEL=qwen2:0.5b # must be LLM model id from Ollama models page
RERANKER_MODEL=BAAI/bge-reranker-base # please, choose any BAAI reranker model

# Chunking configuration (optional)
CHUNK_SIZE=1000 # increase default chunk size to 1000 characters
CHUNK_OVERLAP=200
AUTO_CHUNKING=true # automatically use optimized strategies per file type
DEFAULT_CHUNK_STRATEGY=default # only used when AUTO_CHUNKING=false
MARKDOWN_CHUNK_SIZE=1500 # larger chunks for markdown files
PDF_CHUNK_SIZE=800 # custom size for PDFs

To use a chat ui, please navigate to http://localhost:3000

Example of .env file for Claude app:

LOCAL_FILES_PATH=/Users/davidmayboroda/Downloads/PDFs/
EMBEDDING_MODEL_ID=sentence-transformers/all-mpnet-base-v2
EMBEDDING_SIZE=768

For the Claude app, please apply the changes to the claude_desktop_config.json file as outlined above.

Example of .env file for ChatGPT custom GPT usage:

LOCAL_FILES_PATH=/Users/davidmayboroda/Downloads/PDFs/
EMBEDDING_MODEL_ID=sentence-transformers/all-mpnet-base-v2
EMBEDDING_SIZE=768
USER_ID=user@gmail.com # your real email
PASSWORD=password # you can create here password that you want

Also, you can run minima using run.sh.

Installing via Smithery (MCP usage)

To install Minima for Claude Desktop automatically via Smithery:

npx -y @smithery/cli install minima --client claude

For MCP usage, please be sure that your local machines python is >=3.10 and 'uv' installed.

Minima (https://github.com/dmayboroda/minima) is licensed under the Mozilla Public License v2.0 (MPLv2).

Name		Name	Last commit message	Last commit date
Latest commit History 81 Commits
assets		assets
chat		chat
electron		electron
indexer		indexer
linker		linker
llm		llm
mcp-server		mcp-server
.env.sample		.env.sample
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
docker-compose-chatgpt.yml		docker-compose-chatgpt.yml
docker-compose-mcp.yml		docker-compose-mcp.yml
docker-compose-ollama.yml		docker-compose-ollama.yml
docker-compose-remote-qdrant.yml		docker-compose-remote-qdrant.yml
repomix-output.txt		repomix-output.txt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Running as containers

Text Chunking Configuration

Installing via Smithery (MCP usage)

About

Uh oh!

Releases

Packages

Languages

License

al-how/minima

Folders and files

Latest commit

History

Repository files navigation

Running as containers

Text Chunking Configuration

Installing via Smithery (MCP usage)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages