Improve RAG responses via source backfilling #1477

timothycarambat · 2024-05-21T22:03:34Z

resolves #1240

Pull Request Type

Warning

This PR enables query to work with chat history now but only in the app. Behavior in the embed is the same.

In an effort to improve RAG results without adding unwarranted or "conditional" middleware like ReRankers, there is a lot of room for improvement of the traditional RAG pipeline AnythingLLM uses. This PR includes the ability to backfill sources during a chat session.

What is backfilling?

Currently, AnythingLLM will append Context snippets to the system prompt on each chat. These contexts come from the attached VectorStores .similaritySearch method. While this works for direct questions this does not scale well as chat's become higher context or more vague. This leads to a bad RAG experience in a traditional chat, but is far worse in query where a follow-up query, that is contextually relevant, is deemed irrelevant.

Example

prompt 1: "What is anythingllm?"

possibly get 4 good sources and get a good LLM response as it is operating with high context.

prompt 2: "Tell me some features"

This question is appropriate, topical, and a decent inquiry but standalone is quite vague when viewed outside of the context of the chat.
Possible get 0 - 1 maybe relevant sources
We now must rely on the maybe existent snippet for source and hope the previous LLM response has something worth looking at

introduced in this pr

prompt 1: "What is anythingllm?"

possibly get 4 good sources and get a good LLM response as it is operating with high context.

prompt 2: "Tell me some features"

Possible get 0 - 1 maybe relevant sources
We now can backfill the context window in this order of priority
- Pinned documents
- This queries search results
- Work backwards in chat history citations to backfill the source window to the workspace's assigned snippet window value.

With this, we can now ensure that follow-up questions will be more relevant and contextually pertinent as a conversation continues.

"What if the contextual topic changes"?

This is handled as we prioritize sources from search results first, then backfill in by more recent chats with historical references
If you have a relevant contextual change in the chat, those sources will take precedence over the historical ones and will become the new historical reference

shatfield4 · 2024-05-22T17:57:31Z

LGTM

Propheticus · 2024-05-24T13:19:09Z

Pulled latest docker image. Chat in query mode, looks like it works:

As designed no citations are shown (because no hits on last query), but still an answer is given based on earlier context in the conversation. 💯

(also confirmed by checking LM Studio logs)

* Improve RAG responses via source backfilling * Hide irrelevant citations from UI

timothycarambat added 2 commits May 21, 2024 16:46

Improve RAG responses via source backfilling

53e11cd

Hide irrelevant citations from UI

8b61090

Merge branch 'master' into patch/rag-improvements-via-src-backfilling

e0987d7

timothycarambat merged commit 13fb639 into master May 23, 2024

timothycarambat deleted the patch/rag-improvements-via-src-backfilling branch May 23, 2024 16:56

CrackerCat pushed a commit to CrackerCat/anything-llm that referenced this pull request Jul 31, 2024

Improve RAG responses via source backfilling (Mintplex-Labs#1477)

6d0c1db

* Improve RAG responses via source backfilling * Hide irrelevant citations from UI

CrackerCat pushed a commit to CrackerCat/anything-llm that referenced this pull request Aug 1, 2024

Improve RAG responses via source backfilling (Mintplex-Labs#1477)

d39ea48

* Improve RAG responses via source backfilling * Hide irrelevant citations from UI

CrackerCat pushed a commit to CrackerCat/anything-llm that referenced this pull request Aug 2, 2024

Improve RAG responses via source backfilling (Mintplex-Labs#1477)

30e31be

* Improve RAG responses via source backfilling * Hide irrelevant citations from UI

CrackerCat pushed a commit to CrackerCat/anything-llm that referenced this pull request Aug 3, 2024

Improve RAG responses via source backfilling (Mintplex-Labs#1477)

5ec2344

* Improve RAG responses via source backfilling * Hide irrelevant citations from UI

cabwds pushed a commit to cabwds/anything-llm that referenced this pull request Jul 3, 2025

Improve RAG responses via source backfilling (Mintplex-Labs#1477)

0b03f16

* Improve RAG responses via source backfilling * Hide irrelevant citations from UI

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Improve RAG responses via source backfilling #1477

Improve RAG responses via source backfilling #1477

Uh oh!

timothycarambat commented May 21, 2024 •

edited

Loading

Uh oh!

shatfield4 commented May 22, 2024

Uh oh!

Propheticus commented May 24, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Improve RAG responses via source backfilling #1477

Improve RAG responses via source backfilling #1477

Uh oh!

Conversation

timothycarambat commented May 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Type

What is backfilling?

Example

Uh oh!

shatfield4 commented May 22, 2024

Uh oh!

Propheticus commented May 24, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

timothycarambat commented May 21, 2024 •

edited

Loading