improve native embedder handling of large files #584

timothycarambat · 2024-01-13T07:02:37Z

Pull Request Type

What is in this change?

Improve native embedder to handle large files (>100K words) without memory accrual resulting in OOM on some devices that are resource-constrained.

Additional Information

There could be improvement in other models as well as the native model relies on a local tmp write log for storing the response and other models still keep the embedding array in memory as the iterate.

Note: this doesn't improve the speed at which native embeddings are done, it just keeps very large file embeds from crashing the entire instance.

Developer Validations

I ran yarn lint from the root of the repo & committed changes
Relevant documentation has been updated
I have tested my code functionality
Docker build succeeds locally

review-agent-prime · 2024-01-13T07:03:35Z

server/utils/EmbeddingEngines/native/index.js

In the #writeToTempfile method, there is a try-catch block that catches an error but does nothing with it. It would be beneficial to log the error for debugging purposes. Also, it would be helpful to log the start and end of the embedding process for better tracking.
Create Issue
See the diff
Checkout the fix

    #writeToTempfile(filePath, data) {
      let fd = 0;
      try {
        fd = fs.openSync(filePath, "a", 0o666);
        let _ = fs.writeSync(fd, data, null, "utf8");
      } catch (e) {
        console.error(`Error writing to tempfile: ${e}`);
      } finally {
        if (fd) fs.closeSync(fd);
      }
    }

    async embedChunks(textChunks = []) {
      console.log(`\x1b[34m[INFO] Starting embedding process...\x1b[0m`);
      ...
      console.log(`\x1b[34m[INFO] Embedding process completed.\x1b[0m`);
    }

git fetch origin && git checkout -b ReviewBot/Impro-e2e95on origin/ReviewBot/Impro-e2e95on

The #writeToTempfile method uses synchronous file operations which can block the event loop and degrade performance. Consider using asynchronous file operations instead.
Create Issue
See the diff
Checkout the fix

    async #writeToTempfile(filePath, data) {
      try {
        await fs.promises.appendFile(filePath, data, { encoding: 'utf8' });
      } catch (e) {
        console.error(`Error writing to tempfile: ${e}`);
      }
    }

git fetch origin && git checkout -b ReviewBot/Impro-e2sno5w origin/ReviewBot/Impro-e2sno5w

server/utils/EmbeddingEngines/native/index.js

* improve native embedder handling of large files * perf changes * ignore storage tmp

improve native embedder handling of large files

de42721

review-agent-prime bot reviewed Jan 13, 2024

View reviewed changes

server/utils/EmbeddingEngines/native/index.js Outdated Show resolved Hide resolved

review-agent-prime bot reviewed Jan 13, 2024

View reviewed changes

server/utils/EmbeddingEngines/native/index.js Outdated Show resolved Hide resolved

timothycarambat closed this Jan 13, 2024

timothycarambat deleted the embedder-improvements branch January 13, 2024 07:15

timothycarambat restored the embedder-improvements branch January 13, 2024 07:28

timothycarambat reopened this Jan 13, 2024

timothycarambat added 2 commits January 13, 2024 00:02

perf changes

088c7da

ignore storage tmp

1755570

timothycarambat merged commit 4f6d931 into master Jan 13, 2024

timothycarambat deleted the embedder-improvements branch January 13, 2024 08:32

cabwds pushed a commit to cabwds/anything-llm that referenced this pull request Jul 3, 2025

improve native embedder handling of large files (Mintplex-Labs#584)

aac158b

* improve native embedder handling of large files * perf changes * ignore storage tmp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

improve native embedder handling of large files #584

improve native embedder handling of large files #584

Uh oh!

timothycarambat commented Jan 13, 2024 •

edited

Loading

Uh oh!

review-agent-prime bot commented Jan 13, 2024

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

improve native embedder handling of large files #584

improve native embedder handling of large files #584

Uh oh!

Conversation

timothycarambat commented Jan 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Type

What is in this change?

Additional Information

Developer Validations

Uh oh!

review-agent-prime bot commented Jan 13, 2024

server/utils/EmbeddingEngines/native/index.js

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

timothycarambat commented Jan 13, 2024 •

edited

Loading