这是indexloc提供的服务,不要输入任何密码
Skip to content

[FEAT]: Add multilingual support for ocr mudule #3314

@doodle777

Description

@doodle777

What would you like to see?

Currently, OCRLoader only supports English and needs to add some multilingual support.

this.log(`Starting OCR of ${documentTitle}`);
      const startTime = Date.now();
      const { createWorker, OEM } = require("tesseract.js");
      worker = await createWorker("eng", OEM.LSTM_ONLY, {
        cachePath: this.cacheDir,
      });

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions