这是indexloc提供的服务,不要输入任何密码
Skip to content

Conversation

@timothycarambat
Copy link
Member

resolves #298

@review-agent-prime
Copy link

collector/scripts/watch/convert/as_pdf.py

Instead of using a for loop to concatenate strings, it's more efficient and readable to use a list comprehension and then join the list into a single string. This is because strings in Python are immutable, so every time you concatenate strings, a new string is created. This can lead to significant overhead for large amounts of data.
Create Issue
See the diff
Checkout the fix

    page_content = ''.join([str(page.get_text('text')) for page in fitz.open(fullpath)])
git fetch origin && git checkout -b ReviewBot/Impro-mtmwd20 origin/ReviewBot/Impro-mtmwd20

@timothycarambat timothycarambat merged commit da0cec7 into master Dec 13, 2023
@timothycarambat timothycarambat deleted the 298-multi-lang-support branch December 13, 2023 19:54
cabwds pushed a commit to cabwds/anything-llm that referenced this pull request Jul 3, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

citation not encoding as UTF8?

2 participants