+
Skip to content
View xli-github's full-sized avatar

Highlights

  • Pro

Block or report xli-github

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

ocr

7 repositories

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,719 677 Updated Feb 10, 2025

Get your documents ready for gen AI

Python 34,165 2,282 Updated Jul 11, 2025

HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieval Results in RAG Systems (WWW 2025)

Python 430 34 Updated Jun 11, 2025

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 17,786 1,191 Updated Jul 11, 2025

A Python wrapper for Google Tesseract

Python 6,161 735 Updated Jun 23, 2025

Python tool for converting files and office documents to Markdown.

Python 60,408 3,188 Updated Jun 4, 2025

Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools,…

Python 51,505 8,416 Updated Jul 11, 2025
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载