Stars
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 80+ languages.
TableBank: A Benchmark Dataset for Table Detection and Recognition
binjie09 / chatgpt-web
Forked from Chanzhaoyu/chatgpt-web使用 express 和 vue3 搭建的 ChartGPT 演示网页
High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle
zoujuny / TableCell
Forked from weidafeng/TableCell在TableBank的基础上,进一步标注到单元格精度,利用目标检测/分割实现单元格定位。
Label Studio is a multi-type data labeling and annotation tool with standardized output format
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source …
量化资源和教程请访问 ke.quantide.cn 和公众号🫶Quantide
润学全球官方指定GITHUB,整理润学宗旨、纲领、理论和各类润之实例;解决为什么润,润去哪里,怎么润三大问题; 并成为新中国人的核心宗教,核心信念。
Describe past Kaggle solutions
Opencv4.0 with python (English&中文), and will keep the update ! 👊
LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.
微调预训练语言模型(BERT、Roberta、XLBert等),用于计算两个文本之间的相似度(通过句子对分类任务转换),适用于中文文本
微调预训练语言模型,解决多标签分类任务(可加载BERT、Roberta、Bert-wwm以及albert等知名开源tf格式的模型)
各大网站逆向demo。企名片、震坤行工业超市、天翼云登录、物超所值、瓜子二手车、马蜂窝、中华诗词库、澳门彩票、药智网、福建省招标投标在线监管平台、全国公共资源交易平台、问卷星、中国人民银行条法司、中华人民共和国公安部、AqiStudy、巨量星图、HeyTap、掌上高考、船讯网、百度指数、今日头条、知乎、七麦数据、途牛、七猫小说、企查查、同花顺、网易云音乐、拉勾招聘、玩物得志、房天下
Jupyter Notebooks to help you get hands-on with Pinecone vector databases
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
自然语言处理NLP在中文文本上的一些应用,如文本分类、情感分析、命名实体识别等
jieba analysis plugin for elasticsearch 7.0.0, 6.4.0, 6.0.0, 5.4.0,5.3.0, 5.2.2, 5.2.1, 5.2, 5.1.2, 5.1.1
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.