Stars
TaxoNERD : recognizing taxonomic entities using deep models
JanusGraph: an open-source, distributed graph database
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 80+ languages.
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Hands-On Graph Neural Networks Using Python, published by Packt
A modular graph-based Retrieval-Augmented Generation (RAG) system
LLMs4OM: Matching Ontologies with Large Language Models
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
A Unified Toolkit for Deep Learning Based Document Image Analysis
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
Table structure recognition dataset of the paper: Complicated Table Structure Recognition
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…
Companion code to the paper "Extracting Scientific Figures with Distantly Supervised Neural Networks" 🤖
[NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling
HiPlot makes understanding high dimensional data easy
Neo4j Graph Data Science with Graph ML & GNNs
Tools and utilities to enable loading data and building graph applications with Amazon Neptune.
Samples and documentation for using the Amazon Neptune graph database service
Multi-class Classification with fine-tuned BERT & GNN