+
Skip to content
Change the repository type filter

All

    Repositories list

    • MinerU

      Public
      A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
      Python
      GNU Affero General Public License v3.0
      3.2k39k1136Updated Jul 8, 2025Jul 8, 2025
    • Vis3

      Public
      OSS browser based on s3. 一个基于 S3 的数据(json / jsonl / html / md等)可视化工具。👇 Try online.
      TypeScript
      Apache License 2.0
      43601Updated Jul 8, 2025Jul 8, 2025
    • labelU

      Public
      Data annotation toolbox supports image, audio and video data.
      Python
      Apache License 2.0
      1321.3k231Updated Jul 8, 2025Jul 8, 2025
    • Data annotation component library --provided as NPM packages
      TypeScript
      Apache License 2.0
      3511121Updated Jul 8, 2025Jul 8, 2025
    • [CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation
      Python
      Apache License 2.0
      50590312Updated Jul 4, 2025Jul 4, 2025
    • 2400Updated Jul 2, 2025Jul 2, 2025
    • datasets resource
      1111730Updated Jul 1, 2025Jul 1, 2025
    • .github

      Public
      2100Updated Jul 1, 2025Jul 1, 2025
    • OHR-Bench

      Public
      (ICCV 2025) OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation
      Python
      127910Updated Jul 1, 2025Jul 1, 2025
    • [ICCV 2025] The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”
      Python
      Apache License 2.0
      25640Updated Jun 27, 2025Jun 27, 2025
    • UniMERNet

      Public
      UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
      Python
      Apache License 2.0
      31374232Updated Jun 16, 2025Jun 16, 2025
    • [ACL 2025] This is the official implementation for the paper: "Meta-rater: A Multi-dimensional Data Selection Method for Pre-training Language Models"
      Python
      0400Updated Jun 15, 2025Jun 15, 2025
    • LEGION

      Public
      The official implementation of the paper "LEGION: Learning to Ground and Explain for Synthetic Image Detection"
      Python
      44240Updated Jun 11, 2025Jun 11, 2025
    • FakeVLM

      Public
      FakeVLM: Advancing Synthetic Image Detection through Explainable Multimodal Models and Fine-Grained Artifact Analysis
      Python
      Apache License 2.0
      25460Updated Jun 11, 2025Jun 11, 2025
    • ProverGen

      Public
      [ICLR 2025] This is the official implementation for the paper: "Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluation"
      Python
      42900Updated Jun 11, 2025Jun 11, 2025
    • HTML
      0000Updated May 16, 2025May 16, 2025
    • PM4Bench

      Public
      Python
      01300Updated May 16, 2025May 16, 2025
    • GRAIT

      Public
      [NAACL25 findings] Gradient-Driven Refusal-Aware Instruction Tuning for Effective Hallucination Mitigation
      Python
      0100Updated Apr 28, 2025Apr 28, 2025
    • DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
      Python
      GNU Affero General Public License v3.0
      1131.4k273Updated Apr 14, 2025Apr 14, 2025
    • UrBench

      Public
      [AAAI 2025]This repo contains evaluation code for the paper “UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios”
      Python
      Other
      53110Updated Apr 10, 2025Apr 10, 2025
    • Python
      Apache License 2.0
      0710Updated Mar 31, 2025Mar 31, 2025
    • LOKI

      Public
      [ICLR 2025 Spotlight] The official implementation of the paper “LOKI:A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models”
      Python
      415410Updated Mar 31, 2025Mar 31, 2025
    • Python
      Apache License 2.0
      4047690Updated Mar 13, 2025Mar 13, 2025
    • VHM

      Public
      VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis
      Python
      Apache License 2.0
      69200Updated Feb 19, 2025Feb 19, 2025
    • LabelLLM

      Public
      The Open-Source Data Annotation Platform
      TypeScript
      Apache License 2.0
      91865141Updated Feb 19, 2025Feb 19, 2025
    • WanJuan3.0(“万卷·丝路”)一个作为综合性的纯文本语料库,采集了多个国家地区的网络公开信息、文献、专利等资料,数据总规模超1.2TB,Token总数超过300B,处于国际领先水平,首期开源的语料库主要由泰语、俄语、阿拉伯语、韩语和越南语5个子集构成,每个子集的数据规模均超过150GB
      MIT License
      13000Updated Feb 13, 2025Feb 13, 2025
    • A Comprehensive Toolkit for High-Quality PDF Content Extraction
      Python
      GNU Affero General Public License v3.0
      5938.1k879Updated Jan 3, 2025Jan 3, 2025
    • CRaFT

      Public
      [AAAI25] Certainty Represented Knowledge Flow for Refusal-Aware Instruction Tuning
      Python
      0400Updated Jan 1, 2025Jan 1, 2025
    • Miner-PDF-Benchmark

      Public archive
      MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.
      Python
      Apache License 2.0
      52300Updated Dec 11, 2024Dec 11, 2024
    • ECCV2024_Parrot Captions Teach CLIP to Spot Text
      Python
      Apache License 2.0
      16630Updated Sep 6, 2024Sep 6, 2024
    点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载