+
Skip to content
@swiss-ai

swiss-ai

Popular repositories Loading

  1. mmore mmore Public

    Massive Multimodal Open RAG & Extraction A scalable multimodal pipeline for processing, indexing, and querying multimodal documents Ever needed to take 8000 PDFs, 2000 videos, and 500 spreadsheets …

    Python 148 30

  2. apertus-tech-report apertus-tech-report Public

    Tech Report of the Apertus LLM Suite

    118 4

  3. pretrain-data pretrain-data Public

    Pretraining data reconstruction scripts for Apertus

    Python 94 4

  4. Megatron-LM Megatron-LM Public

    Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    Python 33 13

  5. MoE MoE Public

    some mixture of experts architecture implementations

    Python 22 3

  6. parity-aware-bpe parity-aware-bpe Public

    Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization [arXiv 2025]

    Python 15 3

Repositories

Showing 10 of 50 repositories
  • Megatron-LM Public Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    swiss-ai/Megatron-LM’s past year of commit activity
    Python 33 3,226 6 18 Updated Oct 14, 2025
  • Pai-Megatron-Patch Public Forked from alibaba/Pai-Megatron-Patch

    The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

    swiss-ai/Pai-Megatron-Patch’s past year of commit activity
    Python 0 Apache-2.0 200 0 0 Updated Oct 14, 2025
  • verl Public Forked from volcengine/verl

    verl: Volcano Engine Reinforcement Learning for LLMs

    swiss-ai/verl’s past year of commit activity
    Python 0 Apache-2.0 2,261 0 0 Updated Oct 10, 2025
  • pretrain-data Public

    Pretraining data reconstruction scripts for Apertus

    swiss-ai/pretrain-data’s past year of commit activity
    Python 94 Apache-2.0 4 0 1 Updated Oct 9, 2025
  • swiss-ai/reasoning_getting-started’s past year of commit activity
    Shell 1 0 0 0 Updated Oct 8, 2025
  • mmore Public

    Massive Multimodal Open RAG & Extraction A scalable multimodal pipeline for processing, indexing, and querying multimodal documents Ever needed to take 8000 PDFs, 2000 videos, and 500 spreadsheets and feed them to an LLM as a knowledge base? Well, MMORE is here to help you!

    swiss-ai/mmore’s past year of commit activity
    Python 148 Apache-2.0 30 23 8 Updated Oct 8, 2025
  • swiss-ai/model-spinning’s past year of commit activity
    Python 7 2 0 0 Updated Oct 8, 2025
  • swiss-ai/posttraining-data’s past year of commit activity
    Python 1 0 1 0 Updated Sep 30, 2025
  • lm-evaluation-harness Public Forked from EleutherAI/lm-evaluation-harness

    A framework for few-shot evaluation of language models.

    swiss-ai/lm-evaluation-harness’s past year of commit activity
    Python 2 MIT 2,798 0 2 Updated Sep 25, 2025
  • pretrain-code Public

    Pretraining codebase for Apertus models, based on Megatron-LM

    swiss-ai/pretrain-code’s past year of commit activity
    Shell 13 Apache-2.0 2 0 0 Updated Sep 25, 2025

Most used topics

Loading…

点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载