这是indexloc提供的服务,不要输入任何密码
Skip to content

Support for RoAD: 2D Rotary Adaptation for Efficient Finetuning, Efficient Batching and Composability #2672

@ppetrushkov

Description

@ppetrushkov

Feature request

RoAD is a Parameter-efficient Fine-Tuning technique that is especially well-suited for efficient inference with mixed adapters in a batch, while still providing high output quality with very small parameter count.

RoAD learns 2D rotation matrices that are applied to the layer output which can be written using only element-wise multiplication (rather than matrix multiplication), enabling very fast inference with adapters in unmerged state. It is somewhat related to Orthogonal Finetuning (OFT) method.

Paper: https://arxiv.org/pdf/2409.00119
PEFT implementation: https://github.com/ppetrushkov/peft/tree/road (not from paper authors)

Your contribution

The PEFT implementation should be ready and tested. I created this feature request following the contributing guidelines, if the maintainers are not against including this method, I can proceed with creating a pull request and add the missing documentation and examples.

I also have a fork of VLLM where I implemented and tested the efficiency of this method which shows significantly better performance than LoRA https://github.com/ppetrushkov/vllm/tree/v0.9.1-road

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions