The great deduplication #2771

BenjaminBossan · 2025-09-04T15:48:07Z

There is a lot of code duplication PEFT, especially when it comes to the model.py files of the different PEFT methods. The reason for this is that for new PEFT methods, contributors will often copy an existing one and adapt it where necessary (which is as it should be). For lack of abstractions, this resulted in dozens of methods being copied 1:1.

At the start, when there were still few PEFT methods, we couldn't know yet which functions could stay the same across different PEFT methods and which ones would need adjusting. Therefore, it wasn't a bad choice to avoid prematurely abstracting those away. Now that we have a much better picture, it is time to make this step though. If a PEFT method requires special treatment, it can still override these methods, thus we don't lose flexibility. The refactored methods are:

merge_and_unload
unload
delete_adapter
set_adapter
enable_adapter_layers
disable_adapter_layers
_replace_module
_unload_and_optionally_merge
_mark_only_adapters_as_trainable
_check_new_adapter_config
_check_target_module_exists
_prepare_adapter_config
__getattr__
get_peft_config_as_dict (fully deleted)

These methods were dealt with in separate commits for easier review.

This PR results in over 3000 fewer lines of code. We went from ~10700 lines counted as duplicated to ~8000. The 90th percentile of duplication ratio for methods/functions went from 100% to 97%. This seems little but it means that there are significantly fewer functions now that are 100% duplicates.

As a concrete example, the boft/model.py file now contains 70 lines of code where previously it contained 227 (counted using cloc). As this is a fairly standard model file for a PEFT method, it means future contributors can skip ~2/3 of the model code, allowing them to better focus on the actual new code.

The remaining, very similar functions are almost all on layer.py. So e.g. for LoRA, Linear.merge is 95% identical to _ConvNd.merge. But since there are differences, the functions cannot be easily deduplicated (and it's questionable whether we even want to deduplicate them at all).

I introduced a new module, functional.py, which contains functions (just reimported from elsewhere) that can be useful for libraries that want to integrate PEFT. I would suggest that we should treat them as public API and thus guarantee backwards compatibility. When refactoring the methods for deduplication, I considered whether they can be used on any model (like transformers) or if they only make sense on a PeftModel. If it was the former, I made them standalone functions and added them to functional.py. If they are dependent on PeftModel instances, they just became normal methods. E.g., disable_adapter_layers has for active_adapter in self.active_adapters, so it presupposes self.active_adapters.

While working on this PR, I also deduplicated almost identical TRANSFORMERS_MODULES_TO_XXX_TARGET_MODULES_MAPPING constants by copying them from LoRA and only overriding a few values that differ. Moreover, some PEFT methods didn't have their own TRANSFORMERS_MODULES_TO_XXX_TARGET_MODULES_MAPPING but used the one from LoRA instead. They now each have their own constant, which is a copy from the one from LoRA.

For now, no deprecation

- set_adapter not touched yet - went from 10739 to 8494 duplicated lines (quick check)

HuggingFaceDocBuilderDev · 2025-09-04T15:53:00Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Make it deal with quantized weights by default, so that special treatment in certain PEFT methods can be removed.

While working on this, I also deduplicated almost identical TRANSFORMERS_MODULES_TO_XXX_TARGET_MODULES_MAPPING constants by copying them from LoRA and only overriding a few values that differ. Moreover, some PEFT methods didn't have their own TRANSFORMERS_MODULES_TO_XXX_TARGET_MODULES_MAPPING but used the one from LoRA instead. They now each have their own constant, which is a copy from the one from LoRA.

It's only used there, no need to have it on BaseTuner.

It's confusing otherwise

It is too specific to be really used as standalone function.

Not that generally useful

Also improve some existing docstrings, e.g. for inject_adapter_in_model.

sayakpaul

All tests passing gives enough confidence that nothing was broken in this PR. I would also probably run the itegration/slow tests just to confirm no disruption.

But apart from that, this is really cool!

docs/source/package_reference/functional.md

sayakpaul · 2025-09-11T09:26:59Z

src/peft/utils/constants.py

    "qwen2": ["q_proj", "v_proj"],
    "qwen3": ["q_proj", "v_proj"],
 }



How can this be deleted? A small comment in-line here would he helpful.

As these mappings were all identical or almost identical to the one for LoRA, I just made them copies of the LoRA mappings, e.g. https://github.com/huggingface/peft/pull/2771/files#diff-b054446713795a79a89f2285d66068573d69f0b701ee31419bb223d1601e7844R108 for C3A.

I mentioned that in the PR description:

While working on this PR, I also deduplicated almost identical TRANSFORMERS_MODULES_TO_XXX_TARGET_MODULES_MAPPING constants by copying them from LoRA and only overriding a few values that differ. Moreover, some PEFT methods didn't have their own TRANSFORMERS_MODULES_TO_XXX_TARGET_MODULES_MAPPING but used the one from LoRA instead. They now each have their own constant, which is a copy from the one from LoRA.

src/peft/tuners/adalora/model.py

BenjaminBossan

Thanks for your review, Sayak.

All tests passing gives enough confidence that nothing was broken in this PR. I would also probably run the itegration/slow tests just to confirm no disruption.

Good idea about running slow tests, unfortunately they're not setup to run on forks :-/ I ran a subset of those locally and they pass.

But I think it's not a big deal, when it's merged, they will during the night, we can fix anything that breaks then and generally, the stuff that is changed here is best covered by the normal tests anyway.

Btw. one unfortunate side effect of this PR is that the test coverage declines slightly (by 1%) because there are fewer duplicated lines being tested :D

BenjaminBossan · 2025-09-11T10:15:16Z

src/peft/utils/constants.py

    "qwen2": ["q_proj", "v_proj"],
    "qwen3": ["q_proj", "v_proj"],
 }



As these mappings were all identical or almost identical to the one for LoRA, I just made them copies of the LoRA mappings, e.g. https://github.com/huggingface/peft/pull/2771/files#diff-b054446713795a79a89f2285d66068573d69f0b701ee31419bb223d1601e7844R108 for C3A.

I mentioned that in the PR description:

While working on this PR, I also deduplicated almost identical TRANSFORMERS_MODULES_TO_XXX_TARGET_MODULES_MAPPING constants by copying them from LoRA and only overriding a few values that differ. Moreover, some PEFT methods didn't have their own TRANSFORMERS_MODULES_TO_XXX_TARGET_MODULES_MAPPING but used the one from LoRA instead. They now each have their own constant, which is a copy from the one from LoRA.

src/peft/tuners/adalora/model.py

docs/source/package_reference/functional.md

sayakpaul · 2025-09-11T10:59:38Z

Btw. one unfortunate side effect of this PR is that the test coverage declines slightly (by 1%) because there are fewer duplicated lines being tested :D

Exactly why regular software engineering coverage often cannot be applied one-on-one in these libraries. I am totally fine with that.

githubnemo

Minor comments / questions. Looks good otherwise :)

src/peft/tuners/ia3/model.py

githubnemo · 2025-09-17T12:32:17Z

src/peft/tuners/ln_tuning/model.py

+    def _unloading_checks(self, adapter_names: Optional[list[str]]):
+        adapters_to_consider = adapter_names or self.active_adapters
+        is_modules_to_save_available = any(
+            self.peft_config[adapter].modules_to_save for adapter in adapters_to_consider
+        )
+        if is_modules_to_save_available and len(adapters_to_consider) > 1:
+            raise ValueError("Cannot unload multiple adapters that specify `modules_to_save`.")


is this a remnant and can be removed? doesn't seem to be used.

Hmm, good question. So I agree that this check doesn't make too much sense, but the general idea is right: If we have multiple modules_to_save, then merge_and_unload doesn't really work. Right now, we just let it pass:

from transformers import AutoModelForCausalLM from peft import LoraConfig, get_peft_model from copy import deepcopy model_id = "facebook/opt-125m" model = AutoModelForCausalLM.from_pretrained(model_id) config = LoraConfig(modules_to_save=["lm_head"]) model = get_peft_model(model, config) model.add_adapter("adapter2", config) model.base_model.model.lm_head.modules_to_save.default.weight.data.fill_(111) model.base_model.model.lm_head.modules_to_save.adapter2.weight.data.fill_(222) unloaded = deepcopy(model).merge_and_unload(adapter_names=["default", "adapter2"]) print(unloaded.lm_head.weight) # is [[111, 111, ...]]

I'd suggest to leave this situation as is for now and handle it in a separate PR. WDYT?

Agreed, let's leave it as-is for now.

src/peft/tuners/randlora/model.py

src/peft/utils/save_and_load.py

* IA³ merge checks for bnb * Typos

BenjaminBossan

Thanks for the detailed review @githubnemo, I resolved the flagged issues or commented on them. Please check again.

src/peft/tuners/ia3/model.py

BenjaminBossan · 2025-09-18T15:35:40Z

src/peft/tuners/ln_tuning/model.py

+    def _unloading_checks(self, adapter_names: Optional[list[str]]):
+        adapters_to_consider = adapter_names or self.active_adapters
+        is_modules_to_save_available = any(
+            self.peft_config[adapter].modules_to_save for adapter in adapters_to_consider
+        )
+        if is_modules_to_save_available and len(adapters_to_consider) > 1:
+            raise ValueError("Cannot unload multiple adapters that specify `modules_to_save`.")


Hmm, good question. So I agree that this check doesn't make too much sense, but the general idea is right: If we have multiple modules_to_save, then merge_and_unload doesn't really work. Right now, we just let it pass:

from transformers import AutoModelForCausalLM from peft import LoraConfig, get_peft_model from copy import deepcopy model_id = "facebook/opt-125m" model = AutoModelForCausalLM.from_pretrained(model_id) config = LoraConfig(modules_to_save=["lm_head"]) model = get_peft_model(model, config) model.add_adapter("adapter2", config) model.base_model.model.lm_head.modules_to_save.default.weight.data.fill_(111) model.base_model.model.lm_head.modules_to_save.adapter2.weight.data.fill_(222) unloaded = deepcopy(model).merge_and_unload(adapter_names=["default", "adapter2"]) print(unloaded.lm_head.weight) # is [[111, 111, ...]]

I'd suggest to leave this situation as is for now and handle it in a separate PR. WDYT?

src/peft/tuners/randlora/model.py

src/peft/utils/save_and_load.py

…sary tests

During work on huggingface#2771, the usage of set_auxiliary_adapters became obsolete, expect in one place, which was missed. This has now been cleaned up and the obsolete method is removed.

During work on #2771, the usage of set_auxiliary_adapters became obsolete, expect in one place, which was missed. This has now been cleaned up and the obsolete method is removed.

BenjaminBossan added 13 commits September 4, 2025 15:12

_replace_module

e8eb396

_unload_and_optionally_merge

ee12f37

merge_and_unload & unload

4c009c5

fixes _replace_module

a6a4f04

delete_adapter

64e5c68

Fixes unload

c5ca6a0

_mark_only_adapters_as_trainable

67ae568

enable_adapter_layers & disable_adapter_layers

244cacb

For now, no deprecation

delete get_peft_config_as_dict

dd914c4

More fixes _replace_module

1a3a4b9

__getattr__

712fc55

Delete methods deduplicated so far

885a8bd

- set_adapter not touched yet - went from 10739 to 8494 duplicated lines (quick check)

Add more to functional.py

1accf86

BenjaminBossan added 16 commits September 4, 2025 17:53

make style

2af2434

Fix issues with MixedModel

f468e9c

_check_new_adapter_config

64efcac

fix type annotation

5f5747e

_check_target_module_exists

e334e5e

more _replace_module

a00d193

Make it deal with quantized weights by default, so that special treatment in certain PEFT methods can be removed.

Merge branch 'main' into the-great-deduplication

b35abf8

set_adapter (on model, not layer)

8260957

Move _unloading_checks to LNTuning

cd4ca3a

It's only used there, no need to have it on BaseTuner.

Rename base_layer_cls => tuner_layer_cls

03620d8

It's confusing otherwise

Remove replace_module from functional.py

91566d6

It is too specific to be really used as standalone function.

Remove nonsense code

1fd568a

Remove check_target_module_exists from functional

3f2ce1c

Not that generally useful

Extend documentation

f24f645

Also improve some existing docstrings, e.g. for inject_adapter_in_model.

Some clean ups/improvements to docs

8f80f81

BenjaminBossan added 2 commits September 9, 2025 14:29

delete_adapter also deals with auxiliary layers

d7ef6ca

Make style

6576feb

BenjaminBossan marked this pull request as ready for review September 9, 2025 13:26

BenjaminBossan requested a review from githubnemo September 9, 2025 13:26

BenjaminBossan changed the title ~~[WIP] The great deduplication~~ The great deduplication Sep 9, 2025

Delete obsolete method

729bcbb

BenjaminBossan requested a review from sayakpaul September 9, 2025 13:38

sayakpaul approved these changes Sep 11, 2025

View reviewed changes

BenjaminBossan commented Sep 11, 2025

View reviewed changes

githubnemo reviewed Sep 18, 2025

View reviewed changes

Reviewer feedback:

dc618ad

* IA³ merge checks for bnb * Typos

BenjaminBossan commented Sep 18, 2025

View reviewed changes

Fix bug IA3

17a259f

BenjaminBossan requested a review from githubnemo September 19, 2025 09:52

githubnemo approved these changes Sep 22, 2025

View reviewed changes

BenjaminBossan merged commit f1b8364 into huggingface:main Sep 23, 2025
14 checks passed

BenjaminBossan deleted the the-great-deduplication branch September 23, 2025 11:26

This was referenced Sep 26, 2025

FEAT add DeLoRA #2780

Merged

WaveFT method added into tuners #2560

Merged

githubnemo mentioned this pull request Oct 7, 2025

Add Orthogonal Subspace Fine-Tuning (OSF) Tuner for Parameter-Efficient Continual Learning #2685

Merged

mwbini added a commit to mwbini/peft that referenced this pull request Oct 12, 2025

remove unnecessary methods following PR huggingface#2771, and unneces…

f98865b

…sary tests

mwbini added a commit to mwbini/peft that referenced this pull request Oct 13, 2025

remove unnecessary methods following PR huggingface#2771, and unneces…

bf78bb7

…sary tests

mwbini added a commit to mwbini/peft that referenced this pull request Oct 14, 2025

remove unnecessary methods following PR huggingface#2771, and unneces…

26426b7

…sary tests

mwbini added a commit to mwbini/peft that referenced this pull request Oct 16, 2025

remove unnecessary methods following PR huggingface#2771, and unneces…

46c3855

…sary tests

BenjaminBossan mentioned this pull request Oct 28, 2025

MNT: Clean up unused method set_auxiliary_adapters #2876

Merged

The great deduplication #2771

The great deduplication #2771

Uh oh!

Conversation

BenjaminBossan commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Sep 4, 2025

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sayakpaul Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

BenjaminBossan Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

BenjaminBossan Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sayakpaul commented Sep 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

githubnemo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

githubnemo Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

BenjaminBossan Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

githubnemo Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

BenjaminBossan Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

BenjaminBossan commented Sep 4, 2025 •

edited

Loading

sayakpaul commented Sep 11, 2025 •

edited

Loading