Release: 0.17.1 changes #2739

BenjaminBossan · 2025-08-14T09:41:50Z

This PR contains the changes require for the 0.17.1 patch release:

The commit with the fixes to target_parameters (FIX Multiple issues with target_parameters #2710)
Version bumps

Note: Windows tests are failing because #2724 is not included, but since this is only affecting tests, I see no reason to cherry-pick that commit into the release.

There are a few issues with target_parameters that are fixed in this PR. Existing parametrizations When using target_parameters with LoRA, after the forward call finishes, the LoRA parametrization is removed. However, this also used to remove all other parametrizations on the same parameter, which is bad. With this PR, only the LoRA parametrization is removed. Module repr This PR also extends the __repr__ of lora.ParamWrapper to contain the parameter name, which makes it more useful. Extend testing Added a tiny gpt-oss model to the target_parameters test suite. Multiple LoRA adapters with target_parameters There is an issue when adding a second LoRA adapter with target_paramters, where this second adapter would not actually be applied correctly. The corresponding unit test was too lax to notice the bug. This is not easy to fix, so for now we forbid adding a second adapter with target_parameters. This is very strict but it's better than having silent errors. Although it was possible to fix that specific issue, the solution resulted in ever deeply nested adapters (i.e. with multiple .base_layer). This in turn results in those infixes to be part of the state_dict. But then we cannot load the individual adapters correctly, except if the model is restored in the exact same order as it was previously created. This is not normally a requirement in PEFT (e.g. I can create a model with two adapters and later decide to load only one of them). In the long run, we need to think about solutions that would allow this. It may require some form of normalization of the layers to prevent ever deeper nesting. Also, what is ugly right now is that, given that the LoRA lives on a module but actually targets one of possibly multiple parameter, the LoRA weights don't actually reference said parameter in any name. That means, purely from the state_dict, it is unclear which parameter a LoRA weight belongs to. Ideally, this should be encoded in the LoRA weight key.

HuggingFaceDocBuilderDev · 2025-08-14T09:45:30Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

BenjaminBossan and others added 2 commits August 14, 2025 11:38

Bump version to 0.17.1

5a710f9

BenjaminBossan requested a review from githubnemo August 14, 2025 10:16

githubnemo approved these changes Aug 20, 2025

View reviewed changes

BenjaminBossan merged commit 53c25fe into huggingface:release/0.17.1 Aug 21, 2025
10 of 27 checks passed

BenjaminBossan deleted the release/0.17.1-changes branch August 21, 2025 09:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Release: 0.17.1 changes #2739

Release: 0.17.1 changes #2739

Uh oh!

BenjaminBossan commented Aug 14, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Aug 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Release: 0.17.1 changes #2739

Release: 0.17.1 changes #2739

Uh oh!

Conversation

BenjaminBossan commented Aug 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Aug 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

BenjaminBossan commented Aug 14, 2025 •

edited

Loading