Update dependency peft to v0.15.2 #61

red-hat-konflux · 2025-05-17T21:52:03Z

This PR contains the following updates:

Package	Update	Change
peft	minor	`==0.3.0` -> `==0.15.2`

Warning

Some dependencies could not be looked up. Check the warning logs for more information.

Release Notes

huggingface/peft (peft)

`v0.15.2`

Compare Source

This patch fixes a bug that resulted in prompt learning methods like P-tuning not to work (#2477).

`v0.15.1`

Compare Source

This patch includes a fix for #2450. In this bug modules_to_save was not handled correctly when used in conjunction with DeepSpeed ZeRO stage 3 which resulted in those modules being placeholder values in the saved checkpoints.

Full Changelog: huggingface/peft@v0.15.0...v0.15.1

`v0.15.0`

Compare Source

Highlights

New Methods

CorDA: Context-Oriented Decomposition Adaptation

@iboing and @5eqn contributed CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for Task-Aware Parameter-Efficient Fine-tuning . This task-driven initialization method has two modes, knowledge-preservation and instruction-preservation, both using external data to select ranks intelligently. The former can be used to select those ranks that correspond to weights not affiliated with knowledge from, say, a QA dataset. The latter can be used to select those ranks that correspond most to the task at hand (e.g., a classification task). (#2231)

Trainable Tokens: Selective token update

The new Trainable Tokens tuner allows for selective training of tokens without re-training the full embedding matrix, e.g. when adding support for reasoning / thinking tokens. This is a lot more memory efficient and the saved checkpoint is much smaller. It can be used standalone or in conjunction with LoRA adapters by passing trainable_token_indices to LoraConfig. (#2376)

Enhancements

LoRA now supports targeting multihead attention modules (but for now only those with _qkv_same_embed_dim=True). These modules were tricky as they may expose linear submodules but won't use their forward methods, therefore needing explicit support. (#1324)

Hotswapping now allows different alpha scalings and ranks without recompilation of the model when the model is prepared using a call to prepare_model_for_compiled_hotswap() before compiling the model. (#2177)

GPTQModel support was added in #2247 as a replacement for AutoGPTQ which is not maintained anymore.

Changes

It's now possible to use all-linear as target_modules for custom (non-transformers) models (#2267). With this change comes a bugfix where it was possible that non-linear layers were selected when they shared the same name with a linear layer (e.g., bar.foo and baz.foo).
The internal tuner API was refactored to make method registration easier. With this change the number of changes to numerous files is reduced to a single register_peft_method() call. (#2282)
PEFT_TYPE_TO_MODEL_MAPPING is now deprecated and should not be relied upon. Use PEFT_TYPE_TO_TUNER_MAPPING instead. (#2282)
Mixed adapter batches can now be used in conjunction with beam search. (#2287)
It was possible that modules_to_save keys wrongly matched parts of the state dict if the key was a substring of another key (e.g., classifier and classifier2). (#2334)
Auto-casting of the input dtype to the LoRA adapter dtype can now be disabled via disable_input_dtype_casting=True. (#2353)
The config parameters rank_pattern and alpha_pattern used by many adapters now supports matching full paths as well by specifying the pattern with a caret in front, for example: ^foo to target model.foo but not model.bar.foo. (#2419)
AutoPeftModels do not reduce the embedding size anymore if the tokenizer size differs from the embedding size. Only if there are more tokens in the tokenizer than in the embedding matrix, the matrix will be resized. This is to prevent resizing of embedding matrices in models that have 'spare' tokens built-in. (#2427)

Highlights

New Methods

Context-aware Prompt Tuning

@tsachiblau added a new soft prompt method called Context-aware Prompt Tuning (CPT) which is a combination of In-Context Learning and Prompt Tuning in the sense that, for each training sample, it builds a learnable context from training examples in addition to the single training sample. Allows for sample- and parameter-efficient few-shot classification and addresses recency-bias.

Explained Variance Adaptation

@sirluk contributed a new LoRA initialization method called Explained Variance Adaptation (EVA). Instead of randomly initializing LoRA weights, this method uses SVD on minibatches of finetuning data to initialize the LoRA weights and is also able to re-allocate the ranks of the adapter based on the explained variance ratio (derived from SVD). Thus, this initialization method can yield better initial values and better rank distribution.

Bone

@JL-er added an implementation for Block Affine (Bone) Adaptation which utilizes presumed sparsity in the base layer weights to divide them into multiple sub-spaces that share a single low-rank matrix for updates. Compared to LoRA, Bone has the potential to significantly reduce memory usage and achieve faster computation.

Enhancements

PEFT now supports LoRAs for int8 torchao quantized models (check this and this notebook) . In addition, VeRA can now be used with 4 and 8 bit bitsandbytes quantization thanks to @ZiadHelal.

Hot-swapping of LoRA adapters is now possible using the hotswap_adapter function. Now you are able to load one LoRA and replace its weights in-place with the LoRA weights of another adapter which, in general, should be faster than deleting one adapter and loading the other adapter in its place. The feature is built so that no re-compilation of the model is necessary if torch.compile was called on the model (right now, this requires ranks and alphas to be the same for the adapters).

LoRA and IA³ now support Conv3d layers thanks to @jsilter, and @JINO-ROHIT added a notebook showcasing PEFT model evaluation using lm-eval-harness toolkit.

With the target_modules argument, you can specify which layers to target with the adapter (e.g. LoRA). Now you can also specify which modules not to target by using the exclude_modules parameter (thanks @JINO-ROHIT).

Changes

There have been made several fixes to the OFT implementation, among other things, to fix merging, which makes adapter weights trained with PEFT versions prior to this release incompatible (see #1996 for details).
Adapter configs are now forward-compatible by accepting unknown keys.
Prefix tuning was fitted to the DynamicCache caching infrastructure of transformers (see #2096). If you are using this PEFT version and a recent version of transformers with an old prefix tuning checkpoint, you should double check that it still works correctly and retrain it if it doesn't.
Added lora_bias parameter to LoRA layers to enable bias on LoRA B matrix. This is useful when extracting LoRA weights from fully fine-tuned parameters with bias vectors so that these can be taken into account.
#2180 provided a couple of bug fixes to LoKr (thanks @yaswanth19). If you're using LoKr, your old checkpoints should still work but it's recommended to retrain your adapter.
from_pretrained now warns the user if PEFT keys are missing.
Attribute access to modules in modules_to_save is now properly and transparently handled.
PEFT supports the changes to bitsandbytes 8bit quantization from the recent v0.45.0 release. To benefit from these improvements, we thus recommend to upgrade bitsandbytes if you're using QLoRA. Expect slight numerical differences in model outputs if you're using QLoRA with 8bit bitsandbytes quantization.

What's Changed

Bump version to 0.13.1.dev0 by @BenjaminBossan in https://github.com/huggingface/peft/pull/2094
Support Conv3d layer in LoRA and IA3 by @jsilter in https://github.com/huggingface/peft/pull/2082
Fix Inconsistent Missing Keys Warning for Adapter Weights in PEFT by @yaswanth19 in https://github.com/huggingface/peft/pull/2084
FIX: Change check if past_key_values is empty by @BenjaminBossan in https://github.com/huggingface/peft/pull/2106
Update install.md by @Salehbigdeli in https://github.com/huggingface/peft/pull/2110
Update OFT to fix merge bugs by @Zeju1997 in https://github.com/huggingface/peft/pull/1996
ENH: Improved attribute access for modules_to_save by @BenjaminBossan in https://github.com/huggingface/peft/pull/2117
FIX low_cpu_mem_usage consolidates devices by @BenjaminBossan in https://github.com/huggingface/peft/pull/2113
TST Mark flaky X-LoRA test as xfail by @BenjaminBossan in https://github.com/huggingface/peft/pull/2114
ENH: Warn when from_pretrained misses PEFT keys by @BenjaminBossan in https://github.com/huggingface/peft/pull/2118
FEAT: Adding exclude modules param(#2044) by @JINO-ROHIT in https://github.com/huggingface/peft/pull/2102
fix merging bug / update boft conv2d scaling variable by @Zeju1997 in https://github.com/huggingface/peft/pull/2127
FEAT: Support quantization for VeRA using bitsandbytes (#2070) by @ZiadHelal in https://github.com/huggingface/peft/pull/2076
Bump version to 0.13.2.dev0 by @BenjaminBossan in https://github.com/huggingface/peft/pull/2137
FEAT: Support torchao by @BenjaminBossan in https://github.com/huggingface/peft/pull/2062
FIX: Transpose weight matrix based on fan_in_fan_out condition in PiSSA initialization (#2103) by @suyang160 in https://github.com/huggingface/peft/pull/2104
FIX Type annoations in vera/bnb.py by @BenjaminBossan in https://github.com/huggingface/peft/pull/2139
ENH Make PEFT configs forward compatible by @BenjaminBossan in https://github.com/huggingface/peft/pull/2038
FIX Raise an error when performing mixed adapter inference and passing non-existing adapter names by @BenjaminBossan in https://github.com/huggingface/peft/pull/2090
FIX Prompt learning with latest transformers error by @BenjaminBossan in https://github.com/huggingface/peft/pull/2140
adding peft lora example notebook for ner by @JINO-ROHIT in https://github.com/huggingface/peft/pull/2126
FIX TST: NaN issue with HQQ GPU test by @BenjaminBossan in https://github.com/huggingface/peft/pull/2143
FIX: Bug in target module optimization if child module name is suffix of parent module name by @BenjaminBossan in https://github.com/huggingface/peft/pull/2144
Bump version to 0.13.2.dev0 by @BenjaminBossan in https://github.com/huggingface/peft/pull/2145
FIX Don't assume past_key_valus for encoder models by @BenjaminBossan in https://github.com/huggingface/peft/pull/2149
Use SFTConfig instead of SFTTrainer keyword args by @qgallouedec in https://github.com/huggingface/peft/pull/2150
FIX: Sft train script FSDP QLoRA embedding mean resizing error by @BenjaminBossan in https://github.com/huggingface/peft/pull/2151
Optimize DoRA in eval and no dropout by @ariG23498 in https://github.com/huggingface/peft/pull/2122
FIX Missing low_cpu_mem_usage argument by @BenjaminBossan in https://github.com/huggingface/peft/pull/2156
MNT: Remove version pin of diffusers by @BenjaminBossan in https://github.com/huggingface/peft/pull/2162
DOC: Improve docs for layers_pattern argument by @BenjaminBossan in https://github.com/huggingface/peft/pull/2157
Update HRA by @DaShenZi721 in https://github.com/huggingface/peft/pull/2160
fix fsdp_auto_wrap_policy by @eljandoubi in https://github.com/huggingface/peft/pull/2167
MNT Remove Python 3.8 since it's end of life by @BenjaminBossan in https://github.com/huggingface/peft/pull/2135
Improving error message when users pass layers_to_transform and layers_pattern by @JINO-ROHIT in https://github.com/huggingface/peft/pull/2169
FEAT Add hotswapping functionality by @BenjaminBossan in https://github.com/huggingface/peft/pull/2120
Fix to prefix tuning to fit transformers by @BenjaminBossan in https://github.com/huggingface/peft/pull/2096
MNT: Enable Python 3.12 on CI by @BenjaminBossan in https://github.com/huggingface/peft/pull/2173
MNT: Update docker nvidia base image to 12.4.1 by @BenjaminBossan in https://github.com/huggingface/peft/pull/2176
DOC: Extend modules_to_save doc with pooler example by @BenjaminBossan in https://github.com/huggingface/peft/pull/2175
FIX VeRA failure on multiple GPUs by @BenjaminBossan in https://github.com/huggingface/peft/pull/2163
FIX: Import location of HF hub errors by @BenjaminBossan in https://github.com/huggingface/peft/pull/2178
DOC: fix broken link in the README of loftq by @dennis2030 in https://github.com/huggingface/peft/pull/2183
added checks for layers to transforms and layer pattern in lora by @JINO-ROHIT in https://github.com/huggingface/peft/pull/2159
ENH: Warn when loading PiSSA/OLoRA together with other adapters by @BenjaminBossan in https://github.com/huggingface/peft/pull/2186
TST: Skip AQLM test that is incompatible with torch 2.5 by @BenjaminBossan in https://github.com/huggingface/peft/pull/2187
FIX: Prefix tuning with model on multiple devices by @BenjaminBossan in https://github.com/huggingface/peft/pull/2189
FIX: Check for prefix tuning + gradient checkpointing fails by @BenjaminBossan in https://github.com/huggingface/peft/pull/2191
Dora_datacollector_updated by @shirinyamani in https://github.com/huggingface/peft/pull/2197
[BUG] Issue with using rank_pattern and alpha_pattern together in LoraConfig by @sirluk in https://github.com/huggingface/peft/pull/2195
evaluation of peft model using lm-eval-harness toolkit by @JINO-ROHIT in https://github.com/huggingface/peft/pull/2190
Support Bone by @JL-er in https://github.com/huggingface/peft/pull/2172
BUG🐛: Fixed scale related bugs in LoKr | Added rank_dropout_scale parameter by @yaswanth19 in https://github.com/huggingface/peft/pull/2180
update load_dataset for examples/feature_extraction by @sinchir0 in https://github.com/huggingface/peft/pull/2207
[FEAT] New LoRA Initialization Method: Explained Variance Adaptation by @sirluk in https://github.com/huggingface/peft/pull/2142
[FIX] EVA meta device check bug + add multi-gpu functionality by @sirluk in https://github.com/huggingface/peft/pull/2218
CPT Tuner by @tsachiblau in https://github.com/huggingface/peft/pull/2168
[FIX] Invalid None check for loftq_config attribute in LoraConfig by @sirluk in https://github.com/huggingface/peft/pull/2215
TST: Move slow compile tests to nightly CI by @BenjaminBossan in https://github.com/huggingface/peft/pull/2223
CI Update AutoAWQ version to fix CI by @BenjaminBossan in https://github.com/huggingface/peft/pull/2222
FIX Correctly set device of input data in bnb test by @BenjaminBossan in https://github.com/huggingface/peft/pull/2227
CI: Skip EETQ tests while broken by @BenjaminBossan in https://github.com/huggingface/peft/pull/2226
Add Validation for Invalid task_type in PEFT Configurations by @d-kleine in https://github.com/huggingface/peft/pull/2210
[FEAT] EVA: ensure deterministic behavior of SVD on multi gpu setups by @sirluk in https://github.com/huggingface/peft/pull/2225
TST: Eva: Speed up consistency tests by @BenjaminBossan in https://github.com/huggingface/peft/pull/2224
CI: Fix failing torchao test by @BenjaminBossan in https://github.com/huggingface/peft/pull/2232
TST: Update Llava model id in test by @BenjaminBossan in https://github.com/huggingface/peft/pull/2236
TST: Skip test on multi-GPU as DataParallel fails by @BenjaminBossan in https://github.com/huggingface/peft/pull/2234
Bump version of MacOS runners from 12 to 13 by @githubnemo in https://github.com/huggingface/peft/pull/2235
new version Bone by @JL-er in https://github.com/huggingface/peft/pull/2233
ENH Argument to enable bias for LoRA B by @BenjaminBossan in https://github.com/huggingface/peft/pull/2237
FIX: Small regression in BNB LoRA output by @BenjaminBossan in https://github.com/huggingface/peft/pull/2238
Update CPT documentation by @tsachiblau in https://github.com/huggingface/peft/pull/2229
FIX: Correctly pass low_cpu_mem_usage argument when initializing a PEFT model with task_type by @BenjaminBossan in https://github.com/huggingface/peft/pull/2253
FIX Correctly determine word embeddings on Deberta by @BenjaminBossan in https://github.com/huggingface/peft/pull/2257
FIX: Prevent CUDA context initialization due to AWQ by @BenjaminBossan in https://github.com/huggingface/peft/pull/2230
ENH: Updates for upcoming BNB Int8 release by @matthewdouglas in https://github.com/huggingface/peft/pull/2245
Prepare for PEFT release of v0.14.0 by @BenjaminBossan in https://github.com/huggingface/peft/pull/2258

New Contributors

@jsilter made their first contribution in https://github.com/huggingface/peft/pull/2082
@yaswanth19 made their first contribution in https://github.com/huggingface/peft/pull/2084
@Salehbigdeli made their first contribution in https://github.com/huggingface/peft/pull/2110
@JINO-ROHIT made their first contribution in https://github.com/huggingface/peft/pull/2102
@ZiadHelal made their first contribution in https://github.com/huggingface/peft/pull/2076
@suyang160 made their first contribution in https://github.com/huggingface/peft/pull/2104
@qgallouedec made their first contribution in https://github.com/huggingface/peft/pull/2150
@eljandoubi made their first contribution in https://github.com/huggingface/peft/pull/2167
@dennis2030 made their first contribution in https://github.com/huggingface/peft/pull/2183
@sirluk made their first contribution in https://github.com/huggingface/peft/pull/2195
@JL-er made their first contribution in https://github.com/huggingface/peft/pull/2172
@sinchir0 made their first contribution in https://github.com/huggingface/peft/pull/2207
@tsachiblau made their first contribution in https://github.com/huggingface/peft/pull/2168
@d-kleine made their first contribution in https://github.com/huggingface/peft/pull/2210
@githubnemo made their first contribution in https://github.com/huggingface/peft/pull/2235
@matthewdouglas made their first contribution in https://github.com/huggingface/peft/pull/2245

Full Changelog: huggingface/peft@v0.13.2...v0.14.0

`v0.13.2`: : Small patch release

Compare Source

This patch release contains a small bug fix for an issue that prevented some LoRA checkpoints to be loaded correctly (mostly concerning stable diffusion checkpoints not trained with PEFT when loaded in diffusers, #2144).

Full Changelog: huggingface/peft@v0.13.1...v0.13.2

`v0.13.1`: : Small patch release

Compare Source

This patch release contains a small bug fix for the low_cpu_mem_usage=True option (#2113).

Full Changelog: huggingface/peft@v0.13.0...v0.13.1

`v0.13.0`: : LoRA+, VB-LoRA, and more

Compare Source

Highlights

New methods

LoRA+

@kallewoof added LoRA+ to PEFT (#1915). This is a function that allows to initialize an optimizer with settings that are better suited for training a LoRA adapter.

VB-LoRA

@leo-yangli added a new method to PEFT called VB-LoRA (#2039). The idea is to have LoRA layers be composed from a single vector bank (hence "VB") that is shared among all layers. This makes VB-LoRA extremely parameter efficient and the checkpoints especially small (comparable to the VeRA method), while still promising good fine-tuning performance. Check the VB-LoRA docs and example.

Enhancements

New Hugging Face team member @ariG23498 added the helper function rescale_adapter_scale to PEFT (#1951). Use this context manager to temporarily increase or decrease the scaling of the LoRA adapter of a model. It also works for PEFT adapters loaded directly into a transformers or diffusers model.

@ariG23498 also added DoRA support for embedding layers (#2006). So if you're using the use_dora=True option in the LoraConfig, you can now also target embedding layers.

For some time now, we support inference with batches that are using different adapters for different samples, so e.g. sample 1-5 use "adapter1" and samples 6-10 use "adapter2". However, this only worked for LoRA layers so far. @saeid93 extended this to also work with layers targeted by modules_to_save (#1990).

When loading a PEFT adapter, you now have the option to pass low_cpu_mem_usage=True (#1961). This will initialize the adapter with empty weights ("meta" device) before loading the weights instead of initializing on CPU or GPU. This can speed up loading PEFT adapters. So use this option especially if you have a lot of adapters to load at the same time or if these adapters are very big. Please let us know if you encounter issues with this option, as we may make this the default in the future.

Changes

Safe loading of PyTorch weights

Unless indicated otherwise, PEFT adapters are saved and loaded using the secure safetensors format. However, we also support the PyTorch format for checkpoints, which relies on the inherently insecure pickle protocol from Python. In the future, PyTorch will be more strict when loading these files to improve security by making the option weights_only=True the default. This is generally recommended and should not cause any trouble with PEFT checkpoints, which is why with this release, PEFT will enable this by default. Please open an issue if this causes trouble.

What's Changed

Bump version to 0.12.1.dev0 by @BenjaminBossan in https://github.com/huggingface/peft/pull/1950
CI Fix Windows permission error on merge test by @BenjaminBossan in https://github.com/huggingface/peft/pull/1952
Check if past_key_values is provided when using prefix_tuning in peft_model by @Nidhogg-lyz in https://github.com/huggingface/peft/pull/1942
Add lora+ implementation by @kallewoof in https://github.com/huggingface/peft/pull/1915
FIX: New bloom changes breaking prompt learning by @BenjaminBossan in https://github.com/huggingface/peft/pull/1969
ENH Update VeRA preconfigured models by @BenjaminBossan in [https://github.com/ENH Update VeRA preconfigured models huggingface/peft#1941](https://redirect.github.com/huggingfac

Configuration

📅 Schedule: Branch creation - "after 5am on saturday" (UTC), Automerge - At any time (no schedule defined).

🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.

♻ Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.

🔕 Ignore: Close this PR and you won't be reminded about these updates again.

If you want to rebase/retry this PR, check this box

To execute skipped test pipelines write comment /ok-to-test.

This PR has been generated by MintMaker (powered by Renovate Bot).

Signed-off-by: red-hat-konflux <126015336+red-hat-konflux[bot]@users.noreply.github.com>

coveralls · 2025-05-17T22:09:27Z

Pull Request Test Coverage Report for Build 15089424447

Details

0 of 0 changed or added relevant lines in 0 files are covered.
No unchanged relevant lines lost coverage.
Overall first build on konflux/mintmaker/konflux-poc/peft-0.x at 93.407%

Totals
Change from base Build 15020007478:	93.4%
Covered Lines:	85
Relevant Lines:	91

💛 - Coveralls

Update dependency peft to v0.15.2

a3dffa8

Signed-off-by: red-hat-konflux <126015336+red-hat-konflux[bot]@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update dependency peft to v0.15.2 #61

Update dependency peft to v0.15.2 #61

Uh oh!

red-hat-konflux bot commented May 17, 2025

Uh oh!

coveralls commented May 17, 2025

Uh oh!

Uh oh!

Update dependency peft to v0.15.2 #61

Are you sure you want to change the base?

Update dependency peft to v0.15.2 #61

Uh oh!

Conversation

red-hat-konflux bot commented May 17, 2025

Release Notes

v0.15.2

v0.15.1

v0.15.0

Highlights

New Methods

CorDA: Context-Oriented Decomposition Adaptation

Trainable Tokens: Selective token update

Enhancements

Changes

What's Changed

New Contributors

v0.14.0: Version 0.14.0: EVA, Context-aware Prompt Tuning, Bone, and more

Highlights

New Methods

Context-aware Prompt Tuning

Explained Variance Adaptation

Bone

Enhancements

Changes

What's Changed

New Contributors

v0.13.2: : Small patch release

v0.13.1: : Small patch release

v0.13.0: : LoRA+, VB-LoRA, and more

Highlights

New methods

LoRA+

VB-LoRA

Enhancements

Changes

Safe loading of PyTorch weights

What's Changed

Configuration

Uh oh!

coveralls commented May 17, 2025

Pull Request Test Coverage Report for Build 15089424447

Details

💛 - Coveralls

Uh oh!

Uh oh!

`v0.15.2`

`v0.15.1`

`v0.15.0`

`v0.14.0`: Version 0.14.0: EVA, Context-aware Prompt Tuning, Bone, and more

`v0.13.2`: : Small patch release

`v0.13.1`: : Small patch release

`v0.13.0`: : LoRA+, VB-LoRA, and more