validated cpt, ephemeral_gpu_offloading and eva finetuning on XPU #2694

yao-matrix · 2025-08-01T17:03:14Z

@BenjaminBossan , pls help review, thx very much.

Signed-off-by: Yao, Matrix <matrix.yao@intel.com>

review-notebook-app · 2025-08-01T17:03:19Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

Signed-off-by: Yao, Matrix <matrix.yao@intel.com>

BenjaminBossan · 2025-08-04T09:31:25Z

examples/eva_finetuning/README.md

+## Leveraging multiple accelerators

-EVA initialization can be parallelized across multiple GPUs. In this case inputs from multiple GPUs are gathered before computing the SVD for the batch. This requires that the model is wrapped in a `torch.nn.DataParallel` or `torch.nn.DistributedDataParallel` class. An example of how to use this can be found in [eva_finetuning_multi_gpu.py](https://github.com/huggingface/peft/blob/main/examples/eva_finetuning/eva_finetuning_multi_gpu.py).
+EVA initialization can be parallelized across multiple accelerators. In this case inputs from multiple accelerators are gathered before computing the SVD for the batch. This requires that the model is wrapped in a `torch.nn.DataParallel` or `torch.nn.DistributedDataParallel` class. An example of how to use this can be found in [eva_finetuning_multi_accelerator.py](https://github.com/huggingface/peft/blob/main/examples/eva_finetuning/eva_finetuning_multi_accelerator.py).


Just wondering: Are Intel accelerators not still technically "GPUs"? Is it really needed to replace "GPU" in the docs and comments with "accelerator"?

Yes, Intel XPU are still technical GPUs architecture, which is general-purpose and programmable. I am using the term accelerator by following PyTorch's vocabulary(here), PyTorch are supporting more accelerators, so I am using accelerator here, so if some people validated another new device later, they don't need to change README anymore, and even not eva_finetuning.py. Pls let me know your opinions, thx.

That makes sense, thanks for explaining.

HuggingFaceDocBuilderDev · 2025-08-05T09:26:34Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

BenjaminBossan

Thanks for checking that CPT, EVA, and GPU offloading work with XPU

BenjaminBossan · 2025-08-05T09:42:08Z

examples/eva_finetuning/README.md

+## Leveraging multiple accelerators

-EVA initialization can be parallelized across multiple GPUs. In this case inputs from multiple GPUs are gathered before computing the SVD for the batch. This requires that the model is wrapped in a `torch.nn.DataParallel` or `torch.nn.DistributedDataParallel` class. An example of how to use this can be found in [eva_finetuning_multi_gpu.py](https://github.com/huggingface/peft/blob/main/examples/eva_finetuning/eva_finetuning_multi_gpu.py).
+EVA initialization can be parallelized across multiple accelerators. In this case inputs from multiple accelerators are gathered before computing the SVD for the batch. This requires that the model is wrapped in a `torch.nn.DataParallel` or `torch.nn.DistributedDataParallel` class. An example of how to use this can be found in [eva_finetuning_multi_accelerator.py](https://github.com/huggingface/peft/blob/main/examples/eva_finetuning/eva_finetuning_multi_accelerator.py).


That makes sense, thanks for explaining.

* add eval loss logging during predition * make sure the train and eval logs aren't mixed * test grpo in eval * fix tests --------- Co-authored-by: Quentin Gallouédec <quentin.gallouedec@huggingface.co>

validated cpt, ephemeral_gpu_offloading and eva finetuning on XPU

89d7a3e

Signed-off-by: Yao, Matrix <matrix.yao@intel.com>

yao-matrix added 2 commits August 1, 2025 10:03

Merge branch 'huggingface:main' into cpt-xpu

a43631e

fix

c4af76d

Signed-off-by: Yao, Matrix <matrix.yao@intel.com>

BenjaminBossan reviewed Aug 4, 2025

View reviewed changes

BenjaminBossan approved these changes Aug 5, 2025

View reviewed changes

BenjaminBossan merged commit 86feb8c into huggingface:main Aug 5, 2025
2 of 14 checks passed

yao-matrix deleted the cpt-xpu branch August 5, 2025 17:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

validated cpt, ephemeral_gpu_offloading and eva finetuning on XPU #2694

validated cpt, ephemeral_gpu_offloading and eva finetuning on XPU #2694

Uh oh!

yao-matrix commented Aug 1, 2025 •

edited

Loading

Uh oh!

review-notebook-app bot commented Aug 1, 2025

Uh oh!

BenjaminBossan Aug 4, 2025

Uh oh!

yao-matrix Aug 4, 2025

Uh oh!

BenjaminBossan Aug 5, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Aug 5, 2025

Uh oh!

BenjaminBossan left a comment

Uh oh!

BenjaminBossan Aug 5, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

validated cpt, ephemeral_gpu_offloading and eva finetuning on XPU #2694

validated cpt, ephemeral_gpu_offloading and eva finetuning on XPU #2694

Uh oh!

Conversation

yao-matrix commented Aug 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Aug 1, 2025

Uh oh!

BenjaminBossan Aug 4, 2025

Choose a reason for hiding this comment

Uh oh!

yao-matrix Aug 4, 2025

Choose a reason for hiding this comment

Uh oh!

BenjaminBossan Aug 5, 2025

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Aug 5, 2025

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

BenjaminBossan Aug 5, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yao-matrix commented Aug 1, 2025 •

edited

Loading