extend text-generation-benchmark to xpu, pass #2730

yao-matrix · 2025-08-08T02:05:58Z

@BenjaminBossan, pls help review, thx very much.

Signed-off-by: Yao, Matrix <matrix.yao@intel.com>

BenjaminBossan

Thanks for updating the text generation benchmark, this was definitely on our list to do soon. Unfortunately, I ran into an issue when trying it out. Could you please check?

Also, once you're finished with your changes, please run ruff on method_comparison/text_generation_benchmark/ (this directory is not automatically checked).

method_comparison/text_generation_benchmark/utils.py

BenjaminBossan · 2025-08-08T09:42:11Z

method_comparison/text_generation_benchmark/utils.py

+        accelerator_reserved_mb = 0.0

-    return ram_usage_mb, gpu_allocated_mb, gpu_reserved_mb
+    return ram_usage_mb, accelerator_allocated_mb, accelerator_reserved_mb


For some reason, I'm getting incorrect results. When I run python run_base.py -v --force on this branch on a 4090, I get 0 MB memory usage in the report. To debug this, you can set a breakpoint at this line and run the aforementioned command. The first time we get here, accelerator_allocated_mb and accelerator_reserved_mb are 0, which is expected. After this, the model is loaded to the accelerator. Thus, when we get here the second time, these values should be > 0, but I still get 0 here. When running the same on the main branch, I get correct reports.

I could not determine where this difference comes from, nvidia-smi shows the same output for main branch and this branch, torch.cuda.is_available() always returns True, and the model_kwargs are also identical. Can you replicate this and do you have an idea what the issue could be?

@BenjaminBossan oh, my fault, the chain should be if-elif-else, but i break to 2(if and if-else), so CUDA GPU goes into first if and second else(which gives it 0), sorry and thx for checking

Ah yes, makes sense.

Signed-off-by: Yao, Matrix <matrix.yao@intel.com>

Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

Signed-off-by: Yao, Matrix <matrix.yao@intel.com>

HuggingFaceDocBuilderDev · 2025-08-11T10:05:16Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

BenjaminBossan

Thanks for fixing the issue with CUDA. I think the script still has a small error for XPU, which I flagged, please check.

BenjaminBossan · 2025-08-11T10:18:07Z

method_comparison/text_generation_benchmark/utils.py

+        accelerator_reserved_mb = 0.0

-    return ram_usage_mb, gpu_allocated_mb, gpu_reserved_mb
+    return ram_usage_mb, accelerator_allocated_mb, accelerator_reserved_mb


Ah yes, makes sense.

BenjaminBossan · 2025-08-11T10:18:30Z

method_comparison/text_generation_benchmark/utils.py

-        return gpu_allocated, gpu_reserved
+        _, accelerator_allocated, accelerator_reserved = get_memory_usage()
+        return accelerator_allocated, accelerator_reserved
+    elif torch.xpu.is_available():


Here, elif is not necessary because the previous branch returns, but it also doesn't hurt.

method_comparison/text_generation_benchmark/utils.py

Signed-off-by: Yao, Matrix <matrix.yao@intel.com>

BenjaminBossan

Thanks for making the text generation benchmark XPU compatible.

extend text-generation-benchmark to xpu, pass

2eb0a16

Signed-off-by: Yao, Matrix <matrix.yao@intel.com>

BenjaminBossan requested changes Aug 8, 2025

View reviewed changes

yao-matrix and others added 3 commits August 8, 2025 17:23

fix

c9a8bc3

Signed-off-by: Yao, Matrix <matrix.yao@intel.com>

Update method_comparison/text_generation_benchmark/utils.py

fd6df1e

Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

again

32c0dc2

Signed-off-by: Yao, Matrix <matrix.yao@intel.com>

BenjaminBossan requested changes Aug 11, 2025

View reviewed changes

yao-matrix added 2 commits August 11, 2025 16:07

fix

c9eaa55

Signed-off-by: Yao, Matrix <matrix.yao@intel.com>

align style w/ other code

4ba6b37

Signed-off-by: Yao, Matrix <matrix.yao@intel.com>

BenjaminBossan approved these changes Aug 12, 2025

View reviewed changes

BenjaminBossan merged commit 95df499 into huggingface:main Aug 12, 2025
2 of 14 checks passed

yao-matrix deleted the tgb-xpu branch August 12, 2025 20:57

BenjaminBossan mentioned this pull request Aug 14, 2025

bench mark scripts #2525

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

extend text-generation-benchmark to xpu, pass #2730

extend text-generation-benchmark to xpu, pass #2730

Uh oh!

yao-matrix commented Aug 8, 2025

Uh oh!

BenjaminBossan left a comment •

edited

Loading

Uh oh!

Uh oh!

BenjaminBossan Aug 8, 2025

Uh oh!

yao-matrix Aug 8, 2025 •

edited

Loading

Uh oh!

BenjaminBossan Aug 11, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Aug 11, 2025

Uh oh!

BenjaminBossan left a comment

Uh oh!

BenjaminBossan Aug 11, 2025

Uh oh!

BenjaminBossan Aug 11, 2025

Uh oh!

Uh oh!

BenjaminBossan left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

extend text-generation-benchmark to xpu, pass #2730

extend text-generation-benchmark to xpu, pass #2730

Uh oh!

Conversation

yao-matrix commented Aug 8, 2025

Uh oh!

BenjaminBossan left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

BenjaminBossan Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

yao-matrix Aug 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

BenjaminBossan Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Aug 11, 2025

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

BenjaminBossan Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

BenjaminBossan Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

BenjaminBossan left a comment •

edited

Loading

yao-matrix Aug 8, 2025 •

edited

Loading