Template: Add `gpu` profile #3272

mashehu · 2024-11-07T13:26:18Z

Close #1383

codecov · 2024-11-07T13:47:49Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 76.50%. Comparing base (71b26a4) to head (f630d39).
Report is 9 commits behind head on dev.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

nf_core/pipeline-template/nextflow.config

Co-authored-by: Sateesh_Peri <33637490+sateeshperi@users.noreply.github.com>

nictru · 2024-11-07T15:58:20Z

So the flow of information in the scdownstream pipeline looks as following:

Users provide the gpu profile
The gpu profile prepares the containerization tools for GPU mounting and sets the hidden pipeline parameter use_gpu to true -> can be used for if-clauses in workflows
Processes with GPU support get the process_gpu label -> can be used to handle all GPU-enabled processes in a certain way. Different executors need different tweaking to handle tasks from these processes correctly. I added a bit of documentation here.
This section makes sure that processes have an ext variable which reflects if they should use GPU or not. This can be useful if the same module supports usage both with and without GPU. Example: cellbender

This implementation is the best I could come up with so far, but not many really "senior" people have had a look at it AFAIK. Maybe we can discuss which elements of the suggested approach should be part of the template and where we can perform some improvements.

nictru · 2024-11-07T15:58:55Z

Another thought: Should this not be an optional part of the template?

mashehu · 2024-11-07T16:05:58Z

it's teeny-tiny enough, that we might keep it in, imo, similar to arm... but also not 100% sure

nictru · 2024-11-07T16:31:27Z

But is the goal to only add the profile to the template?

I think it would at least be nice to have a common structure for all GPU-enabled modules, even if it only means adding the process_gpu label

GallVp · 2024-11-07T23:39:09Z

Thank you @nictru

Users provide the gpu profile

The gpu profile being added to the template does not have the use_gpu configuration variable. I can see that you have added it to the profile in the pipeline: https://github.com/nf-core/scdownstream/blob/3231971f309d1ac025e7180b69852c3637c975dd/nextflow.config#L182

I don't think we need a separate configuration variable.

The gpu profile prepares the containerization tools for GPU mounting and sets the hidden pipeline parameter use_gpu to true -> can be used for if-clauses in workflows

In the scdownstream pipeline, use_gpu is defined as a pipeline parameter (https://github.com/nf-core/scdownstream/blob/3231971f309d1ac025e7180b69852c3637c975dd/nextflow.config#L40) and a configuration variable (https://github.com/nf-core/scdownstream/blob/3231971f309d1ac025e7180b69852c3637c975dd/nextflow.config#L182). These two are different so setting gpu in the -profile, will not set the use_gpu pipeline parameter. It will only set the configuration variable.

I think we only need the use_gpu pipeline parameter and should get rid of the use_gpu configuration variable. The use_gpu label in the base.config file can be changed to,

process {
    withLabel: 'use_gpu' {
        ext.use_gpu = { params.use_gpu }
    }
}

At pipeline execution, the user must set the gpu profile and the use_gpu pipeline parameter to utilise the GPU-based tools.

Processes with GPU support get the process_gpu label -> can be used to handle all GPU-enabled processes in a certain way. Different executors need different tweaking to handle tasks from these processes correctly. I added a bit of documentation here.

Thank you. This is very helpful.

This section makes sure that processes have an ext variable which reflects if they should use GPU or not. This can be useful if the same module supports usage both with and without GPU. Example: cellbender

This is quite clever. Nice!

nictru · 2024-11-08T09:47:28Z

Hey @GallVp, thanks for the input.

Some notes:

It might sound stupid, but I did not know there was a difference between pipeline parameters and configuration variables. I agree that we should only have one.

I am not fully aware about what you can and can't do with configuration variables. I also was not able to find any documentation about them at all. I only used the configuration variable in scdownstream. Not sure if one can use configuration variables for workflow if clauses?

If no, then the pipeline parameter should be preferred, but it should still be set to true by the gpu profile and hidden from the parameter documentation
If yes, I think then the configuration variable is better, because it prevents users from using one without the other

At pipeline execution, the user must set the gpu profile and the use_gpu pipeline parameter to utilise the GPU-based tools.

I would prefer if setting the gpu profile would be sufficient. Keeping them separate will probably lead to a lot of users using one without the other, which almost certainly will lead to errors. I am not aware of any use cases where this would be an advantage, but I'm eager to hear.

nictru

Not approving yet, because we should first decide what the implementation should look like and add all necessary parts

GallVp · 2024-11-09T02:44:38Z

Not sure if one can use configuration variables for workflow if clauses?

No, configuration variables are not available in the workflows. Maybe, there is a backdoor that I am not aware of.

How about we get rid of both the variable and the parameter? Instead, we modify the profile as,

gpu {
    process {
        withLabel: process_gpu {
            ext.use_gpu = true
        }
    }
    docker.runOptions = '-u $(id -u):$(id -g) --gpus all'
    apptainer.runOptions = '--nv'
    singularity.runOptions = '--nv'
}

The workflows can use workflow.profile.contains('gpu') in place of the use_gpu parameter.

The above solution, however, does not take care of the arm profile. So, we need to have a gpu_arm profile as well.

nictru · 2024-11-13T09:54:48Z

Hey, I just tested the suggested approach and it seems to work as expected.
I think we could do it like this, but I would prefer having the withLabel block in the base.config - just to make sure that all the withLabel configs are collected in one place.

I did this using the following:

withLabel: process_gpu {
    ext.use_gpu = {workflow.profile.contains('gpu')}
}

So that we don't need a config variable. But this is more a cosmetic topic and not a hill-to-die-on.

mashehu · 2024-11-13T09:56:46Z

feel free to add the changes, @nictru

Co-authored-by: Nico Trummer <nictru32@gmail.com>

GallVp · 2024-11-13T20:15:14Z

I did this using the following:

withLabel: process_gpu {
    ext.use_gpu = { workflow.profile.contains('gpu') }
}

I agree. This is more in line with the existing infrastructure. Thank you!

nictru

I'm happy now :)

ewels · 2024-11-21T10:02:40Z

Needs opt-in functionality in the nf-core pipelines create. See also #3261

mirpedrol · 2025-03-11T15:36:35Z

🧹 spring cleaning message 🌷

What is the status of this PR? It looks ready after solving the conflicts. Are you planing to finish it?

mashehu · 2025-03-11T15:42:00Z

still need to opt-in template features first

jfy133 · 2025-04-08T08:59:00Z

This PR is indemand @mirpedrol @mashehu 🙏

https://nfcore.slack.com/archives/C043UU89KKQ/p1743749138969479

Note that @adamrtalbot is running through a few use cases how it would be used in configs if this helps in any way: https://nfcore.slack.com/archives/C043UU89KKQ/p1744102571110729?thread_ts=1744098920.100049&cid=C043UU89KKQ

JoseEspinosa · 2025-04-08T10:04:25Z

nf_core/pipeline-template/conf/base.config

        maxRetries    = 2
    }
+    withLabel: process_gpu {
+        ext.use_gpu = {workflow.profile.contains('gpu')}


Suggested change

ext.use_gpu = {workflow.profile.contains('gpu')}

ext.use_gpu = { workflow.profile.contains('gpu') }

Then to set the accelerator for the individual processes as here, you suggest using something such as:

process { withName: 'RUN_ALPHAFOLD2' { accelerator = workflow.profile.contains('gpu') ? 1 : 0 } }

Instead of using a parameter?

Yes that's the idea

Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>

mirpedrol · 2025-04-08T14:42:11Z

This PR is indemand @mirpedrol @mashehu 🙏

In order to merge this, we need to add an opt-in functionality when creating the pipeline template.
Until then, pipelines that support GPUs can add these changes 👍 This will also serve as a POC and debugging before we add this to the template.

jfy133 · 2025-04-09T08:51:18Z

For curiosity what exactly needs to be 'opt in' here?

Is there any harm having the label in the template so modules can start using it?

The label could also be optionally empty for now, but at least allows custom/institutional config-level settings

mashehu · 2025-04-10T07:27:53Z

we will also add gpu CI with that opt-in feature.

mirpedrol · 2025-04-10T08:44:39Z

What do you think if we merge this now, and we add it to all pipelines with the template update. But we agree on not adding GPU CI until we have the opt-in functionality. This profile will be included to this opt-in feature later, so it will be removed from all pipelines that don't select the opt-in feature.
Note that this will need a bit of planning and communication, otherwise some pipelines might needit and get it removed with the template if they don't notice about the new opt-in, so worth giving it a thought before merging this PR

jfy133 · 2025-04-10T11:32:48Z

Could also do that. However I think the less 'breaky' method would be adding just the label as a valid label.

People can then already use that in their configs to 'replicate' the GPU profile in this PR themselves, until the GPU profile itself goes into the template. That way it doesn't get 'removed'.

i.e., the label is cheap, and can be in there without it having to be opt in and it won't bother anywone

mashehu · 2025-04-10T11:45:04Z

sorry, I don't understand why having an empty label would help or what you mean with valid labels.

jfy133 · 2025-04-10T14:44:38Z

Because then modules can be validly labelled as having GPU support, and configs can actually do something with it

Then I can set in my institutional config

process {
    withLabel: proces_gpu {
         Queue = 'my_clusterd_gpu_queue'
    }

}

Or whatever

jfy133 · 2025-04-10T15:09:19Z

Because then modules can be validly labelled as having GPU support.

Then I can set in my institutional config

process {
    withLabel: proces_gpu {
         Queue = 'my_clusterd_gpu_queue'
    }

}

Or whatever

jfy133 · 2025-04-11T07:13:01Z

See the examples here: nf-core/configs#876 (review)

mirpedrol · 2025-04-11T07:41:56Z

This would be at the modules guidelines level no? We don't need the label in the template to be able to use it in the modules

jfy133 · 2025-04-11T08:41:49Z

This would be at the modules guidelines level no? We don't need the label in the template to be able to use it in the modules

It can go there, but the documentation of what are the valid labels are points to the pipeline template... (and I assumed where your fixed list comes from when selecting this in the dropdown in modules create)

mirpedrol · 2025-04-11T09:03:11Z

I didn't think about the dropdown from modules create, that's true. But currently we only allow one label, so if we add this it would be a bigger change. I think it is better to allow people to add the label manually and not modify how modules create works.

jfy133 · 2025-04-11T09:55:32Z

I didn't think about the dropdown from modules create, that's true. But currently we only allow one label, so if we add this it would be a bigger change. I think it is better to allow people to add the label manually and not modify how modules create works.

Good point. But I still think it makes sense to already have this in the pipeline template already then so we can refer already to our 'standard' i.e., a single process_gpu rather than just gpu or gpu_low or process_gpu_low all of which we are seeing now crop up in configs 😅

mashehu · 2025-04-24T12:12:05Z

@jfy133 are you okay, with merging this PR as it is now? Rest we discussed we can do later.

jfy133 · 2025-04-24T12:29:18Z

The bit in base.config is probably sufficient for now 👍

We may just want to consider whether it should be process_gpu_single process_gpu_low etc for adjusting the number of GPUs but I think we can assume process_gpu_single etc can be added later

template: add gpu profile

03c8336

mashehu requested a review from sateeshperi November 7, 2024 13:26

[automated] Update CHANGELOG.md

d790b9c

sateeshperi reviewed Nov 7, 2024

View reviewed changes

nf_core/pipeline-template/nextflow.config Outdated Show resolved Hide resolved

sateeshperi reviewed Nov 7, 2024

View reviewed changes

nf_core/pipeline-template/nextflow.config Outdated Show resolved Hide resolved

mashehu requested a review from nictru November 7, 2024 15:37

nictru and others added 2 commits November 7, 2024 16:38

Mount all available GPUs when using Docker

51a232c

Co-authored-by: Sateesh_Peri <33637490+sateeshperi@users.noreply.github.com>

Remove tmpdir artifact

23f6951

Co-authored-by: Sateesh_Peri <33637490+sateeshperi@users.noreply.github.com>

sateeshperi requested a review from GallVp November 7, 2024 18:12

GallVp approved these changes Nov 7, 2024

View reviewed changes

nictru requested changes Nov 8, 2024

View reviewed changes

add proccess label to base.config

4c88397

Co-authored-by: Nico Trummer <nictru32@gmail.com>

nictru approved these changes Nov 13, 2024

View reviewed changes

ewels modified the milestones: 3.1, 3.2 Nov 21, 2024

mirpedrol mentioned this pull request Dec 3, 2024

Add opt-in option to template features #3315

Closed

3 tasks

mirpedrol removed this from the 3.2 milestone Jan 27, 2025

mirpedrol added this to the 3.3.0 milestone Jan 27, 2025

JoseEspinosa reviewed Apr 8, 2025

View reviewed changes

Update nf_core/pipeline-template/conf/base.config

514a3b6

Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>

Merge branch 'dev' into add-gpu-profile

be32504

This was referenced Apr 15, 2025

Replaced params.use_gpu with a workflow variable nf-core/modules#8302

Merged

Remove --use_gpu parameter nf-core/methylseq#520

Closed

Merge branch 'dev' into add-gpu-profile

f630d39

mashehu merged commit 49f761d into nf-core:dev Apr 24, 2025
96 checks passed

mahesh-panchal mentioned this pull request Jun 25, 2025

[FEATURE] Update modules to remove undocumented task.ext fields nf-core/modules#8716

Open

	ext.use_gpu = {workflow.profile.contains('gpu')}
	ext.use_gpu = { workflow.profile.contains('gpu') }

Template: Add gpu profile #3272

Template: Add gpu profile #3272

Uh oh!

Conversation

mashehu commented Nov 7, 2024 • edited by mirpedrol Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Nov 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

nictru commented Nov 7, 2024

Uh oh!

nictru commented Nov 7, 2024

Uh oh!

mashehu commented Nov 7, 2024

Uh oh!

nictru commented Nov 7, 2024

Uh oh!

GallVp commented Nov 7, 2024

Uh oh!

nictru commented Nov 8, 2024

Uh oh!

nictru left a comment

Choose a reason for hiding this comment

Uh oh!

GallVp commented Nov 9, 2024

Uh oh!

nictru commented Nov 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mashehu commented Nov 13, 2024

Uh oh!

GallVp commented Nov 13, 2024

Uh oh!

nictru left a comment

Choose a reason for hiding this comment

Uh oh!

ewels commented Nov 21, 2024

Uh oh!

mirpedrol commented Mar 11, 2025

Uh oh!

mashehu commented Mar 11, 2025

Uh oh!

jfy133 commented Apr 8, 2025

Uh oh!

JoseEspinosa Apr 8, 2025

Choose a reason for hiding this comment

Uh oh!

JoseEspinosa Apr 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nictru Apr 8, 2025

Choose a reason for hiding this comment

Uh oh!

mirpedrol commented Apr 8, 2025

Uh oh!

jfy133 commented Apr 9, 2025

Uh oh!

mashehu commented Apr 10, 2025

Uh oh!

mirpedrol commented Apr 10, 2025

Uh oh!

jfy133 commented Apr 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mashehu commented Apr 10, 2025

Uh oh!

jfy133 commented Apr 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jfy133 commented Apr 10, 2025

Uh oh!

jfy133 commented Apr 11, 2025

Uh oh!

mirpedrol commented Apr 11, 2025

Uh oh!

Template: Add `gpu` profile #3272

Template: Add `gpu` profile #3272

mashehu commented Nov 7, 2024 •

edited by mirpedrol

Loading

codecov bot commented Nov 7, 2024 •

edited

Loading

nictru commented Nov 13, 2024 •

edited

Loading

JoseEspinosa Apr 8, 2025 •

edited

Loading

jfy133 commented Apr 10, 2025 •

edited

Loading

jfy133 commented Apr 10, 2025 •

edited

Loading