Repeated files and reusing input and filtered data

I have inputs IPed for several targets, the fact that i need to run the pipeline for every target means the inputs are being re aligned and filtered every time thus wasting time, computational power and space. In general, space is also an issue, every run generates 300GB of files, could you implement an option that deletes some files after use. This is because i cannot run all the .json one after the other automatically, i run out of space and have to run each json alone, delete hundreds of GBs then run the next json. hence i am stuck while i would rather set it and forget it.

TLDR:
1-Run all jsons combined to reduce processing of the identical samples
2-Auto delete redundant files as the pipeline goes (it also feels like the same files are being duplicated to serve as inputs for the many jobs, one duplicate per job)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repeated files and reusing input and filtered data #312

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Repeated files and reusing input and filtered data #312

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions