+
Skip to content

Support for Read Lengths Not in {50, 100, 150, 200} #408

@MikeKazz

Description

@MikeKazz

Description of the bug

Hello,

I am using the nf-core/atacseq pipeline and need to specify the --read_length parameter for my FASTQ files, which have a read length of 76 bp. However, the pipeline currently only allows --read_length values of 50, 100, 150, or 200, making it unclear how to proceed with read lengths that fall outside these values.

According to the documentation, the --read_length parameter is used to define the --sjdbOverhang parameter in STAR as (read_length - 1), which is critical for accurate alignment. For my case, the correct sjdbOverhang should be 75, but there is no direct way to specify this in the pipeline.

Proposed Solutions

  1. Allow Users to Directly Specify sjdbOverhang

    • Introduce a new optional parameter (e.g., --sjdbOverhang) that lets users override the value instead of relying on --read_length.
  2. Expand --read_length Options

    • Instead of restricting --read_length to specific values, allow users to input any valid read length.
  3. Auto-Detect Read Length from Input FASTQs

    • The pipeline could infer the read length from the input FASTQ files and set sjdbOverhang accordingly.

Would it be possible to implement one of these solutions in a future release? Let me know if there's any workaround in the meantime.

Thanks for your help!

Command used and terminal output

Relevant files

No response

System information

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions

      点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载