8000 Canu 'cormhap' is at 100 terabytes and going · Issue #2380 · marbl/canu · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
Canu 'cormhap' is at 100 terabytes and going #2380
Closed
@melian98

Description

@melian98

Hello everyone, I am using canu to assemble reads that were sequenced by pacbio. I am running on an hpc cluster and my command is as follows
canu -p <project_name> genomeSize=900m -pacbio-raw corMhapFilterThreshold=0.0000000002 corMhapOptions="--threshold 0.80 --num-hashes 512 --num-min-matches 3 --ordered-sketch-size 1000 --ordered-kmer-size 14 --min-olap-length 2000 --repeat-idf-scale 50" mhapMemory=60g mhapBlockSize=500 ovlMerDistinct=0.975 gridEngineArrayMaxJobs=99 <sample1>.fastq.gz <sample2>.fastq.gz <sample3>.fastq.gz

Most of the settings you see above were obtained from the faq in the canu website (https://canu.readthedocs.io/en/latest/faq.html#my-assembly-is-running-out-of-space-is-too-slow) but even after implementing all these steps, my assembly has been running for days and is at 100 terabytes and counting. I was wondering if there is any other way to speed the process up and reduce storage space. Thank you all for your help

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions

      0