Skip to content

biosustain/dsp_nf-assembly-to-gene-calling

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 

Repository files navigation

dsp_nf-assembly-to-gene-calling

This nextflow pipeline can take sequencing data from Oxford Nanopore Technologies (fastq or gzipped fastq) and conduct required steps in order to attain gene prediction as a gff3 file.

The pipeline relies on the following software that have been integrated in this nextflow pipeline:

currently only takes the following input parameters:

  • input: *.fastq.gz file
  • threads: integer - how many threads to parallelize processes that support it
  • outdir - to be used as publishDir
  • species: species to be used by Augustus (lower case, underscore instead of space)
  • genome_size - for Flye
  • assembly_coverage - for Flye

example to initiate run:

nextflow run https://github.com/biosustain/dsp_nf-assembly-to-gene-calling --input my_data.fq.gz \
--outdir results --species aspergillus_oryzae --genome_size 37m --assembly_coverage 30 --threads 16 -resume

Note: we ran a ~3gb fastq.gz file on an the Azure virtual machine: Standard E16as v5 (16 vcpus, 128 GiB memory), with 16 threads the total run time was just below 25 hours.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published