Advanced usage

Use in HPC clusters

The supported installation of Dadaist2 is via Bioconda:

  • The package can be natively installed via conda in the user's home
  • A singularity (or Docker) image can be generated as suggested in the installation page.

Dadaist2 as a module provider

Dadaist2 has been released as a set of wrappers to allow implementing some of them in existing pipelines.

A minimal example is provided in the nextflow/simple.nf Nextflow script (link), where we delegate to Nextflow the parallelisation of the input reads trimming with cutadapt.

Example

Running a workflow using Dadaist tools:

nextflow run simple.nf  -with-singularity dadaist2.simg \
  --reads "data/16S/*_R{1,2}_001.fastq.gz" --ref "refs/SILVA_SSU_r138_2019.RData"  

The minimal workflow runs parallelising adaptor trimming and collecting the results for Dadaist main module:

N E X T F L O W  ~  version 19.10.0
Launching `simple.nf` [angry_celsius] - revision: 053df3283c
Example pipeline
 =======================================
 taxonomy db  : refs/SILVA_SSU_r138_2019.RData
 reads        : data/16S/*_R{1,2}_001.fastq.gz
 outdir       : dadaist
executor >  slurm (4)
[81/39e622] process > cutadapt (F99_S0_L001) [100%] 3 of 3, cached: 3 ✔
[19/e589db] process > dada (1)               [100%] 1 of 1, cached: 1 ✔
[5a/893a7d] process > phyloseq (1)           [100%] 1 of 1 ✔
[ae/bd2ecb] process > normalize-alpha (1)    [100%] 1 of 1 ✔

Note that -with-singularity makes use of our container that provides:

  • All Dadaist wrappers
  • R with commonly used libraries such as DADA2, PhyloSeq, Microbiome, vegan…
  • Tools like VSEARCH, cutadapt, fastp, seqfu, multiqc…