A pipeline specifically to process small sequences generated from mutation accumulation experiments. Credit to Nathaniel Sharp and Jacob Roman-Fredette for the general snakemake rule layout and read group shell script.
This pipeline currently does not support multiple-lane sequencing runs for now.
fastqc, BBMap, Trimmomatic, fastq_screen, bwa-mem2, samtools, picard, gatk4
- Install the dependencies using conda.
- In your working directory, create a
pipelines
directory and put the pipeline config, readgroup shell script, your fastq_screen config, and the pipeline in there. - Create a directory called
sequences
and move your raw sequencing outputs into a new directory underneath namedreads
. - Create another directory under
sequences
namedadapter
, and move the adapters that you used under there. - Next, download your species' reference genome under a new directory titled
alignment_reference
. - If you want to screen against suspected contaminants, download reference genomes into a directory named
screens
. - Run the pipeline on -np mode to check if the expected files will be generated.
- Run the pipeline with the following command: snakemake -s pipeline.smk --cores .