Releases: assemblerflow/flowcraft
Releases · assemblerflow/flowcraft
Spring cleaning
New features
- Added support for the report system to:
maxbin2
- Added new
manifest.config
with the pipeline metadata
New components
Kraken2
: Taxonomic identification on FastQ files
Bug fixes
- Fix bug in
momps
component related to added in the introduction of the clear input parameter - Fixed bug with the
-ft
parameters not retrieving the dockerhub tags for
all the components. - Fixed bug in the
megahit
process where the fastg mode would break the process - Fix inspect and report mode to fetch the nextflow file independently of its
position in thenextflow run
command inside the .nextflow.log file. - Fix parsing of .nextflow.log file when searching for
nextflow run
command. - Fixed bug between mash_sketch_fasta and mash_dist.
Minor/Other changes
- Added option to
dengue_typing
to retrieve closest reference sequence and link it
with a secondary channel intomafft
- New version of DEN-IM recipe
- Now prints an ordered list of components
- Moved taxonomy results from
results/annotation/
to `results/taxonomy/
Now prints an ordered list of components
Moved taxonomy results from results/annotation/ to results/taxonomy/
Grandma's recipes
New features
- Added new
recipe
system to flowcraft along with 6 starting recipes.
Recipes are pre-made and curated pipelines that address specific questions.
To create a recipe, the-r <recipe_name>
can be used. To list available
recipes, the--recipe-list
and--recipe-list-short
options were added. - Added
-ft
or--fetch-tags
which allows to retrieve all DockerHub
container tags. - Added function to collect all the components from the components classes,
replacing the current process_map dictionary implementation. Now, it will be
generated from the engine rather than hardcoded into the dict.
Components changes
- Added new
disableRR
param in thespades
component that disables repeat
resolution - The
abyss
andspades
components emit GFA in a secondary channel. - The new
bandage
component can accept either FASTA from a primary channel
or GFA from a secondary channel. - Updated skesa to version 2.3.0.
- Updated mash based components for the latest version - 1.6.0-1.
New components
- Added component
abyss
. - Added component
bandage
. - Added component
unicycler
. - Added component
prokka
. - Added component
bcalm
. - Added component
diamond
.
Minor/Other changes
- Added removal of duplicate IDs from
reads_download
component input. - Added seed parameter to
downsample_fastq
component. - Added bacmet database to
abricate
component. - Added default docker option to avoid docker permission errors.
- Changed the default URL generated by inspect and report commands.
- Added directives to
-L
parameter of build module.
Bug fixes
- Fixed forks with same source process name.
- Fixed
inspect
issue when tasks took more than a day in duration. - Added hardware address to
inpsect
andreport
hash.
Clear!
Features
- Added a new
clearInput
parameter to components that change their input.
The aim of this option is to allow the controlled removal of temporary files,
which is particularly useful in very large workflows.
Components changes
- Updated images for components
mash_dist
,mash_screen
and
mapping_patlas
.
New components
- Added component
fast_ani
.
Minor/Other changes
- Added
--export-directives
option tobuild
mode to export component's
directives in JSON format to standard output. - Added more date information in
inspect
mode, including the year and the
locale of the executing system.
Report!!
Introducing the new Report system that is hosted by the Flowcraft web application. With the new report
mode, it is now possible to visualize the reports of a flowcraft pipeline in the browser, either at the end of the pipeline run or in real-time while the pipeline is in execution.
Features
- Added
report
run mode to Flowcraft that displays the report of any given
pipeline in the Flowcraft's web application. Thereport
mode can be executed
after a pipeline ended or during the pipeline execution using the--watch
option. - Added standalone report HTML at the end of the pipeline execution.
- Components with support for the new report system:
abricate
assembly_mapping
check_coverage
chewbbaca
dengue_typing
fastqc
fastqc_trimmomatic
integrity_coverage
mlst
patho_typing
pilon
process_mapping
process_newick
process_skesa
process_spades
process_viral_assembly
seq_typing
trimmomatic
true_coverage
Minor/Other changes
- Refactored report json for components
mash_dist
,mash_screen
and
mapping_patlas
Bug fixes
- Fixed issue where
seq_typing
andpatho_typing
processes were not feeding
report data to report compiler. - Fixed fail messages for
process_assembly
andprocess_viral_assembly
components
Indriya
Components changes
mapping_patlas
: refactored to remove temporary files used to create
sam and bam files and added data to .report.json. Updated databases to pATLAS
version 1.5.2.mash_screen
andmash_dist
: added data to .report.json. Updated databases
to pATLAS version 1.5.2.
New components
- Added component
fasterq_dump
- Added component
mash_sketch_fasta
- Added component
mash_sketch_fastq
- Added component
downsample_fastq
for FastQ read sub sampling using seqtk - Added component
momps
for typing of Legionella pneumophila - Added component
split_assembly
- Added component
mafft
- Added component
raxml
- Added component
viral_assembly
- Added component
progressive_mauve
- Added component
dengue_typing
Minor/Other changes
- Added check for
params.accessions
that enables to report a proper
error when it is set tonull
. - Added
build
option to export component parameters information in JSON format. - Fixed minor issue preventing the
maxbin2
andsplit_assembly
components
from being used multiples times in a pipeline - Added a catch to the
filter_poly
process for cases where the input file is empty. - spades template now reports the exit code of spades' execution
Bug fixes
- Removed the need for the nf process templates to have an empty line
at the beginning of the template files. - Fixed issue when the
inspect
mode was executed on a pipeline directory
with failed processes but with the work directory removed (the log files
where no longer available). - Fixed issue when the
inspect
mode was executed on a pipeline without the
memory directory defined. - Fixed issue in the
inspect
mode, where there is a rare race condition between
tags in the log and trace files. - Fixed bug on
midas_species
process where the output file was not being
linked correctly, causing the process to fail - Fixed bug on
bowtie
where the reference parameter was missing the pid - Fixed bug on
filter_poly
where the tag was missing
Param Param!
Improvements
- The parameter system has been revamped, and parameters are now component-specific
and independent by default. This allows a better fine-tuning of the parameters
and also the execution of the same component multiple times (for instance in a fork)
with different parameters. The old parameter system that merged identical parameters
is still available by using the--merge-params
flag when building the pipeline. - Added a global
--clearAtCheckpoint
parameter that, when set to true, will remove
temporary files that are no longer necessary for downstream steps of the pipeline
from the work directory. This option is currently supported for thetrimmomatic
,
fastqc_trimmomatic
,skesa
andspades
components.
New components
maxbin2
: An automatic tool for binning metagenomic sequences.bowtie2
: Align short paired-end sequencing reads to long reference
sequences.retrieve_mapped
: Retrieves the mapped reads of a previous bowtie2 mapping process.
New recipes
plasmids
: A recipe to perform mapping, mash screen on reads
and also mash dist for assembly based approaches (all to detect
plasmids). This also includes annotation with abricate for the assembly.plasmids_mapping
: A recipe to perform mapping for plasmids.plasmids_mash
: A recipe to perform mash screen for plasmids.plasmids_assembly
: A recipe to perform mash dist for plasmid
assemblies.
Minor/Other changes
- Added "smart" check when the user provides a typo in pipeline string
for a given process, outputting some "educated" guesses to the
terminal.http://192.92.149.169:80/inspect/5b585070ed1cb0dbb9a696c724b7f9ed7844e1afe5188ef3aa44c7c3cc5b539d - Added "-cr" option to show current recipe
pipeline_string
. - Changed the way recipes were being parsed by
proc_collector
for the
usage of-l
and-L
options. - Added check for non-ascii characters in colored_print.
- Fixed log when a file with the pipeline is provided to -t option
instead of a string.
Bug fixes
- Fixed pipeline names that contain new line characters.
- Fixed pipeline generation when automatic dependencies were added right after a fork
- Template: sistr.nf: Fixed comparison that determined process status.
- Fixed issue with
--version
option.
Inspector gadget
New components
card_rgi
: Anti-microbial gene screening for assembliesfilter_poly
: Runs PrinSeq on paired-end FastQ files to remove low complexity sequenceskraken
: Taxonomical identification of FastQ filesmegahit
: Metagenomic assembler for paired-end FastQ filesmetaprob
: Preforms read binning on metagenomic FastQ filesmetamlst
: Checks the Sequence Type of metagenomic FastQ reads using Multilocus Sequence Typingmetaspades
: Metagenomic assembler for paired-end FastQ filesmidas_species
: Taxonomical identification of FastQ files on the species levelremove host
: Read mapping with Bowtie2 against the host genome (default hg19) and removes the mapping readssistr
: Salmonella in silico typing component for assemblies.
Features
- Added
inspect
run mode to flowcraft for displaying the progress overview
during a nextflow run. This run mode has anoverview
andbroadcast
options
for viewing the progress of a pipeline.
Minor/Other changes
- Changed
mapping_patlas
docker container tag and variable
(PR #76). - The
env
scope of nextflow.config now extends thePYTHONPATH
environmental variable. - Updated indexes for both
mapping_patlas
andmash
based processes. - New logo!
Bug Fixes
- Template: fastqc_report.py: Added fix to trim range evaluation.
- Script: merge_json.py: Fixed chewbbaca JSON merge function.
Press Start!
First official release of assemblerflow.