Thomas Sandmann’s blog - QuantSeq RNAseq analysis (1): configuring the nf-core/rnaseq workflow #4
Replies: 2 comments 2 replies
-
Hi Thomas, first of all all thank you for the work that you did trying to establish a nice and straightforward workflow to perform QuantSeq 3' RNA-sequencing analysis using the nf-core pipeline. I am also working a lot on this and only recently (last week or so) I came across this page via the nf-core/rnaseq Slack channel. I already played around different parameters to customize the pipeline for QuantSeq 3' data, in particular to implement polyA trimming. I am working with RNA material coming from FFPE tumor tissue, which is highly degraded, and a great fraction of my reads comes with polyA tail + adapter. The main issue I faced was the removal of both adapter and polyA using Trimgalore. Indeed, while with Cutadapt I was able to implement it (specifying two adapter's sequences and the argument -n 2), within the nf-core pipeline Cutadapt is not customizable but only Trimaglore is. Therefore I decided to perform quality and adapter trimming on raw reads using Cutadapt through an external shell script and provide the adapter trimmed reads in input to the nf-core pipeline to undergo poly A removal with Trimgalore. Coming across your blog, it has come to my attention that it is possible to perform polyA removal also with STAR, which would save me from using the external shell script. I therefore performed a trial where I compared my configuration (i.e. external script for adapter and quality trimming and polyA removal with Trimgalore) with the one mentioned here (i.e. adapter and quality trimming with Trimgalore and polyA removal with STAR) but results are not comparable. In particular, in terms of % of aligned reads, which is the parameter that I am monitoring, with the second configuration I obtain a lower % of aligned reads (e.g. 65% vs 80% obtained with the first configuration). From my side the main issue is that I am not sure of how STAR performs polyA removal and how the number of "A"s specified through --clip3pAdapterSeq influences the trimming. It would be great if we could confront each other to find the final configuration that best suits QuantSeq 3' data! Thank you! Miriam |
Beta Was this translation helpful? Give feedback.
-
I tried to ask Alex Dobin (alexdobin/STAR#1774) to have more info on the behavior of the parameter --clip3pAdapterSeq. In particular, it would be nice to know whether the number of nucleotides trimmed is exactly as the number of As specified or not (e.g. if we have a read tail with 18 As and we only specify 10 in --clip3pAdapterSeq, does it mean that 8 will remain there? If so, this can explain the lower alignment rate). Unfortunately on the STAR manual, as well as on the web, I was not able to find anything that answers my question. |
Beta Was this translation helpful? Give feedback.
-
Thomas Sandmann’s blog - QuantSeq RNAseq analysis (1): configuring the nf-core/rnaseq workflow
https://tomsing1.github.io/blog/posts/nextflow-core-quantseq-1-settings/
Beta Was this translation helpful? Give feedback.
All reactions