Alignment and Relative Abundance Question #125
-
I want to determine the relative DNA abundance for a set of 217 genomes across the 115 experiments (sequenced with Illumina or PacBio). I first ran MiniMap2 on each of sequencing files separately based on technology (Illumina or PacBio) to optimize the MiniMap2 settings. Example commands used were:
After converting to BAM and sorting, I ran CoverM on each BAM file using all the 217 FASTA files in the genome list. I used the following command:
I compiled the data and wanted to make sure I was interpreting it correctly. I have a few questions I'm hoping you can help me with.
Thanks in advance for your help (and thanks for all your previous help)! I really appreciate it! |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
Hi,
Thanks again for kind words. |
Beta Was this translation helpful? Give feedback.
Hi,
I suspect this might have to do with the fact that reads mapped to genomes which ultimately have <10% covered_fraction are discarded. Does that make sense?
Yes that is right, no additional calcs required. However, CoverM makes the assumption though that the fraction of reads that are unmapped is the fraction of the community not in the reference set. If recovered genomes are smaller than the ones that are not recovered for instance (or if there is euk host contamination), then it will be off.
Thanks again for kind words.