You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Building off a use case identified in https://gitlab.labmed.uw.edu/molmicro/NGS16S/issues/155, in some instances involving amplicon sequencing, the chemistry may not be sufficient to guarantee that paired reads have overlap on which to merge. This scenario has been identified in some species of Campylobacter and Helicobacter, where insertions in the V1V2 region of 16S result in longer than expected amplicons and no overlap in 250 cycle paired end sequencing.
To accommodate this situation, it would be useful for the pipeline to output sequences and weights of all unmerged SVs (i.e. inferred sequences post denoising) which fit the 16S model for downstream classification and analysis. In conversation, we have identified unmerged forward SVs as the primary target for now (for simplifiying abundance assessments relative to merged reads), although unmerged reverse SVs may also be of interest and could be included for sake of completeness.
A separate issue will be raised to incorporate test data reflecting the scenario into this project.
The text was updated successfully, but these errors were encountered:
It seems like easier access to unmerged sequences would enable the less computationally facile members of the team (like me) to troubleshoot incidents like this ourselves https://gitlab.labmed.uw.edu/molmicro/NGS16S/-/issues/173.
Building off a use case identified in https://gitlab.labmed.uw.edu/molmicro/NGS16S/issues/155, in some instances involving amplicon sequencing, the chemistry may not be sufficient to guarantee that paired reads have overlap on which to merge. This scenario has been identified in some species of Campylobacter and Helicobacter, where insertions in the V1V2 region of 16S result in longer than expected amplicons and no overlap in 250 cycle paired end sequencing.
To accommodate this situation, it would be useful for the pipeline to output sequences and weights of all unmerged SVs (i.e. inferred sequences post denoising) which fit the 16S model for downstream classification and analysis. In conversation, we have identified unmerged forward SVs as the primary target for now (for simplifiying abundance assessments relative to merged reads), although unmerged reverse SVs may also be of interest and could be included for sake of completeness.
A separate issue will be raised to incorporate test data reflecting the scenario into this project.
The text was updated successfully, but these errors were encountered: