You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Flagged by @rneherslack message, the Dengue virus DENVx genotypes dataset could be further improved in its clade assignments. For example for DENV1:
DENV2 samples that align are correctly placed onto the outgroup node and marked as unassigned. (good!)
However, DENV1 samples that don't belong to an annotated genotype are also marked as unassigned, which is arguably incorrect. (This could be improved!) An example shown below:
Description
These samples should be assigned to the DENV1 serotype without a specific genotype, rather than being marked as unassigned. To illustrate this group of samples visually, we aim to reduce the samples in the magenta region of the table:
Possible solution
To ensure accurate serotype assignment while allowing for true-negative genotype assignments. I'm currrently planning the following steps:
In the dengue/all tree, identify the amino acid mutations from the dengue/all reconstructed root to the reconstructed root of each serotype.
In each dengue/denv* tree, locate the amino acid mutations from the serotype reconstructed root to the outgroup dengue/all reconstructed root, and correct the coordinates accordingly.
Add the corrected coordinates of the amino acid mutations to each of the clades_genotype_denv*.tsv files, using the serotype name (e.g., DENV1) as the identifier.
After implementing these changes:
All DENV1 samples should be assigned to the DENV1 serotype, even if they don't belong to a specific genotype.
Samples from other serotypes (e.g., DENV2) should still be correctly marked as unassigned.
Of course, open to other suggestions or guidance here.
The text was updated successfully, but these errors were encountered:
Context
Flagged by @rneher slack message, the Dengue virus DENVx genotypes dataset could be further improved in its clade assignments. For example for DENV1:
Description
These samples should be assigned to the DENV1 serotype without a specific genotype, rather than being marked as unassigned. To illustrate this group of samples visually, we aim to reduce the samples in the magenta region of the table:
Possible solution
To ensure accurate serotype assignment while allowing for true-negative genotype assignments. I'm currrently planning the following steps:
dengue/all
tree, identify the amino acid mutations from thedengue/all
reconstructed root to the reconstructed root of each serotype.dengue/denv*
tree, locate the amino acid mutations from the serotype reconstructed root to the outgroupdengue/all
reconstructed root, and correct the coordinates accordingly.clades_genotype_denv*.tsv
files, using the serotype name (e.g.,DENV1
) as the identifier.After implementing these changes:
Of course, open to other suggestions or guidance here.
The text was updated successfully, but these errors were encountered: