predict a single class for all bases in a non-repeat subsequence #53

williamstark01 · 2022-08-24T15:32:52Z

I notice that the model does a good job with predicting a repeat but struggles with replicating the sequence, here is the first parts of the subsequences of this sample prediction:

AGAACCTATTATTTGCATGA🥑🥑🥑🥑🥑🥑🥑🥑🥑🥑🥑🥑🥑TAGAAGAAACCTGTATTTTTTTCATCA
CGAAATTTATTATTTATATA🥑🥑🥑🥑🥑🥑🥑🥑🥑🥑🥑🥑🥑TAAAAAAAATTTATATTTTTTTTATTA

I realize that we don't need this functionality from the model, as we only need the absence of a repeat in these subsequences. Would it make sense then to predict a single additional class for bases in non-repeat subsequences, making the prediction and output of the model like this?

AGAACCTATTATTTGCATGA🥑🥑🥑🥑🥑🥑🥑🥑🥑🥑🥑🥑🥑TAGAAGAAACCTGTATTTTTTTCATCA
____________________🥑🥑🥑🥑🥑🥑🥑🥑🥑🥑🥑🥑🥑___________________________

(Or any other character to represent the absence of a repeat.)

Would that be easy to test?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

predict a single class for all bases in a non-repeat subsequence #53

predict a single class for all bases in a non-repeat subsequence #53

williamstark01 commented Aug 24, 2022 •

edited

Loading

predict a single class for all bases in a non-repeat subsequence #53

predict a single class for all bases in a non-repeat subsequence #53

Comments

williamstark01 commented Aug 24, 2022 • edited Loading

williamstark01 commented Aug 24, 2022 •

edited

Loading