We can observe that low sample size leads to imprecise results, but as we get closer to 100,000 sample size, the results of the cross-validation are very close to what was reported so far in this notebook, which makes sense since the signal is mostly encoded in the relative position of the nucleotides, more than it is in their exact sequence.
In total, 81 of the 109 sequences show signal for the combination of the three-way junction module and the T-loop with cross-validation.