Hi! I have some reads from sequencing a viral genome. We are analyzing readings with two matches, and we calculate the existing distance between both matches, for which we carry out histograms. I would like to know how can I distinguish if the regions that do not match (cuts) are due to chance or not?
It is about Avibirnavirus, it has its own polymerase called VP1. It seems that the areas that are not replicated are due to the fact that they are attached to another viral protein called VP3 , which has a protective action on the genome. VP3 requires a minimum size of 9bp to join.
Initially we used blast on the reads from the sequencing, and we are currently working with the reads that were discarded after using blast. From these discarded readings we are only using those that had two matches. We have calculated the distance between both matches and made histograms of these in which it is observed a density function with linear probability with a maximum in 0 and a minimum in the length of the genome.
However, as the multiplicity of infection of this virus increases, the distribution becomes irregular. For this reason, we think that it is possible that the reorganization of the genome after replication is carried out randomly.
It is about Avibirnavirus, it has its own polymerase called VP1. It seems that the areas that are not replicated are due to the fact that they are attached to another viral protein called VP3 , which has a protective action on the genome. VP3 requires a minimum size of 9bp to join.
Initially we used blast on the reads from the sequencing, and we are currently working with the reads that were discarded after using blast. From these discarded readings we are only using those that had two matches. We have calculated the distance between both matches and made histograms of these in which it is observed a density function with linear probability with a maximum in 0 and a minimum in the length of the genome.
However, as the multiplicity of infection of this virus increases, the distribution becomes irregular. For this reason, we think that it is possible that the reorganization of the genome after replication is carried out randomly.
Comment