why aren't they using mutation pictures
typically you get a set of m organisms with similar RNA or DNA of length n1
and you want to determine the relationship, the mutations.
The p1 positions where all (or all but c) are the same are not interesting
and can be omitted. Let n=n1-p1.
Then you choose one reference sequence, the common ancestor, typically
the average of the sequences.
Then you make the binary m*n mutation-matrix whose rows are the
sequences,organisms , whose columns are the positions and there
is a 1 ar position (i,j) iff organism i is different from
the reference at position j.
Then you draw the mutation picture where 1s are black pixels and 0 are white
pixels like this:
Then the rows and columns can be re-ordered to show the best grouping
of the pixels into connected areas. Related organisms and mutations are
placed next to each other.
This is straightforward, isn't it ? How else could it be done.How to visualize
the evolution of that set.
Yet I never saw it.I don't even know how others do call this "mutation picture"
Instead people are doing "phylo"-trees to assign the organisms to groups.
with lines and sequence names in it. But this takes more space, is not
rectangular and doesn't so well visualize the distances between the groups.
What's the reason that we don't see the mutation pictures in papers
or webpages ? Is it the way how our science is organized with grants,papers
and peer-reviews and there is just no money to be made from it ?
I don't understand.
(I'm mainly doing influenza sequences)
-------------------------------------------------
how to name it, how to find it , have you better suggestions ?
what are suitable keywords to find it
in fact a google image search for "mutation picture" matrix positions sequences
gave this as top hit, my own previous post
typically you get a set of m organisms with similar RNA or DNA of length n1
and you want to determine the relationship, the mutations.
The p1 positions where all (or all but c) are the same are not interesting
and can be omitted. Let n=n1-p1.
Then you choose one reference sequence, the common ancestor, typically
the average of the sequences.
Then you make the binary m*n mutation-matrix whose rows are the
sequences,organisms , whose columns are the positions and there
is a 1 ar position (i,j) iff organism i is different from
the reference at position j.
Then you draw the mutation picture where 1s are black pixels and 0 are white
pixels like this:
Then the rows and columns can be re-ordered to show the best grouping
of the pixels into connected areas. Related organisms and mutations are
placed next to each other.
This is straightforward, isn't it ? How else could it be done.How to visualize
the evolution of that set.
Yet I never saw it.I don't even know how others do call this "mutation picture"
Instead people are doing "phylo"-trees to assign the organisms to groups.
with lines and sequence names in it. But this takes more space, is not
rectangular and doesn't so well visualize the distances between the groups.
What's the reason that we don't see the mutation pictures in papers
or webpages ? Is it the way how our science is organized with grants,papers
and peer-reviews and there is just no money to be made from it ?
I don't understand.
(I'm mainly doing influenza sequences)
-------------------------------------------------
how to name it, how to find it , have you better suggestions ?
what are suitable keywords to find it
in fact a google image search for "mutation picture" matrix positions sequences
gave this as top hit, my own previous post
Comment