Okay, so I've finally got my bearings with TopHat (I think).
First, I was able to shoehorn my Helicos RNA-Seq data so that it can be used with TopHat (thank you for the help Cole). Then I was finally able to work around the issues that I was having with getting a good gff3 file (thanks to the forum members who helped).
Now I have an additional questions, is there a way to *only* report junctions that are found, but aren't in the supplied gff3 file? I generated a list of junctions with and without the --no-novel-juncs parameter so I know that I have about a thousand novel junctions by comparing the two output junction files.
It wouldn't be too hard to write a script to compare the outputs and only keep junctions that are novel, but I'm lazy and would prefer not to do if there is another way.
Thanks,
Sam
First, I was able to shoehorn my Helicos RNA-Seq data so that it can be used with TopHat (thank you for the help Cole). Then I was finally able to work around the issues that I was having with getting a good gff3 file (thanks to the forum members who helped).
Now I have an additional questions, is there a way to *only* report junctions that are found, but aren't in the supplied gff3 file? I generated a list of junctions with and without the --no-novel-juncs parameter so I know that I have about a thousand novel junctions by comparing the two output junction files.
It wouldn't be too hard to write a script to compare the outputs and only keep junctions that are novel, but I'm lazy and would prefer not to do if there is another way.
Thanks,
Sam
Comment