Seqanswers Leaderboard Ad

**kopi-o** · 03-27-2012, 08:47 AM

With 99.12% alignment rate, there is hardly any room for improvement! Is it a prokaryote?

In theory, you should use TopHat for RNA-seq because it considers splicing. Bowtie2 does not do gapped alignment in that sense (spliced alignment), although it allows for short gaps. Of course, for simpler organisms with no introns, there is not much point in using TopHat.

**amitm** · 03-27-2012, 09:27 AM

Originally posted by kopi-o View Post

With 99.12% alignment rate, there is hardly any room for improvement! Is it a prokaryote?

In theory, you should use TopHat for RNA-seq because it considers splicing. Bowtie2 does not do gapped alignment in that sense (spliced alignment), although it allows for short gaps. Of course, for simpler organisms with no introns, there is not much point in using TopHat.

hi,
na, its human cell line RNA. Yep, thats what I have been thinking but since I am interested in transcript isoform quantification, I would want to ensure the efficacy of the pipeline. I have also visualized the BAM file on IGV, looks fine.

But what I may be missing out for not using TopHat has been nagging me. I have put up aliignment using TopHat and would compare the two results. Would update if I find any changes in the BAM files.

thanks

**NateP** · 03-27-2012, 12:03 PM

I think the main difference between Tophat and Bowtie2 is this:

Say you have a read that spans two exons.

With Tophat, that read will be mapped two both exons in the mapping to splice junctions phase.

With Bowtie2 (--local setting i believe?), that read will be soft trimmed until it maps to only one of the two exons, which ever gives the higher mapping score.

Someone please correct me if I'm mistaken there.

**Jon_Keats** · 03-27-2012, 01:09 PM

Something is fishy. There is no way you should get that high of alignment with 100x100 human RNA sequencing using bowtie2 unless the library is messed up. The IGV plot you show is highly biased to the 3' exon and in the top sample the exonic regions are not easily distinguished from the introns.

**peromhc** · 03-27-2012, 05:26 PM

Originally posted by Jon_Keats View Post

Something is fishy. There is no way you should get that high of alignment with 100x100 human RNA sequencing using bowtie2 unless the library is messed up. The IGV plot you show is highly biased to the 3' exon and in the top sample the exonic regions are not easily distinguished from the introns.

Along these lines, >40% multiply mapped reads is likely one of the problems. Have you looked at read quality-- kmer frequency, etc?

**sdriscoll** · 12-05-2012, 04:35 PM

If you run bowtie2 in local mode it will absolutely align over 90% of your data.

As others have mentioned, Tophat was not made because bowtie could not do gapped alignments, it was made because there was no aligner that could align reads to the genome across splice junctions. Tophat does this which is separate from gapped alignments, which Tophat will now also report thanks to bowtie2.

If you do not use Tophat in your cufflinks pipeline cufflinks will be missing valuable information about how the aligned reads are joining exons (in fact joining transcripts) together. cufflinks was designed to make use of that information. The only way to get bowtie2 to generate those type of alignments would be to align to a transcriptome and then converte the alignments back to genomic coordinates (something that Tophat does as part of its alignment pipeline). Then you'd be missing out on novel alignment information, though.

If you want to get the best results out of your pipeline it's not 100% alignment you should be going for but for alignments to the genome that include spliced alignments. Those alignmets are the most powerful thing for assembling transcripts and for providing evidence of new exons and alternative splicing. For example you might see coverage that looks like a new exon from bowtie2 but only with Tophat would you also be able to see if reads aligning to that new exon also have junctions with annotated exons from a nearby gene.

**carmeyeii** · 12-14-2012, 12:30 PM

So you can supply TopHat with a GTF file of annotated transcripts, which, using the --GTF option, will be the first place where reads are mapped, followed by the whole genome, with or without novel junction discovery in this second stage. As I understand it, this is after TopHat 1.4.
I'm curious to know how t was before 1.4. I think you could already give TopHat a GTF file, but it used it second. Am I right? If so, what is the difference between using it [the GTF file] first and using it second after the genome?

Carmen

**sdriscoll** · 12-14-2012, 12:50 PM

I don't think it ever did a transcriptome alignment stage back then. I was never entirely sure what including the GTF was doing back then because of that. I think they looked at it as a guide to help resolve messy/unclear junction conditions.

**carmeyeii** · 12-14-2012, 12:53 PM

Hmmmm.... So I'm guessing it used it to validate the potential junctions it had found in its initial mapping to the genome?

But then it would never find new stuff :/

Or maybe it looked for junctions close enough to what it had found to correct those to "perfection"... ?

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 15 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

bowtie2 vs. TopHat

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News