Seqanswers Leaderboard Ad

**Brian Bushnell** · 02-14-2016, 04:29 PM

TopHat is not very tolerant of errors in data. I'd recommend avoiding all of the Tuxedo pipeline where possible; I've always found it to be slow and unstable. Deseq and Edger seem to give more accurate results, anyway.

**tonup69** · 02-14-2016, 04:49 PM

Brian - I am just talking about mapping right now. Even if I switched to Deseq or Edger (which I could do on our local install) I still need a gaped aligner to map the sequence with first. Do you use RNA-STAR?

**tonup69** · 02-15-2016, 09:52 AM

Here are the actual numbers on the same fastqsanger file (trimmed to 200bp QS>20).

Tophat2:
Reads:
Input: 39248980
Mapped: 20971700 (53.4% of input)
of these: 3246961 (15.5%) have multiple alignments (5659 have >20)
53.4% overall read alignment rate.

RNA-STAR:
Started job on | Feb 15 12:26:35
Started mapping on | Feb 15 12:26:38
Finished on | Feb 15 12:35:13
Mapping speed, Million of reads per hour | 274.36

Number of input reads | 39248980
Average input read length | 166
UNIQUE READS:
Uniquely mapped reads number | 32172637
Uniquely mapped reads % | 81.97%
Average mapped length | 164.84
Number of splices: Total | 7873494
Number of splices: Annotated (sjdb) | 0
Number of splices: GT/AG | 7824667
Number of splices: GC/AG | 46067
Number of splices: AT/AC | 2760
Number of splices: Non-canonical | 0
Mismatch rate per base, % | 0.47%
Deletion rate per base | 0.14%
Deletion average length | 1.17
Insertion rate per base | 0.24%
Insertion average length | 1.15
MULTI-MAPPING READS:
Number of reads mapped to multiple loci | 5162521
% of reads mapped to multiple loci | 13.15%
Number of reads mapped to too many loci | 59135
% of reads mapped to too many loci | 0.15%
UNMAPPED READS:
% of reads unmapped: too many mismatches | 0.00%
% of reads unmapped: too short | 4.42%
% of reads unmapped: other | 0.30%

**Brian Bushnell** · 02-15-2016, 08:09 PM

Originally posted by tonup69 View Post

Brian - I am just talking about mapping right now. Even if I switched to Deseq or Edger (which I could do on our local install) I still need a gaped aligner to map the sequence with first. Do you use RNA-STAR?

Nope, I use BBMap

I have heard good things about STAR, but I've never benchmarked it.

**Michael.Ante** · 02-16-2016, 12:54 AM

Hi Tonup69,

check your alignments with rseqc, especially the clipping profile will be interesting.
Maybe your alignment rate with STAR is very good, but the alignment-length might be quite short...

**tonup69** · 02-16-2016, 07:32 AM

Well, its a nice thought, but we don't have either of those wrappers on our local install and I am not the admin for the box. I will bring it up, but I am limited to what we have on our server.

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, 04-25-2024, 11:49 AM	0 responses 20 views 0 likes	Last Post by seqadmin 04-25-2024, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 20 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 62 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

TOPHAT2 vs RNA-STAR 2X the mapped %.

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News