Seqanswers Leaderboard Ad

**sphil** · 12-14-2010, 02:03 AM

hey,

probably the distance between your paired-ends is to high such that TopHat isn't able to map it accurate to the source sequence. This could result of a high standard deviation in the sample prep. of the reads you use (i.e. too large clone libraries).
If you map the read on their own they could be mapped because the information of mate pairs doesn't really matter in such a case. Try to enlarge the possible gaps while using TopHat and review the results.

Don't know if it really helps but i guess that this could be a reason.

cheers

phil

**arrchi** · 05-26-2011, 10:30 AM

Hi adarshjose,

Did you solve your problem? I would be very interested in how you solved the discrepancy.

-a

**arrchi** · 05-26-2011, 10:31 AM

Hi adarshjose,

Did you solve your problem? I would be very interested in how you solved the discrepancy.

-a

**jameslz** · 06-16-2011, 09:44 PM

The reads may be trimmed....

**anurag.gautam** · 03-20-2012, 11:06 PM

Hi ,
I tried to map illumina ~2 million reads to Oryza sativa indica reference genome with its reference gtf file using different versions of Tophat 1.1.4, 1.3.0, 1.3.1, 1.3.2, 1.3.3 and the current one 1.4.1 .
I used the defalut options just to check if the mapping statistics really gets affected. As a result, I got the following stats:
Reads Used Reads Mapped
Tophat1.1.4 2,000,000 2,27,554
Tophat1.3.0 2,000,000 2,30,817
Tophat1.3.1 2,000,000 2,31,935
Tophat1.3.2 2,000,000 4,517
Tophat1.3.3 2,000,000 2,31,935
Tophat1.4.1 2,000,000 1,37,724

I wanted to know why the number of reads mapped is varying in each version even though using the same data. Secondly, why there is a drastic change in the mapping stats with version 1.3.2 and 1.4.1 as compared with other versions? Can please anybody throw some light on this matter

**pbluescript** · 03-21-2012, 04:09 AM

Originally posted by anurag.gautam View Post

Hi ,
I tried to map illumina ~2 million reads to Oryza sativa indica reference genome with its reference gtf file using different versions of Tophat 1.1.4, 1.3.0, 1.3.1, 1.3.2, 1.3.3 and the current one 1.4.1 .
I used the defalut options just to check if the mapping statistics really gets affected. As a result, I got the following stats:
Reads Used Reads Mapped
Tophat1.1.4 2,000,000 2,27,554
Tophat1.3.0 2,000,000 2,30,817
Tophat1.3.1 2,000,000 2,31,935
Tophat1.3.2 2,000,000 4,517
Tophat1.3.3 2,000,000 2,31,935
Tophat1.4.1 2,000,000 1,37,724

I wanted to know why the number of reads mapped is varying in each version even though using the same data. Secondly, why there is a drastic change in the mapping stats with version 1.3.2 and 1.4.1 as compared with other versions? Can please anybody throw some light on this matter

Could you fix your comma placement? I don't know how many alignments Tophat gave you. Does 2,27,554 mean 227,554?

**anurag.gautam** · 03-21-2012, 04:21 AM

Yes both are same
Tophat1.1.4 2,000,000 227,554
Tophat1.3.0 2,000,000 230,817
Tophat1.3.1 2,000,000 231,935
Tophat1.3.2 2,000,000 4,517
Tophat1.3.3 2,000,000 231,935
Tophat1.4.1 2,000,000 137,724

**pbluescript** · 03-21-2012, 04:58 AM

Originally posted by anurag.gautam View Post

Yes both are same
Tophat1.1.4 2,000,000 227,554
Tophat1.3.0 2,000,000 230,817
Tophat1.3.1 2,000,000 231,935
Tophat1.3.2 2,000,000 4,517
Tophat1.3.3 2,000,000 231,935
Tophat1.4.1 2,000,000 137,724

That's not a lot of mapped reads. Either something went wrong with the library prep, sequencing, or mapping method. How good is the reference genome for Oryza sativa indica?

**anurag.gautam** · 03-21-2012, 05:23 AM

Reference genome of ORyza sativa indica is of good quality and has good coverage. The reads are also of higher quality. , But still the question remains the same , why different mapping stats?

**zun** · 06-12-2012, 06:15 PM

hello anurag.gautam,

I also have used tophat series with same O.sativa reads since 2010,
but I haven't encountered the same situation as yours.
In fact the number of mapped reads varied a little, but not drastically like your case.....hmm I don't know the reason why, sorry...

> adarshjose
I had a same problem before, and realized that was because tophat abandoned the mate pairs which mapped on different chromosomes when uniting the left/right reads mapped by bowtie.
but tophat2 has a option called "--report-discordant-pair-alignment" which allows mate pairs to map to different chromosomes.
so you will get higher mapping rate with tophat2...
hope this will help you....

zun

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, 04-25-2024, 11:49 AM	0 responses 19 views 0 likes	Last Post by seqadmin 04-25-2024, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 18 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 62 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

TopHat -paired end vs single end reads

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News