Tophat --segment-length

Pepe

Member

Join Date: Mar 2009

Posts: 28
- Share
- Tweet
#1

Tophat --segment-length

05-31-2010, 10:33 PM

Hi,

I hope the developers see this post. I'll post it here so I can attach a figure that will help me making my point.
I think I've found a bug in Tophat (v1.0.13).

Reads that are spliced in segments that match multiples of the --segment-length parameter automatically are assigned a skipped region of the size of --segment-length.
No mismatches are assigned to the read and counts as valid, (which would be if the skipped region was assigned the real value)

In the figure attached (obtained from IGV, colored positions are mismatches to the reference) , there are clearly 5 reads that are wrongly spliced (I only allow 2 mismatches to the reference), the --segment-length parameter was set to 20. The cigar strings and NM tags for those 5 reads are:
40M20N20M, NM:i:0
40M20N20M, NM:i:0
20M20N40M, NM:i:0
20M20N40M, NM:i:0

If I align the reads setting --segment-length to 25 I will find many reads with cigar: 25M25N50M and 50M25N25M,
for --segment-length = 21: 21M21N42M and 42M21N21M

I hope this helps.
Attached Files

igv_snapshot.jpg (19.5 KB, 71 views)
Tags: tophat cigar bug

Previous template Next

Topics	Statistics	Last Post
Gene Misexpression in the Healthy Human Population by seqadmin Started by seqadmin, Yesterday, 06:46 AM	0 responses 9 views 0 likes	Last Post by seqadmin Yesterday, 06:46 AM
New Method for Rapid Genetic Diagnosis of Mendelian Disorders by seqadmin Started by seqadmin, 07-24-2024, 11:09 AM	0 responses 26 views 0 likes	Last Post by seqadmin 07-24-2024, 11:09 AM
Advancing Nanopore Technology for Portable Sensing Devices by seqadmin Started by seqadmin, 07-19-2024, 07:20 AM	0 responses 160 views 0 likes	Last Post by seqadmin 07-19-2024, 07:20 AM
New RNA-Based Gene Writing Technology Achieves Precise Gene Integration by seqadmin Started by seqadmin, 07-16-2024, 05:49 AM	0 responses 127 views 0 likes	Last Post by seqadmin 07-16-2024, 05:49 AM

Seqanswers Leaderboard Ad