Unconfigured Ad

**jameslz** · 06-15-2011, 08:42 PM

A more question:
Tophat can't handle reads with different size of length ?

**maubp** · 06-16-2011, 02:00 AM

The SAM files look OK to me. Notice that in tophat 1.3.0 one of the quality strings is a single * which means data not available, while with tophat 1.2.0 you do get a full quality string which happens to start with a * character (which is valid).

So, that could be a bug in tophat 1.3.0 (using * for missing qualities when it probably does know them), and a separate bug in htseq-count failing to accept * for missing qualities.

What version of htseq-count are you using? I had a quick look at HTSeq-0.5.1p2.tar.gz file src/HTSeq/_HTSeq.pyx and there is no obvious sign that they cope with this situation (but I didn't fully explore their code).

P.S. I've emailed Simon Anders about this possible HTSeq issue.

**jameslz** · 06-16-2011, 02:15 AM

Originally posted by maubp View Post

The SAM files look OK to me. Notice that in tophat 1.3.0 one of the quality strings is a single * which means data not available, while with tophat 1.2.0 you do get a full quality string which happens to start with a * character (which is valid).

So, that could be a bug in tophat 1.3.0 (using * for missing qualities when it probably does know them), and a separate bug in htseq-count failing to accept * for missing qualities.

What version of htseq-count are you using? I had a quick look at HTSeq-0.5.1p2.tar.gz file src/HTSeq/_HTSeq.pyx and there is no obvious sign that they cope with this situation (but I didn't fully explore their code).

P.S. I've emailed Simon Anders about this possible HTSeq issue.

Thanks for your answer.
I use the latest version HTSeq-0.5.1p2.

**jameslz** · 06-16-2011, 05:22 PM

Originally posted by jameslz View Post

A more question:
Tophat can't handle reads with different size of length ?

if I trim the low quality base from the 3' end , I can map 80% paired reads to the reference, if not, just 70% paired reads can be mapped to genome.

can anyone help me?

Topics	Statistics	Last Post
UC San Diego Bioengineers Map Gene Function in Human Stem Cells by SEQadmin2 Started by SEQadmin2, Today, 10:26 AM	0 responses 9 views 0 reactions	Last Post by SEQadmin2 Today, 10:26 AM
New Analysis Splits Leukemia Into 16 Epigenomic Subgroups by SEQadmin2 Started by SEQadmin2, 07-09-2026, 10:04 AM	0 responses 24 views 0 reactions	Last Post by SEQadmin2 07-09-2026, 10:04 AM
Genome-Wide CRISPR Screen Uncovers Unlikely Psoriasis Target by SEQadmin2 Started by SEQadmin2, 07-08-2026, 10:08 AM	0 responses 16 views 0 reactions	Last Post by SEQadmin2 07-08-2026, 10:08 AM
Engineered Protein Motor Takes Its First Steps Along DNA Track by SEQadmin2 Started by SEQadmin2, 07-07-2026, 11:05 AM	0 responses 33 views 0 reactions	Last Post by SEQadmin2 07-07-2026, 11:05 AM

Unconfigured Ad

tophat 1.3.0 sam output quality string problem

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News