Unconfigured Ad

**EGrassi** · 01-15-2013, 02:08 AM

A quick check on the htseq_count code tells me that it never uses the reads "mate_aligned" attribute and just considers all of the paired reads. Does this seem a strange behaviour only to me? I don't see in any place a check on wheter the two reads fall at a sensible distance to be reliably considered in the counts.

**Simon Anders** · 01-16-2013, 12:24 AM

The "mate_aligned" bit in the FLAG field indicates, in my reading of the SAM spec, that an alignment for the mate is given in the SAM file, not that this alignment is considered plausible. If TopHat really changes the mate_aligned field according to the distance, I'd consider this a very odd behaviour. In my opinion, it should set the alignment quality (5th field in the SAM file) to a low value to indicate that an alignment is reported but should not be trusted.

htseq-count, by the way, filters by the alignment quality only if you use the -a option. I guess I should change this to be the default.

**EGrassi** · 01-16-2013, 12:30 AM

Originally posted by Simon Anders View Post

The "mate_aligned" bit in the FLAG field indicates, in my reading of the SAM spec, that an alignment for the mate is given in the SAM file, not that this alignment is considered plausible. If TopHat really changes the mate_aligned field according to the distance, I'd consider this a very odd behaviour. In my opinion, it should set the alignment quality (5th field in the SAM file) to a low value to indicate that an alignment is reported but should not be trusted.

As long as the samtools flagstat percentage of properly paired reads gotten on the accepted_hits changed setting the -r tophat parameter I believed that the ones reported as not properly aligned were in the sam file but should not be considered as aligned in the analyses.

(filtering on quality only with an option is fine in my opinion by the way

).

Topics	Statistics	Last Post
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 30 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 96 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible by SEQadmin2 Started by SEQadmin2, 06-05-2026, 10:09 AM	0 responses 116 views 0 reactions	Last Post by SEQadmin2 06-05-2026, 10:09 AM
A New Single-Cell Method Maps DNA-Protein Interactions by SEQadmin2 Started by SEQadmin2, 06-04-2026, 08:59 AM	0 responses 109 views 0 reactions	Last Post by SEQadmin2 06-04-2026, 08:59 AM

Unconfigured Ad

Tophat + htseq_count

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News