Problem with HTSeq-count tool.

a.kmg

Member

Join Date: Aug 2014

Posts: 15
- Share
- Tweet
#1

Problem with HTSeq-count tool.

09-18-2014, 06:36 AM

Hello,

I use TopHat2 to generate the alignment of my paired-end fastq files.

The following "align_summary.txt" is created :

Left reads:
Input : 47466605
Mapped : 41265217 (86.9% of input)
of these: 5205466 (12.6%) have multiple alignments (141994 have >20)

Right reads:
Input : 47466605
Mapped : 41580477 (87.6% of input)
of these: 5255753 (12.6%) have multiple alignments (143224 have >20) 87.3% overall read mapping rate.

Aligned pairs: 39852098
of these: 4959709 (12.4%) have multiple alignments
408853 ( 1.0%) are discordant alignments
83.1% concordant pair alignment rate.

how to determine the number of uniquely aligned reads ?

If I do :

39,852,098-4,959,709-408,853=34,483,536 uniquely aligned reads.

But when I run htseq on this sam file, I obtained:

reads assigned to a gene : 30,247,445
no_feature : 6,495,866
ambiguous : 748,775

If I do not talk nonsense, the sum of these three numbers should equal the number of uniquely aligned reads (since HTseq deals only these reads). But 37,492,086 reads are processed and not 34,483,536.

So my calculation of unique reads is false?

Does anyone has an idea how to interpret this "align_summary.txt" file?

Thanks at all !

Last edited by a.kmg; 09-24-2014, 01:07 AM.
Tags: None

Previous template Next

Topics	Statistics	Last Post
Gene Misexpression in the Healthy Human Population by seqadmin Started by seqadmin, Yesterday, 06:46 AM	0 responses 9 views 0 likes	Last Post by seqadmin Yesterday, 06:46 AM
New Method for Rapid Genetic Diagnosis of Mendelian Disorders by seqadmin Started by seqadmin, 07-24-2024, 11:09 AM	0 responses 24 views 0 likes	Last Post by seqadmin 07-24-2024, 11:09 AM
Advancing Nanopore Technology for Portable Sensing Devices by seqadmin Started by seqadmin, 07-19-2024, 07:20 AM	0 responses 159 views 0 likes	Last Post by seqadmin 07-19-2024, 07:20 AM
New RNA-Based Gene Writing Technology Achieves Precise Gene Integration by seqadmin Started by seqadmin, 07-16-2024, 05:49 AM	0 responses 127 views 0 likes	Last Post by seqadmin 07-16-2024, 05:49 AM

Seqanswers Leaderboard Ad