Seqanswers Leaderboard Ad

**dpryan** · 01-20-2014, 01:38 AM

Perhaps you have a lot of immature mRNAs or a lot of expressed repeat regions. The general idea is to look at some of the alignments in IGV and see if they really don't match anything. Also ensure that the chromosome names in the BAM file and GTF file match (that probably causes this sort of thing half the time).

**sindrle** · 01-20-2014, 01:38 PM

Hi again!

I have now tested HTSeq with all modes, also upgraded to Python 2.7.6 and inspected using IGV.

Here is in total 4 reads, one with mapping quality 50 and three with 3.
I used HTSeq option -a 0, so they should been picked up..

All three modes only counts 1 read.. How can this be?

**dpryan** · 01-20-2014, 01:42 PM

HTSeq-count also looks at the NH auxiliary tag. With a MAPQ of 3, it's likely that three of those are multimappers (this will be the case if you used tophat2) and would be (properly) ignored.

**sindrle** · 01-20-2014, 02:26 PM

Oh yeah, that make sense.

How come you know so much about everything? Where have you learned?

But, its pretty sure something is wrong here right, so I should keep looking? I have checked my GTF, the chromosome names are the same.

**sindrle** · 01-20-2014, 06:00 PM

I have made a SAM file with -samout option and checked around a bit..

From Tophat.log I get 21.8 mill. total kept reads
In my SAM I get a total of: 20.90 mill., wonder where the rest are?

Of the 20.9 mill. I have:

17.7 mill. NH:i:1 of which 158.000 ambiguous
I also get 3.4 mill. alignment_not_unique &
1.6 mill. no_feature

Looking at the HTSeq output file I get:

no_feature: 3.8 mill.
ambiguous: 158.000
alignment_not_unique: 3.4 mill.

So the SAM has 1 million reads less than the BAM. Also "no_feature" is different in the SAM and HTSeq output..

I tried to watch specific reads in IGV, but selecting reads by read name (right click the BAM track and choose "select by name", does not change the view....Annoying).

But anyone have something to add on this?

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, 04-25-2024, 11:49 AM	0 responses 19 views 0 likes	Last Post by seqadmin 04-25-2024, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 17 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 62 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

HTseq: Very few counts recognised

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News