Hi czelin,
This is unlikely, since reads that overlap to many exons should be counted once per each exon. You could check whether the reads that are ignored by the script are properly paired or if they are mapping to multiple regions on the genome?
Alejhandro
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Dear all,
I have recently started with exon-wise analysis and would appreciate your help.
I have paired 100bp reads. I have prepared the annotation file with DEXSeq python scripts. What I have realized is that when I have a shorter exon (e.g. 150bp) the number of tags is 0. In IGV I can see lots (>500) of reads spanning this exon. Somehow few of the reads were counted for the control samples and none for comparison and this exon was reported as significantly DE.
I suspect that is because the two read pairs are longer than this exon and they overlapped adjacent exon[s]. This caused the reads be considered as "_ambiguous_readpair_position". If my understanding is correct, is there any way to solve the short-exon issue? If my understanding is wrong, could you please correct me?
Leave a comment:
-
-
I just noticed the chromosome name of the gff file doesn't contain "chr":
Code:1 Homo_sapiens.GRCh37.71.gtf exonic_part 11869 11871 . + . transcripts "ENST00000456328"; exonic_part_number "001"; gene_id "ENSG00000223972"
Leave a comment:
-
By the way, these are the commands I used:
Code:samtools index 21722_mapped_hg19/accepted_hits.bam samtools view 21722_mapped_hg19/accepted_hits.bam >21722_accepted_hits.sam sort -k1,1 -k2,2n 21722_accepted_hits.sam >21722_accepted_hits_sorted.sam python ~/scripts_64/dexseq_count.py -p yes -s no ~/scripts_64/Homo_sapiens.GRCh37.71.DEXSeq.chr.gff 21722_accepted_hits_sorted.sam KARPAS299_CEP.txt
Last edited by metheuse; 04-17-2013, 08:54 PM.
Leave a comment:
-
Originally posted by Simon Anders View PostHav you checked your alignments with a genome browser? Load the SAM file and the GFF file produced by dexseq_prepare in, e.g., IGV, and look at one of the loci with zero counts. If there really are no reads, you experiment has failed (or you are using a wrong annotation file).
This resulted in 71 intersecting reads. Here are the first 10 of them:
Code:chr1 29379725 29391495 HWI-ST1235:101:C1WW9ACXX:6:1312:1770:71045/1 50 + chr1 29379725 29391495 HWI-ST1235:101:C1WW9ACXX:6:2116:17025:56846/2 50 - chr1 29379731 29391501 HWI-ST1235:101:C1WW9ACXX:6:2307:18153:8715/1 50 + chr1 29379734 29391504 HWI-ST1235:101:C1WW9ACXX:6:2314:2163:52123/2 50 + chr1 29379736 29391506 HWI-ST1235:101:C1WW9ACXX:6:2314:2163:52123/1 50 - chr1 29379741 29391511 HWI-ST1235:101:C1WW9ACXX:6:2313:2799:76858/1 50 + chr1 29379742 29391512 HWI-ST1235:101:C1WW9ACXX:6:2210:9177:71165/1 50 + chr1 29379742 29391512 HWI-ST1235:101:C1WW9ACXX:6:1101:15865:15952/2 50 - chr1 29379742 29391512 HWI-ST1235:101:C1WW9ACXX:6:1307:5858:86759/2 50 - chr1 29379742 29391512 HWI-ST1235:101:C1WW9ACXX:6:1308:20389:32047/1 50 -
Last edited by metheuse; 04-17-2013, 05:12 PM.
Leave a comment:
-
Originally posted by metheuse View PostI got the same problem. It's all zeros in the second column of the output of dexseq_count.py
I saw you said you realized later you didn't follow the manual exactly. Could you tell me what's the problem? Thanks.
Leave a comment:
-
Originally posted by oliviera View PostOnly 92 with adjp < 0.01.
With DEseq I get ~2000 genes at this cut off. I think I made something wrong
Do you really need to make sure that your list of differentially used exons do not contain more than 1% false positives? In most use cases, 10% are considered acceptable.
And: Of course, you get much more genes than exons. Detecting differential expression is needs to see much less information in the data than detecting differential exon usage.
Leave a comment:
-
I got the same problem. It's all zeros in the second column of the output of dexseq_count.py
I saw you said you realized later you didn't follow the manual exactly. Could you tell me what's the problem? Thanks.
Leave a comment:
-
Only 92 with adjp < 0.01.
With DEseq I get ~2000 genes at this cut off. I think I made something wrong
Leave a comment:
-
Hi Alejandro,
Here is the graph to check dispersion estimate. What do you think?
meanvalues<- rowMeans(counts(ecs))
plot(meanvalues, fData(ecs)$dispBeforeSharing, log="xy", main="mean vs CR dispersion")
x<- 0.01:max(meanvalues)
y<- ecs@dispFitCoefs[1] + ecs@dispFitCoefs[2] / x
lines(x, y, col="red")
Leave a comment:
-
Hi oliviera,
This warning sometimes happens when your data is sparsed. Have you done the "fit diagnostics" as indicated in the vignette? As it is just a warning, you can go on with your analysis, maybe you will loose some power if the fit is "above" most of the points...
Alejandro
Leave a comment:
-
Dear all,
I try to use DExseq but got the following error when I call the function
ecs<- fitDispersionFunction(ecs)
Warning message:
In glmgam.fit(mm, disps[good], coef.start = coefs) :
Too much damping - convergence tolerance not achievable
Here is the version in R I use
R version 2.15.1 (2012-06-22)
Platform: x86_64-apple-darwin9.8.0/x86_64 (64-bit)
locale:
[1] C/UTF-8/C/C/C/C
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] DEXSeq_1.4.0 Biobase_2.18.0 BiocGenerics_0.4.0
loaded via a namespace (and not attached):
[1] RCurl_1.95-1.1 XML_3.95-0.1 biomaRt_2.14.0 hwriter_1.3 plyr_1.7.1
[6] statmod_1.4.16 stringr_0.6.1
Any suggestion on what is going on?
Cheers
Olivier
Leave a comment:
-
Thank you!
I will try it with additional replicates. I only used one per condition because I wanted to work my way through the GSNAP -> DEXSeq workflow successfully before I ran all the replicates through GSNAP (~12 hours per replicate). I will let you know how it goes tomorrow.
Leave a comment:
Latest Articles
Collapse
-
by seqadmin
The human gut contains trillions of microorganisms that impact digestion, immune functions, and overall health1. Despite major breakthroughs, we’re only beginning to understand the full extent of the microbiome’s influence on health and disease. Advances in next-generation sequencing and spatial biology have opened new windows into this complex environment, yet many questions remain. This article highlights two recent studies exploring how diet influences microbial...-
Channel: Articles
02-24-2025, 06:31 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 03-03-2025, 01:15 PM
|
0 responses
169 views
0 likes
|
Last Post
by seqadmin
03-03-2025, 01:15 PM
|
||
Started by seqadmin, 02-28-2025, 12:58 PM
|
0 responses
259 views
0 likes
|
Last Post
by seqadmin
02-28-2025, 12:58 PM
|
||
Started by seqadmin, 02-24-2025, 02:48 PM
|
0 responses
641 views
0 likes
|
Last Post
by seqadmin
02-24-2025, 02:48 PM
|
||
Started by seqadmin, 02-21-2025, 02:46 PM
|
0 responses
265 views
0 likes
|
Last Post
by seqadmin
02-21-2025, 02:46 PM
|
Leave a comment: