Unconfigured Ad

**GenoMax** · 01-05-2014, 05:17 PM

You can use the coverageBed from the bedtools program to get the coverage information: http://bedtools.readthedocs.org/en/l.../coverage.html

**lnzzz** · 01-07-2014, 02:25 AM

Thank you very much.
I tried coverageBed. I got a file with the read coverage for each position in the genome. I succeeded in extracting the distribution along one particular transcript. However, what I really need, it is an estimation of the global read distribution along all transcripts. I don't really how I can get it with coverageBed

.

**GenoMax** · 01-07-2014, 04:36 AM

Can you post the command you used for coverageBed?

BTW: Did you try the test dataset for RSeQC with your local install? Is that generating an error? Looking at your file paths it appears that you are using OS X (10.8?)

**lnzzz** · 01-07-2014, 05:38 AM

Yes I used a bed file with the coordinate of my genomes. Here is the command I use for coverage:
/Users/bedtools2/bin/coverageBed -d -s -split -abam 621_hits.bam -b TAIR10.bed >coverage.txt

This command gave me a very big file (20Go)... As this file obtained with coveragebed was huge (20Go), I also use the genomeCoverageBed and obtained the depth at each position:

/Users/bedtools2/bin/genomeCoverageBed -bg -split -trackline -ibam 621_hits.bam -g TAIR10.bed > coverage2.txt

What I would like is the global read distribution along transcript. That is to say, what is the read percentage in the first 10% bases of the transcripts, in the next 10%… (Enclosed an example of what I could get for one transcript and I would to obtain for all the transcripts.)

Attached Files

readdensity.png (80.1 KB, 28 views)

**GenoMax** · 01-07-2014, 07:49 AM

The -d option reports coverage at each position. Can you try the coverageBed without the -d and -split? Your transcriptome BED file is start/stop positions of exons? If you are set on the 10% intervals then you may need to create a custom BED file.

**lnzzz** · 01-07-2014, 08:03 AM

Yes, I think I need to create custom bed file.

Thank you for your help

**swbarnes2** · 01-07-2014, 10:25 AM

Quick and dirty way
This works best if you align to a list of transcripts, instead of genome. Sure, it's not quite as accurate as aligning to genome with TopHat, but you don't need exact figures, just a ballpark.

1) get a list of your transcripts, and how long each one is. (If you align to a list of transcripts, samtools idxstats will tell you this)
2) take line of your sam file, and associate it with a transcript (If you align to a list of transcripts, each line will already have that info)
3) Go through each line of the .sam, and change the alignment position to a corrected integer position that is the position / total length of the transcript
4) Bin up all your new positions.

Topics	Statistics	Last Post
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, Today, 11:58 AM	0 responses 9 views 0 reactions	Last Post by SEQadmin2 Today, 11:58 AM
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible by SEQadmin2 Started by SEQadmin2, 06-05-2026, 10:09 AM	0 responses 25 views 0 reactions	Last Post by SEQadmin2 06-05-2026, 10:09 AM
A New Single-Cell Method Maps DNA-Protein Interactions by SEQadmin2 Started by SEQadmin2, 06-04-2026, 08:59 AM	0 responses 34 views 0 reactions	Last Post by SEQadmin2 06-04-2026, 08:59 AM
Long-Read RNA Sequencing Uncovers a Hidden Layer of Immune Cell Regulation by SEQadmin2 Started by SEQadmin2, 06-02-2026, 12:03 PM	0 responses 56 views 0 reactions	Last Post by SEQadmin2 06-02-2026, 12:03 PM

Unconfigured Ad

RNA-seq : 5’ to 3’ transcript coverage

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News