Seqanswers Leaderboard Ad

**r_j_p** · 10-29-2013, 08:07 AM

How big is your region of interest, and do you want one row for every position? If you do it could be a huge spreadsheet that Excel might struggle with.

A VCF file, (for example as generated by samtools), has similar information for sites with SNPs & indels, which you can then filter to help distinguish true variants from sequencing errors.

**gringer** · 10-29-2013, 12:48 PM

I would use bedtools (which can use BAM files as input). Use genomecov to get the coverage all across the genome:

genomecov — bedtools 2.31.0 documentation

http://bedtools.readthedocs.org/en/latest/content/tools/genomecov.html

Or just coverage if you have a specific gff/bed file with the region that you want to determine the coverage for:

coverage — bedtools 2.31.0 documentation

http://bedtools.readthedocs.org/en/latest/content/tools/coverage.html

edit: sorry, that's just for the base-pair coverage (independent of the base). I suspect that per-nucleotide counts in a particular region is straying into "needs a custom script to work it out from the samtools mpileup output" territory.

**geneart** · 10-31-2013, 08:28 AM

Hello all,
I have a similar related question to extract chromosome positions and depth per region. So I used bedtools to get bam to bed conversion:
I got :
Chr1 11419 11425 HWUSI-# 1 + 1
Chr1 13877 13892 HWUSI-# 0 - 2
Chr1 14714 14721 HWUSI-# 1 + 3
Chr1 16155 16163 HWUSI-# 1 - 4
Chr1 18239 18249 HWUSI-# 1 + 5
Now I looked up the manual for bedtools and it indicates that col5 as scores 1 to 1000 . I tried looking more into it, but cannot understand what those 1 and 0s stand for specifically.I know they are scores and that's it. I also have downstream in this column 32, 42 etc and not jut 1 or 0.I have cut and pasted just the top 5 outputs.
Also the last column looks like it is assigned a number for each cluster. Is this correct?
Please can some one clarify this for me?
Also if I need to get just the first 4 columns as is and also get the seq itself in the next column and counts of tags in the next columns, can I use bedtools to do it? If so , how? or do I need to write a script so that I have:
Chr# start end name seq # of tags pos/neg strand cluster#. from my bam files?
Please any help will do

Thanks in advance
geneart.

**Coru S** · 12-13-2013, 04:33 AM

Hi All,

Thank you very much for your comments and hints.
I found a computer scientist who wrote a small program that can do the job.

Greetings
Rayk

**gray_cambs** · 01-14-2014, 03:52 AM

Corus S
Would you be able to post the code as I also would be interested in doing this search??

Cheers

Gray

**Coru S** · 01-15-2014, 08:31 AM

Well, I don't have the source code, just the final files of the program.

But I can ask the guy who wrote it, if he is willing to share the code.

I'll let you know...

**Coru S** · 01-17-2014, 08:11 AM

Hi Gray,

I contacted him, but unfortunately he doesn't want to reveal the code or the final files in the public at the moment, since the program is still beta status.

When he has extended the program he will make it publicly available in the final version.

Greetings

**gringer** · 01-17-2014, 02:52 PM

I contacted him, but unfortunately he doesn't want to reveal the code or the final files in the public at the moment, since the program is still beta status.

Okay, well I've realised you can do this just using samtools and common Linux command-line programs. Here are base counts:

Code:

#forward
samtools view -F 0x10 sampled_lib19_extended.bam YAL005C_mRNA-51+51bp:500-3000 | awk -F '\t' '{print $10}' | fold -w 1 | sort | uniq -c
#reverse
samtools view -f 0x10 sampled_lib19_extended.bam YAL005C_mRNA-51+51bp:500-3000 | awk -F '\t' '{print $10}' | fold -w 1 | sort | uniq -c
#generic
samtools view [filter] input.bam sequence:bpRange | awk -F '\t' '{print $10}' | fold -w 1 | sort | uniq -c

[These one-liners can be easily modified to put sequence names at the start of the lines if necessary]

For a quick INDEL count, you can use the CIGAR column (#6):

Code:

#generic (use -f 0x10 or -F 0x10 for forward/reverse filter as necessary)
samtools view [filter] input.bam sequence:bpRange | awk -F '\t' '{print $6}' | fold -w 1 | grep '[ID]' | sort | uniq -c

**yl01** · 01-27-2014, 01:07 PM

Igvtools (which is a command line tools associated with IGV) count can do this task easily. It produces a text file in wig format. However it may not easy to open it in Excel as desired.

**Katherine_B** · 01-03-2019, 04:50 PM

Originally posted by yl01 View Post

Igvtools (which is a command line tools associated with IGV) count can do this task easily. It produces a text file in wig format. However it may not easy to open it in Excel as desired.

Could you possibly share the igvtools command line to do this? I have been trying to figure out how to get the base coverage values at each position from my BAM files. I'd want to know which base is at a position and how many reads produce that base so I can create some custom scripts using that information.
I tried:

igv count --bases input_sorted.bam output_basecounts.wig

But my .wig files are empty. Should I be saving the output differently??

**HESmith** · 01-03-2019, 05:53 PM

Use BEDtools 'genomecov' command with '-d' flag to obtain per-base depth of coverage.

Topics	Statistics	Last Post
The Role of Spliceosomes in RNA Splicing and Genome Evolution by seqadmin Started by seqadmin, Today, 07:03 AM	0 responses 10 views 0 likes	Last Post by seqadmin Today, 07:03 AM
A Closer Look at the Enigmatic Genomes of Oikopleura dioica by seqadmin Started by seqadmin, 05-10-2024, 06:35 AM	0 responses 31 views 0 likes	Last Post by seqadmin 05-10-2024, 06:35 AM
Advanced Epigenome Editing Platform Explores Gene Regulation Mechanisms by seqadmin Started by seqadmin, 05-09-2024, 02:46 PM	0 responses 41 views 0 likes	Last Post by seqadmin 05-09-2024, 02:46 PM
Telomere Maintenance by PARP1: A New Perspective in Cancer Research by seqadmin Started by seqadmin, 05-07-2024, 06:57 AM	0 responses 33 views 0 likes	Last Post by seqadmin 05-07-2024, 06:57 AM

Seqanswers Leaderboard Ad

Announcement

How to extract base coverage from bam/bai-files

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News