Unconfigured Ad

**swbarnes2** · 08-22-2012, 11:52 AM

Pipe the mpileup output to awk (or cut would work too), and then as it gets each line, it will cut out the parts you want, and only output that.

**westerman** · 08-23-2012, 04:47 AM

'cut' is so much more simple than 'awk' (in my opinion, of course). To use 'cut' just do:

mpileup -p reference.fasta my_file.bam | cut -f 1,2,4

**bfantinatti** · 08-27-2012, 06:10 AM

Hello

I used pipe awk and worked very well, thank you for the two answers.
Despite generated the files as I wanted, I detected a little problem in results.
The columns of the coverage shows some wrong nombers compared to the grafical view of the assembly.
I'm using Tablet (http://bioinf.scri.ac.uk/tablet/) and IGV (http://www.broadinstitute.org/igv/) for visualize the reads. And looking to both visual and mpileup output, some positions shows a different coverage number, for example: The positions 1-4 is has exactly the same coverage in Tablet, IGV and mpileup. But the position 5-8 shows me one coverage point more in tablet and IGV than in mpileup (was this clear for you?).
Is there some error in Tablet and IGV or in mpileup output? Or is the mpileup disregarding some reads because some quality problem or other stuff, resulting in diferences in coverage value?
Thank you

**swbarnes2** · 08-27-2012, 08:14 AM

The two softwares might differ in how they treat anamalous reads, or reads with zero mapq. For instance, the default on mpileup is to ignore anamlous pairs, and you change that with the command line option -A. I bet IGV counts them all.

**bfantinatti** · 08-27-2012, 09:01 AM

Hello

I tried using -A and worked very well

Thank you for the answer.
One more question:
When I have some gap on assembly, mpileup jumps directly to the next position presenting a coverage:
scaffold_0 1 4
scaffold_0 2 4
scaffold_0 3 4
scaffold_0 7 8
scaffold_0 8 8
scaffold_0 9 8

I need the positions with 0 coverage also be included on mpileup output. Something like this:
scaffold_0 1 4
scaffold_0 2 4
scaffold_0 3 4
scaffold_0 4 0
scaffold_0 5 0
scaffold_0 6 0
scaffold_0 7 8
scaffold_0 8 8
scaffold_0 9 8

**dnusol** · 11-07-2012, 03:22 AM

hi bfantinatti,

did you manage to get positions with 0 coverage in your mpileup output?

cheers,

D.

**bfantinatti** · 11-07-2012, 03:42 AM

Yes

Hello dnusol, yes i did. Sorry, I forgot to post the solution here. I got the solution on annother forum related to bash issues.
The solution was to apply the following code:

awk '($2-p2)>1{
for(i=p2+1;i<$2;i++)
print $1,i,0
}
{p2=$2}1' file

This will add lines where its lacks, keeping the sequence of the second column, and adding 0 on the respective 3rd column.

Topics	Statistics	Last Post
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, Yesterday, 06:09 AM	0 responses 16 views 0 reactions	Last Post by SEQadmin2 Yesterday, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 34 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible by SEQadmin2 Started by SEQadmin2, 06-05-2026, 10:09 AM	0 responses 41 views 0 reactions	Last Post by SEQadmin2 06-05-2026, 10:09 AM
A New Single-Cell Method Maps DNA-Protein Interactions by SEQadmin2 Started by SEQadmin2, 06-04-2026, 08:59 AM	0 responses 48 views 0 reactions	Last Post by SEQadmin2 06-04-2026, 08:59 AM

Unconfigured Ad

Mpileup output

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News