Seqanswers Leaderboard Ad

**jkbonfield** · 06-03-2009, 12:53 AM

I'm not sure if it's the case here, but I've noticed the CIGAR string has major issues if you attempt to include gaps in the clipped sequence.

Or rather CIGAR works fine I assume, but samtools does not. (It's not really a big issue as the only time I've seen this happen is someone manually trimming an alignment back.)

**mikyatope** · 06-03-2009, 08:00 AM

Originally posted by zee View Post

Is there a way to convert a SAM consensus output (using -c option for pileup) to the old maq-style .cns consensus?

I have some maq-based pipelines I would like to use on my BWA results.

maybe it's related.

Is possible get the consensus sequence in a simple fasta format with SAMtools?

**jess** · 06-03-2009, 08:09 AM

I tried using the -c option,bt the pileup output is same evn widout this option! I gave d command smfink like dis:

samtools pileup -f ref.fasta aln_sorted.bam -s -c -v >test.pileup

Let me know wher I m gng Wrong!

**jess** · 06-03-2009, 11:23 AM

ok! So i knw where i ws gng wrong...
the .aln file shud be put in last after all d options.

**lh3** · 06-04-2009, 04:09 AM

samtools.pl now updated at SVN:

SAM tools

http://samtools.svn.sourceforge.net/viewvc/samtools/trunk/samtools/misc/samtools.pl?view=markup

Download SAM tools for free. SAM (Sequence Alignment/Map) is a flexible generic format for storing nucleotide sequence alignment. SAMtools provide efficient utilities on manipulating alignments in the SAM format.

pileup2fq is implemented, similar to maq's cns2fq. Please note that samtools.pl filters based on the RMS mapping quality (-Q) while maq's cns2fq filters on the maximum mapping quality. Also, pileup2fq masks a small region around an potential indel, but maq's cns2fq does not. The overall accuracy looks similar to maq, though.

**jess** · 06-04-2009, 05:54 AM

Thanks Heng. I will try and let you know if I get stuck in something.

**corthay** · 06-04-2009, 09:36 PM

Thank you for your speedy response.

I have one more question. I got following results by using bwa(0.4.9), my favorite.
seq-name#0 69 * 0 0 * * 0 0 (sequence) (quality)
seq-name#0 133 * 0 0 * * 0 0 (sequence) (quality)

Both reads do not be mapped but the flag for "the mate is unnmapped" are 0.
How should I interpret it?

**lh3** · 06-05-2009, 01:17 AM

This is a flaw in bwa when generating SAM. I will fixed it.

It is not so easy to generate absolutely correct SAM due to the dependency between fields and between mates. We tried to minimize the dependency in design, but reducing dependency causes inconvenience in other cases. There is always a balance.

**corthay** · 06-05-2009, 01:31 AM

I appreciate that you immediately replied to my question.
I would like to handle the sam format files.

**krawitzp** · 06-09-2009, 07:20 AM

genome likelihood format

Hi,
where can I find further documentation on the genome likelihood format 3.0 ?
thanks,
peter

**ElMichael** · 06-25-2009, 07:36 AM

Hi,
could anybody, please, explain the output format of the wgsim_eval.pl script?
I used this script to evaluate aln.sam file after making alignment with BWA.

06x 1654169 / 3308330 3308330 5.000e-01
05x 31765 / 63530 3371860 5.000e-01
04x 4938 / 9872 3381732 5.000e-01
03x 163891 / 327252 3708984 5.001e-01
02x 65120 / 129918 3838902 5.001e-01
01x 2669 / 5090 3843992 5.001e-01
00x 113748 / 141416 3985408 5.109e-01

BTW, in the BWA-man is written that " These reads are mapped with bowtie, bwa, maq and soap... The resultant alignments were then evaluated with wgsim_eval.pl script. "
How could I use this script for alignments from other programs such as bowtie, soap?
thanks,
Mike.

**gcrdb** · 06-30-2009, 10:35 AM

hi, I have trouble conveting sam to bam.. I tried both:

samtools import ref .fai in.sam out.bam
got error:
[sam_header_read2] 22 sequences loaded.
[sam_read1] reference '-143963499' is recognized as '*'.
Parse error at line 1: invalid CIGAR operation
Aborted

samtools view -bt ref .fai -o in.sam out.bam
and got similar error:
[sam_header_read2] 22 sequences loaded.
[sam_read1] reference '' is recognized as '*'.
[main_samview] truncated file.

thanks,

**lh3** · 07-07-2009, 07:32 AM

Lincoln has released SAM/BAM perl APIs a few days (weeks?) ago. It is here:

http://search.cpan.org/~lds/Bio-SamTools-1.00/lib/Bio/DB/Sam.pm

Compiling this module requires samtools C source codes. Bio:

B::Sam is known to work with samtools-0.1.4 and 0.1.5 (released today).

BTW, the latest samtools supports opening BAM files over FTP. For example:

samtools tview ftp://ftp.ncbi.nih.gov/1000genomes/f...32.2009_06.bam

**gcrdb** · 07-13-2009, 11:04 AM

Bio:

B::Sam perl APIs need to start from BAM files (-bam) , not SAM files(no "-sam" at all). I only have SAM files which from bwa, all I need is to convert SAM to BAM.
I am stuck with SAM files.....
samtools import ref .fai in.sam out.bam
got error:
[sam_header_read2] 22 sequences loaded.
[sam_read1] reference '-143963499' is recognized as '*'.
Parse error at line 1: invalid CIGAR operation
Aborted

thanks,

**ohofmann** · 07-16-2009, 01:03 PM

Bit of a newbie question. I've been trying to use the pileup analysis on a BWA dataset. Is there any way to switch of the read bases, read quality and alignment quality information in the output file and get a summarized format instead?

I'm looking at a small number of sequences that have a coverage of 50.000X upwards, and as a result the pileup output becomes almost unmanageable.

Thanks!

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, 04-25-2024, 11:49 AM	0 responses 19 views 0 likes	Last Post by seqadmin 04-25-2024, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 18 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 62 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News