Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • jkbonfield
    Senior Member
    • Jul 2008
    • 146

    #61
    I'm not sure if it's the case here, but I've noticed the CIGAR string has major issues if you attempt to include gaps in the clipped sequence.

    Or rather CIGAR works fine I assume, but samtools does not. (It's not really a big issue as the only time I've seen this happen is someone manually trimming an alignment back.)

    Comment

    • mikyatope
      Junior Member
      • Jun 2009
      • 3

      #62
      Originally posted by zee View Post
      Is there a way to convert a SAM consensus output (using -c option for pileup) to the old maq-style .cns consensus?

      I have some maq-based pipelines I would like to use on my BWA results.
      maybe it's related.

      Is possible get the consensus sequence in a simple fasta format with SAMtools?

      Comment

      • jess
        Junior Member
        • May 2009
        • 7

        #63
        I tried using the -c option,bt the pileup output is same evn widout this option! I gave d command smfink like dis:


        samtools pileup -f ref.fasta aln_sorted.bam -s -c -v >test.pileup

        Let me know wher I m gng Wrong!

        Comment

        • jess
          Junior Member
          • May 2009
          • 7

          #64
          ok! So i knw where i ws gng wrong...
          the .aln file shud be put in last after all d options.

          Comment

          • lh3
            Senior Member
            • Feb 2008
            • 686

            #65
            samtools.pl now updated at SVN:

            Download SAM tools for free. SAM (Sequence Alignment/Map) is a flexible generic format for storing nucleotide sequence alignment. SAMtools provide efficient utilities on manipulating alignments in the SAM format.


            pileup2fq is implemented, similar to maq's cns2fq. Please note that samtools.pl filters based on the RMS mapping quality (-Q) while maq's cns2fq filters on the maximum mapping quality. Also, pileup2fq masks a small region around an potential indel, but maq's cns2fq does not. The overall accuracy looks similar to maq, though.

            Comment

            • jess
              Junior Member
              • May 2009
              • 7

              #66
              Thanks Heng. I will try and let you know if I get stuck in something.

              Comment

              • corthay
                Member
                • Oct 2008
                • 25

                #67
                Thank you for your speedy response.

                I have one more question. I got following results by using bwa(0.4.9), my favorite.
                seq-name#0 69 * 0 0 * * 0 0 (sequence) (quality)
                seq-name#0 133 * 0 0 * * 0 0 (sequence) (quality)

                Both reads do not be mapped but the flag for "the mate is unnmapped" are 0.
                How should I interpret it?

                Comment

                • lh3
                  Senior Member
                  • Feb 2008
                  • 686

                  #68
                  This is a flaw in bwa when generating SAM. I will fixed it.

                  It is not so easy to generate absolutely correct SAM due to the dependency between fields and between mates. We tried to minimize the dependency in design, but reducing dependency causes inconvenience in other cases. There is always a balance.

                  Comment

                  • corthay
                    Member
                    • Oct 2008
                    • 25

                    #69
                    I appreciate that you immediately replied to my question.
                    I would like to handle the sam format files.

                    Comment

                    • krawitzp
                      Junior Member
                      • Jun 2009
                      • 1

                      #70
                      genome likelihood format

                      Hi,
                      where can I find further documentation on the genome likelihood format 3.0 ?
                      thanks,
                      peter

                      Comment

                      • ElMichael
                        Member
                        • Jun 2009
                        • 31

                        #71
                        Hi,
                        could anybody, please, explain the output format of the wgsim_eval.pl script?
                        I used this script to evaluate aln.sam file after making alignment with BWA.
                        06x 1654169 / 3308330 3308330 5.000e-01
                        05x 31765 / 63530 3371860 5.000e-01
                        04x 4938 / 9872 3381732 5.000e-01
                        03x 163891 / 327252 3708984 5.001e-01
                        02x 65120 / 129918 3838902 5.001e-01
                        01x 2669 / 5090 3843992 5.001e-01
                        00x 113748 / 141416 3985408 5.109e-01
                        BTW, in the BWA-man is written that " These reads are mapped with bowtie, bwa, maq and soap... The resultant alignments were then evaluated with wgsim_eval.pl script. "
                        How could I use this script for alignments from other programs such as bowtie, soap?
                        thanks,
                        Mike.

                        Comment

                        • gcrdb
                          Junior Member
                          • Jan 2009
                          • 9

                          #72
                          hi, I have trouble conveting sam to bam.. I tried both:

                          samtools import ref .fai in.sam out.bam
                          got error:
                          [sam_header_read2] 22 sequences loaded.
                          [sam_read1] reference '-143963499' is recognized as '*'.
                          Parse error at line 1: invalid CIGAR operation
                          Aborted

                          samtools view -bt ref .fai -o in.sam out.bam
                          and got similar error:
                          [sam_header_read2] 22 sequences loaded.
                          [sam_read1] reference '' is recognized as '*'.
                          [main_samview] truncated file.

                          thanks,

                          Comment

                          • lh3
                            Senior Member
                            • Feb 2008
                            • 686

                            #73
                            Lincoln has released SAM/BAM perl APIs a few days (weeks?) ago. It is here:



                            Compiling this module requires samtools C source codes. Bio:B::Sam is known to work with samtools-0.1.4 and 0.1.5 (released today).

                            BTW, the latest samtools supports opening BAM files over FTP. For example:

                            samtools tview ftp://ftp.ncbi.nih.gov/1000genomes/f...32.2009_06.bam

                            Comment

                            • gcrdb
                              Junior Member
                              • Jan 2009
                              • 9

                              #74
                              Bio:B::Sam perl APIs need to start from BAM files (-bam) , not SAM files(no "-sam" at all). I only have SAM files which from bwa, all I need is to convert SAM to BAM.
                              I am stuck with SAM files.....
                              samtools import ref .fai in.sam out.bam
                              got error:
                              [sam_header_read2] 22 sequences loaded.
                              [sam_read1] reference '-143963499' is recognized as '*'.
                              Parse error at line 1: invalid CIGAR operation
                              Aborted

                              thanks,

                              Comment

                              • ohofmann
                                Member
                                • Jan 2009
                                • 37

                                #75
                                Bit of a newbie question. I've been trying to use the pileup analysis on a BWA dataset. Is there any way to switch of the read bases, read quality and alignment quality information in the output file and get a summarized format instead?

                                I'm looking at a small number of sequences that have a coverage of 50.000X upwards, and as a result the pileup output becomes almost unmanageable.

                                Thanks!

                                Comment

                                Latest Articles

                                Collapse

                                • SEQadmin2
                                  From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                                  by SEQadmin2


                                  Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                                  The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                                  ...
                                  06-02-2026, 10:05 AM
                                • SEQadmin2
                                  Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                                  by SEQadmin2


                                  With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                                  Introduction

                                  Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                                  05-22-2026, 06:42 AM
                                • SEQadmin2
                                  Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                                  by SEQadmin2

                                  Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                                  Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                                  05-06-2026, 09:04 AM

                                ad_right_rmr

                                Collapse

                                News

                                Collapse

                                Topics Statistics Last Post
                                Started by SEQadmin2, Today, 08:59 AM
                                0 responses
                                3 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 06-02-2026, 12:03 PM
                                0 responses
                                21 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 06-02-2026, 11:40 AM
                                0 responses
                                14 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 05-28-2026, 11:40 AM
                                0 responses
                                29 views
                                0 reactions
                                Last Post SEQadmin2  
                                Working...