Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How to perform alignment between BWA's sam file and contigs.fasta assembly files

    I performed de novo genome assembly on a bacteria sample (two files: reads.1.fq and reads.2.fq), got resulting the "contigs.fasta" file, used BWA to map the two initial fq read files to "contigs.fasta". Now I have the resulting SAM file from BWA.

    How can I perform alignment between those two files and compare my "contigs.fasta" file with the SAM file resulting from BWA?

  • #2
    You have already performed an alignment with your original data files to the contigs.fasta, which resulted in the SAM file. You would want to convert this SAM file to BAM (binary representation) using samtools, sort it and then use a viewer such as Integrated Genome Viewer (IGV) to view the alignments.

    Comment


    • #3
      I used Tablet to view the alignments. However, that is not exactly what I am looking for, what I am trying to achieve here is that; the bacteria comes from soil, and in the process two bacteria genomes were extracted, I have the reads (shotgun sequencing) from only one bacteria, I performed de novo assembly to assemble the genome of that bacteria, then my task was to use the assembled genome of that bacteria to assemble the genome of the second bacteria. So I used BWA to achieve that. Now I am assuming that that SAM file is the genome of the second bacteria. Now I want to align the first bacteria against the second bacteria.

      Am I way off the road?

      By the way, the data is simulated. Bacteria one is simulated with Matlab, and bacteria two is basically bacteria one with some SNPs and deletions and insertions introduced by Matlab.

      Comment


      • #4
        If bacteria two is essentially like bacteria one (with some SNP's, deletions etc) then you are not going to be able to separate the two "genomes" (unless the reads were tagged to identify the two samples before they were sequenced/simulated).

        What exactly are you trying to do? Are these bacteria supposed to be from two strains or two separate species or two separate genera. That order indicates increasing differences that would be easier to resolve by sequencing. But just between the strains it is going to be difficult to separate the two genomes.

        BTW: bwa is an alignment program. An assembly program example would be SPAdes/velvet for bacterial genomes. Unless you are referring to using bwa to do a reference based alignment and then get a consensus from that alignment "assembly" of second genome.
        Last edited by GenoMax; 02-09-2016, 03:59 PM.

        Comment


        • #5
          I am not quite sure myself of the details, since this is only a "genome assembly" practice with several sub-tasks. These two bacteria are both from one soil sample.

          I did the genome assembly for the first bacteria using Velvet, SPAdes, ABySS, and Minia.

          Yes that is probably what I should do: "[...] get a consensus from that alignment "assembly" of second genome."

          In short what I have done so far is:

          (i) Performed genome assembly using the aforementioned assemblers
          (ii) Used MUMmer to compare the results of different assemblers
          (iii) Analysed the assembly results using QUAST, chose the one that was best in terms of quality
          (iv) used that contigs.fasta file and the original reads and BWA in the following way in hope of getting the second bacteria:

          bwa index contigs.fa
          bwa aln contigs.fa reads.1.fq > align_1.sai
          bwa aln contigs.fa reads.2.fq > align_2.sai
          bwa sampe contigs.fa align_1.sai align_2.sai reads.1.fq reads.2.fq > align_bwa.sam


          Details of the task if needed:

          Comment


          • #6
            "Closely related bacteria" is vague and subject to interpretation. The farther you are allowed to go the better your chances will be of "separating" the genomes.

            You actually should be thinking about the reads that don't map because they would potentially represent the "other" bacterium. You would want to separate and then assemble those. This task appears to be not very well defined.
            Last edited by GenoMax; 02-09-2016, 05:08 PM.

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Recent Advances in Sequencing Analysis Tools
              by seqadmin


              The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
              Yesterday, 07:48 AM
            • seqadmin
              Essential Discoveries and Tools in Epitranscriptomics
              by seqadmin




              The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
              04-22-2024, 07:01 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, Today, 06:57 AM
            0 responses
            9 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, Yesterday, 07:17 AM
            0 responses
            14 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 05-02-2024, 08:06 AM
            0 responses
            19 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-30-2024, 12:17 PM
            0 responses
            23 views
            0 likes
            Last Post seqadmin  
            Working...
            X