Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How working with sam format and blast?

    Hi,

    I'm trying to work with the BWA, but I'm having a problem.
    After doing the alignment with BWA, I need to pass the generated file to fasta format, so I can make a blast.
    But when I do this, apparently the fasta file generated comter seems only the reads. Lose alignment. Anyone know what's happening?
    I use te samtools to pass the file in sam format for bam format, and after I use the blast2fastx to convert the alignment in bam format to fasta format.
    I don't know if have some influence, but I'm work with the paired-end alignment.

  • #2
    Converting from SAM or BAM to fasta will result in information loss, unless the converter happens to store all of the alignment information in the description line (and then blast keeps it).

    After using bwa your reads are aligned, why are you then blasting things? If you describe what you're really trying to do we might be able to tell you a more efficient way.

    Comment


    • #3
      I did the alignment using BWA, with a reference genome and my reads. Then I had to make a blast alignment I generated to verify that the generated alignment contained some sequence of chloroplast DNA.

      Comment


      • #4
        Originally posted by Guigra View Post
        I did the alignment using BWA, with a reference genome and my reads. Then I had to make a blast alignment I generated to verify that the generated alignment contained some sequence of chloroplast DNA.
        How about doing it the other way around? Filter your reads using BLAST against a chloroplast DNA database, and then use the remaining reads with BWA to map against the genome

        Unless you want to identify chloroplast insertions in the nuclear genome?

        Comment


        • #5
          Actually I'm trying to check if material extraction for sequencing was done correctly without leftover remnants of chloroplast DNA.
          Your tip solved my problem, thank you! You know how to make this filter in the BWA or bowtie, or some other program? If you know tell me how can I do?

          Comment


          • #6
            Originally posted by Guigra View Post
            Actually I'm trying to check if material extraction for sequencing was done correctly without leftover remnants of chloroplast DNA.
            Your tip solved my problem, thank you! You know how to make this filter in the BWA or bowtie, or some other program? If you know tell me how can I do?
            Well you could align reads to the chloroplast genome of your choice, then extract unmapped reads (something like samtools view -F 4....you'll have to double check the flags) and use that to input into BWA for your actual alignment.

            Comment


            • #7
              Originally posted by jimmybee View Post
              Well you could align reads to the chloroplast genome of your choice, then extract unmapped reads (something like samtools view -F 4....you'll have to double check the flags) and use that to input into BWA for your actual alignment.
              Thanks for the help! I will use your tip.

              Comment


              • #8
                Or just included the chloroplast genome in the reference with the nuclear genome and do the entire alignment in one go.

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  Essential Discoveries and Tools in Epitranscriptomics
                  by seqadmin




                  The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                  04-22-2024, 07:01 AM
                • seqadmin
                  Current Approaches to Protein Sequencing
                  by seqadmin


                  Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                  04-04-2024, 04:25 PM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, 04-25-2024, 11:49 AM
                0 responses
                20 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 04-24-2024, 08:47 AM
                0 responses
                20 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 04-11-2024, 12:08 PM
                0 responses
                62 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 04-10-2024, 10:19 PM
                0 responses
                61 views
                0 likes
                Last Post seqadmin  
                Working...
                X