Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How working with sam format and blast?

    Hi,

    I'm trying to work with the BWA, but I'm having a problem.
    After doing the alignment with BWA, I need to pass the generated file to fasta format, so I can make a blast.
    But when I do this, apparently the fasta file generated comter seems only the reads. Lose alignment. Anyone know what's happening?
    I use te samtools to pass the file in sam format for bam format, and after I use the blast2fastx to convert the alignment in bam format to fasta format.
    I don't know if have some influence, but I'm work with the paired-end alignment.

  • #2
    Converting from SAM or BAM to fasta will result in information loss, unless the converter happens to store all of the alignment information in the description line (and then blast keeps it).

    After using bwa your reads are aligned, why are you then blasting things? If you describe what you're really trying to do we might be able to tell you a more efficient way.

    Comment


    • #3
      I did the alignment using BWA, with a reference genome and my reads. Then I had to make a blast alignment I generated to verify that the generated alignment contained some sequence of chloroplast DNA.

      Comment


      • #4
        Originally posted by Guigra View Post
        I did the alignment using BWA, with a reference genome and my reads. Then I had to make a blast alignment I generated to verify that the generated alignment contained some sequence of chloroplast DNA.
        How about doing it the other way around? Filter your reads using BLAST against a chloroplast DNA database, and then use the remaining reads with BWA to map against the genome

        Unless you want to identify chloroplast insertions in the nuclear genome?

        Comment


        • #5
          Actually I'm trying to check if material extraction for sequencing was done correctly without leftover remnants of chloroplast DNA.
          Your tip solved my problem, thank you! You know how to make this filter in the BWA or bowtie, or some other program? If you know tell me how can I do?

          Comment


          • #6
            Originally posted by Guigra View Post
            Actually I'm trying to check if material extraction for sequencing was done correctly without leftover remnants of chloroplast DNA.
            Your tip solved my problem, thank you! You know how to make this filter in the BWA or bowtie, or some other program? If you know tell me how can I do?
            Well you could align reads to the chloroplast genome of your choice, then extract unmapped reads (something like samtools view -F 4....you'll have to double check the flags) and use that to input into BWA for your actual alignment.

            Comment


            • #7
              Originally posted by jimmybee View Post
              Well you could align reads to the chloroplast genome of your choice, then extract unmapped reads (something like samtools view -F 4....you'll have to double check the flags) and use that to input into BWA for your actual alignment.
              Thanks for the help! I will use your tip.

              Comment


              • #8
                Or just included the chloroplast genome in the reference with the nuclear genome and do the entire alignment in one go.

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  Exploring the Dynamics of the Tumor Microenvironment
                  by seqadmin




                  The complexity of cancer is clearly demonstrated in the diverse ecosystem of the tumor microenvironment (TME). The TME is made up of numerous cell types and its development begins with the changes that happen during oncogenesis. “Genomic mutations, copy number changes, epigenetic alterations, and alternative gene expression occur to varying degrees within the affected tumor cells,” explained Andrea O’Hara, Ph.D., Strategic Technical Specialist at Azenta. “As...
                  07-08-2024, 03:19 PM
                • seqadmin
                  Exploring Human Diversity Through Large-Scale Omics
                  by seqadmin


                  In 2003, researchers from the Human Genome Project (HGP) announced the most comprehensive genome to date1. Although the genome wasn’t fully completed until nearly 20 years later2, numerous large-scale projects, such as the International HapMap Project and 1000 Genomes Project, continued the HGP's work, capturing extensive variation and genomic diversity within humans. Recently, newer initiatives have significantly increased in scale and expanded beyond genomics, offering a more detailed...
                  06-25-2024, 06:43 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, 07-10-2024, 07:30 AM
                0 responses
                30 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 07-03-2024, 09:45 AM
                0 responses
                201 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 07-03-2024, 08:54 AM
                0 responses
                212 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 07-02-2024, 03:00 PM
                0 responses
                194 views
                0 likes
                Last Post seqadmin  
                Working...
                X