Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • morning latte
    Member
    • Jun 2013
    • 91

    bowtie mapping

    Hello,

    This is my first time to use bowtie and have trouble with running it and interpreting the output.

    Below is the command I used.
    bowtie index -f fastafile > output

    The number of reads with at least one reported alignment is less than 1%. Please help me if there is anyway that I can try to improve it. Thanks.
  • dpryan
    Devon Ryan
    • Jul 2011
    • 3478

    #2
    Try blasting a few of the reads that don't map and see what pops up. It's unusual in this day and age to see someone mapping a fasta file, is this old data?

    Comment

    • morning latte
      Member
      • Jun 2013
      • 91

      #3
      Thanks for q quick respond.
      No, it is not old data. It was generated from Illumina Hiseq. I convert fastq files to fasta file to reduce size of the dataset. Sounds like quality information of fastq files affects bowtie mapping? In that case, should I use fastq files? Thanks again.

      Comment

      • dpryan
        Devon Ryan
        • Jul 2011
        • 3478

        #4
        Yes, use the fastq files. Also, use the quality information to perform quality trimming (trim adapters while you're doing that). Have a look at trimmomatic or trim_galore if you're unfamiliar with trimming reads.

        Comment

        • morning latte
          Member
          • Jun 2013
          • 91

          #5
          Thanks.

          I used quality information for quality trimming. Before assembly, I transformed fastq files to fasta files. Could you explain a bit why fastq files is more useful than fasta for bowtie mapping? In the meanwhile let me try mapping with fastq files.

          Comment

          • dpryan
            Devon Ryan
            • Jul 2011
            • 3478

            #6
            Ah, so you're trying to map against a reference that you assembled with these same reads? I'm surprised that that yielded such a poor result.

            Fastq files are generally more useful for mapping as the base-calling quality can be used to moderate mismatch penalties. Suppose you have a read with two equally good alignments, each having a single mismatch, but at different positions. If alignment #1 has the mismatch at a base with a high Phred score while alignment #2 has its mismatch at a base with a low Phred score, then alignment #2 is more likely to be correct (since the mismatch base was more likely to have been miscalled). It's not unusual for there to be slight quality dips near the end of reads, so utilising this information helps deal with that. Having said that, I wouldn't hold my breath that this will help in your situation. Perhaps something went amiss with your assembly. If there's a related organism, you might try mapping against that just for comparison. If the alignments are better, then the assembly is probably just bad.

            Comment

            • swbarnes2
              Senior Member
              • May 2008
              • 910

              #7
              Yeah, everyone keeps the fastqs as fastqs, people don't throw away the quality data. Once upon a time, the SRA actually wanted the four channel quality data, not just the single quality number.

              So stop converting them.

              But notice that most aligners will work with gzipped fastqs. So just keep them compressed. And notice that your .bam contains all the information that your original fastq has, (or with bowite default, the mapped .bam + the unmapped .bam) so once you make those .bams, you can archive the original fastq, if you want.

              Comment

              • morning latte
                Member
                • Jun 2013
                • 91

                #8
                Sorry that I replied late. I really appreciated your advices.

                Comment

                Latest Articles

                Collapse

                • GATTACAT
                  Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                  by GATTACAT
                  Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
                  07-01-2026, 11:43 AM
                • SEQadmin2
                  Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                  by SEQadmin2


                  I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                  Here are nine questions we think about, in roughly the order they matter, before...
                  06-18-2026, 07:11 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by SEQadmin2, Yesterday, 11:08 AM
                0 responses
                6 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-30-2026, 05:37 AM
                0 responses
                11 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-26-2026, 11:10 AM
                0 responses
                19 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-17-2026, 06:09 AM
                0 responses
                53 views
                0 reactions
                Last Post SEQadmin2  
                Working...