Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • nivea
    Member
    • Apr 2011
    • 17

    Can BLAST do the RNA-seq reads mapping efficiently?

    Hi all,

    I'm dealing with low-quality reads of a bacteria which are 100 base long. Thus, most of the software cannot map back most of the reads to the reference genome with no more than three mismatches. So can I use BLAST to do the mapping work? Will it to slow?

    Thanks
  • Thorondor
    Member
    • Feb 2011
    • 69

    #2
    amount of reads? size of your reference genome? what software you tried already?

    Comment

    • Simon Anders
      Senior Member
      • Feb 2010
      • 995

      #3
      if you think you have more than three mismatches, why do you limit your aligner to it? (Your questions sounds a bit as if you think that most short-read aligners had a hard-coded limit to the maximum number of mismatches allowed. They don't.)

      Comment

      • Thorondor
        Member
        • Feb 2011
        • 69

        #4
        if you have illulmina reads you might want to read this review:
        Discussion of any scientific study related to high content or next generation genomics. Whole genome association, metagenomics, digital gene expression, etc.

        Comment

        • doc.ramses
          Member
          • Jan 2011
          • 26

          #5
          Originally posted by Thorondor View Post
          if you have illulmina reads you might want to read this review:
          http://seqanswers.com/forums/showthread.php?t=11045
          Can someone send me the paper via PM ?
          Thank you !

          Comment

          • nivea
            Member
            • Apr 2011
            • 17

            #6
            Originally posted by Thorondor View Post
            amount of reads? size of your reference genome? what software you tried already?
            Hi Thorondor,

            I used bowtie to map32129789 reads (76bp) to the bacteria E.coli K12 with a tolerance of mismatches <=3. However, only 0.02% of the reads mapped uniquely to the genome.
            After using BLAST check the result I found for most of the reads, only the previous 35b of reads got hit in the genome. Thus, it's probably because of the sequencing error that I cannot use bowtie to map them back.

            So I want to try BLAST. But do you think it's too slow? Or does this program work accurately enough for reads mapping?

            Thanks for reply~

            Comment

            • nivea
              Member
              • Apr 2011
              • 17

              #7
              Originally posted by Simon Anders View Post
              if you think you have more than three mismatches, why do you limit your aligner to it? (Your questions sounds a bit as if you think that most short-read aligners had a hard-coded limit to the maximum number of mismatches allowed. They don't.)
              Hi Simon,


              The problem is the quality of the reads, which you can see from the upper post. The sequencing error made only smaller than 35bp of most of the reads can map back to the reference genome. If I want to take advantage of the current reads, do you think it's good enough for me to use BLAST?

              Thanks~

              Comment

              • Simon Anders
                Senior Member
                • Feb 2010
                • 995

                #8
                Actually, Bowtie should disregard mismatches on low-quality bases, but this does not work that well.

                However, why don't you simply trim off the bad-quality parts and give the trimmed reads to Bowtie? In a bacterial genome, 35 bp should be more than enough to get good matches.

                regarding BLAST: It is way too slow for millions of reads, and its model of scoring mismatches is based on assuming mutations rather than read errors to be the cause, which is inappropriate.

                Comment

                • Thorondor
                  Member
                  • Feb 2011
                  • 69

                  #9
                  so why not trim the reads first and then try to map without quality values? You don't want to assemble bad reads anyway and they mostly won't overlap if you don't trim. Or you could just cut all your fastq reads to a length of 40 bp and then try to map with BWA.

                  Maybe you also want to have a look at Cufflinks for your assembly? http://cufflinks.cbcb.umd.edu/

                  edit: simon was faster :-/

                  Comment

                  • nivea
                    Member
                    • Apr 2011
                    • 17

                    #10
                    Originally posted by Simon Anders View Post
                    Actually, Bowtie should disregard mismatches on low-quality bases, but this does not work that well.

                    However, why don't you simply trim off the bad-quality parts and give the trimmed reads to Bowtie? In a bacterial genome, 35 bp should be more than enough to get good matches.

                    regarding BLAST: It is way too slow for millions of reads, and its model of scoring mismatches is based on assuming mutations rather than read errors to be the cause, which is inappropriate.
                    Thanks Simon! That's very helpful.

                    Comment

                    • nivea
                      Member
                      • Apr 2011
                      • 17

                      #11
                      Originally posted by Thorondor View Post
                      so why not trim the reads first and then try to map without quality values? You don't want to assemble bad reads anyway and they mostly won't overlap if you don't trim. Or you could just cut all your fastq reads to a length of 40 bp and then try to map with BWA.

                      Maybe you also want to have a look at Cufflinks for your assembly? http://cufflinks.cbcb.umd.edu/

                      edit: simon was faster :-/

                      Thanks Thorondor!

                      Comment

                      Latest Articles

                      Collapse

                      • SEQadmin2
                        Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                        by SEQadmin2


                        I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                        Here are nine questions we think about, in roughly the order they matter, before...
                        06-18-2026, 07:11 AM
                      • SEQadmin2
                        From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                        by SEQadmin2


                        Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                        The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                        ...
                        06-02-2026, 10:05 AM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by SEQadmin2, 06-26-2026, 11:10 AM
                      0 responses
                      12 views
                      0 reactions
                      Last Post SEQadmin2  
                      Started by SEQadmin2, 06-17-2026, 06:09 AM
                      0 responses
                      48 views
                      0 reactions
                      Last Post SEQadmin2  
                      Started by SEQadmin2, 06-09-2026, 11:58 AM
                      0 responses
                      107 views
                      0 reactions
                      Last Post SEQadmin2  
                      Started by SEQadmin2, 06-05-2026, 10:09 AM
                      0 responses
                      125 views
                      0 reactions
                      Last Post SEQadmin2  
                      Working...