Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • oiiio
    Senior Member
    • Jan 2011
    • 105

    Is this true?

    I was going to put this in the 'literature watch' section, but decided to place it here because its more a question about alignment tools.

    In the latest online release (April 20) of Genome Research there is "lobSTR: A short tandem repeat profiler for personal genomes". Reading some of the results in this paper are quite interesting...

    I attached a screenshot of one of the tables where it does a comparison to other popular read aligners (100bp Illumina).

    In the column titled 'indel tolerance(bp)', only BLAT is capable of going past 7 bp indels? Is this true? And i'm assuming that the comparison for bowtie was not bowtie2..
    Attached Files
  • oiiio
    Senior Member
    • Jan 2011
    • 105

    #2
    Here is a link to the full paper

    Comment

    • xied75
      Senior Member
      • Feb 2012
      • 129

      #3
      The numbers are really interesting. Can't speak for others, but for BWA, I can do 15.8 million human paired-end 90bp reads in 576 seconds real time with 30 threads (-t 30, total CPU time 12000 seconds). The paper's time is bit slow, does that include BWA SAMSE/SAMPE as well?

      In BWA ALN, for reads between 93-124, the maxdiff by default is 5, that's the gap number you saw in the table.

      Best,

      dong

      Comment

      • zee
        NGS specialist
        • Apr 2008
        • 249

        #4
        This is a fairly old version of novoalign that was used in the comparison. In that older version the gap extension penalty of 15 was much higher than BWA or Bowtie's 5. In our latest versions we have now set it to 6 which is more comparable.
        Novoalign will definitely pick up indels greater than 7bp. I have generated indels with novoalign-dedup-Dindel that can go as high as 40bp.

        Also, on the speed note it looks like they compared novoalign single-threaded version to their parallel version and likewise for BWA. I have not read the whole paper but I would think they should try apples-to-apples wherever possible.

        Regarding that attached table I dont know what "noninformative reads" actually refers to but I think the authors are showing that their tool is best because it finds 0 noninformative reads. On the flip side lobSTR does not report the highest number of "informative" reads.

        Comment

        • dietmar13
          Senior Member
          • Mar 2010
          • 107

          #5
          interesting, but

          they should use a real competitor for their speed test, not the lame ducks



          i've tested RUM, STAR, and Tophat with a RNAseq data-set, followed by DE analyses, and found no major differences between these three aligners concerning DE gene lists, except mapping speed: STAR was by far the fastest...

          Comment

          • adaptivegenome
            Super Moderator
            • Nov 2009
            • 436

            #6
            It is odd that speed is a major point in the paper. It is a new method for genotyping repeats. My question is whether it produces more accurate alignments (which an ROC plot would reveal) and really whether it produces more accurate genotypes. It was not clear to me that they tested either in the manuscript.

            Comment

            • dvanic
              Member
              • Jan 2012
              • 61

              #7
              i've tested RUM, STAR, and Tophat with a RNAseq data-set, followed by DE analyses, and found no major differences between these three aligners concerning DE gene lists, except mapping speed: STAR was by far the fastest...
              How were they with alternative isoform detection?
              And what do you mean by "no major differences"?

              Comment

              • erlichya
                Junior Member
                • Apr 2012
                • 1

                #8
                Confusion

                Hi Guys,

                I think there is some confusion here. The table was generated using the default parameters of different aligners. The run times of all tools (*including lobSTR*) was determined using a single thread. We also said that in the main text.

                So, Zee, for your question, we did compare 'apples to apples'. Next time, please try to make an effort to read the manuscript that you are criticizing.

                Will be happy to answer any other question.

                Yaniv

                Comment

                • arvid
                  Senior Member
                  • Jul 2011
                  • 156

                  #9
                  Originally posted by erlichya View Post
                  Hi Guys,

                  I think there is some confusion here. The table was generated using the default parameters of different aligners. The run times of all tools (*including lobSTR*) was determined using a single thread. We also said that in the main text.

                  So, Zee, for your question, we did compare 'apples to apples'. Next time, please try to make an effort to read the manuscript that you are criticizing.

                  Will be happy to answer any other question.

                  Yaniv
                  Hmm, I don't quite agree that comparing speed, sensitivity and accuracy of aligners using their default settings make much sense, when these default settings differ - typically the default parameters are optimized for slightly different tasks. This is a typical 'apples with oranges' situation, IMHO. It would make sense to set similar sensitivity settings on all aligners before comparing anything...

                  Running in a single thread makes sense for strict algorithm comparison, but doesn't reflect a real usage situation, however. Does lobSTR scale well in parallelization?

                  Comment

                  Latest Articles

                  Collapse

                  • SEQadmin2
                    Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                    by SEQadmin2


                    I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.


                    Here are nine questions we think about, in roughly the order they matter, before...
                    06-18-2026, 07:11 AM
                  • SEQadmin2
                    From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                    by SEQadmin2


                    Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                    The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                    ...
                    06-02-2026, 10:05 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by SEQadmin2, 06-17-2026, 06:09 AM
                  0 responses
                  26 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-09-2026, 11:58 AM
                  0 responses
                  44 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-05-2026, 10:09 AM
                  0 responses
                  48 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-04-2026, 08:59 AM
                  0 responses
                  49 views
                  0 reactions
                  Last Post SEQadmin2  
                  Working...