Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Alex8
    Member
    • Oct 2010
    • 10

    SOLiD denovo: contigs corrupted after conversion from double-encoding to nucleotide

    Hello.

    I use SOLiD de novo pipeline (http://solidsoftwaretools.com/gf/project/denovo/) to assemble bacterial genome.
    I use Mummerplot for proof-mapping the resulting contigs to reference genome. Double-encoded contigs (contigs.de) look OK. However, after they are converted to base-space by ASiD utility, they break apart into small pieces, see figures. Single contig is double-encoded, while multiple fragments are what it turns into in base-space.

    Anybody has a suggestion on how to cure this problem?

    PS: I tried older version of SOLiD pipeline (denovo_preprocessor_solid + velvet + denovo_postprocessor_solid + denovoadp) and the base-space contigs were uncorrupted. However, it is much slower, so I'd still like to fix the latest denovo tool.
    Attached Files
  • pepperoni
    Member
    • Oct 2011
    • 59

    #2
    Dear Alex8,
    did you solve the problem? I want to assemble color space with velvet and don't know how to do it.
    thanks

    Comment

    • Alex8
      Member
      • Oct 2010
      • 10

      #3
      Hi,
      I downloaded denovo2.2 version which seems to have the bug fixed.
      however, solid software tools site is retired now - if you don't find it anywhere else, can send you the distr.

      Comment

      • colindaven
        Senior Member
        • Oct 2008
        • 417

        #4
        I stumbled across SOPRA recently, which might be helpful for you guys.

        It contains the denovo scripts from Solid Software Tools as well.

        Comment

        • cement_head
          Senior Member
          • Mar 2012
          • 264

          #5
          CLC Genomics Workbench has a two week free trial...

          Comment

          • colindaven
            Senior Member
            • Oct 2008
            • 417

            #6
            I'm not convinced yet CLC is the best solution for de novo assembly of colour space reads. Sure, it's easy, fast and it maps reads back to contigs to attempt to eliminate indel/colour errors....
            but most contigs seemed were under 1000bp.

            It appears that the CS Velvet is capable of building longer contigs > 1000bp.

            (Solid 5500xl, 75bp single end reads)

            I can't quantify yet but will do when we get results back for a range of settings.

            Comment

            • pepperoni
              Member
              • Oct 2011
              • 59

              #7
              Are you using the base space fastq data provided by the 5500? or the color space csfasta? otherwise how are you being able to use Columbus? Since my data came from SOLiD 4 I don't have any base space data.
              thanks

              Comment

              Latest Articles

              Collapse

              • SEQadmin2
                Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                by SEQadmin2


                I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                Here are nine questions we think about, in roughly the order they matter, before...
                06-18-2026, 07:11 AM
              • SEQadmin2
                From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                by SEQadmin2


                Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                ...
                06-02-2026, 10:05 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by SEQadmin2, 06-17-2026, 06:09 AM
              0 responses
              36 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-09-2026, 11:58 AM
              0 responses
              99 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-05-2026, 10:09 AM
              0 responses
              120 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-04-2026, 08:59 AM
              0 responses
              113 views
              0 reactions
              Last Post SEQadmin2  
              Working...