Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • De novo assembly: raw data type & volume

    Hi,
    I'm trying to assemble 454 raw reads (gDNA) with Newbler 2.6. Can anyone tell me what is the maximum volume of raw data (nt) newbler could intake in one-step or incremental form (as it could assemble large genome of up to 3Gb in size) ? Also the proportion of shotgun and mate-paired reads we should use in order to have a better assembly ?

  • #2
    I have done some assemblies with some pretty large data sets, in the 40-50 Gb range. With the large and het options, I can get an assembly, without, they simply never finish. By never, I mean, after 6 weeks of processing, no updates of the status files for several weeks. I did these on machine with 1 TB RAM. I tried various incremental assemblies and different parameters and essentially got to the same place as when I presented Newbler with all the data. I didn't see any improvements with the CIO options.

    Comment


    • #3
      Thanks Bob........did you use '-m' option or others advanced options? And also trim the dataset? I had the trouble when tried to feed the trimmed and split 454 mate-paired reads. Because Newbler couldn't detect them as mate-paired though it is not seen for Illumina paired-end reads after trimming.

      And for a large eukaryotic genome, say 3Gb in size, the 40-50Gb dataset you used covers only 16-17(x) of the whole genome that might not be quite enough whereas, for a 300Mb genome the figure reached upto 167(x)! So, is there any rule that what coverage we should initially use while trying to assembling a large genome?

      Comment


      • #4
        Hi,
        yes, we do quality and contaminant trimming. Newbler looks for the linker, so if you mean you are splitting the reads and removing the linker, that doesn't work. Or at least, didn't last time I did that.
        As for rules of thumb, I always refer to the Broad's guidelines. Which often don't work, but you have to start somewhere.
        And yes, we did try the -m an other options. I probably tried about 20-30 different combinations.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Latest Developments in Precision Medicine
          by seqadmin



          Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

          Somatic Genomics
          “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
          Yesterday, 01:16 PM
        • seqadmin
          Recent Advances in Sequencing Analysis Tools
          by seqadmin


          The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
          05-06-2024, 07:48 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Yesterday, 07:15 AM
        0 responses
        13 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 05-23-2024, 10:28 AM
        0 responses
        17 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 05-23-2024, 07:35 AM
        0 responses
        20 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 05-22-2024, 02:06 PM
        0 responses
        10 views
        0 likes
        Last Post seqadmin  
        Working...
        X