Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • wanadou
    Member
    • Jan 2018
    • 11

    help for NGS and assembly

    Hello

    I think to do the whole genome sequencing of a halophyte plant (350 Mbp) using the Hiseq Pe150 which generate data with 30X coverage
    1- this strategy could it be effective to make an de novo assembly in order to have a draft genome?
    or
    the sequencing using PacBio (1 cell SMRT) to generate sequences with more bigger size is it effective to make de novo assembly of the draft genome?
  • nucacidhunter
    Jafar Jabbari
    • Jan 2013
    • 1250

    #2
    The genome size is relatively small so you will have better assembly if you sequence with PacBio but one SMRT cell will not be enough. You should consider 50x coverage (17.5 Gb) which will require 3-4 Sequel SMRT cells. You can also sequence 2 SMRT cells and then do a preliminary assembly and decide on further sequencing.

    Illumina sequencing will result in lots of short contigs especially if the plant genome is composed of repeat regions.

    I do not think any of these platforms will enable complete assembly but the results will be better if DNA is from a homozygous plant.

    Comment

    • wanadou
      Member
      • Jan 2018
      • 11

      #3
      Hello nucacidhunter,
      Thanks for your reply
      The cost of sequencing using PacBio is relatively high. the sequencing of 1 cell SMRT can give 5Gb. I think to do an preliminary assembly for a draft genome. In second time i will do sequencing using Illumina platform which provide 50X coverage. this second sequencing can be used to do hybrid assembly in ordre to have complete genome.
      you think this strategy is feasible? or 10-15X coverage given by PacBio is insuffusant to do draft genome assembly

      Comment

      • nucacidhunter
        Jafar Jabbari
        • Jan 2013
        • 1250

        #4
        It depends on the genome. You might look at repeat content of a related species if it is available. I have seen Illumina based assembled contigs which were couple of hundred bases larger than the sequencing length.

        I do not think either of these platforms alone or in combination will give complete assembly but I would recommend to do 2 SMRT cells and assemble then depending on results decide for more sequencing on one. For instance, if you are interested in particular genes or region covered by PacBio contigs then Illumina will be good to increase accuracy. Depending on your application an incomplete assembly might be enough.

        PacBio is sensitive to carry over of contaminants from DNA extraction so you should try an extraction method that result in long fragments and is free from carbohydrates and phenolic compounds. Link for PacBio calculator:

        Comment

        • colindaven
          Senior Member
          • Oct 2008
          • 417

          #5
          Its' very tricky to analyse 10X pacbio data and do a de novo assembly. I have had very bad experiences in the past with this setup, getting poor N50s and putting lots of effort in. 30x+ is far, far better since you need a lot of the longest reads to span gaps AND self correct reads for a decent assembly .

          There were a few tools for low coverage pacbio hybrid assembly, DBG2OLC for example

          Comment

          • Markiyan
            Senior Member
            • Sep 2010
            • 126

            #6
            Use Illumina 2x250 or 2x300, not 2x150 in de novo!

            Please use Illumina 2x250 or 2x300 in any de novo appication! (in addition to the pacbio).

            It means 1-2 runs (2x250 or 2x300) using MiSeq or 1/2 of the HiSeq 2500 RR 2x250.

            Usually it gives you significantly better assemblies than 2x150 or 2x100 bp runs.

            PS: Make sure a PCR-free library prep is used.

            Comment

            • gringer
              David Eccles (gringer)
              • May 2011
              • 845

              #7
              If you want cheap, [MinION](https://store.nanoporetech.com/basic.html) is always available (starter discount is $1000 for two flow cells and a rapid kit). The consensus assemblies are less accurate than PacBio, but if you're doing Illumina as well for local assembly correction (which I also recommend) that probably won't matter as much.

              Comment

              • luc
                Senior Member
                • Dec 2010
                • 469

                #8
                The sequencer is certainly cheap, but to my knowledge not the reagents and additional flowcells. PacBio might be more affordable?

                Originally posted by gringer View Post
                If you want cheap, [MinION](https://store.nanoporetech.com/basic.html) is always available (starter discount is $1000 for two flow cells and a rapid kit). The consensus assemblies are less accurate than PacBio, but if you're doing Illumina as well for local assembly correction (which I also recommend) that probably won't matter as much.

                Comment

                • gringer
                  David Eccles (gringer)
                  • May 2011
                  • 845

                  #9
                  As far as I know, ONT is the only sequencing machine supplier that publishes all their prices on their store website; you can have a look and see for yourself!

                  At the moment marginal cost per base for the MinION is approximately equal to PacBio Sequel or Illumina MiSeq, with longer reads, a shorter sample preparation time, and no net capital expense for the sequencer. Flow cells are cheaper if ordered in bulk, and those who want the maximum affordability (and have lots of money to throw around) can purchase a PromethION.

                  Reminder: all those costs are on the ONT store website.

                  Comment

                  • soungalo
                    Junior Member
                    • Apr 2018
                    • 3

                    #10
                    Wrong way, sir

                    You won't be able to get a de novo assembly this way.
                    Regarding the Illumina strategy - to get a decent assembly you'll have to:
                    a. Increase coverage (50-60X should be OK)
                    b. Use longer reads (250bp), preferably use an insert size allowing for merging paired end reads.
                    c. Use various insert size libraries (for example by applying mate-pair technology).

                    As for PacBio - except for the simplest genomes, pure PacBio data cannot produce a decent assembly. Hybrid approaches are nice, but you'll still need lots of Illumina data, which means considerable costs.

                    Comment

                    • gringer
                      David Eccles (gringer)
                      • May 2011
                      • 845

                      #11
                      You won't be able to get a good de-novo assembly from Illumina data alone, and longer SBS reads won't fix that. Consider things like 28S rRNA gene regions, where kilobase-length regions are repeated many times in tandem; those will never be properly assembled from just Illumina data.

                      A hybrid approaches with reasonable long-read coverage (e.g. 40-100X) for generating assembly scaffolds and low short-read coverage (e.g. 40X) for local correction should produce somewhat contiguous sequences (e.g. N50 > 100 kb) that have a high accuracy. Contiguity can be improved with more long-range techniques like Hi-C and optical mapping.

                      Single-technology sequencing is (at the moment) a false economy. The additional money spent on hybrid sequencing approaches is more than made up for by the reduced pain on the bioinformatics end of things.

                      Comment

                      • nucacidhunter
                        Jafar Jabbari
                        • Jan 2013
                        • 1250

                        #12
                        Originally posted by gringer View Post
                        As far as I know, ONT is the only sequencing machine supplier that publishes all their prices on their store website; you can have a look and see for yourself!
                        Reminder: all those costs are on the ONT store website.
                        This is because ONT sells their consumables directly but other companies such as PacBio sell them through a local distributor that have different margin and shipping costs depending on global location.

                        A 50-80x coverage with PacBio alone depending on genome heterogeneity results in very good assembly.

                        Comment

                        • gringer
                          David Eccles (gringer)
                          • May 2011
                          • 845

                          #13
                          other companies such as PacBio sell them through a local distributor that have different margin and shipping costs depending on global location.
                          That's not the only reason. Sigma also has region-specific pricing. Without logging in, I choose the location when I first visit the site (presumably stored as a cookie for subsequent visits), and their website gives me prices for items.

                          Comment

                          • luc
                            Senior Member
                            • Dec 2010
                            • 469

                            #14
                            Final costs depend on reagent costs and yields (and reliability). Results for both systems will depend a lot on DNA sample quality. At current yields the Sequel runs will yield more data/dollar in our hands, if one is not ordering a pack of 48 or 300 nanopore flowcells. The average nanopore read lengths will be longer, but results/flowcells seem much more variable.

                            Originally posted by gringer View Post
                            As far as I know, ONT is the only sequencing machine supplier that publishes all their prices on their store website; you can have a look and see for yourself!

                            At the moment marginal cost per base for the MinION is approximately equal to PacBio Sequel or Illumina MiSeq, with longer reads, a shorter sample preparation time, and no net capital expense for the sequencer. Flow cells are cheaper if ordered in bulk, and those who want the maximum affordability (and have lots of money to throw around) can purchase a PromethION.

                            Reminder: all those costs are on the ONT store website.

                            Comment

                            Latest Articles

                            Collapse

                            • SEQadmin2
                              From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                              by SEQadmin2


                              Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                              The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                              ...
                              Yesterday, 10:05 AM
                            • SEQadmin2
                              Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                              by SEQadmin2


                              With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                              Introduction

                              Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                              05-22-2026, 06:42 AM
                            • SEQadmin2
                              Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                              by SEQadmin2

                              Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                              Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                              05-06-2026, 09:04 AM

                            ad_right_rmr

                            Collapse

                            News

                            Collapse

                            Topics Statistics Last Post
                            Started by SEQadmin2, Yesterday, 12:03 PM
                            0 responses
                            19 views
                            0 reactions
                            Last Post SEQadmin2  
                            Started by SEQadmin2, Yesterday, 11:40 AM
                            0 responses
                            14 views
                            0 reactions
                            Last Post SEQadmin2  
                            Started by SEQadmin2, 05-28-2026, 11:40 AM
                            0 responses
                            29 views
                            0 reactions
                            Last Post SEQadmin2  
                            Started by SEQadmin2, 05-26-2026, 10:12 AM
                            0 responses
                            31 views
                            0 reactions
                            Last Post SEQadmin2  
                            Working...