Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Please help with assembly 2x250 bp with 350 insert

    Hi folks,

    I have got data generated from 2x250 bp with 350 bp insert for fungal genome assembly. I got very good results from SPAdes, but later discovered that SPAdes recommends an insert of 550-700 bp for 2x250 bp sequencing.

    I have tried Abyss, Velvet and CLC so far, but they did not give as good results as SPAdes. Do you have opinion or suggestion what else I shall try? Or would it be just okay to use SPAdes?

    Thanks in advance and have a great weekend!

  • #2
    Discovar de novo was designed specifically for 2x250 bp reads so you could give it a try. Of course it is recommended to have inserts longer than the sequencing reads (500 bp in your case) but it goes for all assemblers and doesn't mean your assembly is not valid. Best way to improve the assembly would likely be to add long/linked reads though.

    Comment


    • #3
      @Vinn: Have you tried to merge your reads (since they must overlap in the middle) and then try assembly as a single end dataset?

      Comment


      • #4
        Originally posted by Vinn View Post
        I have got data generated from 2x250 bp with 350 bp insert for fungal genome assembly. I got very good results from SPAdes, but later discovered that SPAdes recommends an insert of 550-700 bp for 2x250 bp sequencing.
        I think that means "If you want to use SPAdes for 2x250bp reads, we recommend you target you libraries for 550-700bp" rather than "If you have a library outside of 550-700bp, don't use SPAdes". Once you have the library, it's too late, but SPAdes is very flexible with insert sizes.

        As Genomax mentioned, you might try merging the reads first; I have found that to improve SPAdes assemblies.

        Comment


        • #5
          Dear Genomax,

          Thanks for your reply and for the suggestion. I was thinking about that too, but since I just got another PE library (with another insert size), I am not sure if SPAdes can handle one single read and one paired-end?

          Have a great Easter holiday!

          Comment


          • #6
            Dear Ola,

            Thanks for your reply and for your suggestion. I just received another library with another insert size and will try using it to improve the one I have.

            Have a great Easter holiday!

            Comment


            • #7
              Dear Brian,

              Thanks for your reply and for the suggestion. I will try merging and reassembling again.
              Happy Easter holiday!

              Comment


              • #8
                Hi Vinn,

                SPAdes can handle one paired and one single-ended set of reads. I recommend that anyway when using a single library and merging reads, because not all the reads will merge.

                Comment


                • #9
                  Hi Brian,

                  Thank you very much; I will try as you suggested. Anyway, I couldn't stop wondering what if I trim both R1 and R2 reads to 150 bp using bbduk (ftr=150), and use them as a 150PE?

                  Comment


                  • #10
                    Hi Vinn,

                    You could certainly do that, but unless your sequence quality is very bad at the ends, it won't give you a better assembly; it will mainly just reduce your sequence volume. In my testing, SPAdes produces the best assemblies when you merge reads (if they are overlapping) and feed it both the merged and unmerged reads. Remember that SPAdes supports kmers up to 127bp; with 150bp reads, the kmer depth at k=127 will be quite low. Whereas with 250bp reads (or merged 350+bp reads) it will be much higher, potentially resulting in a superior 127-mer assembly.

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Best Practices for Single-Cell Sequencing Analysis
                      by seqadmin



                      While isolating and preparing single cells for sequencing was historically the bottleneck, recent technological advancements have shifted the challenge to data analysis. This highlights the rapidly evolving nature of single-cell sequencing. The inherent complexity of single-cell analysis has intensified with the surge in data volume and the incorporation of diverse and more complex datasets. This article explores the challenges in analysis, examines common pitfalls, offers...
                      06-06-2024, 07:15 AM
                    • seqadmin
                      Latest Developments in Precision Medicine
                      by seqadmin



                      Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

                      Somatic Genomics
                      “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
                      05-24-2024, 01:16 PM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, Yesterday, 06:58 AM
                    0 responses
                    13 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 06-06-2024, 08:18 AM
                    0 responses
                    20 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 06-06-2024, 08:04 AM
                    0 responses
                    18 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 06-03-2024, 06:55 AM
                    0 responses
                    13 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X