Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Velvet 1.2.10: why the big difference in results with -long vs -short w/ 250bp reads?

    Greetings.

    I am doing some de novo assembly of a 23 Mb genome using MiSeq paired end Illumina reads (250bp reads, 400bp insert (SD 130)). These reads, however, have been trimmed for quality and range widely in their finished size, with most at about 190bp. Assembly using -long/-longPaired vs -short/shortPaired gives surprisingly different final results. Any ideas why this is happening or which results are more reliable?

    Thanks!

    Commands:
    Code:
    velveth Genome1_71 71 -short -fastq reads_R1.trimmed.fastq.se reads_R2.trimmed.fastq.se  -shortPaired -separate -fastq reads_R1.trimmed.fastq.pe reads_R2.trimmed.fastq.pe
    velvetg Genome1_71 -exp_cov 43 -ins_length 407 -ins_length_sd 130
    
    velveth Genome1_71 71 -long -fastq reads_R1.trimmed.fastq.se reads_R2.trimmed.fastq.se  -longPaired -separate -fastq reads_R1.trimmed.fastq.pe reads_R2.trimmed.fastq.pe
    velvetg Genome1_71 -exp_cov 43 -ins_length_long 407 -ins_length_long_sd 130
    Results, short:
    Code:
    Final graph has 128642 nodes and n50 of 17324, max 332339, total 26267561, using 6042595/7501247 reads
    Results, long:
    Code:
    Final graph has 148426 nodes and n50 of 1610, max 28675, total 26984545, using 6083488/7501247 reads

  • #2
    Zerbino tells us there shouldn't be any difference, but what you've found is interesting.

    Have you tried this without the singletons and just the paired reads?

    It's interesting that your -long flag increases read utilization and subsequently affects your n50. It's breaking up your reads since you've lost fragments larger than 28675...

    Does that 28675 fragment exist in your -short assembly?

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Quality Control Essentials for Next-Generation Sequencing Workflows
      by seqadmin




      Like all molecular biology applications, next-generation sequencing (NGS) workflows require diligent quality control (QC) measures to ensure accurate and reproducible results. Proper QC begins at nucleic acid extraction and continues all the way through to data analysis. This article outlines the key QC steps in an NGS workflow, along with the commonly used tools and techniques.

      Nucleic Acid Quality Control
      Preparing for NGS starts with isolating the...
      02-10-2025, 01:58 PM
    • seqadmin
      An Introduction to the Technologies Transforming Precision Medicine
      by seqadmin


      In recent years, precision medicine has become a major focus for researchers and healthcare professionals. This approach offers personalized treatment and wellness plans by utilizing insights from each person's unique biology and lifestyle to deliver more effective care. Its advancement relies on innovative technologies that enable a deeper understanding of individual variability. In a joint documentary with our colleagues at Biocompare, we examined the foundational principles of precision...
      01-27-2025, 07:46 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 02-07-2025, 09:30 AM
    0 responses
    68 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 02-05-2025, 10:34 AM
    0 responses
    107 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 02-03-2025, 09:07 AM
    0 responses
    83 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 01-31-2025, 08:31 AM
    0 responses
    47 views
    0 likes
    Last Post seqadmin  
    Working...
    X