Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Velvet 1.2.10: why the big difference in results with -long vs -short w/ 250bp reads?

    Greetings.

    I am doing some de novo assembly of a 23 Mb genome using MiSeq paired end Illumina reads (250bp reads, 400bp insert (SD 130)). These reads, however, have been trimmed for quality and range widely in their finished size, with most at about 190bp. Assembly using -long/-longPaired vs -short/shortPaired gives surprisingly different final results. Any ideas why this is happening or which results are more reliable?

    Thanks!

    Commands:
    Code:
    velveth Genome1_71 71 -short -fastq reads_R1.trimmed.fastq.se reads_R2.trimmed.fastq.se  -shortPaired -separate -fastq reads_R1.trimmed.fastq.pe reads_R2.trimmed.fastq.pe
    velvetg Genome1_71 -exp_cov 43 -ins_length 407 -ins_length_sd 130
    
    velveth Genome1_71 71 -long -fastq reads_R1.trimmed.fastq.se reads_R2.trimmed.fastq.se  -longPaired -separate -fastq reads_R1.trimmed.fastq.pe reads_R2.trimmed.fastq.pe
    velvetg Genome1_71 -exp_cov 43 -ins_length_long 407 -ins_length_long_sd 130
    Results, short:
    Code:
    Final graph has 128642 nodes and n50 of 17324, max 332339, total 26267561, using 6042595/7501247 reads
    Results, long:
    Code:
    Final graph has 148426 nodes and n50 of 1610, max 28675, total 26984545, using 6083488/7501247 reads

  • #2
    Zerbino tells us there shouldn't be any difference, but what you've found is interesting.

    Have you tried this without the singletons and just the paired reads?

    It's interesting that your -long flag increases read utilization and subsequently affects your n50. It's breaking up your reads since you've lost fragments larger than 28675...

    Does that 28675 fragment exist in your -short assembly?

    Comment

    Latest Articles

    Collapse

    • seqadmin
      The Impact of AI in Genomic Medicine
      by seqadmin



      Artificial intelligence (AI) has evolved from a futuristic vision to a mainstream technology, highlighted by the introduction of tools like OpenAI's ChatGPT and Google's Gemini. In recent years, AI has become increasingly integrated into the field of genomics. This integration has enabled new scientific discoveries while simultaneously raising important ethical questions1. Interviews with two researchers at the center of this intersection provide insightful perspectives into...
      02-26-2024, 02:07 PM
    • seqadmin
      Multiomics Techniques Advancing Disease Research
      by seqadmin


      New and advanced multiomics tools and technologies have opened new avenues of research and markedly enhanced various disciplines such as disease research and precision medicine1. The practice of merging diverse data from various ‘omes increasingly provides a more holistic understanding of biological systems. As Maddison Masaeli, Co-Founder and CEO at Deepcell, aptly noted, “You can't explain biology in its complex form with one modality.”

      A major leap in the field has
      ...
      02-08-2024, 06:33 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, Today, 06:12 AM
    0 responses
    13 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 02-23-2024, 04:11 PM
    0 responses
    67 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 02-21-2024, 08:52 AM
    0 responses
    70 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 02-20-2024, 08:57 AM
    0 responses
    61 views
    0 likes
    Last Post seqadmin  
    Working...
    X