Why is predicted best k for de novo assembly so different from actual best k?

NYGen

Member

Join Date: Aug 2014

Posts: 20
- Share
- Tweet
#1

Why is predicted best k for de novo assembly so different from actual best k?

06-05-2015, 12:07 PM

Greetings everyone,

I’m doing my first genome assembly for a non-model plant species, and I need some insight about best kmer lengths for DBG-based assembly. My NGS data is a full lane of Illumina HiSeq V4 2x125 with a single library of insert size 350. Using kmer-counting methods, such as Jellyfish, my predicted best value k has ranged from 93-101 due to the large number of reads and their relatively-long length for HiSeq reads.

However, I have found all of my highest-quality assemblies at kmer-lengths <40, with my highest N50 and CEGs-mapped occurring at lengths 29 and 33, respectively. Does anyone know why I’m seeing such a large difference between predicted best k and actual best k? Given that it’s a discrepancy of more than 50bp, I feel like there’s got to be a common explanation that googling simply hasn’t turned up. I predict it’s related to heterogeneity in the reads, but I’m unable to find much elaboration on the effect of heterogeneous reads, so thanks for any insights!
Tags: de novo assembly, genome assembly, kmer

Previous template Next

Topics	Statistics	Last Post
Mechanical Forces in DNA Transcription Uncovered by Clemson Researchers by seqadmin Started by seqadmin, 10-02-2024, 04:51 AM	0 responses 13 views 0 likes	Last Post by seqadmin 10-02-2024, 04:51 AM
New Epigenetic Clock Links Cheek Cells to Mortality Risk by seqadmin Started by seqadmin, 10-01-2024, 07:10 AM	0 responses 21 views 0 likes	Last Post by seqadmin 10-01-2024, 07:10 AM
AI-Powered Blood Test Shows Promise for Early Ovarian Cancer Detection by seqadmin Started by seqadmin, 09-30-2024, 08:33 AM	0 responses 25 views 0 likes	Last Post by seqadmin 09-30-2024, 08:33 AM
Stem Cell Research Suggests Human Cells May Enter Developmental Pause by seqadmin Started by seqadmin, 09-26-2024, 12:57 PM	0 responses 18 views 0 likes	Last Post by seqadmin 09-26-2024, 12:57 PM

Seqanswers Leaderboard Ad