Seqanswers Leaderboard Ad

**GenoMax** · 04-21-2017, 07:14 AM

What kind of analysis are you trying to do? In general I have never worried about k-mer warnings from FastQC.

**Vinn** · 04-21-2017, 07:17 AM

Originally posted by GenoMax View Post

What kind of analysis are you trying to do? In general I have never worried about k-mer warnings from FastQC.

Hi GenoMax, thanks for your reply. I would like to do de novo assembly.

**GenoMax** · 04-21-2017, 07:43 AM

Take a look at @Brian's suggestions in this thread. I have provided a link for a specific post but take a look at the whole thread. He should be along with more later.

**Vinn** · 04-21-2017, 07:48 AM

Thank you, I will read the thread through.

**Brian Bushnell** · 04-24-2017, 10:16 AM

Kmer-content spikiness at the beginning of the read is normal for many fragmentation methodologies and should not be removed. I'm not sure what's going on at the end, though...

**Vinn** · 04-25-2017, 06:48 AM

Thanks for your reply Brian. Just to be on a safe side, do you think it is better to trim the end off?

**Brian Bushnell** · 04-25-2017, 09:58 AM

Excessive trimming reduces accuracy, and will degrade the results of any experiment. If you want to be confident that bases are genomic rather than artificial, I suggest you follow this methodology:

1) Map the reads to the reference (if you don't have a reference, you can make a quick assembly with Tadpole) with BBMap like this:

Code:

bbmap.sh in=reads.fq ref=ref.fa mhist=mhist.txt qhist=qhist.txt

2) Plot mhist with R or Excel with a log-scale Y-axis to look at the positional error rates.

If there is not an increased error rate in a region of the read, there is no reason to trim it. And conversely, it is prudent to trim if there is a high error rate at one end or the other.

**Vinn** · 04-26-2017, 01:56 PM

Thanks so much Brian for your advice. I will try as you suggested.

Topics	Statistics	Last Post
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Yesterday, 08:47 AM	0 responses 14 views 0 likes	Last Post by seqadmin Yesterday, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 54 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM

Seqanswers Leaderboard Ad

Announcement

K-mer content failed on 5' end - advice needed

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News