Seqanswers Leaderboard Ad

**Brian Bushnell** · 11-05-2014, 09:34 AM

It would help if you run FastQC and post the output, as well as your QC steps, and mapping command line. As it stands, the reason could be anything.

**zinky** · 11-05-2014, 06:59 PM

I use NGS QC Toolkit to do QC, and the result shows that more than 80% of reads are high quality filtered reads. So I do the mapping step. My mapping commond lines are:
bwa aln -t 5 genome.fa file_1.fastq > file_1.fastq.sai
bwa aln -t 5 genome.fa file_2.fastq > file_2.fastq.sai
bwa sampe -A -a 600 -r '@RG\tID:noID\tPL:ILLUMINA\tLB:noLB\tSM:"file"' genome file_1.fastq.sai file_2.fastq.sai file_1.fastq file_2.fastq > file.sam

**Brian Bushnell** · 11-05-2014, 07:07 PM

You may have short inserts and thus high adapter contamination. You can get an insert size distribution with BBMerge, like this:

bbmerge.sh in1=file_1.fastq in2=file_2.fastq ihist=ihist.txt

If a lot of reads have insert sizes shorter than read length, that will indicate adapter contamination which needs to be removed (e.g. with BBDuk).

Also, I don't recommend bwa aln, particularly in recent versions of bwa. You will achieve higher speed and accuracy with bwa mem or BBMap, which can also generate some useful diagnostic plots (such as mhist).

But I still recommend you post FastQC results.

**zinky** · 11-05-2014, 07:39 PM

thanks for your suggestion，I have asked the sequence stuff and got insert size information ： 350bp .so my parameter -a was set 600 to tolerate extra larger insert size aiming improve mapping rate. before that，i used fastQc to estimate reads quality either. the qc report was good，which suggested no index contamination（green kmer distribution and green overrepresent sequence）and high sequencing quality.
ps：i don't know why mypictures can not be uploaded here.

so i doubt whether the sample was mixed with none human-soured DNA as i metioned above（actually，i don't what they are）.
Also， i will try the tools you suggested，thanks Brain .

Topics	Statistics	Last Post
Gene Misexpression in the Healthy Human Population by seqadmin Started by seqadmin, 07-25-2024, 06:46 AM	0 responses 9 views 0 likes	Last Post by seqadmin 07-25-2024, 06:46 AM
New Method for Rapid Genetic Diagnosis of Mendelian Disorders by seqadmin Started by seqadmin, 07-24-2024, 11:09 AM	0 responses 26 views 0 likes	Last Post by seqadmin 07-24-2024, 11:09 AM
Advancing Nanopore Technology for Portable Sensing Devices by seqadmin Started by seqadmin, 07-19-2024, 07:20 AM	0 responses 160 views 0 likes	Last Post by seqadmin 07-19-2024, 07:20 AM
New RNA-Based Gene Writing Technology Achieves Precise Gene Integration by seqadmin Started by seqadmin, 07-16-2024, 05:49 AM	0 responses 127 views 0 likes	Last Post by seqadmin 07-16-2024, 05:49 AM

Seqanswers Leaderboard Ad

Announcement

Human whole-genome sequencing data analysis with low mapping rate

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News