Hi, dear all!
I want to perform de novo assembly with four libraries (insert size were 270bp, 500bp, 2K and 5K), and the read are paired and the length is 150bp. After mapping the reads to reference with BWA, there are about 1/3 reads were chimeric for the two mate-pair libraries (2K and 5K). I don't know whether I should filter out these reads? There are little information after google. Considering the short libraries were used for constructing contig, then the reads from long libraries are mapped to contigs to link these contigs, in my opinion, the assembly tool still could use chimeric reads to link the contigs. However, my mate think there were rare chimeric reads in previous experiment since reads were short, and the assemble tool may can't deal with chimeric reads. Furthermore, I think if I filter out these reads, then this wouldn't be a true de novo assemble. So should I filter out the chimeric reads in mate-pair library?
Also, because I want to test ALLPATHs-LG, so I designed the 270bp library, but according to paper titled "Genome sequencing and comparison of two nonhuman primate animal models, the cynomolgus and Chinese rhesus macaques", the overlap reads were filter out when using SOAPdenovo, should I filter out these reads to use SOAPdenovo?
Any suggestion would be grateful!
Best wishes!
lamz138138
I want to perform de novo assembly with four libraries (insert size were 270bp, 500bp, 2K and 5K), and the read are paired and the length is 150bp. After mapping the reads to reference with BWA, there are about 1/3 reads were chimeric for the two mate-pair libraries (2K and 5K). I don't know whether I should filter out these reads? There are little information after google. Considering the short libraries were used for constructing contig, then the reads from long libraries are mapped to contigs to link these contigs, in my opinion, the assembly tool still could use chimeric reads to link the contigs. However, my mate think there were rare chimeric reads in previous experiment since reads were short, and the assemble tool may can't deal with chimeric reads. Furthermore, I think if I filter out these reads, then this wouldn't be a true de novo assemble. So should I filter out the chimeric reads in mate-pair library?
Also, because I want to test ALLPATHs-LG, so I designed the 270bp library, but according to paper titled "Genome sequencing and comparison of two nonhuman primate animal models, the cynomolgus and Chinese rhesus macaques", the overlap reads were filter out when using SOAPdenovo, should I filter out these reads to use SOAPdenovo?
Any suggestion would be grateful!
Best wishes!
lamz138138
Comment