A former lab member assembled a number of contigs from Illumina reads using SPAdes. I have been trying to assess the depth of coverage using Bowtie2 when I noticed something interesting. I find that there are no Bowtie alignments (concordant or discordant) for the largest contig. Can anyone explain this?
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Originally posted by sewellh View PostI am mapping the reads used to make the contigs back on to the contigs. I get alignments for all contigs except for the largest one. I have BLASTED the contig and it is what I expect.
Comment
-
1) Have you verified that all of the contigs have unique, correctly-formatted names?
2) Does the contig look normal to you - high complexity, mainly defined bases, rather than e.g. a homopolymer or mostly-N sequence?
3) Is it possible that this contig is a replicate of other contigs? Even though it's bigger, it could be fully covered by other contigs. So, do any other contigs map to it?
4) Is it highly repetitive such that reads aligning to it might exceed the maximum number of allowed alignments?
Comment
-
Yes, the contigs have uniqe and correctly formatted names. But even when I try to just map the reads to the single large contig, I get no matches.
It doesn't look like this contig is a replicate of others but it does have a 3-4 copies of a ~500 nt fragment within itself. Does that mean that this contig was made incorrectly or that there is something else I should do? I would would expect that if I tried to align the raw reads just to single contig that I would get some alignments.
Update: Using the resulting fastq files from the Hammer error correcting, I still get no Bowtie alignments to that contig
Comment
-
Thanks so much for your help. I'll try out BBMap. If you are curious at all to look at the contigs, they're on JGI. The largest is:
>gi|589096183|gb|JARN01000011.1| Dehalococcoidia bacterium DscP2 WGS:JARN01:comHGAPfinal_Contig11_1.11, whole genome shotgun sequence
Comment
Latest Articles
Collapse
-
by seqadmin
The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...-
Channel: Articles
11-06-2024, 07:24 PM -
-
by seqadmin
Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...-
Channel: Articles
10-18-2024, 07:11 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Today, 11:09 AM
|
0 responses
22 views
0 likes
|
Last Post
by seqadmin
Today, 11:09 AM
|
||
Started by seqadmin, Today, 06:13 AM
|
0 responses
20 views
0 likes
|
Last Post
by seqadmin
Today, 06:13 AM
|
||
Started by seqadmin, 11-01-2024, 06:09 AM
|
0 responses
30 views
0 likes
|
Last Post
by seqadmin
11-01-2024, 06:09 AM
|
||
New Model Aims to Explain Polygenic Diseases by Connecting Genomic Mutations and Regulatory Networks
by seqadmin
Started by seqadmin, 10-30-2024, 05:31 AM
|
0 responses
21 views
0 likes
|
Last Post
by seqadmin
10-30-2024, 05:31 AM
|
Comment