Thanks for the reply! That was my impression; that I wouldn't be able to resolve it with 300bp insert library but only with mate pairs or long read technology.
Cheers,
@ecastron
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Originally posted by ecastron View PostThanks Brian, I'll give it a try. I anticipate that I'm going to get one cluster because the reads are seemingly identical. It's suggestive that the coverage for the rRNA operon is about 3 times the coverage of the neighboring genes so at a minimum I'll report that in the submission.
I guess the alternative would be going back to the wet lab to check how many copies there are.
Eduardo
Leave a comment:
-
Thanks Brian, I'll give it a try. I anticipate that I'm going to get one cluster because the reads are seemingly identical. It's suggestive that the coverage for the rRNA operon is about 3 times the coverage of the neighboring genes so at a minimum I'll report that in the submission.
I guess the alternative would be going back to the wet lab to check how many copies there are.
Cheers,
Eduardo
Leave a comment:
-
You can try mapping reads to a 16S copy, then clustering the reads that mapped, then assembling the clusters. This will work if the reads are sufficiently long (for Illumina, merging them may be useful) and the 16S are sufficiently different. If not, you'll just get one cluster. You probably need overlapping 2x250bp reads at a minimum (insert size around 400bp+) to have a good chance.
You can cluster like this with Dedupe (packaged with BBMap):
dedupe.sh in=merged.fq -Xmx30g am=f ac=f fo c rnc=f mcs=50 mo=350 pto pattern=cluster_%.fq
The "mo=350" specifies a min overlap of 350bp. This should be around 80%-90% of your read length. If you have single-ended 250bp reads, set it to 200; if you have merged reads with an insert size of around 400bp, try 350. If you have 100bp non-overlapping reads, don't bother, they're too short.
For this kind of situation, which is very sensitive to chimeras, I recommend merging reads with BBMerge using the "vstrict" flag.
Leave a comment:
-
Hi Cyanoevo,
I have the exact same problem. Did you ever find an answer?
Cheers,
Eduardo
Leave a comment:
-
Genes with multiple copies assembling as single contig
Hi all,
I'm doing a denovo assembly of a cyanobacterial genome with SPades, all is working well but when there are multiple copies of a gene (e.g. 16srRNA gene), it appears that all reads associated with that gene are being mapped to a single contig.
Coverage of these contigs appears to correspond quite well to number of expected copies in the genome (i.e. normal coverage ~50x, for a contig with a gene with four copies, coverage ~200x).
Does anyone know of a method to prevent this from happening so that each of the copies assemble separately in different contigs?
Cheers
NTags: None
Latest Articles
Collapse
-
by seqadmin
The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...-
Channel: Articles
04-22-2024, 07:01 AM -
-
by seqadmin
Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...-
Channel: Articles
04-04-2024, 04:25 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 04-25-2024, 11:49 AM
|
0 responses
20 views
0 likes
|
Last Post
by seqadmin
04-25-2024, 11:49 AM
|
||
Started by seqadmin, 04-24-2024, 08:47 AM
|
0 responses
20 views
0 likes
|
Last Post
by seqadmin
04-24-2024, 08:47 AM
|
||
Started by seqadmin, 04-11-2024, 12:08 PM
|
0 responses
62 views
0 likes
|
Last Post
by seqadmin
04-11-2024, 12:08 PM
|
||
Started by seqadmin, 04-10-2024, 10:19 PM
|
0 responses
61 views
0 likes
|
Last Post
by seqadmin
04-10-2024, 10:19 PM
|
Leave a comment: