How many contigs one can get after metagenome assembly?
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Originally posted by BIOin View Posti want to assemble 25 million reads. i am getting varying results with different assemblers.
For a metagenome, the complexity can vary depending on your sample. If you had a very complex sample, 25M reads (platform? paired end? read length?) is probably barely scratching the surface -- 25M 2x100 Illumina reads is only 5Gb, which isn't gigantic if you have a diverse sample.
Comment
-
thanks for the reply.
yes my data is complex(animal rumen), my data set Illumina 25M HiSeq 2000 2x100,
I just started using meta-velvet to assemble high quality metagenome data. I tried running meta-velvet with a k-mer of 45, after the assembly is finished and I look at the output file "meta-velvetg.contigs.fa" got 1128469 contigs with max contig length 31758 bp and N50 190.
Should i have to consider this assembly or need to run more Kmers...
Please give me suggestions on assemblers to be use
Comment
-
I tried to assembly a metagenome (plant endophyte, the plant genome is not avaiable now) uing ILLUMINA hiseq 2000 2*100 reads too, my data has 69 M paired end reads, 9.9 Billion bases. I assemblied these reads using CLC genomic workbench, and got 770 thousands contigs. I am working on these contigs now. How do you deal with your so many contigs? Could we share our idears>
Originally posted by BIOin View Postthanks for the reply.
yes my data is complex(animal rumen), my data set Illumina 25M HiSeq 2000 2x100,
I just started using meta-velvet to assemble high quality metagenome data. I tried running meta-velvet with a k-mer of 45, after the assembly is finished and I look at the output file "meta-velvetg.contigs.fa" got 1128469 contigs with max contig length 31758 bp and N50 190.
Should i have to consider this assembly or need to run more Kmers...
Please give me suggestions on assemblers to be use
Originally posted by BIOin View Postthanks for the reply.
yes my data is complex(animal rumen), my data set Illumina 25M HiSeq 2000 2x100,
I just started using meta-velvet to assemble high quality metagenome data. I tried running meta-velvet with a k-mer of 45, after the assembly is finished and I look at the output file "meta-velvetg.contigs.fa" got 1128469 contigs with max contig length 31758 bp and N50 190.
Should i have to consider this assembly or need to run more Kmers...
Please give me suggestions on assemblers to be use
Comment
-
For the assembly of paired-end only Illumina data, I like to use ABySS assembler. But if the metagenome is too complicated, I agree with the previous post that both 25 M and 69 M reads are just to scratch the surface. Using different assemblers won't make signficant difference in terms of the number of contigs or n50.
If the purpose is just to recover genes from the metagenome, paired-end only Illumina data is useful to uncover genes except for those that suffer from strain variations. But to increase the integraty of the assembly dramatically (increase n50), mate-pair data with long inserts can significantly increase scaffolding performance. With some programs to resolve some gaps within scaffolds, the assembly can be improved further.
Comment
-
Originally posted by Shuiquan View PostFor the assembly of paired-end only Illumina data, I like to use ABySS assembler. But if the metagenome is too complicated, I agree with the previous post that both 25 M and 69 M reads are just to scratch the surface. Using different assemblers won't make signficant difference in terms of the number of contigs or n50.
If the purpose is just to recover genes from the metagenome, paired-end only Illumina data is useful to uncover genes except for those that suffer from strain variations. But to increase the integraty of the assembly dramatically (increase n50), mate-pair data with long inserts can significantly increase scaffolding performance. With some programs to resolve some gaps within scaffolds, the assembly can be improved further.
Comment
Latest Articles
Collapse
-
by seqadmin
Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.
[Article Coming Soon!]...-
Channel: Articles
Yesterday, 08:07 AM -
-
by seqadmin
Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...-
Channel: Articles
09-23-2024, 06:35 AM -
-
by seqadmin
During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.
Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...-
Channel: Articles
09-09-2024, 10:59 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 10-02-2024, 04:51 AM
|
0 responses
14 views
0 likes
|
Last Post
by seqadmin
10-02-2024, 04:51 AM
|
||
Started by seqadmin, 10-01-2024, 07:10 AM
|
0 responses
24 views
0 likes
|
Last Post
by seqadmin
10-01-2024, 07:10 AM
|
||
Started by seqadmin, 09-30-2024, 08:33 AM
|
1 response
31 views
0 likes
|
Last Post
by EmiTom
Yesterday, 06:46 AM
|
||
Started by seqadmin, 09-26-2024, 12:57 PM
|
0 responses
20 views
0 likes
|
Last Post
by seqadmin
09-26-2024, 12:57 PM
|
Comment