How many contigs one can get after metagenome assembly?
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Originally posted by BIOin View Posti want to assemble 25 million reads. i am getting varying results with different assemblers.
For a metagenome, the complexity can vary depending on your sample. If you had a very complex sample, 25M reads (platform? paired end? read length?) is probably barely scratching the surface -- 25M 2x100 Illumina reads is only 5Gb, which isn't gigantic if you have a diverse sample.
Comment
-
thanks for the reply.
yes my data is complex(animal rumen), my data set Illumina 25M HiSeq 2000 2x100,
I just started using meta-velvet to assemble high quality metagenome data. I tried running meta-velvet with a k-mer of 45, after the assembly is finished and I look at the output file "meta-velvetg.contigs.fa" got 1128469 contigs with max contig length 31758 bp and N50 190.
Should i have to consider this assembly or need to run more Kmers...
Please give me suggestions on assemblers to be use
Comment
-
I tried to assembly a metagenome (plant endophyte, the plant genome is not avaiable now) uing ILLUMINA hiseq 2000 2*100 reads too, my data has 69 M paired end reads, 9.9 Billion bases. I assemblied these reads using CLC genomic workbench, and got 770 thousands contigs. I am working on these contigs now. How do you deal with your so many contigs? Could we share our idears>
Originally posted by BIOin View Postthanks for the reply.
yes my data is complex(animal rumen), my data set Illumina 25M HiSeq 2000 2x100,
I just started using meta-velvet to assemble high quality metagenome data. I tried running meta-velvet with a k-mer of 45, after the assembly is finished and I look at the output file "meta-velvetg.contigs.fa" got 1128469 contigs with max contig length 31758 bp and N50 190.
Should i have to consider this assembly or need to run more Kmers...
Please give me suggestions on assemblers to be use
Originally posted by BIOin View Postthanks for the reply.
yes my data is complex(animal rumen), my data set Illumina 25M HiSeq 2000 2x100,
I just started using meta-velvet to assemble high quality metagenome data. I tried running meta-velvet with a k-mer of 45, after the assembly is finished and I look at the output file "meta-velvetg.contigs.fa" got 1128469 contigs with max contig length 31758 bp and N50 190.
Should i have to consider this assembly or need to run more Kmers...
Please give me suggestions on assemblers to be use
Comment
-
For the assembly of paired-end only Illumina data, I like to use ABySS assembler. But if the metagenome is too complicated, I agree with the previous post that both 25 M and 69 M reads are just to scratch the surface. Using different assemblers won't make signficant difference in terms of the number of contigs or n50.
If the purpose is just to recover genes from the metagenome, paired-end only Illumina data is useful to uncover genes except for those that suffer from strain variations. But to increase the integraty of the assembly dramatically (increase n50), mate-pair data with long inserts can significantly increase scaffolding performance. With some programs to resolve some gaps within scaffolds, the assembly can be improved further.
Comment
-
Originally posted by Shuiquan View PostFor the assembly of paired-end only Illumina data, I like to use ABySS assembler. But if the metagenome is too complicated, I agree with the previous post that both 25 M and 69 M reads are just to scratch the surface. Using different assemblers won't make signficant difference in terms of the number of contigs or n50.
If the purpose is just to recover genes from the metagenome, paired-end only Illumina data is useful to uncover genes except for those that suffer from strain variations. But to increase the integraty of the assembly dramatically (increase n50), mate-pair data with long inserts can significantly increase scaffolding performance. With some programs to resolve some gaps within scaffolds, the assembly can be improved further.
Comment
Latest Articles
Collapse
-
by seqadmin
This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.
The Headliner
The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...-
Channel: Articles
03-03-2025, 01:39 PM -
-
by seqadmin
The human gut contains trillions of microorganisms that impact digestion, immune functions, and overall health1. Despite major breakthroughs, we’re only beginning to understand the full extent of the microbiome’s influence on health and disease. Advances in next-generation sequencing and spatial biology have opened new windows into this complex environment, yet many questions remain. This article highlights two recent studies exploring how diet influences microbial...-
Channel: Articles
02-24-2025, 06:31 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 03-03-2025, 01:15 PM
|
0 responses
179 views
0 likes
|
Last Post
by seqadmin
03-03-2025, 01:15 PM
|
||
Started by seqadmin, 02-28-2025, 12:58 PM
|
0 responses
272 views
0 likes
|
Last Post
by seqadmin
02-28-2025, 12:58 PM
|
||
Started by seqadmin, 02-24-2025, 02:48 PM
|
0 responses
656 views
0 likes
|
Last Post
by seqadmin
02-24-2025, 02:48 PM
|
||
Started by seqadmin, 02-21-2025, 02:46 PM
|
0 responses
267 views
0 likes
|
Last Post
by seqadmin
02-21-2025, 02:46 PM
|
Comment