Hi,
yes, we do quality and contaminant trimming. Newbler looks for the linker, so if you mean you are splitting the reads and removing the linker, that doesn't work. Or at least, didn't last time I did that.
As for rules of thumb, I always refer to the Broad's guidelines. Which often don't work, but you have to start somewhere.
And yes, we did try the -m an other options. I probably tried about 20-30 different combinations.
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Thanks Bob........did you use '-m' option or others advanced options? And also trim the dataset? I had the trouble when tried to feed the trimmed and split 454 mate-paired reads. Because Newbler couldn't detect them as mate-paired though it is not seen for Illumina paired-end reads after trimming.
And for a large eukaryotic genome, say 3Gb in size, the 40-50Gb dataset you used covers only 16-17(x) of the whole genome that might not be quite enough whereas, for a 300Mb genome the figure reached upto 167(x)! So, is there any rule that what coverage we should initially use while trying to assembling a large genome?
Leave a comment:
-
I have done some assemblies with some pretty large data sets, in the 40-50 Gb range. With the large and het options, I can get an assembly, without, they simply never finish. By never, I mean, after 6 weeks of processing, no updates of the status files for several weeks. I did these on machine with 1 TB RAM. I tried various incremental assemblies and different parameters and essentially got to the same place as when I presented Newbler with all the data. I didn't see any improvements with the CIO options.
Leave a comment:
-
De novo assembly: raw data type & volume
Hi,
I'm trying to assemble 454 raw reads (gDNA) with Newbler 2.6. Can anyone tell me what is the maximum volume of raw data (nt) newbler could intake in one-step or incremental form (as it could assemble large genome of up to 3Gb in size) ? Also the proportion of shotgun and mate-paired reads we should use in order to have a better assembly ?Tags: None
Latest Articles
Collapse
-
by seqadmin
The complexity of cancer is clearly demonstrated in the diverse ecosystem of the tumor microenvironment (TME). The TME is made up of numerous cell types and its development begins with the changes that happen during oncogenesis. “Genomic mutations, copy number changes, epigenetic alterations, and alternative gene expression occur to varying degrees within the affected tumor cells,” explained Andrea O’Hara, Ph.D., Strategic Technical Specialist at Azenta. “As...-
Channel: Articles
07-08-2024, 03:19 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Yesterday, 06:46 AM
|
0 responses
9 views
0 likes
|
Last Post
by seqadmin
Yesterday, 06:46 AM
|
||
Started by seqadmin, 07-24-2024, 11:09 AM
|
0 responses
26 views
0 likes
|
Last Post
by seqadmin
07-24-2024, 11:09 AM
|
||
Started by seqadmin, 07-19-2024, 07:20 AM
|
0 responses
159 views
0 likes
|
Last Post
by seqadmin
07-19-2024, 07:20 AM
|
||
Started by seqadmin, 07-16-2024, 05:49 AM
|
0 responses
127 views
0 likes
|
Last Post
by seqadmin
07-16-2024, 05:49 AM
|
Leave a comment: