Guys thanks for the replies.
@kmcarr I can not give a straight answer to your question for I can not tell from the numbers only wich transcriptome assembly was "better". The number, the n50 and the distribution of the lengths of the *isotigs* was marginaly "better' without the -large option, however the -large option gave me a better resolution for an individual multi copy gene family that we are after. I need to wait for the PCR aplicons from the wet lab guys to coroborate that, but the indications so far was that for a particular family (which BTW contains sevelar repeats) the -large option might give us better resolution.
@flxlex both with and without -large the assemblies run relative fine (about a couple of hours each in a 4core 32gb RAM machine) so finishing of the assembly is not an issue for us. However I take seriously into account your comment that -large "shortcuts some steps" and marks some reads as repeats and I ll have a manual look at the .ace files of the protein family we are after. The contigs number and lenght distributions as I mentioned are not significantly different. So with the lack of any other formal way, I ll go with the empirical assesment here and I manualy (and together with some wet lab confirmation) check which option give us better resolution for the family we are after.
Thanks again for your replies.
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
-large is supposed to be used for large genome assemblies, which won't finish 'ever' without the -large option set. On occasion, I needed it for transcriptome assemblies, otherwise they would take way too long.
Generally, one wants to avoid -large, as it shortcuts some steps and thereby can lead to worse results (shorter contigs, more reads mared as repeat, for instance).
Leave a comment:
-
Originally posted by cbouyio View PostThe question has occured when I (for curiosity purposes) tried the -large flag for a transcriptome assembly (together with the -cdna flag of course) and then I observed a significant difference on the size and the constitution of the isotigs generated. No something significant in the number but significant difference in the lengths of the isotigs and how they have been put together.
Leave a comment:
-
GS De Novo Assembler (Newbler) -large option for transcriptomes
Hi all,
Has anyone ever tried the -large option for de-novo assembly of 454 transcriptome data.
The issue for the question is that the -large flag (flag for large of complex genomes) has no more documentation apart form that phrase I just wrote in the parentheses.
I understand that this is an option for genome assemblies (mostly.. only...???) but what is the influnce of this flag if one use it for transcriptomes.
The question has occured when I (for curiosity purposes) tried the -large flag for a transcriptome assembly (together with the -cdna flag of course) and then I observed a significant difference on the size and the constitution of the isotigs generated. No something significant in the number but significant difference in the lengths of the isotigs and how they have been put together.
Has anybody gone to the bottom of how this flag works?
Many thanks
Latest Articles
Collapse
-
by seqadmin
Spatial biology is an exciting field that encompasses a wide range of techniques and technologies aimed at mapping the organization and interactions of various biomolecules in their native environments. As this area of research progresses, new tools and methodologies are being introduced, accompanied by efforts to establish benchmarking standards and drive technological innovation.
3D Genomics
While spatial biology often involves studying proteins and RNAs in their...-
Channel: Articles
01-01-2025, 07:30 PM -
-
by seqadmin
Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...-
Channel: Articles
12-16-2024, 07:57 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 01-09-2025, 04:04 PM
|
0 responses
12 views
0 likes
|
Last Post
by seqadmin
01-09-2025, 04:04 PM
|
||
Started by seqadmin, 01-09-2025, 09:42 AM
|
0 responses
19 views
0 likes
|
Last Post
by seqadmin
01-09-2025, 09:42 AM
|
||
Started by seqadmin, 01-08-2025, 03:17 PM
|
0 responses
29 views
0 likes
|
Last Post
by seqadmin
01-08-2025, 03:17 PM
|
||
Started by seqadmin, 01-03-2025, 11:18 AM
|
1 response
47 views
1 like
|
Last Post
by Tonia
01-05-2025, 12:15 PM
|
Leave a comment: