Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • dan
    replied
    Originally posted by sklages View Post
    @Dan,

    my suggestion was to either use the bambus mate pair output or a TraceArchive formatted XML file for use with "toAmos", like

    $ toAmos -s x.fasta -q x.qual -m test.mates -o test.a.afg

    or

    $ toAmos -s x.fasta -q x.qual -x test.xml -o test.a.afg

    but, of course, you have to create these files before.

    It is probably the easiest way to write your own "your_data2TraceArchive.pl" and then convert it to whatever you want, e.g. CA format using "tracedb-to-frg.pl".
    Yeah, that's what I'm struggling with... Any reference for the bambus format on the off chance that its simpler? I can't help thinking there is a canned solution in the AMOS docs if only I could find it.

    Leave a comment:


  • new300
    replied
    Originally posted by sklages View Post
    Celera Assembler has been optimised for at least 454FLX length reads; I have used it for shorter GS20 as well, but it may fail. All these assemblies were hybrid assemblies, not 454-only ones.
    That's interesting, I'm surprised it copes with the elevated indel/homopolymer run error rate.

    Leave a comment:


  • sklages
    replied
    @Dan,

    my suggestion was to either use the bambus mate pair output or a TraceArchive formatted XML file for use with "toAmos", like

    $ toAmos -s x.fasta -q x.qual -m test.mates -o test.a.afg

    or

    $ toAmos -s x.fasta -q x.qual -x test.xml -o test.a.afg

    but, of course, you have to create these files before.

    It is probably the easiest way to write your own "your_data2TraceArchive.pl" and then convert it to whatever you want, e.g. CA format using "tracedb-to-frg.pl".

    @new300,

    Celera Assembler has been optimised for at least 454FLX length reads; I have used it for shorter GS20 as well, but it may fail. All these assemblies were hybrid assemblies, not 454-only ones.

    hth,
    Sven

    Leave a comment:


  • new300
    replied
    Are you planning to use the Celera assembler with short reads? Because I don't think it would work very well.

    Leave a comment:


  • dan
    started a topic Input files for the Celera Assembler?

    Input files for the Celera Assembler?

    Hi,

    I have sequence reads as a fasta file with a fasta quality file. I am converting these into .frg input for the Celera Assembler (CA) using AMOS,

    Code:
    toAmos -s test.fasta -q test.qual -o test.afg
    and then:

    Code:
    amos2frg -i test.afg

    However, the above strategy does not recognize the 'mate pairs' in my data (paired end reads) that are 'linked' using the St Louis naming convention (basically read0012.b is pared with read0012.g, the .b and the .g denoting the forward and the backward read, respectively).

    On the AMOS mailing list I got a reply to a similar question from Sven Klages, which told me that CA is usually passed a separate "linkage information" file in addition to the .frg input created above. His suggestion was to create linkage data in 'Trace Archive' format*, and then convert that to input for CA using AMOS again.

    Has anyone tackled this problem before? Is there any code 'off the shelf' to create the mate pair data?


    * http://www.ncbi.nlm.nih.gov/Traces/t...#header-global


    Cheers,
    Dan.


    P.S. Any other good mailing lists for sequence assembly?

    If you know any, I'll add them here:

Latest Articles

Collapse

  • seqadmin
    Advanced Methods for the Detection of Infectious Disease
    by seqadmin




    The recent pandemic caused worldwide health, economic, and social disruptions with its reverberations still felt today. A key takeaway from this event is the need for accurate and accessible tools for detecting and tracking infectious diseases. Timely identification is essential for early intervention, managing outbreaks, and preventing their spread. This article reviews several valuable tools employed in the detection and surveillance of infectious diseases.
    ...
    11-27-2023, 01:15 PM
  • seqadmin
    Strategies for Investigating the Microbiome
    by seqadmin




    Microbiome research has led to the discovery of important connections to human and environmental health. Sequencing has become a core investigational tool in microbiome research, a subject that we covered during a recent webinar. Our expert speakers shared a number of advancements including improved experimental workflows, research involving transmission dynamics, and invaluable analysis resources. This article recaps their informative presentations, offering insights...
    11-09-2023, 07:02 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, 12-05-2023, 02:24 PM
0 responses
17 views
0 likes
Last Post seqadmin  
Started by seqadmin, 12-05-2023, 07:37 AM
0 responses
27 views
0 likes
Last Post seqadmin  
Started by seqadmin, 12-04-2023, 08:23 AM
0 responses
13 views
0 likes
Last Post seqadmin  
Started by seqadmin, 12-01-2023, 09:55 AM
0 responses
26 views
0 likes
Last Post seqadmin  
Working...
X