Header Leaderboard Ad

Collapse

454ContigGraph.txt

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • dsenalik
    replied
    Originally posted by henry.gibbons View Post
    I would be very interested in this. You see displays of this at AGBT/other meetings but I have not seen any tools to display this very useful information. I'm not a programmer, so this would be someone else's bag but would love the tool!
    For anyone finding this old thread, I wrote a program to do just this.

    I posted it at this address:

    http://www.vcru.wisc.edu/simonlab/sd...html#contignet

    Leave a comment:


  • henry.gibbons
    replied
    Visualizing 454contiggraph.txt files

    Originally posted by linikujp View Post
    You may develop a program to visualize this.
    I am thinking of doing this... who else is interested?
    I would be very interested in this. You see displays of this at AGBT/other meetings but I have not seen any tools to display this very useful information. I'm not a programmer, so this would be someone else's bag but would love the tool!

    Leave a comment:


  • lh3
    replied
    Thanks. Quite a nice blog on technical details.

    Leave a comment:


  • linikujp
    replied
    Originally posted by flxlex View Post
    I just posted an entry on the 454ContigGraph.txt file on my blog about newbler:

    http://contig.wordpress.com/2010/04/...raph-txt-file/

    Cheers,

    flxlex
    Hi, that's a good one. Thank you!

    Leave a comment:


  • flxlex
    replied
    I just posted an entry on the 454ContigGraph.txt file on my blog about newbler:

    http://contig.wordpress.com/2010/04/...raph-txt-file/

    Cheers,

    flxlex

    Leave a comment:


  • flxlex
    replied
    I got the following explanation:

    I: reads 'flowing through' the contig, i.e. reads that start in a neighboring contig, flow through the contig, and end in another neighbor again. In the example:

    I 502 GC 301:875-3'..855-3';6:970-5'..855-3';2867:971-5'..855-3';5:972-5'..855-3'

    310 reads start in contig 875, flow out it's 3' end, into the 5' of the contig you are looking at (502), out of it again at the 3', and into the 3' of contig 855. The '..' kind of represents the contig in question from 5' to 3'.

    In this case, the large number of reads flowing through (301, 6, 2867 and 5, respectively) and the short sequence ('GC') point to a tandem repeat/microsat. All reads go into contig 855, which perhaps is high depth and short also?

    Leave a comment:


  • dschika
    replied
    I don't know if it is correct but what i suppose looking at my 454ContigGraph-file:

    S isotig-id isotig-length contigs_in_isotig(with direction +/-)

    I contig_id consenus_sequence coverage-depth:contigs_surrounding_the contig (???) (leading zeros in the contig_id are discarded: 1 = contig00001)

    As I'm very unsure about the last entry in the "I-rows" I still don't have a clue why this entry is missing sometimes...
    Last edited by dschika; 03-29-2010, 07:13 AM.

    Leave a comment:


  • linikujp
    replied
    Originally posted by seqseq View Post
    And does anyone know a way to visualize (parts of) this graph structure?

    Thanks
    You may develop a program to visualize this.
    I am thinking of doing this... who else is interested?

    Leave a comment:


  • linikujp
    replied
    Yes. It is that manual.

    In my file, there are "S" and "I" starting instead of "F" and "I".

    Here is an example:
    S 1 3694 618:+;561:-;560:+;82:-;711:-;500:+

    I 164 AGAGGCTTAgggtttttCATCCAATCaaacAGTGCCAGACCACGGTTACACAACAGAACCGATGGCCTCAGCCGGTAACAGGATGGATACCAacagggtgg
    I 502 GC 301:875-3'..855-3';6:970-5'..855-3';2867:971-5'..855-3';5:972-5'..855-3'

    Couldn't find the explantation from the manual. I think programmer from 454 can explain this.

    Leave a comment:


  • seqseq
    replied
    You are referring to this manual file, right?
    http://xyala.cap.ed.ac.uk/Gene_Pool/...ls_Oct2009.pdf

    There is a decription of the "first section" of 454ContigGraph.txt containing the average depth of the alignment per contig. But for the "second section" containing connections between contigs they only describe the lines starting with "C" not the ones starting with "I" or with "F". Does anyone know where I can find a description for these parts of the "second section"?

    And does anyone know a way to visualize (parts of) this graph structure?

    Thanks

    Leave a comment:


  • linikujp
    replied
    There is detailed introduction in the Version2.3's manual:
    Genome Sequencer FLX System Software Manual Part C: GS De Novo Assembler – GS Reference Mapper – SFF Tools
    1.15.1.12 454ContigGraph.txt

    After you run a cDNA aseembly project, look at the actual file and read the manual. It is clear and easy to understand.

    Leave a comment:


  • btully
    started a topic 454ContigGraph.txt

    454ContigGraph.txt

    The 454ContigGraph.txt file output by the GS assembler is exactly the kind of information I am looking for. However, I could not tell from the Roche manual how to visualize the results. Any suggestions?

Latest Articles

Collapse

  • seqadmin
    Improved Targeted Sequencing: A Comprehensive Guide to Amplicon Sequencing
    by seqadmin



    Amplicon sequencing is a targeted approach that allows researchers to investigate specific regions of the genome. This technique is routinely used in applications such as variant identification, clinical research, and infectious disease surveillance. The amplicon sequencing process begins by designing primers that flank the regions of interest. The DNA sequences are then amplified through PCR (typically multiplex PCR) to produce amplicons complementary to the targets. RNA targets...
    03-21-2023, 01:49 PM
  • seqadmin
    Targeted Sequencing: Choosing Between Hybridization Capture and Amplicon Sequencing
    by seqadmin




    Targeted sequencing is an effective way to sequence and analyze specific genomic regions of interest. This method enables researchers to focus their efforts on their desired targets, as opposed to other methods like whole genome sequencing that involve the sequencing of total DNA. Utilizing targeted sequencing is an attractive option for many researchers because it is often faster, more cost-effective, and only generates applicable data. While there are many approaches...
    03-10-2023, 05:31 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, 03-22-2023, 12:26 PM
0 responses
10 views
0 likes
Last Post seqadmin  
Started by seqadmin, 03-17-2023, 12:32 PM
0 responses
15 views
0 likes
Last Post seqadmin  
Started by seqadmin, 03-15-2023, 12:42 PM
0 responses
21 views
0 likes
Last Post seqadmin  
Started by seqadmin, 03-09-2023, 10:17 AM
0 responses
68 views
1 like
Last Post seqadmin  
Working...
X