Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • 454ContigGraph.txt

    The 454ContigGraph.txt file output by the GS assembler is exactly the kind of information I am looking for. However, I could not tell from the Roche manual how to visualize the results. Any suggestions?

  • #2
    There is detailed introduction in the Version2.3's manual:
    Genome Sequencer FLX System Software Manual Part C: GS De Novo Assembler – GS Reference Mapper – SFF Tools
    1.15.1.12 454ContigGraph.txt

    After you run a cDNA aseembly project, look at the actual file and read the manual. It is clear and easy to understand.

    Comment


    • #3
      You are referring to this manual file, right?


      There is a decription of the "first section" of 454ContigGraph.txt containing the average depth of the alignment per contig. But for the "second section" containing connections between contigs they only describe the lines starting with "C" not the ones starting with "I" or with "F". Does anyone know where I can find a description for these parts of the "second section"?

      And does anyone know a way to visualize (parts of) this graph structure?

      Thanks

      Comment


      • #4
        Yes. It is that manual.

        In my file, there are "S" and "I" starting instead of "F" and "I".

        Here is an example:
        S 1 3694 618:+;561:-;560:+;82:-;711:-;500:+

        I 164 AGAGGCTTAgggtttttCATCCAATCaaacAGTGCCAGACCACGGTTACACAACAGAACCGATGGCCTCAGCCGGTAACAGGATGGATACCAacagggtgg
        I 502 GC 301:875-3'..855-3';6:970-5'..855-3';2867:971-5'..855-3';5:972-5'..855-3'

        Couldn't find the explantation from the manual. I think programmer from 454 can explain this.

        Comment


        • #5
          Originally posted by seqseq View Post
          And does anyone know a way to visualize (parts of) this graph structure?

          Thanks
          You may develop a program to visualize this.
          I am thinking of doing this... who else is interested?

          Comment


          • #6
            I don't know if it is correct but what i suppose looking at my 454ContigGraph-file:

            S isotig-id isotig-length contigs_in_isotig(with direction +/-)

            I contig_id consenus_sequence coverage-depth:contigs_surrounding_the contig (???) (leading zeros in the contig_id are discarded: 1 = contig00001)

            As I'm very unsure about the last entry in the "I-rows" I still don't have a clue why this entry is missing sometimes...
            Last edited by dschika; 03-29-2010, 07:13 AM.

            Comment


            • #7
              I got the following explanation:

              I: reads 'flowing through' the contig, i.e. reads that start in a neighboring contig, flow through the contig, and end in another neighbor again. In the example:

              I 502 GC 301:875-3'..855-3';6:970-5'..855-3';2867:971-5'..855-3';5:972-5'..855-3'

              310 reads start in contig 875, flow out it's 3' end, into the 5' of the contig you are looking at (502), out of it again at the 3', and into the 3' of contig 855. The '..' kind of represents the contig in question from 5' to 3'.

              In this case, the large number of reads flowing through (301, 6, 2867 and 5, respectively) and the short sequence ('GC') point to a tandem repeat/microsat. All reads go into contig 855, which perhaps is high depth and short also?

              Comment


              • #8
                I just posted an entry on the 454ContigGraph.txt file on my blog about newbler:

                The single file I’ll discuss today has in fact almost the entire assembly in it, besides the actual sequences (although even some of these are also included, see below). As explained in my first po…


                Cheers,

                flxlex

                Comment


                • #9
                  Originally posted by flxlex View Post
                  I just posted an entry on the 454ContigGraph.txt file on my blog about newbler:

                  The single file I’ll discuss today has in fact almost the entire assembly in it, besides the actual sequences (although even some of these are also included, see below). As explained in my first po…


                  Cheers,

                  flxlex
                  Hi, that's a good one. Thank you!

                  Comment


                  • #10
                    Thanks. Quite a nice blog on technical details.

                    Comment


                    • #11
                      Visualizing 454contiggraph.txt files

                      Originally posted by linikujp View Post
                      You may develop a program to visualize this.
                      I am thinking of doing this... who else is interested?
                      I would be very interested in this. You see displays of this at AGBT/other meetings but I have not seen any tools to display this very useful information. I'm not a programmer, so this would be someone else's bag but would love the tool!

                      Comment


                      • #12
                        Originally posted by henry.gibbons View Post
                        I would be very interested in this. You see displays of this at AGBT/other meetings but I have not seen any tools to display this very useful information. I'm not a programmer, so this would be someone else's bag but would love the tool!
                        For anyone finding this old thread, I wrote a program to do just this.

                        I posted it at this address:

                        Comment

                        Latest Articles

                        Collapse

                        • seqadmin
                          Exploring the Dynamics of the Tumor Microenvironment
                          by seqadmin




                          The complexity of cancer is clearly demonstrated in the diverse ecosystem of the tumor microenvironment (TME). The TME is made up of numerous cell types and its development begins with the changes that happen during oncogenesis. “Genomic mutations, copy number changes, epigenetic alterations, and alternative gene expression occur to varying degrees within the affected tumor cells,” explained Andrea O’Hara, Ph.D., Strategic Technical Specialist at Azenta. “As...
                          07-08-2024, 03:19 PM
                        • seqadmin
                          Exploring Human Diversity Through Large-Scale Omics
                          by seqadmin


                          In 2003, researchers from the Human Genome Project (HGP) announced the most comprehensive genome to date1. Although the genome wasn’t fully completed until nearly 20 years later2, numerous large-scale projects, such as the International HapMap Project and 1000 Genomes Project, continued the HGP's work, capturing extensive variation and genomic diversity within humans. Recently, newer initiatives have significantly increased in scale and expanded beyond genomics, offering a more detailed...
                          06-25-2024, 06:43 AM

                        ad_right_rmr

                        Collapse

                        News

                        Collapse

                        Topics Statistics Last Post
                        Started by seqadmin, 07-16-2024, 05:49 AM
                        0 responses
                        26 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, 07-15-2024, 06:53 AM
                        0 responses
                        32 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, 07-10-2024, 07:30 AM
                        0 responses
                        40 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, 07-03-2024, 09:45 AM
                        0 responses
                        205 views
                        0 likes
                        Last Post seqadmin  
                        Working...
                        X