The 454ContigGraph.txt file output by the GS assembler is exactly the kind of information I am looking for. However, I could not tell from the Roche manual how to visualize the results. Any suggestions?
Header Leaderboard Ad
Collapse
454ContigGraph.txt
Collapse
Announcement
Collapse
SEQanswers June Challenge Has Begun!
The competition has begun! We're giving away a $50 Amazon gift card to the member who answers the most questions on our site during the month. We want to encourage our community members to share their knowledge and help each other out by answering questions related to sequencing technologies, genomics, and bioinformatics. The competition is open to all members of the site, and the winner will be announced at the beginning of July. Best of luck!
For a list of the official rules, visit (https://www.seqanswers.com/forum/sit...wledge-and-win)
For a list of the official rules, visit (https://www.seqanswers.com/forum/sit...wledge-and-win)
See more
See less
X
-
There is detailed introduction in the Version2.3's manual:
Genome Sequencer FLX System Software Manual Part C: GS De Novo Assembler – GS Reference Mapper – SFF Tools
1.15.1.12 454ContigGraph.txt
After you run a cDNA aseembly project, look at the actual file and read the manual. It is clear and easy to understand.
-
You are referring to this manual file, right?
There is a decription of the "first section" of 454ContigGraph.txt containing the average depth of the alignment per contig. But for the "second section" containing connections between contigs they only describe the lines starting with "C" not the ones starting with "I" or with "F". Does anyone know where I can find a description for these parts of the "second section"?
And does anyone know a way to visualize (parts of) this graph structure?
Thanks
Comment
-
Yes. It is that manual.
In my file, there are "S" and "I" starting instead of "F" and "I".
Here is an example:
S 1 3694 618:+;561:-;560:+;82:-;711:-;500:+
I 164 AGAGGCTTAgggtttttCATCCAATCaaacAGTGCCAGACCACGGTTACACAACAGAACCGATGGCCTCAGCCGGTAACAGGATGGATACCAacagggtgg
I 502 GC 301:875-3'..855-3';6:970-5'..855-3';2867:971-5'..855-3';5:972-5'..855-3'
Couldn't find the explantation from the manual. I think programmer from 454 can explain this.
Comment
-
I don't know if it is correct but what i suppose looking at my 454ContigGraph-file:
S isotig-id isotig-length contigs_in_isotig(with direction +/-)
I contig_id consenus_sequence coverage-depth:contigs_surrounding_the contig (???) (leading zeros in the contig_id are discarded: 1 = contig00001)
As I'm very unsure about the last entry in the "I-rows" I still don't have a clue why this entry is missing sometimes...Last edited by dschika; 03-29-2010, 07:13 AM.
Comment
-
I got the following explanation:
I: reads 'flowing through' the contig, i.e. reads that start in a neighboring contig, flow through the contig, and end in another neighbor again. In the example:
I 502 GC 301:875-3'..855-3';6:970-5'..855-3';2867:971-5'..855-3';5:972-5'..855-3'
310 reads start in contig 875, flow out it's 3' end, into the 5' of the contig you are looking at (502), out of it again at the 3', and into the 3' of contig 855. The '..' kind of represents the contig in question from 5' to 3'.
In this case, the large number of reads flowing through (301, 6, 2867 and 5, respectively) and the short sequence ('GC') point to a tandem repeat/microsat. All reads go into contig 855, which perhaps is high depth and short also?
Comment
-
Comment
-
Originally posted by flxlex View Post
Comment
-
Visualizing 454contiggraph.txt files
Originally posted by linikujp View PostYou may develop a program to visualize this.
I am thinking of doing this... who else is interested?
Comment
-
Originally posted by henry.gibbons View PostI would be very interested in this. You see displays of this at AGBT/other meetings but I have not seen any tools to display this very useful information. I'm not a programmer, so this would be someone else's bag but would love the tool!
I posted it at this address:
Comment
Latest Articles
Collapse
-
by seqadmin
Developments in sequencing technologies and methodologies have transformed the field of epigenetics, giving researchers a better way to understand the complex world of gene regulation and heritable modifications. This article explores some of the diverse sequencing methods employed in the study of epigenetics, ranging from classic techniques to cutting-edge innovations while providing a brief overview of their processes, applications, and advances.
Methylation Detect...-
Channel: Articles
05-31-2023, 10:46 AM -
-
Differential Expression and Data Visualization: Recommended Tools for Next-Level Sequencing Analysisby seqadmin
After covering QC and alignment tools in the first segment and variant analysis and genome assembly in the second segment, we’re wrapping up with a discussion about tools for differential gene expression analysis and data visualization. In this article, we include recommendations from the following experts: Dr. Mark Ziemann, Senior Lecturer in Biotechnology and Bioinformatics, Deakin University; Dr. Medhat Mahmoud Postdoctoral Research Fellow at Baylor College of Medicine;...-
Channel: Articles
05-23-2023, 12:26 PM -
-
by seqadmin
Continuing from our previous article, we share variant analysis and genome assembly tools recommended by our experts Dr. Medhat Mahmoud, Postdoctoral Research Fellow at Baylor College of Medicine, and Dr. Ming "Tommy" Tang, Director of Computational Biology at Immunitas and author of From Cell Line to Command Line.
Variant detection and analysis tools
Mahmoud classifies variant detection work into two main groups: short variants (<50...-
Channel: Articles
05-19-2023, 10:03 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Yesterday, 08:56 PM
|
0 responses
8 views
0 likes
|
Last Post
by seqadmin
Yesterday, 08:56 PM
|
||
Deep Sequencing Unearths Novel Genetic Variants: Enhancing Precision Medicine for Vascular Anomalies
by seqadmin
Started by seqadmin, Yesterday, 07:33 AM
|
0 responses
8 views
0 likes
|
Last Post
by seqadmin
Yesterday, 07:33 AM
|
||
Unveiling Genetic Associations Through Transcription Factor Binding Quantitative Trait Loci
by seqadmin
Started by seqadmin, 05-31-2023, 07:50 AM
|
0 responses
4 views
0 likes
|
Last Post
by seqadmin
05-31-2023, 07:50 AM
|
||
Exploring French-Canadian Ancestry: Insights into Migration, Settlement Patterns, and Genetic Structure
by seqadmin
Started by seqadmin, 05-26-2023, 09:22 AM
|
0 responses
8 views
0 likes
|
Last Post
by seqadmin
05-26-2023, 09:22 AM
|
Comment