Unconfigured Ad

**maubp** · 06-12-2012, 01:41 PM

It is being discussed for the next Assemblathon competition, and the mailing list was recently made public. See:

Mabosplay: Daftar, Login & Link Alternatif Mabosplay Terbaru!

http://assemblathon.org/pages/mailing-list

Mabosplay: Temukan cara daftar, login mudah, dan link alternatif terbaru Mabosplay. Ikuti panduan praktis ini agar pengalaman bermain online semakin aman & nyaman!

https://lists.ucdavis.edu/sympa/arc/assemblathon-file-format

**adaptivegenome** · 06-12-2012, 02:38 PM

Thanks, this is great. I think the format might be also useful in mapping. However I do realize at some point we won't be mapping to a reference anymore.

**nilshomer** · 06-12-2012, 04:10 PM

Originally posted by genericforms View Post

Thanks, this is great. I think the format might be also useful in mapping. However I do realize at some point we won't be mapping to a reference anymore.

It will be quite difficult to adapt the FM-index (BWT) based aligners. My prediction would be full-on assembly being the norm in about 2 years.

**adaptivegenome** · 06-12-2012, 04:28 PM

Originally posted by nilshomer View Post

It will be quite difficult to adapt the FM-index (BWT) based aligners. My prediction would be full-on assembly being the norm in about 2 years.

You would know better than me how fast the technology is progressing, so I can't say for sure that it would be worth it, but I think FastG could be useful in specifying alternate reference sequences during mapping. I am not sure it would require significant alteration to existing methods.

**nilshomer** · 06-12-2012, 07:37 PM

Originally posted by genericforms View Post

You would know better than me how fast the technology is progressing, so I can't say for sure that it would be worth it, but I think FastG could be useful in specifying alternate reference sequences during mapping. I am not sure it would require significant alteration to existing methods.

I am not saying mapping to FastG is not possible, I am asserting that FM-indexes are not suitable (yet) for multiple hapolotypes in the same reference sequence.

**lh3** · 06-12-2012, 08:59 PM

The reference genome will still be relevant even if we could get a very good assembly. After all, the annotations are in the reference coordinate. To annotate a new assembly, we need to map the assembly to the reference genome.

We all wish to map data to a graph, but few have a clear definition of the problem, let alone the solution. Adopting graph alignment is likely to take longer than we hope. For now, my vague vision is a graph alone is not enough. We also need the alignment between the graph and the reference.

As to fastg, you can read from the archive that I a little worry about its scope (final scaffold only or generic sequence graph?), technical complexity (simpler and easier to parse format?) and mathematical clarity (more straightforward graph interpretation?), but probably it is me who has the wrong opinions.

**adaptivegenome** · 06-12-2012, 09:37 PM

Originally posted by lh3 View Post

The reference genome will still be relevant even if we could get a very good assembly. After all, the annotations are in the reference coordinate. To annotate a new assembly, we need to map the assembly to the reference genome.

We all wish to map data to a graph, but few have a clear definition of the problem, let alone the solution. Adopting graph alignment is likely to take longer than we hope. For now, my vague vision is a graph alone is not enough. We also need the alignment between the graph and the reference.

As to fastg, you can read from the archive that I a little worry about its scope (final scaffold only or generic sequence graph?), technical complexity (simpler and easier to parse format?) and mathematical clarity (more straightforward graph interpretation?), but probably it is me who has the wrong opinions.

I would be the first to admit that I am probably underestimating the complexity here, but a graph approach would be really nice.

I suppose the final specs are not released yet, however from the conference it seems that the format is very easy to parse and represents an obvious advance from an IUPAC coded reference (you can explicitly define indels, repeats, etc.).

**vmakinen** · 02-25-2013, 05:43 AM

Originally posted by nilshomer View Post

I am not saying mapping to FastG is not possible, I am asserting that FM-indexes are not suitable (yet) for multiple hapolotypes in the same reference sequence.

Actually they are already suitable. A slight modification to BWT is enough:

Generalized Compressed Suffix Array

http://www.cs.helsinki.fi/group/suds/gcsa/

**kbradnam** · 04-24-2013, 11:08 PM

FASTG v1.0 spec is now available from here:

Best Open Source System Software 2026

http://fastg.sourceforge.net

Compare the best free open source System Software at SourceForge. Free, secure and fast System Software downloads from the largest Open Source applications and software directory

Topics	Statistics	Last Post
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible by SEQadmin2 Started by SEQadmin2, Yesterday, 10:09 AM	0 responses 10 views 0 reactions	Last Post by SEQadmin2 Yesterday, 10:09 AM
A New Single-Cell Method Maps DNA-Protein Interactions by SEQadmin2 Started by SEQadmin2, 06-04-2026, 08:59 AM	0 responses 20 views 0 reactions	Last Post by SEQadmin2 06-04-2026, 08:59 AM
Long-Read RNA Sequencing Uncovers a Hidden Layer of Immune Cell Regulation by SEQadmin2 Started by SEQadmin2, 06-02-2026, 12:03 PM	0 responses 27 views 0 reactions	Last Post by SEQadmin2 06-02-2026, 12:03 PM
DNA Methylation Study Reveals How Epigenetic Changes Pass Between Generations by SEQadmin2 Started by SEQadmin2, 06-02-2026, 11:40 AM	0 responses 21 views 0 reactions	Last Post by SEQadmin2 06-02-2026, 11:40 AM

Unconfigured Ad

FastG format?

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News