Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • kbradnam
    replied
    FASTG v1.0 spec is now available from here:

    Compare the best free open source System Software at SourceForge. Free, secure and fast System Software downloads from the largest Open Source applications and software directory

    Leave a comment:


  • vmakinen
    replied
    Originally posted by nilshomer View Post
    I am not saying mapping to FastG is not possible, I am asserting that FM-indexes are not suitable (yet) for multiple hapolotypes in the same reference sequence.
    Actually they are already suitable. A slight modification to BWT is enough:

    Leave a comment:


  • adaptivegenome
    replied
    Originally posted by lh3 View Post
    The reference genome will still be relevant even if we could get a very good assembly. After all, the annotations are in the reference coordinate. To annotate a new assembly, we need to map the assembly to the reference genome.

    We all wish to map data to a graph, but few have a clear definition of the problem, let alone the solution. Adopting graph alignment is likely to take longer than we hope. For now, my vague vision is a graph alone is not enough. We also need the alignment between the graph and the reference.

    As to fastg, you can read from the archive that I a little worry about its scope (final scaffold only or generic sequence graph?), technical complexity (simpler and easier to parse format?) and mathematical clarity (more straightforward graph interpretation?), but probably it is me who has the wrong opinions.
    I would be the first to admit that I am probably underestimating the complexity here, but a graph approach would be really nice.

    I suppose the final specs are not released yet, however from the conference it seems that the format is very easy to parse and represents an obvious advance from an IUPAC coded reference (you can explicitly define indels, repeats, etc.).

    Leave a comment:


  • lh3
    replied
    The reference genome will still be relevant even if we could get a very good assembly. After all, the annotations are in the reference coordinate. To annotate a new assembly, we need to map the assembly to the reference genome.

    We all wish to map data to a graph, but few have a clear definition of the problem, let alone the solution. Adopting graph alignment is likely to take longer than we hope. For now, my vague vision is a graph alone is not enough. We also need the alignment between the graph and the reference.

    As to fastg, you can read from the archive that I a little worry about its scope (final scaffold only or generic sequence graph?), technical complexity (simpler and easier to parse format?) and mathematical clarity (more straightforward graph interpretation?), but probably it is me who has the wrong opinions.

    Leave a comment:


  • nilshomer
    replied
    Originally posted by genericforms View Post
    You would know better than me how fast the technology is progressing, so I can't say for sure that it would be worth it, but I think FastG could be useful in specifying alternate reference sequences during mapping. I am not sure it would require significant alteration to existing methods.
    I am not saying mapping to FastG is not possible, I am asserting that FM-indexes are not suitable (yet) for multiple hapolotypes in the same reference sequence.

    Leave a comment:


  • adaptivegenome
    replied
    Originally posted by nilshomer View Post
    It will be quite difficult to adapt the FM-index (BWT) based aligners. My prediction would be full-on assembly being the norm in about 2 years.
    You would know better than me how fast the technology is progressing, so I can't say for sure that it would be worth it, but I think FastG could be useful in specifying alternate reference sequences during mapping. I am not sure it would require significant alteration to existing methods.

    Leave a comment:


  • nilshomer
    replied
    Originally posted by genericforms View Post
    Thanks, this is great. I think the format might be also useful in mapping. However I do realize at some point we won't be mapping to a reference anymore.
    It will be quite difficult to adapt the FM-index (BWT) based aligners. My prediction would be full-on assembly being the norm in about 2 years.

    Leave a comment:


  • adaptivegenome
    replied
    Thanks, this is great. I think the format might be also useful in mapping. However I do realize at some point we won't be mapping to a reference anymore.

    Leave a comment:


  • maubp
    replied
    It is being discussed for the next Assemblathon competition, and the mailing list was recently made public. See:
    The URL you requested could not be found.

    Leave a comment:


  • adaptivegenome
    started a topic FastG format?

    FastG format?

    I saw some posters at Biology of Genomes this year mentioning the new FastG format for assemblers. I was wondering if anyone has heard about this and if a spec was available yet.
    Last edited by adaptivegenome; 06-12-2012, 12:38 PM. Reason: typo

Latest Articles

Collapse

  • seqadmin
    Understanding Genetic Influence on Infectious Disease
    by seqadmin




    During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.

    Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...
    09-09-2024, 10:59 AM
  • seqadmin
    Addressing Off-Target Effects in CRISPR Technologies
    by seqadmin






    The first FDA-approved CRISPR-based therapy marked the transition of therapeutic gene editing from a dream to reality1. CRISPR technologies have streamlined gene editing, and CRISPR screens have become an important approach for identifying genes involved in disease processes2. This technique introduces targeted mutations across numerous genes, enabling large-scale identification of gene functions, interactions, and pathways3. Identifying the full range...
    08-27-2024, 04:44 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, Today, 02:44 PM
0 responses
7 views
0 likes
Last Post seqadmin  
Started by seqadmin, 09-06-2024, 08:02 AM
0 responses
143 views
0 likes
Last Post seqadmin  
Started by seqadmin, 09-03-2024, 08:30 AM
0 responses
150 views
0 likes
Last Post seqadmin  
Started by seqadmin, 08-27-2024, 04:40 AM
0 responses
158 views
0 likes
Last Post seqadmin  
Working...
X