I saw some posters at Biology of Genomes this year mentioning the new FastG format for assemblers. I was wondering if anyone has heard about this and if a spec was available yet.
Unconfigured Ad
Collapse
X
-
It will be quite difficult to adapt the FM-index (BWT) based aligners. My prediction would be full-on assembly being the norm in about 2 years.Originally posted by genericforms View PostThanks, this is great. I think the format might be also useful in mapping. However I do realize at some point we won't be mapping to a reference anymore.
Comment
-
-
You would know better than me how fast the technology is progressing, so I can't say for sure that it would be worth it, but I think FastG could be useful in specifying alternate reference sequences during mapping. I am not sure it would require significant alteration to existing methods.Originally posted by nilshomer View PostIt will be quite difficult to adapt the FM-index (BWT) based aligners. My prediction would be full-on assembly being the norm in about 2 years.
Comment
-
-
I am not saying mapping to FastG is not possible, I am asserting that FM-indexes are not suitable (yet) for multiple hapolotypes in the same reference sequence.Originally posted by genericforms View PostYou would know better than me how fast the technology is progressing, so I can't say for sure that it would be worth it, but I think FastG could be useful in specifying alternate reference sequences during mapping. I am not sure it would require significant alteration to existing methods.
Comment
-
-
The reference genome will still be relevant even if we could get a very good assembly. After all, the annotations are in the reference coordinate. To annotate a new assembly, we need to map the assembly to the reference genome.
We all wish to map data to a graph, but few have a clear definition of the problem, let alone the solution. Adopting graph alignment is likely to take longer than we hope. For now, my vague vision is a graph alone is not enough. We also need the alignment between the graph and the reference.
As to fastg, you can read from the archive that I a little worry about its scope (final scaffold only or generic sequence graph?), technical complexity (simpler and easier to parse format?) and mathematical clarity (more straightforward graph interpretation?), but probably it is me who has the wrong opinions.
Comment
-
-
I would be the first to admit that I am probably underestimating the complexity here, but a graph approach would be really nice.Originally posted by lh3 View PostThe reference genome will still be relevant even if we could get a very good assembly. After all, the annotations are in the reference coordinate. To annotate a new assembly, we need to map the assembly to the reference genome.
We all wish to map data to a graph, but few have a clear definition of the problem, let alone the solution. Adopting graph alignment is likely to take longer than we hope. For now, my vague vision is a graph alone is not enough. We also need the alignment between the graph and the reference.
As to fastg, you can read from the archive that I a little worry about its scope (final scaffold only or generic sequence graph?), technical complexity (simpler and easier to parse format?) and mathematical clarity (more straightforward graph interpretation?), but probably it is me who has the wrong opinions.
I suppose the final specs are not released yet, however from the conference it seems that the format is very easy to parse and represents an obvious advance from an IUPAC coded reference (you can explicitly define indels, repeats, etc.).
Comment
-
-
Actually they are already suitable. A slight modification to BWT is enough:Originally posted by nilshomer View PostI am not saying mapping to FastG is not possible, I am asserting that FM-indexes are not suitable (yet) for multiple hapolotypes in the same reference sequence.
Comment
-
Latest Articles
Collapse
-
by SEQadmin2
Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.
The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
...-
Channel: Articles
06-02-2026, 10:05 AM -
-
by SEQadmin2
With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.
Introduction
Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...-
Channel: Articles
05-22-2026, 06:42 AM -
ad_right_rmr
Collapse
News
Collapse
| Topics | Statistics | Last Post | ||
|---|---|---|---|---|
|
Started by SEQadmin2, Yesterday, 10:09 AM
|
0 responses
10 views
0 reactions
|
Last Post
by SEQadmin2
Yesterday, 10:09 AM
|
||
|
Started by SEQadmin2, 06-04-2026, 08:59 AM
|
0 responses
20 views
0 reactions
|
Last Post
by SEQadmin2
06-04-2026, 08:59 AM
|
||
|
Started by SEQadmin2, 06-02-2026, 12:03 PM
|
0 responses
27 views
0 reactions
|
Last Post
by SEQadmin2
06-02-2026, 12:03 PM
|
||
|
Started by SEQadmin2, 06-02-2026, 11:40 AM
|
0 responses
21 views
0 reactions
|
Last Post
by SEQadmin2
06-02-2026, 11:40 AM
|
Comment