I saw some posters at Biology of Genomes this year mentioning the new FastG format for assemblers. I was wondering if anyone has heard about this and if a spec was available yet.
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Originally posted by genericforms View PostThanks, this is great. I think the format might be also useful in mapping. However I do realize at some point we won't be mapping to a reference anymore.
Comment
-
Originally posted by nilshomer View PostIt will be quite difficult to adapt the FM-index (BWT) based aligners. My prediction would be full-on assembly being the norm in about 2 years.
Comment
-
Originally posted by genericforms View PostYou would know better than me how fast the technology is progressing, so I can't say for sure that it would be worth it, but I think FastG could be useful in specifying alternate reference sequences during mapping. I am not sure it would require significant alteration to existing methods.
Comment
-
The reference genome will still be relevant even if we could get a very good assembly. After all, the annotations are in the reference coordinate. To annotate a new assembly, we need to map the assembly to the reference genome.
We all wish to map data to a graph, but few have a clear definition of the problem, let alone the solution. Adopting graph alignment is likely to take longer than we hope. For now, my vague vision is a graph alone is not enough. We also need the alignment between the graph and the reference.
As to fastg, you can read from the archive that I a little worry about its scope (final scaffold only or generic sequence graph?), technical complexity (simpler and easier to parse format?) and mathematical clarity (more straightforward graph interpretation?), but probably it is me who has the wrong opinions.
Comment
-
Originally posted by lh3 View PostThe reference genome will still be relevant even if we could get a very good assembly. After all, the annotations are in the reference coordinate. To annotate a new assembly, we need to map the assembly to the reference genome.
We all wish to map data to a graph, but few have a clear definition of the problem, let alone the solution. Adopting graph alignment is likely to take longer than we hope. For now, my vague vision is a graph alone is not enough. We also need the alignment between the graph and the reference.
As to fastg, you can read from the archive that I a little worry about its scope (final scaffold only or generic sequence graph?), technical complexity (simpler and easier to parse format?) and mathematical clarity (more straightforward graph interpretation?), but probably it is me who has the wrong opinions.
I suppose the final specs are not released yet, however from the conference it seems that the format is very easy to parse and represents an obvious advance from an IUPAC coded reference (you can explicitly define indels, repeats, etc.).
Comment
-
Originally posted by nilshomer View PostI am not saying mapping to FastG is not possible, I am asserting that FM-indexes are not suitable (yet) for multiple hapolotypes in the same reference sequence.
Comment
Latest Articles
Collapse
-
by seqadmin
Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...-
Channel: Articles
10-18-2024, 07:11 AM -
-
by seqadmin
Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.
Nobel Prize for MicroRNA Discovery
This week,...-
Channel: Articles
10-07-2024, 08:07 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Today, 06:58 AM
|
0 responses
8 views
0 likes
|
Last Post
by seqadmin
Today, 06:58 AM
|
||
New AI Model Designs Synthetic DNA Switches for Targeted Gene Expression in Specific Cell Types
by seqadmin
Started by seqadmin, Yesterday, 08:43 AM
|
0 responses
18 views
0 likes
|
Last Post
by seqadmin
Yesterday, 08:43 AM
|
||
Started by seqadmin, 10-17-2024, 07:29 AM
|
0 responses
52 views
0 likes
|
Last Post
by seqadmin
10-17-2024, 07:29 AM
|
||
Genetic Barcodes and Single-Cell Sequencing Illuminate Tumor Initiation and Chemoresistance in Breast Cancer
by seqadmin
Started by seqadmin, 10-15-2024, 06:35 AM
|
0 responses
40 views
0 likes
|
Last Post
by seqadmin
10-15-2024, 06:35 AM
|
Comment