Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • d17
    Member
    • Sep 2008
    • 27

    VCF file from aligned fasta?

    Is it possible to take a file of aligned fasta reads (say 5kb of fasta format sequence or, even better, multifasta format, including one "reference" sequence) and produce a VCF file showing the variants? I'd like to treat each base in the fasta sequences as known (i.e. perfect sequence quality).

    I can easily look at a fasta alignment, say in clustal. It is relatively easy to generate a VCF listing all SNPs. But it gets more complicated when you add in indels and complex variants, and when there are lots of samples it can get messy quickly.

    One solution would be to use an aligner to map these 5kb sequences to the 5kb "reference" sequence, then use any number of tools to generate the VCF from the BAM. However, this would be redoing the alignment step - I already have an alignment! Moreover, it might be difficult to calibrate the SNP calling tools which are not used to working with perfect sequence.

    A similar question is asked in the following thread, but this no satisfactory solution for FASTA --> VCF is found: http://seqanswers.com/forums/showthread.php?t=30461

    Any ideas?

Latest Articles

Collapse

  • SEQadmin2
    Nine Things a Sample Prep Scientist Thinks About Before Sequencing
    by SEQadmin2


    I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.


    Here are nine questions we think about, in roughly the order they matter, before...
    06-18-2026, 07:11 AM
  • SEQadmin2
    From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
    by SEQadmin2


    Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


    The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
    ...
    06-02-2026, 10:05 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by SEQadmin2, 06-17-2026, 06:09 AM
0 responses
30 views
0 reactions
Last Post SEQadmin2  
Started by SEQadmin2, 06-09-2026, 11:58 AM
0 responses
44 views
0 reactions
Last Post SEQadmin2  
Started by SEQadmin2, 06-05-2026, 10:09 AM
0 responses
51 views
0 reactions
Last Post SEQadmin2  
Started by SEQadmin2, 06-04-2026, 08:59 AM
0 responses
51 views
0 reactions
Last Post SEQadmin2  
Working...