Header Leaderboard Ad

Collapse

Extract reference and aligned sequences from BAM file basing on VCF file

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • floem7
    replied
    Thanks, I've followed example found at vcftools page and it works :-)

    Great thanks!

    Edit: however, I realized that aligned fasta format would be better. The aim is to quickly generate
    friendly msa view for a given region. For example for primer design.

    Certainly, ordinary MSA programs don't create sufficiently similar alignment as this in bam file.
    So it require manual inspection.
    Last edited by floem7; 01-19-2015, 02:45 PM. Reason: adding info.

    Leave a comment:


  • sarvidsson
    replied
    Forgot to mention, if you just want to extract FASTA sequences for GFF features (i.e. without any called variants applied), you can use BEDTools getfasta (http://bedtools.readthedocs.org/en/l.../getfasta.html).

    Leave a comment:


  • sarvidsson
    replied
    You can try

    - "vcf-consensus" from VCFtools: http://vcftools.sourceforge.net/perl...#vcf-consensus. Click on "Read more" to get an example how to get the consensus for a given region within the reference sequence (you need to extract this information from your GFF).

    or

    - FastaAlternateReferenceMaker within GATK (https://www.broadinstitute.org/gatk/...renceMaker.php)

    Read the documentation thoroughly - there are several caveats!

    Leave a comment:


  • Extract reference and aligned sequences from BAM file basing on VCF file

    I couldn't find relevant information. I hope it's not duplicate.

    I have resequencing data for two maize lines (BAM and VCF files).
    I want to extract sequences (fasta) for several genes (I have also GFF3 file with annotation data) from reference genome and corresponding sequences from resequencing data. I probably could use sequence identifiers, as they are in my files

    Which tool allows to extract such data, and more generally to extract sequences for
    a given variant type (SNP, indel, etc) and location (exon, intron, etc)?
    Last edited by floem7; 01-17-2015, 03:24 PM. Reason: Adding info.

Latest Articles

Collapse

  • seqadmin
    How RNA-Seq is Transforming Cancer Studies
    by seqadmin



    Cancer research has been transformed through numerous molecular techniques, with RNA sequencing (RNA-seq) playing a crucial role in understanding the complexity of the disease. Maša Ivin, Ph.D., Scientific Writer at Lexogen, and Yvonne Goepel Ph.D., Product Manager at Lexogen, remarked that “The high-throughput nature of RNA-seq allows for rapid profiling and deep exploration of the transcriptome.” They emphasized its indispensable role in cancer research, aiding in biomarker...
    09-07-2023, 11:15 PM
  • seqadmin
    Methods for Investigating the Transcriptome
    by seqadmin




    Ribonucleic acid (RNA) represents a range of diverse molecules that play a crucial role in many cellular processes. From serving as a protein template to regulating genes, the complex processes involving RNA make it a focal point of study for many scientists. This article will spotlight various methods scientists have developed to investigate different RNA subtypes and the broader transcriptome.

    Whole Transcriptome RNA-seq
    Whole transcriptome sequencing...
    08-31-2023, 11:07 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, 09-22-2023, 09:05 AM
0 responses
21 views
0 likes
Last Post seqadmin  
Started by seqadmin, 09-21-2023, 06:18 AM
0 responses
14 views
0 likes
Last Post seqadmin  
Started by seqadmin, 09-20-2023, 09:17 AM
0 responses
14 views
0 likes
Last Post seqadmin  
Started by seqadmin, 09-19-2023, 09:23 AM
0 responses
29 views
0 likes
Last Post seqadmin  
Working...
X