Header Leaderboard Ad

Collapse

Extract reference and aligned sequences from BAM file basing on VCF file

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Extract reference and aligned sequences from BAM file basing on VCF file

    I couldn't find relevant information. I hope it's not duplicate.

    I have resequencing data for two maize lines (BAM and VCF files).
    I want to extract sequences (fasta) for several genes (I have also GFF3 file with annotation data) from reference genome and corresponding sequences from resequencing data. I probably could use sequence identifiers, as they are in my files

    Which tool allows to extract such data, and more generally to extract sequences for
    a given variant type (SNP, indel, etc) and location (exon, intron, etc)?
    Last edited by floem7; 01-17-2015, 03:24 PM. Reason: Adding info.

  • #2
    You can try

    - "vcf-consensus" from VCFtools: http://vcftools.sourceforge.net/perl...#vcf-consensus. Click on "Read more" to get an example how to get the consensus for a given region within the reference sequence (you need to extract this information from your GFF).

    or

    - FastaAlternateReferenceMaker within GATK (https://www.broadinstitute.org/gatk/...renceMaker.php)

    Read the documentation thoroughly - there are several caveats!

    Comment


    • #3
      Forgot to mention, if you just want to extract FASTA sequences for GFF features (i.e. without any called variants applied), you can use BEDTools getfasta (http://bedtools.readthedocs.org/en/l.../getfasta.html).

      Comment


      • #4
        Thanks, I've followed example found at vcftools page and it works :-)

        Great thanks!

        Edit: however, I realized that aligned fasta format would be better. The aim is to quickly generate
        friendly msa view for a given region. For example for primer design.

        Certainly, ordinary MSA programs don't create sufficiently similar alignment as this in bam file.
        So it require manual inspection.
        Last edited by floem7; 01-19-2015, 02:45 PM. Reason: adding info.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          How RNA-Seq is Transforming Cancer Studies
          by seqadmin



          Cancer research has been transformed through numerous molecular techniques, with RNA sequencing (RNA-seq) playing a crucial role in understanding the complexity of the disease. Maša Ivin, Ph.D., Scientific Writer at Lexogen, and Yvonne Goepel Ph.D., Product Manager at Lexogen, remarked that “The high-throughput nature of RNA-seq allows for rapid profiling and deep exploration of the transcriptome.” They emphasized its indispensable role in cancer research, aiding in biomarker...
          09-07-2023, 11:15 PM
        • seqadmin
          Methods for Investigating the Transcriptome
          by seqadmin




          Ribonucleic acid (RNA) represents a range of diverse molecules that play a crucial role in many cellular processes. From serving as a protein template to regulating genes, the complex processes involving RNA make it a focal point of study for many scientists. This article will spotlight various methods scientists have developed to investigate different RNA subtypes and the broader transcriptome.

          Whole Transcriptome RNA-seq
          Whole transcriptome sequencing...
          08-31-2023, 11:07 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 09-22-2023, 09:05 AM
        0 responses
        21 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 09-21-2023, 06:18 AM
        0 responses
        14 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 09-20-2023, 09:17 AM
        0 responses
        14 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 09-19-2023, 09:23 AM
        0 responses
        29 views
        0 likes
        Last Post seqadmin  
        Working...
        X