Header Leaderboard Ad

Collapse

Samtools/vcftools related question

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Samtools/vcftools related question

    Hello all,

    Can anybody help me with a script parsing the vcftools output file. I have a bed file with start and end regions for which I need the depth of each region from bam file. Therefore, I used samtools mpileup, bcftools and vcftools. Now the output file of vcftools gives me depth in single nucleotide level. Though it is useful to know the depth per base, it would be more appropriate for my analysis to know the total, maximum and avg number of reads aligned to each BED region.

    My bed file:
    chr_1 5325 5500
    chr_1 10909 11000
    chr_1 12300 12598
    .......
    So, for the first bed region, the VCFtool output gives depth per base as shown below and it produce the same for other bed regions (it is a very huge file):
    chr_1 5325 1
    chr_1 5326 1
    chr_1 5327 2
    chr_1 5328 2
    chr_1 5329 3
    .....
    chr_1 5499 1
    chr_1 5500 1

    And I would appreciate if anybody can help me with any script which outputs the total, maximum and average number of reads aligned to each bed region.

    Thanks!!

  • #2
    Don't reinvent the wheel: try something like gatk https://www.broadinstitute.org/gatk/...OfCoverage.php

    Comment


    • #3
      Thanks lindenb

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Improved Targeted Sequencing: A Comprehensive Guide to Amplicon Sequencing
        by seqadmin



        Amplicon sequencing is a targeted approach that allows researchers to investigate specific regions of the genome. This technique is routinely used in applications such as variant identification, clinical research, and infectious disease surveillance. The amplicon sequencing process begins by designing primers that flank the regions of interest. The DNA sequences are then amplified through PCR (typically multiplex PCR) to produce amplicons complementary to the targets. RNA targets...
        03-21-2023, 01:49 PM
      • seqadmin
        Targeted Sequencing: Choosing Between Hybridization Capture and Amplicon Sequencing
        by seqadmin




        Targeted sequencing is an effective way to sequence and analyze specific genomic regions of interest. This method enables researchers to focus their efforts on their desired targets, as opposed to other methods like whole genome sequencing that involve the sequencing of total DNA. Utilizing targeted sequencing is an attractive option for many researchers because it is often faster, more cost-effective, and only generates applicable data. While there are many approaches...
        03-10-2023, 05:31 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, Today, 11:44 AM
      0 responses
      8 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 03-24-2023, 02:45 PM
      0 responses
      18 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 03-22-2023, 12:26 PM
      0 responses
      18 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 03-17-2023, 12:32 PM
      0 responses
      18 views
      0 likes
      Last Post seqadmin  
      Working...
      X