Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Dindel is seeing too many reads, potential bug?

    Hey all,

    I have just figured out how to download and use dindel and I am trying to compare it to samtools mpileup. Most of the calls are the same. However, when looking at a dindel call in IGV, I noticed that there is only 1 read covering that position. Yet, dindel gave the following output line in the VCF file:

    Code:
    chr19   11243209        .       c       cG      128     PASS    DP=12;NF=0;NR=4;NRS=3;NFS=1;HP=1        GT:GQ   1/1:12
    When looking at the depthofcoverage file, it states that there is only 1 read. Yet, dindel sees 4 reads. Does anybody have any idea why this could happen? I've attached a screenshot of this position in IGV.
    Attached Files

  • #2
    I had the same doubt. After checking the intermediate results of Dindel, I realized that DP value in the VCF is the number of reads that cover the window that is processed by Dindel.

    For example, in my case the VCF contains an indel in the position 6680:

    Code:
    #CHROM	POS	ID	REF	ALT	QUAL	FILTER	INFO	FORMAT	SAMPLE
    chromosome_II	6680	.	TATA	T	118	PASS	DP=230;NF=0;NR=3;NRS=13;NFS=23;HP=1	GT:GQ	0/1:118
    The "depth" is:
    • DP=230 according to Dindel (INFO column in the VCF file), but
    • depth=45 according to IGV (in fact, according to the pileup file).


    The information displayed by the "step 3" of Dindel showed this:

    Code:
    (...)
    ****
     tid: chromosome_II [B]pos: 6681 leftPos: 6620  rightPos: 6742[/B]
    Fetching reads....
    [B]Number of reads: 230[/B] out of 77463 # unmapped reads: 0 numReadsUnknownLib: 0 numChrMismatch: 0 numMappedWithoutMate: 2 numUnmappedWithoutMate: 0
    candidate_var@pos: 6681 6680,-ATA
    aligned_var@pos 6681 6656 A=>G
    aligned_var@pos 6681 6657 T=>A
    aligned_var@pos 6681 6680 -ATA
    [empiricalDistributionMethod] Number of haplotypes: 8
    Filtered 0 haplotypes.
    ll_ref: -1085.49 max_ll_indel: -1058.3 qual: 118.099
    (...)
    My interpretation is that DP is the number of reads covering the positions (window) 6620-6742, and not only those covering the "starting" point of the indel (6680).

    Hope it is useful.

    Comment


    • #3
      More simple!

      My suspicion was correct, but there is a more straightforward way to confirm it.

      From the header of the VCF file produced by Dindel:

      Code:
      ##INFO=<ID=[B]DP[/B],Number=1,Type=Integer,Description="[B]Total number of reads in haplotype window[/B]">

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Latest Developments in Precision Medicine
        by seqadmin



        Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

        Somatic Genomics
        “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
        Yesterday, 01:16 PM
      • seqadmin
        Recent Advances in Sequencing Analysis Tools
        by seqadmin


        The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
        05-06-2024, 07:48 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, Yesterday, 07:15 AM
      0 responses
      13 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 05-23-2024, 10:28 AM
      0 responses
      17 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 05-23-2024, 07:35 AM
      0 responses
      20 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 05-22-2024, 02:06 PM
      0 responses
      10 views
      0 likes
      Last Post seqadmin  
      Working...
      X