Header Leaderboard Ad

Collapse

Spurious dosage coefficient of determination in imputed VCF file

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Spurious dosage coefficient of determination in imputed VCF file

    I have a VCF file that has dosage r^2 in the info field. The problem is that while the r^2 value should be in the 0 to 1 range, it has both negative values and values above 1.

    Is there a fundamental problem with my data? I might add that this is whole-exome data where the off-target regions have been imputed using Beagle.

    I have pasted some data collected from VCFtools, just to give an example. As you can see there are huge numbers (positive and negative), and a lot of zeros.

    Dosage r^2 example:
    CHROM POS REF ALT DR2
    1 10177 A AC 0
    1 10235 T TA 0
    1 10352 T TA 0
    1 10616 CCGCCGTTGCAAAGGCGCGCCG C 0.01
    1 10642 G A 0
    1 11008 C G 0.01
    1 11012 C G 0.01
    1 11063 T G 0


    More dosage r^2 examples:
    One allele with dr2=0, one with a huge number:
    1 66381 TATATA AATATA,T 0,5.10663e+28
    One with high correlation, another with a huge (negative) number:
    1 769829 C A,G 0.82,-7.97911e+26
    Also really tiny numbers, which is plausible, but suspicious:
    1 15274 A G,T 0,3.66383e-14

Latest Articles

Collapse

  • seqadmin
    How RNA-Seq is Transforming Cancer Studies
    by seqadmin



    Cancer research has been transformed through numerous molecular techniques, with RNA sequencing (RNA-seq) playing a crucial role in understanding the complexity of the disease. Maša Ivin, Ph.D., Scientific Writer at Lexogen, and Yvonne Goepel Ph.D., Product Manager at Lexogen, remarked that “The high-throughput nature of RNA-seq allows for rapid profiling and deep exploration of the transcriptome.” They emphasized its indispensable role in cancer research, aiding in biomarker...
    09-07-2023, 11:15 PM
  • seqadmin
    Methods for Investigating the Transcriptome
    by seqadmin




    Ribonucleic acid (RNA) represents a range of diverse molecules that play a crucial role in many cellular processes. From serving as a protein template to regulating genes, the complex processes involving RNA make it a focal point of study for many scientists. This article will spotlight various methods scientists have developed to investigate different RNA subtypes and the broader transcriptome.

    Whole Transcriptome RNA-seq
    Whole transcriptome sequencing...
    08-31-2023, 11:07 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, Yesterday, 09:05 AM
0 responses
12 views
0 likes
Last Post seqadmin  
Started by seqadmin, 09-21-2023, 06:18 AM
0 responses
10 views
0 likes
Last Post seqadmin  
Started by seqadmin, 09-20-2023, 09:17 AM
0 responses
12 views
0 likes
Last Post seqadmin  
Started by seqadmin, 09-19-2023, 09:23 AM
0 responses
26 views
0 likes
Last Post seqadmin  
Working...
X