Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • nk
    Member
    • Apr 2012
    • 11

    Calculating r^2 in vcftools for only one SNP?

    I would like to check for a set of SNPs in a given region how strongly they correlate with one particular SNP.

    I know that vcftools has the --geno-r2 option which will give me this information, however this takes very long to compute because it actually calculates all possible r^2s between all SNPs. I only need it to compute it for one of them against all others. For example plink has the `--ld-snp` option which allows you to specify a single SNP to calculate against.

    Is there any way I can do this with vcftools?
  • TiborNagy
    Senior Member
    • Mar 2010
    • 329

    #2
    Yes, you can. Using --chr, --from-bp, --to_bp you can specify the region of interest.

    Comment

    • nk
      Member
      • Apr 2012
      • 11

      #3
      I know that, but this will still calculate the pairwise r^2 for all SNPs in this area. What I want is just the r^2 from one SNP in this region to all others.

      I just ended up making a tped file and running this though PLINK now, although this seems unnecessarily complicated.

      Comment

      • gringer
        David Eccles (gringer)
        • May 2011
        • 845

        #4
        I would expect that most of the ways to do this would be complicated because your use case strays a bit from the beaten path. I find it a little odd to talk about r^2 within a "given region", and be concerned about the time it takes to do a full pairwise calculation, because if it takes less than 2h to calculate it's probably a waste of time to look for other solutions. What is the region size? How many SNPs? How many individuals?

        Comment

        • nk
          Member
          • Apr 2012
          • 11

          #5
          Well, the idea is to use this in local Manhattan plots from a GWAS, looking at the r^2 between the most significantly associated SNP and other SNPs around it. As far as I am aware this is not too exotic of an application, but maybe I am wrong.

          And since I want to do this for many cases and more or less on demand 2h is quite a long time. For comparison, the PLINK method takes maybe a minute or so now.

          Comment

          • gringer
            David Eccles (gringer)
            • May 2011
            • 845

            #6
            And since I want to do this for many cases and more or less on demand 2h is quite a long time. For comparison, the PLINK method takes maybe a minute or so now.
            Ah, okay. In that case, write a script to do the conversion to TPED and run PLINK. Then while you're collecting money (or saving time) due to the use of your script, hunt around for more efficient solutions.

            I vaguely recall diagrams with the most significant SNP identified and r^2 values for surrounding SNPs, but unfortunately can't remember how it was done.

            Comment

            • chrchang
              Member
              • Jun 2013
              • 15

              #7
              Do you need to account for phase in your r^2 calculation? If not, you can just use PLINK 1.9's VCF import function:

              plink --vcf [vcf filename] --out [new fileset prefix]

              Then you can use --r2 as usual.

              r^2 with the entire rest of the genome:
              plink --bfile [plink fileset prefix] --r2 --inter-chr --ld-snp [snp id]

              r^2 with the rest of the chromosome:
              plink --bfile [plink fileset prefix] --r2 --inter-chr --ld-snp [snp id] --chr [chromosome number]

              r^2 with a limited window:
              plink --bfile [plink fileset prefix] --r2 --ld-snp [snp id] --ld-window [max snps + 1] --ld-window-kb [max kbs]

              Comment

              • gringer
                David Eccles (gringer)
                • May 2011
                • 845

                #8
                Oh wow, PLINK continues on at BGI. I'll have to spread the word to my colleagues (who use PLINK a lot)....

                Comment

                • jimineep
                  Member
                  • Sep 2011
                  • 10

                  #9
                  Did anyone find a solution for this using vcftools? I ask because vcftools allows you to use the hap-r2 option which takes phase into account when calculating R2, as well as the geno-r2 - so with vcftools this would be more flexible.

                  Comment

                  Latest Articles

                  Collapse

                  • seqadmin
                    Pathogen Surveillance with Advanced Genomic Tools
                    by seqadmin




                    The COVID-19 pandemic highlighted the need for proactive pathogen surveillance systems. As ongoing threats like avian influenza and newly emerging infections continue to pose risks, researchers are working to improve how quickly and accurately pathogens can be identified and tracked. In a recent SEQanswers webinar, two experts discussed how next-generation sequencing (NGS) and machine learning are shaping efforts to monitor viral variation and trace the origins of infectious...
                    03-24-2025, 11:48 AM
                  • seqadmin
                    New Genomics Tools and Methods Shared at AGBT 2025
                    by seqadmin


                    This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

                    The Headliner
                    The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
                    03-03-2025, 01:39 PM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, 03-20-2025, 05:03 AM
                  0 responses
                  49 views
                  0 reactions
                  Last Post seqadmin  
                  Started by seqadmin, 03-19-2025, 07:27 AM
                  0 responses
                  57 views
                  0 reactions
                  Last Post seqadmin  
                  Started by seqadmin, 03-18-2025, 12:50 PM
                  0 responses
                  49 views
                  0 reactions
                  Last Post seqadmin  
                  Started by seqadmin, 03-03-2025, 01:15 PM
                  0 responses
                  200 views
                  0 reactions
                  Last Post seqadmin  
                  Working...