Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Filtering SNPs on identical genotype and position

    Hello everyone,

    I'm looking for software that is able to filter out SNPs which:
    - have an identical genotype across all samples
    - and are within a certain range (f.e. 100 bp)
    Preferably it should work on VCF files, but other suggestions are fine too.

    It shouldn’t be too hard to script this myself, but it would be great if someone knows an available tool for this.
    Thanks!

  • #2
    have you looked at a tool like bedIntersect on bedtools? Or are you trying to filter out only a particular allele for a given variant? For example, if there's a SNP that can be either A, G, or T, are you trying to remove that variant from your vcf altogether or do you just want to remove anything that's been called as an A?

    Comment


    • #3
      Hi Jessica,

      Thanks for the reply.
      Bedtools intersect is not what i'm looking for. Let me clarify myself...

      I have a list of SNP positions and genotype data, e.g:

      Scaffold | position | sample 1 | sample 2 | sample 3
      1 | 100 | 0 | 1 | 2
      1 | 150 | 0 | 1 | 2
      1 | 175 | 0 | 1 | 1
      1 | 300 | 0 | 1 | 2

      The entire line at position 150 has to be removed, because that one has identical genotypes as position 100.
      Position 175 should be kept, because that one has a different genotype in sample 3 compared to position 100.
      Position 300 should be kept also. Alltough it has the same genotype as position 100, it is more than 100 bp forward compared to position 100.

      Comment


      • #4
        Hello,

        Did you ever find a good tool to filter out SNPs with genotypes across your samples?

        I am looking to do the same, filter sites that are invariant across samples that are in a single vcf file. It sounds easy but I haven't found a good tool to do so.

        Are there any suggestions?

        Thank you
        Last edited by htetre; 07-03-2017, 05:22 AM.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Genetic Variation in Immunogenetics and Antibody Diversity
          by seqadmin



          The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...
          11-06-2024, 07:24 PM
        • seqadmin
          Choosing Between NGS and qPCR
          by seqadmin



          Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...
          10-18-2024, 07:11 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 11-08-2024, 11:09 AM
        0 responses
        208 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 11-08-2024, 06:13 AM
        0 responses
        152 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 11-01-2024, 06:09 AM
        0 responses
        80 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 10-30-2024, 05:31 AM
        0 responses
        26 views
        0 likes
        Last Post seqadmin  
        Working...
        X