Header Leaderboard Ad

Collapse

filtering variants with R

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • filtering variants with R

    Hi,

    I'm learning data analysis with R, using RStudio.
    I have my data in one file. I have variants from 3 tissue types, from 130 individuals. I would like to filter the variants within each individual, so that I can get: 1. unique variants for all 3 tissues, 2. variants common to all tissues, 3. variants common between 2 tissue types.
    The file that I'm working on has 118 columns (vcf annotated file), I would like to keep all columns.
    I've tried VennDiagram package, but it filters out variants that are present in any other individual, so in the overlapping files I loose some variants.
    I understand that I would need to use loop filtering, so that 1. I pick individuals by "ID" column, then sort variants by "variant" column and compare them between "tissue" column to output into all filtered types of variants (unique and common).

    Does anyone know how to do that???

    Cheers,
    A.

  • #2
    To find overlapping variants between two files, you can use bcftools. If you want to generate a simple venn diagram, you can provide lists to http://bioinformatics.psb.ugent.be/webtools/Venn

    Comment

    Latest Articles

    Collapse

    • seqadmin
      A Brief Overview and Common Challenges in Single-cell Sequencing Analysis
      by seqadmin


      ​​​​​​The introduction of single-cell sequencing has advanced the ability to study cell-to-cell heterogeneity. Its use has improved our understanding of somatic mutations1, cell lineages2, cellular diversity and regulation3, and development in multicellular organisms4. Single-cell sequencing encompasses hundreds of techniques with different approaches to studying the genomes, transcriptomes, epigenomes, and other omics of individual cells. The analysis of single-cell sequencing data i...

      01-24-2023, 01:19 PM
    • seqadmin
      Introduction to Single-Cell Sequencing
      by seqadmin
      Single-cell sequencing is a technique used to investigate the genome, transcriptome, epigenome, and other omics of individual cells using high-throughput sequencing. This technology has provided many scientific breakthroughs and continues to be applied across many fields, including microbiology, oncology, immunology, neurobiology, precision medicine, and stem cell research.

      The advancement of single-cell sequencing began in 2009 when Tang et al. investigated the single-cell transcriptomes
      ...
      01-09-2023, 03:10 PM

    ad_right_rmr

    Collapse
    Working...
    X