Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • slice Bam

    I have a bam file and i wish to exclude all the reads that are part of known genes. i got the exon list + Coordinates from ucsc as a .gtf file, but i'm not sure what's next... should I use samtools? if so, what options?

    Thank you very much in advance,
    Lilach

  • #2
    You might use htseq-count and use its option to write to SAM format. It then adds a flag indicating if there was a gene assignment or not. You can then simply use grep to extract the "no_feature" (or whatever that's called) reads. There are faster ways, but this is likely the simplest.

    Comment


    • #3
      Originally posted by LilachNoy View Post
      I have a bam file and i wish to exclude all the reads that are part of known genes. i got the exon list + Coordinates from ucsc as a .gtf file, but i'm not sure what's next... should I use samtools? if so, what options?

      Thank you very much in advance,
      Lilach
      This might be a solution (not tested) using bedtools and samtools.

      First, create a bed file of intergenic regions, then use samtools to extract reads in the bed intervals.

      Code:
      bedtools complement -i genes.gtf -g genome.txt > intergenic_regions.bed
      samtools view aln.bam -L intergenic_regions.bed > intergenic_reads.sam
      genome.txt is a tab-separated file of chromosome sizes which you can get from UCSC. See help in bedtools complement.

      Hope this helps
      Dario

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Recent Developments in Metagenomics
        by seqadmin





        Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
        09-23-2024, 06:35 AM
      • seqadmin
        Understanding Genetic Influence on Infectious Disease
        by seqadmin




        During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.

        Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...
        09-09-2024, 10:59 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 10-02-2024, 04:51 AM
      0 responses
      13 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 10-01-2024, 07:10 AM
      0 responses
      22 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 09-30-2024, 08:33 AM
      0 responses
      26 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 09-26-2024, 12:57 PM
      0 responses
      18 views
      0 likes
      Last Post seqadmin  
      Working...
      X