Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Counting read depth using bedtools

    I have sequenced a cell line exposed to UV and would like to know if any genes have been deleted compared to the ancestor. I extracted the CDS regions from the annotation file to annotation.bed and ran coverageBed in order to find the read depth at any given exon

    coverageBed -abam DXB11.bam -b annotation.bed > depth.txt

    the output for a particular domain was
    NW_003614442.1 464809 465646 158 837 837 1.0000000
    So 100% of the region 464809-465646 had a depth of 158 and that entire region of 837 bp was that depth, correct?

    That that is very high as the theoretical depth should be 35. So i looked into the depth at every position of the genome
    /BEDTools-Version-2.16.2/genomeCoverageBed -d -ibam DXB11.bam > DXB11.coverage

    and looked into the same region (464809-465646) and done this way it had a median depth of 18 = much more realistic.
    Are you able to see what i did wrong or maybe advice me another way of more easily getting to a median depth of each exon in the genome from a bam file?

  • #2
    Hi Kaas,
    I think 158 is the number of features that overlapped the interval, not necessarily fold coverage. I would recommend that you read The bedtools manual.

    Default Output:
    After each entry in B, reports:
    1) The number of features in A that overlapped the B interval.
    2) The number of bases in B that had non-zero coverage.
    3) The length of the entry in B.
    4) The fraction of bases in B that had non-zero coverage.
    You may want to try -d or -hist options.

    Hope this helps.
    Last edited by rnaeye; 02-21-2014, 08:00 AM. Reason: additional information

    Comment


    • #3
      Hi rnaeye

      Thank you for you answer. I tried going through the description for genomeCoverageBed (http://bedtools.readthedocs.org/en/l.../coverage.html) and for genomecov (http://bedtools.readthedocs.org/en/l...genomecov.html) but had a hard time translating their bioinformatic terms into what conclusion i can make from my own data based on the results.

      The number of features in A that overlapped the B interval = number of reads that are identified in the exon region i specify. But then you would expect at least some kind of correlation between the length of a given region and the depth coverageBed gives, right? because i do not see any correlation. That is the reason why I find this a bit fishy.

      ok, i will use -d and extract the median from there

      Comment


      • #4
        Hi,
        Try to google following search them "The BEDTools manual PDF"
        You can download a PDF version of user manual. I think it explains better. I guess you should calculate coverage per base and conclude it from there. have fun, best.
        Last edited by rnaeye; 02-21-2014, 10:30 AM.

        Comment


        • #5
          read the help of bedtools coverage
          i think you can use -hist or -d option

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Best Practices for Single-Cell Sequencing Analysis
            by seqadmin



            While isolating and preparing single cells for sequencing was historically the bottleneck, recent technological advancements have shifted the challenge to data analysis. This highlights the rapidly evolving nature of single-cell sequencing. The inherent complexity of single-cell analysis has intensified with the surge in data volume and the incorporation of diverse and more complex datasets. This article explores the challenges in analysis, examines common pitfalls, offers...
            06-06-2024, 07:15 AM
          • seqadmin
            Latest Developments in Precision Medicine
            by seqadmin



            Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

            Somatic Genomics
            “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
            05-24-2024, 01:16 PM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, Today, 07:23 AM
          0 responses
          8 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 06-17-2024, 06:54 AM
          0 responses
          11 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 06-14-2024, 07:24 AM
          0 responses
          24 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 06-13-2024, 08:58 AM
          0 responses
          17 views
          0 likes
          Last Post seqadmin  
          Working...
          X