Header Leaderboard Ad

Collapse

Tool(s) for calculating depth and genome coverage

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Tool(s) for calculating depth and genome coverage

    I am looking for a tool that can take my BAM files and spit out the following information:

    1. Mean depth across the entire genome
    2. Percent of genome covered at a specified depth, or preferably multiple specified depths (i.e. 10X, 20X, 30X etc).

    I'm not a great command line jockey, so a tool that can be installed in Galaxy would be great, but if not, then I can muddle my way through it on the command line.

    Thanks!

  • #2
    http://seqanswers.com/forums/showthread.php?t=17253

    Comment


    • #3
      I've got Bedtools Genome Coverage working in Galaxy now, but the output is confusing. It seems to pick randomly-sized intervals and report a coverage value across that interval. Example output below:

      chr 3 6 2
      chr 6 13 3
      chr 13 14 5
      chr 14 16 6
      chr 16 25 8
      chr 25 27 10
      chr 27 29 12
      chr 29 32 15
      chr 32 47 16
      chr 47 60 18
      chr 60 72 19
      chr 72 84 21

      Really, all I want is a single number: mean depth across the entire genome. Is there a simple way to get this number using this tool?

      Comment


      • #4
        There are multiple solutions here but you will need to leave the confines of galaxy and work on the command line: https://www.biostars.org/p/5165/

        Comment


        • #5
          You can calculate it yourself quite easily, if you have fixed-length reads - (#mapped reads)*(read length)/(genome size).

          Comment


          • #6
            Originally posted by Brian Bushnell View Post
            You can calculate it yourself quite easily, if you have fixed-length reads - (#mapped reads)*(read length)/(genome size).
            Not fixed length, unfortunately. They've been adapter and quality trimmed, although your method would still give me an estimate.

            I actually found from reading another thread on here that the easiest way for me was to just load the BAMs into Qualimap and look at the summary page, as Qualimap calculates that value as part of its analysis.

            Comment


            • #7
              Originally posted by GenoMax View Post
              There are multiple solutions here but you will need to leave the confines of galaxy and work on the command line: https://www.biostars.org/p/5165/
              I found this same thread on Biostars. I was going to try out some of the suggestions there when I read the very last entry about Qualimap. Since I needed these numbers fairly quickly, I just used that, but I'm going to try out come of the command line options for future use, because Qualimap does do batches very well.

              Comment


              • #8
                GATK DepthOfCoverage is command line but very easy to use and well documented on their site. Beyond that I normally use custom scripts based on a pile up. All of these options will require a lot of RAM for a whole genome, so if you can use an interval list/bed file that'll makes things much easier/faster.
                LM

                Comment


                • #9
                  Bedtools Genomecoveragebed Usage : How To do per base coverage analysis for only chr2

                  Hey for a specific chromosome number how to do coverage analysis for input bam file using bedtools and GATK tools

                  Thanks

                  Comment

                  Latest Articles

                  Collapse

                  • seqadmin
                    Improved Targeted Sequencing: A Comprehensive Guide to Amplicon Sequencing
                    by seqadmin



                    Amplicon sequencing is a targeted approach that allows researchers to investigate specific regions of the genome. This technique is routinely used in applications such as variant identification, clinical research, and infectious disease surveillance. The amplicon sequencing process begins by designing primers that flank the regions of interest. The DNA sequences are then amplified through PCR (typically multiplex PCR) to produce amplicons complementary to the targets. RNA targets...
                    03-21-2023, 01:49 PM
                  • seqadmin
                    Targeted Sequencing: Choosing Between Hybridization Capture and Amplicon Sequencing
                    by seqadmin




                    Targeted sequencing is an effective way to sequence and analyze specific genomic regions of interest. This method enables researchers to focus their efforts on their desired targets, as opposed to other methods like whole genome sequencing that involve the sequencing of total DNA. Utilizing targeted sequencing is an attractive option for many researchers because it is often faster, more cost-effective, and only generates applicable data. While there are many approaches...
                    03-10-2023, 05:31 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, Today, 11:44 AM
                  0 responses
                  8 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 03-24-2023, 02:45 PM
                  0 responses
                  18 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 03-22-2023, 12:26 PM
                  0 responses
                  18 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 03-17-2023, 12:32 PM
                  0 responses
                  18 views
                  0 likes
                  Last Post seqadmin  
                  Working...
                  X