Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • ron128
    Member
    • Sep 2011
    • 38

    Coverage Calculation for Whole Genome Sequencing on GA II X

    Hello Everybody. How does one calculate the coverage on a paired end Illumina GA II X run? My fast qc statistics for the reads are as follows:

    Foward reads: 1,95,71,148. No of Bases: 1,05,68,41,992 file size: 3.9 GB

    Reverse reads: 1,95,71,148 No of Bases: 1,05,68,41,992 3.9 GB


    This is a mouse genome we have sequenced. The size of the mouse genome in bases is: 2,7,1,69,65,481 which corresponds to 2.5 GB.

    Do i follow the simple calculation of C = LN / G
    • C stands for coverage
    • G is the haploid genome length
    • L is the read length
    • N is the number of reads

    Doing this results in a measly 0.77 X coverage which is too low , in which case it is unacceptable for the experiment. am i calculating this correctly?

    I cannot map my reads to a reference mouse genome to use coveragebed or any other function/tool, because of the nature of my experiment which wont allow a large no. of reads to map. So finding the uniquely mapped reads is out of question. I also cannot do a de novo assembly because of computer memory constraints. Any thoughts on this are appreciated. Thanks in advance!
  • vivek_
    PhD Student
    • Jul 2012
    • 164

    #2
    That file size sounds about right for exome sequencing but it is likely very low for WGS but you did mention that your experiment is different to regular sequencing projects that are done for variant calling.

    It might be inaccurate to calculate coverage on the whole genome size if your project is something like variant validation.

    If you could specify more about the goal of the project someone can make a better guess.

    Comment

    • aggp11
      Member
      • Jun 2011
      • 87

      #3
      Like Vivek said, these numbers are definitely comparable to an Exome seq experiment and thus very small for a regular WGS experiment for variant calling which expects around 30X coverage of the 3GB human genome. It will be a good idea for you to provide some more details about what is that you are expecting to get out of this experiment.

      Comment

      • ron128
        Member
        • Sep 2011
        • 38

        #4
        Thanks a ton guys for a quick reply!

        One thing i would like to point out like i posted in my earlier thread, which everybody seems to have missed out is that this is a WGS of a MOUSE GENOME and not a HUMAN GENOME.

        This is basically a transfection experiment wherein we have transfected human oncogenes into a NIH 3T3 cell line, which is established from Murien fibroblast cells. The whole genome for this NIH 3t3 cell line was sequenced. I was not involved in the actual sequencing and joined in at a much later stage. I have a feeling that The sequencing company where we got the cell lines sequenced has taken us for a ride, and not given us quality data, seeing as to how you seem to agree with me that this is characteristic of an whole exome and not a whole genome. I do not think that the nature of my experiment will actually affect the sequencing, as in effect we have done a WGS of a cell line.
        Last edited by ron128; 01-10-2013, 12:07 AM.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Pathogen Surveillance with Advanced Genomic Tools
          by seqadmin




          The COVID-19 pandemic highlighted the need for proactive pathogen surveillance systems. As ongoing threats like avian influenza and newly emerging infections continue to pose risks, researchers are working to improve how quickly and accurately pathogens can be identified and tracked. In a recent SEQanswers webinar, two experts discussed how next-generation sequencing (NGS) and machine learning are shaping efforts to monitor viral variation and trace the origins of infectious...
          03-24-2025, 11:48 AM
        • seqadmin
          New Genomics Tools and Methods Shared at AGBT 2025
          by seqadmin


          This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

          The Headliner
          The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
          03-03-2025, 01:39 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 03-20-2025, 05:03 AM
        0 responses
        49 views
        0 reactions
        Last Post seqadmin  
        Started by seqadmin, 03-19-2025, 07:27 AM
        0 responses
        57 views
        0 reactions
        Last Post seqadmin  
        Started by seqadmin, 03-18-2025, 12:50 PM
        0 responses
        50 views
        0 reactions
        Last Post seqadmin  
        Started by seqadmin, 03-03-2025, 01:15 PM
        0 responses
        200 views
        0 reactions
        Last Post seqadmin  
        Working...