Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • raman91
    Junior Member
    • May 2016
    • 6

    No coverage or very low coverage in the Complete Genomics data

    Hi,

    Has anyone worked on Complete Genomics data? Most of the exons in the data have no coverage or very low coverage (5-10 reads only) even though the reads are sequenced at 40X coverage. Can anyone explain why is it so?

    Thanks

    Regards
  • Gorgon_VZ
    Member
    • Oct 2016
    • 10

    #2
    Hi! Is it really the exons only, that show reduced coverage and the rest of the genome shows 40x? Have you checked the mean of all exons or for one particular gene only? What kind of genome and data are you working with? In general I would say it's kind of normal to see differences in coverage over the genome, but I am not aware of a exon specific bias. However, if you look on a particular gene or even a gene-family, the coverage might be reduced because of homology regions and multimapped reads with low mapping quality that are filtered out. Maybe it is worth to have a look on the percentage of reads you are able to map against your reference. A high number of discarded sequences could be a hind of such an effect.

    Comment

    • raman91
      Junior Member
      • May 2016
      • 6

      #3
      I’m sorry, I didn’t explain the issue properly. I’m working on CG WGS data with avg read depth of 40x. The QC metrics looks good for all parameters and the alignment rate is 97.43%. However, I had used cgaTools to convert tsv files provided by CG to BAM. When I visualize these BAM files on IGV, I see minimal coverage at all exons and major parts of introns for all genes (See example image attached). The trend is normally scant coverage hills at junction of intron and exon. I believe something went wrong in my conversion step. Can anyone please suggest a solution for this?
      Or is there any other visualization tool specific for CG data that I should be using?”

      The IGV snapshots is attached herewith.

      Thanks in advance
      Attached Files
      Last edited by raman91; 01-09-2018, 01:48 AM.

      Comment

      • Gorgon_VZ
        Member
        • Oct 2016
        • 10

        #4
        Hi! It could be an issue due to differing genome builds. Ich guess igv is using hg19 and reads are mapped to hg38.

        Comment

        • Gorgon_VZ
          Member
          • Oct 2016
          • 10

          #5
          You should be able to Check this by zoom in to nucleotide Level in igv. If the reads Do Not Match the reference this would be a Hind. I am Not a 100% Sure but i believe igv only uses the coordinates and cigar of the bam and does Not care about matching nucleotides. So maybe it is just a slippage of coordinates between hg19 and 38.

          Comment

          • raman91
            Junior Member
            • May 2016
            • 6

            #6
            Thanks for your reply. I am pretty sure that the reads are mapped to hg19 build only. I checked the IGV browser too and the read sequence match to the reference sequence.

            Comment

            Latest Articles

            Collapse

            • SEQadmin2
              From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
              by SEQadmin2


              Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


              The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
              ...
              06-02-2026, 10:05 AM
            • SEQadmin2
              Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
              by SEQadmin2


              With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


              Introduction

              Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
              05-22-2026, 06:42 AM
            • SEQadmin2
              Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
              by SEQadmin2

              Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


              Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
              05-06-2026, 09:04 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by SEQadmin2, 06-02-2026, 12:03 PM
            0 responses
            19 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-02-2026, 11:40 AM
            0 responses
            14 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 05-28-2026, 11:40 AM
            0 responses
            29 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 05-26-2026, 10:12 AM
            0 responses
            31 views
            0 reactions
            Last Post SEQadmin2  
            Working...