Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How to fix fastq files from a bacteria sample?

    I have a pair of PE read files generated from a Truseq library. It has 33M pairs of 150bp reads. The bacteria is S auerus which has a genome size of 2.8Gbp. My goal is to call protein coding variants.

    I am getting Per Base Sequence Quality, Sequence Duplication Level and Kmer Content errors from fastqc.

    Does this look like it is only because of over-sequencing such that I don't need to do anything and let picard remove duplicates for me after mapping? If not, can you tell me what else I need to do to fix up the fastq.

    Thanks a lot in advance!
    Attached Files

  • #2
    I would say go ahead with the analysis and then see if you have a real problem once you have alignment results.

    Comment


    • #3
      It looks like you have over 3000x coverage, so it's not surprising that some reads appear to be duplicates. You should not deduplicate them unless the library was amplified. But even then, at such a high coverage level, I don't think deduplication is wise because there will be so many true duplicates that occur just by random chance.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        New Genomics Tools and Methods Shared at AGBT 2025
        by seqadmin


        This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

        The Headliner
        The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
        03-03-2025, 01:39 PM
      • seqadmin
        Investigating the Gut Microbiome Through Diet and Spatial Biology
        by seqadmin




        The human gut contains trillions of microorganisms that impact digestion, immune functions, and overall health1. Despite major breakthroughs, we’re only beginning to understand the full extent of the microbiome’s influence on health and disease. Advances in next-generation sequencing and spatial biology have opened new windows into this complex environment, yet many questions remain. This article highlights two recent studies exploring how diet influences microbial...
        02-24-2025, 06:31 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, Today, 12:50 PM
      0 responses
      10 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 03-03-2025, 01:15 PM
      0 responses
      181 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 02-28-2025, 12:58 PM
      0 responses
      276 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 02-24-2025, 02:48 PM
      0 responses
      663 views
      0 likes
      Last Post seqadmin  
      Working...
      X