Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • interpretation of FASTQC Overrepresented Kmers

    I am analyzing our results of 91 base single end Illumina sequencing with FASTQC and have attached two .png images, one of the graph and the other of the graph and list of kmers. The sample, part of a ChIP-Seq experiment, is 'input' DNA.

    I have a couple of questions with regard to interpretation.

    First, a large portion of the listed kmers are part of the adaptor:
    GATCGGAAGAGCTCGTATG. All kmers in postions 1-9 are part of the adaptor, but only the first 13 bases of the adaptor. Kmers covering bases 14-19 of the adaptor are listed, but their positions are listed as 85-86 ! It seems like sequences were inserted inside the adaptor ? Does anyone have an explanation for this ?

    Second, in the graph of relative enrichment vs position in read, there is gradual rise in the 6 listed kmers from positions 30-34 to about position 70 or so. Because 5 of these 6 kmers included in the graph are part of the adaptor, as can discerned from the leftmost part of the graph in postions 1-5, what does it mean that the relative enrichment of these kmers rises from position 30 to position 70 ?
    Attached Files

  • #2
    Hey,

    we've seen a similar rise of adaptor kmers towards the ends of the sequences. We haven't done anything formal analysis, but since we got paired-end sequences, we've been able to align the paired reads together, and it seems that many reads having adapter kmers originate from DNA fragments that are shorter than the read length. When this happens, sequencing will first proceed through your original DNA fragment and then continue to sequence the adapter sequence located immediately after it.

    We have also seen weird patterns in nucleotide distributions in <8 bps of the 5'-end of the reads, but have no idea where it comes from. If you find out let me know.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Non-Coding RNA Research and Technologies
      by seqadmin




      Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.

      Nobel Prize for MicroRNA Discovery
      This week,...
      Yesterday, 08:07 AM
    • seqadmin
      Recent Developments in Metagenomics
      by seqadmin





      Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
      09-23-2024, 06:35 AM
    • seqadmin
      Understanding Genetic Influence on Infectious Disease
      by seqadmin




      During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.

      Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...
      09-09-2024, 10:59 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 10-02-2024, 04:51 AM
    0 responses
    72 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 10-01-2024, 07:10 AM
    0 responses
    84 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 09-30-2024, 08:33 AM
    1 response
    86 views
    0 likes
    Last Post EmiTom
    by EmiTom
     
    Started by seqadmin, 09-26-2024, 12:57 PM
    0 responses
    20 views
    0 likes
    Last Post seqadmin  
    Working...
    X