Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • TruSeq Adaptors reported by FastQC are true adaptors?

    I asked a question about using fastx_clipper to get rid of the adaptor sequence last time and it did not work.

    I googled TruSeq adaptor sequences (http://www.omicsoft.com/downloads/ng...on_list/v1.txt) and compared them with our reported TruSeq Adaptors. Because our sequences have length between 49-52, all reported adaptor have length 50. I choose the first 50 nt from TruSeq adaptor to compare with ours, the results are as follows:
    4 reported TrueSeq Adaptors are exactly the same as the first 50nt of TruSeq adaptor;
    3 reported TrueSeq Adaptors have almost the same sequence as the first 50 nt except the 42nd nt, they all replace A by C;
    2 reported TrueSeq Adaptors, their 2nd-50 nt are the same as TruSeq Adaptor 1-49nt

    I decided to try to align my reads without get rid of adaptor sequence by novoalign (using default setting) to see whether those sequences reported as TruSeq Adaptors are in the result of alignment. Unfortunately, they are. But if I specify the reported adaptor sequence as adaptor in novoalign, these sequence will be removed.

    So my questions now is whether those sequences reported as TruSeq Adaptor by FastQC are true Adaptor sequences or not?

  • #2
    If Ii remind correctly fastQC allows for some mismatches , you may also wish to compare identified adapters to fastQC database contaminants within fastQC folder
    Pbseq

    Comment


    • #3
      FastQC allows some flexibility in its matches, it also doesn't require a match to exist over the whole length of the sequence. The summary of the match will tell you how good a match it actually found.

      Many of the illumina adapters are very similar to each other, differing by only a few bases so FastQC often finds a multitude of possible hits, so it just picks the first of the best set of hits to report.

      Given that the program only does these searches for sequences which occur at very high levels in a library it's pretty unusual to get a complete false positive for the presence of an adapter sequence, although the identification of the exact adapter used may well not be correct.

      Comment


      • #4
        Thank you very much, Simon. That makes more sense to me.

        I want to correct the observation I mentioned earlier. I double checked manuscript of novoalign, the reported adaptor sequences are not aligned to any region, I am assuming they are true adaptors.


        Originally posted by simonandrews View Post
        FastQC allows some flexibility in its matches, it also doesn't require a match to exist over the whole length of the sequence. The summary of the match will tell you how good a match it actually found.

        Many of the illumina adapters are very similar to each other, differing by only a few bases so FastQC often finds a multitude of possible hits, so it just picks the first of the best set of hits to report.

        Given that the program only does these searches for sequences which occur at very high levels in a library it's pretty unusual to get a complete false positive for the presence of an adapter sequence, although the identification of the exact adapter used may well not be correct.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Genetic Variation in Immunogenetics and Antibody Diversity
          by seqadmin



          The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...
          11-06-2024, 07:24 PM
        • seqadmin
          Choosing Between NGS and qPCR
          by seqadmin



          Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...
          10-18-2024, 07:11 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Today, 11:09 AM
        0 responses
        22 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, Today, 06:13 AM
        0 responses
        20 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 11-01-2024, 06:09 AM
        0 responses
        30 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 10-30-2024, 05:31 AM
        0 responses
        21 views
        0 likes
        Last Post seqadmin  
        Working...
        X