Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • TruSeq Adaptors reported by FastQC are true adaptors?

    I asked a question about using fastx_clipper to get rid of the adaptor sequence last time and it did not work.

    I googled TruSeq adaptor sequences (http://www.omicsoft.com/downloads/ng...on_list/v1.txt) and compared them with our reported TruSeq Adaptors. Because our sequences have length between 49-52, all reported adaptor have length 50. I choose the first 50 nt from TruSeq adaptor to compare with ours, the results are as follows:
    4 reported TrueSeq Adaptors are exactly the same as the first 50nt of TruSeq adaptor;
    3 reported TrueSeq Adaptors have almost the same sequence as the first 50 nt except the 42nd nt, they all replace A by C;
    2 reported TrueSeq Adaptors, their 2nd-50 nt are the same as TruSeq Adaptor 1-49nt

    I decided to try to align my reads without get rid of adaptor sequence by novoalign (using default setting) to see whether those sequences reported as TruSeq Adaptors are in the result of alignment. Unfortunately, they are. But if I specify the reported adaptor sequence as adaptor in novoalign, these sequence will be removed.

    So my questions now is whether those sequences reported as TruSeq Adaptor by FastQC are true Adaptor sequences or not?

  • #2
    If Ii remind correctly fastQC allows for some mismatches , you may also wish to compare identified adapters to fastQC database contaminants within fastQC folder
    Pbseq

    Comment


    • #3
      FastQC allows some flexibility in its matches, it also doesn't require a match to exist over the whole length of the sequence. The summary of the match will tell you how good a match it actually found.

      Many of the illumina adapters are very similar to each other, differing by only a few bases so FastQC often finds a multitude of possible hits, so it just picks the first of the best set of hits to report.

      Given that the program only does these searches for sequences which occur at very high levels in a library it's pretty unusual to get a complete false positive for the presence of an adapter sequence, although the identification of the exact adapter used may well not be correct.

      Comment


      • #4
        Thank you very much, Simon. That makes more sense to me.

        I want to correct the observation I mentioned earlier. I double checked manuscript of novoalign, the reported adaptor sequences are not aligned to any region, I am assuming they are true adaptors.


        Originally posted by simonandrews View Post
        FastQC allows some flexibility in its matches, it also doesn't require a match to exist over the whole length of the sequence. The summary of the match will tell you how good a match it actually found.

        Many of the illumina adapters are very similar to each other, differing by only a few bases so FastQC often finds a multitude of possible hits, so it just picks the first of the best set of hits to report.

        Given that the program only does these searches for sequences which occur at very high levels in a library it's pretty unusual to get a complete false positive for the presence of an adapter sequence, although the identification of the exact adapter used may well not be correct.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Recent Advances in Sequencing Analysis Tools
          by seqadmin


          The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
          05-06-2024, 07:48 AM
        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin




          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
          04-22-2024, 07:01 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 05-14-2024, 07:03 AM
        0 responses
        17 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 05-10-2024, 06:35 AM
        0 responses
        40 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 05-09-2024, 02:46 PM
        0 responses
        50 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 05-07-2024, 06:57 AM
        0 responses
        41 views
        0 likes
        Last Post seqadmin  
        Working...
        X