Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Adapter trimming with BBduk

    Hi, I am working with some MiSeq 16S and ITS2 amplicon sequence data generated by JGI. Previously I utilized the data after it was quality controlled and merged but now I am going back to the original raw interleaved files to learn how to do the initial steps myself. The first question I have is about adapter trimming. I am using BBduk and its adapters.fa reference file to trim adapters. This is a relatively simple (possibly silly) question, but in the example in the BBduk manual it trims the right (3' adapter) by specifying "ktrim=r", but no left trimming. Is there a reason trimming on the 5' end is not necessary or should it be done also (with "ktrim=l")?

    Additionally, it seems like most the dialogue people have is about trimming adapters and PCR primers. However, is there any need to look for artifacts associated with the forward and reverse primer pads?

    Thanks

  • #2
    Originally posted by PeatMaster View Post
    This is a relatively simple (possibly silly) question, but in the example in the BBduk manual it trims the right (3' adapter) by specifying "ktrim=r", but no left trimming. Is there a reason trimming on the 5' end is not necessary or should it be done also (with "ktrim=l")?
    For good libraries (ones that use standard protocols without inline barcodes at beginning of reads etc) one expects to have contaminants (e.g. adapters) show at the end of a read. This is specially true if the insert turns out to be shorter than you expect (and you are sequencing longer than the length of the fragment). Once the insert is completely sequenced the read will go into the adapter at 3'-end (and even beyond into void, you will see AAAAA etc if that happens).

    If you expect to have contaminants present on the left (5'-end) of the reads you can certainly run ktrim=l.

    Additionally, it seems like most the dialogue people have is about trimming adapters and PCR primers. However, is there any need to look for artifacts associated with the forward and reverse primer pads?

    Thanks
    I am not sure what you are referring to by "primer pads". BBDuk will scan/trim any sequence you provide (you can add it as a fasta record to adapters file or provide it on command line as literal=ACTGGT,TTTGGTG option).

    Comment


    • #3
      GenoMax, thanks for the reply. The adapter trimming on the 3' end makes sense to me now. Thanks.

      What I mean by the primer pads are best shown in the the supplement from Tremblay et al. 2015 (see link below). They are attached to the primers (between the adapter and the primer on the 5' end, and between the primer and barcode on the 3' end). I admit that I am not totally positive what their role is, but I assume they are utilized as part of the primer construct for all JGI MiSeq 16S and ITS2 runs. Since it is located closer to the fragment that is being sequenced than the adapter, it would be even more likely to be present as an artifact. Is this true or do I have something incorrect?




      Thanks for the help

      Comment


      • #4
        In the example you attached the resulting sequence files don't have any extra sequence at the 5'-end.

        Some (e.g. amplicon) sequences involves the same sequence for multiple samples. As a result, an effort is made to increase the nucleotide diversity of the sequence (it is not good to have a specific base for a given cycle for every cluster in case of illumina sequencing) by adding base padding of varying length. If some of the extra sequence does appear on the 5-end then that can be trimmed with bbduk.

        Comment


        • #5
          Thanks

          OK, I see. Thanks for the help.

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Best Practices for Single-Cell Sequencing Analysis
            by seqadmin



            While isolating and preparing single cells for sequencing was historically the bottleneck, recent technological advancements have shifted the challenge to data analysis. This highlights the rapidly evolving nature of single-cell sequencing. The inherent complexity of single-cell analysis has intensified with the surge in data volume and the incorporation of diverse and more complex datasets. This article explores the challenges in analysis, examines common pitfalls, offers...
            Today, 07:15 AM
          • seqadmin
            Latest Developments in Precision Medicine
            by seqadmin



            Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

            Somatic Genomics
            “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
            05-24-2024, 01:16 PM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, Today, 08:18 AM
          0 responses
          8 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, Today, 08:04 AM
          0 responses
          10 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 06-03-2024, 06:55 AM
          0 responses
          13 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 05-30-2024, 03:16 PM
          0 responses
          27 views
          0 likes
          Last Post seqadmin  
          Working...
          X