Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • #16
    Thanka a bunch.


    Originally posted by Tengfei Liu View Post
    You can use cutadapt to trim both 5' and 3' bps. The fastx_clipper can only trim 3' end. When you use cutadapt, you must use cutadapt -g firstly, and use the processed sequence to do cutadapt -a. If you use -g and -a at the same time, it will only cut one end.

    Comment


    • #17
      I did the same way.

      Thanks for feedback.

      Originally posted by Michael.Ante View Post
      I always use the fastx_trimmer; you can use the -f and -l options to set the first and the last base to be kept.

      Comment


      • #18
        So, just following up on this topic. It has been incredibly helpful. We shouldn't trim the first bases at the 5' end and try to perform the de novo assembly that way correct?

        Thanks!

        Comment


        • #19
          It depends on the library prep. Illumina fragment libraries typically have adapters on the right (3') end, so if you trimmed to the left from the adapter you'd lose all of your genomic sequence. For long mate pair libraries, the answer depends on the protocol.

          Comment


          • #20
            Thanks for your reply, Brian.
            I have mRNA Illumina 100bp paired end reads. I have already removed the adapters, but still have that same the high variation on GC% at the 5' end. For the library prep, TruSeq mRNA prep was used, that's why I am guessing I have the same 5' end bias described before on my dataset. Any thoughts?

            Comment


            • #21
              BBDuk can trim a set number of bases on the left or right side of a read. However, there are some library-prep protocols that are biased, especially near the read start, and thus have suspicious base-frequency histograms, even though they are correct. So, before you trim, I suggest you map the reads to a reference (even the lowest-quality assembly is OK) to determine whether there is actually a higher error rate in the first X bases of the read. If not, then you should not trim them.

              With an assembly, you can determine it like this:

              bbmap.sh in=reads.fq mhist=mhist.txt qhist=qhist.txt

              This will give you histograms of the average qualities by read position, and match/substitution/insertion/deletion/N rates by read position. That will allow you to determine whether the stated read quality is accurate, and thus whether you need to trim the ends of reads.

              If you want to trim a set number of bases on each side, you can use BBDuk's "ftl" (force-trim left) and "ftr" (force-trim right) flags to set the limits of where to trim.

              Comment


              • #22
                The fragmentation sites may be biased, depending on how fragmentation was done. Try mapping with BBMap and using the 'mhist' output, which shows the error rate by read position. If the error rate on the 5' end is not much higher than anywhere else, there's no need to trim it.

                Comment


                • #23
                  Use Trimmomatic

                  Comment


                  • #24
                    I use TrimGalore to trim adapter and the fastqc result also show that there are biases in 5' end, several kmers occurring.

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Recent Advances in Sequencing Analysis Tools
                      by seqadmin


                      The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
                      Today, 07:48 AM
                    • seqadmin
                      Essential Discoveries and Tools in Epitranscriptomics
                      by seqadmin




                      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                      04-22-2024, 07:01 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, Today, 07:17 AM
                    0 responses
                    7 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 05-02-2024, 08:06 AM
                    0 responses
                    19 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 04-30-2024, 12:17 PM
                    0 responses
                    20 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 04-29-2024, 10:49 AM
                    0 responses
                    28 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X