Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • yifangt
    replied
    Yes, for Illumina Mate-pair reads, there are multiple scenarios that need be handled at the same time, at least the junction sequence and its reverse complement. Maybe Trim_galore can do it with multiple steps, is that right?

    Leave a comment:


  • fkrueger
    replied
    No, if you wanted to use many adapter sequences you would have to use Cutadapt itself. For standard (Illumina) sequencing libraries this is probably not needed though. Do you have a reason to try many adapters as the first attempt instead of just going with the default (which is Illumina adapters)?

    Leave a comment:


  • yifangt
    replied
    Thanks Felix, I will give it a try again. Is it poosible to provide multiple adaptor sequences in a single line like cutadapt?
    cutadapt -a PRIMER1 -b ADAPTOR1 -b ADAPTOR2
    It seems there is no description about this in the Trim_galore guide. Thanks a lot!

    Leave a comment:


  • fkrueger
    replied
    Apologies for that, this was a bug in the code because the default length cutoff had not been defined when the check against the length of unpaired reads was performed. I have now fixed that and put up a new version. Just as side note, the retain length for unpaired singleton reads has to be greater than the paired-end length cutoff (which is 20bp by default).

    Please download the hot-fixed version 0.3.1 from the Trim Galore project page.
    Best,
    Felix

    Leave a comment:


  • yifangt
    replied
    trim galore error

    Hello,
    When I tried trim-galore (v0.3.0) for mate_pair reads (actually my first time to use it) I always had an error:
    Code:
    Use of uninitialized value $length_cutoff in numeric gt (>) at /usr/bin/trim_galore line 1068.
    Use of uninitialized value $length_cutoff in numeric gt (>) at /usr/bin/trim_galore line 1077.
    Failed to write to file: No such file or directory
    And this is my command line:
    Code:
    trim_galore --paired -phred33 -q 20 -a CTGTCTCTTATACACATCT -a2 AGATGTGTATAAGAGACAG -e 0.15 --clip_R1 14 --clip_R2 15 --retain_unpaired  -t -r1 20 -r2 20  -o MP03-05k_R1+R2trim_galored.fq test_R1.fq test_R2.fq
    I checked the script lines it seems related to my reads length. What did I miss? Thank you!

    Leave a comment:


  • fkrueger
    replied
    A new version of Trim Galore (v0.3.0) is now available for download from https://www.bioinformatics.babraham....s/trim_galore/.

    This new release adds the options '--clip_R1' and '--clip_R2' to trim off a fixed number of bases from the 5' end of reads. This can be useful if the quality is unusually low at the start, or whenever there is an undesired bias at the start of reads. An example for this could be PBAT-Seq in general (where the first 4bp show a very strong sequence (and methylation) bias), or the start of Read 2 of every shotgun Bisulfite-Seq paired-end library where the end repair procedure introduces unmethylated cytosines. For more information on this please refer to the M-bias section of the Bismark User Guide.

    Together with the '--ignore' and '--ignore_r2' options in the Bismark methylation extractor one may now choose whether to remove potential biased positions at the start of reads before mapping or ignoring them later on using the Bismark methylation extractor.

    Leave a comment:


  • fkrueger
    replied
    Originally posted by Vasilisa View Post
    Hi!
    I have a question about the usage of Trim Galore (v0.3.0) for paired-end data filtering.

    I used FastQC to generate the quality assessment report. Based on it, I want to trim my sequences both at the 3'- and 5'-ends (from "Per base sequence quality") and also filter them based on quality (from "Per sequence quality scores"). Is there a way to somehow integrate FastQC with Trim Galore for paired-end data?

    Thanks a lot!
    Vasilisa
    Hi Vasilisa,

    FastQC and Trim Galore are independent pieces of software, so they can't really be 'integrated'. Trim Galore will by default use a Phred score cutoff of 20 for quality trimming (the implementation is the same as the one used by BWA, please see the Cutadapt documentation for more information). As of the latest version of Trim Galore you can also trim sequences on their 5' ends if intended. If you are simply experiencing somwhat lower qualities at the first couple of positions it is probably simply due to the quality score calibration that takes place during the first cycles on the HiSeq/MiSeq. Normally it is sufficient to run Trim Galore in its default setting.

    Leave a comment:


  • Vasilisa
    replied
    paired-end data filtering with FastQC

    Hi!
    I have a question about the usage of Trim Galore (v0.3.0) for paired-end data filtering.

    I used FastQC to generate the quality assessment report. Based on it, I want to trim my sequences both at the 3'- and 5'-ends (from "Per base sequence quality") and also filter them based on quality (from "Per sequence quality scores"). Is there a way to somehow integrate FastQC with Trim Galore for paired-end data?

    Thanks a lot!
    Vasilisa

    Leave a comment:


  • fkrueger
    replied
    We have just released Trim Galore v0.2.8, which is more of a cosmetic update. The changes in detail are:

    • Trim Galore will now compress output files with GZIP on the fly instead of compressing the trimmed file once trimming has completed. In the interest of time temporary files are not being compressed
    • Added a small sanity check to exit if no files were supplied for trimming. Thanks to P. for 'bringing this to my attention'
    • The download also includes the updated RRBS guide

    The download can be found here: http://www.bioinformatics.babraham.a...s/trim_galore/

    Leave a comment:


  • fkrueger
    replied
    We have just released Trim Galore v0.2.7 that adds an option '--dont_gzip' which takes precedence over .gz file endings.

    The download can be found here: http://www.bioinformatics.babraham.a...s/trim_galore/.

    Leave a comment:


  • fkrueger
    replied
    Originally posted by blanco View Post
    Thanks again for your quick reply fkrueger.

    What, however, if I do not want to use the default 13bp adapter but instead want to use the whole 63 bp index-adapter (index = red):
    (5')-adapter-(3')=GATCGGAAGAGCACACGTCTGAACTCCAGTCACATCACGATCTCGTATGCCGTCTTCTGCTTG

    Would I still be safe to just use the -a option? I am assuming yes.
    Yes that should work. However if you look closely at the sequence you posted you'll find that it starts with GATCGGAAGAGC.... A-tailing of the library prior to adapter ligation will add another A to the 5-prime end, like so:
    AGATCGGAAGAGC, which happens to be pretty much what Trim Galore is using by default:
    AGATCGGAAGAGC.

    If you still want to use the entire sequence then you would need tp specify different adapters for both sides since their sequence starts diverging after the first 13bp. Unless you want to trim only a subset of indexed adapters it is probably best to just run it in default mode.

    Leave a comment:


  • blanco
    replied
    Thanks again for your quick reply fkrueger.

    What, however, if I do not want to use the default 13bp adapter but instead want to use the whole 63 bp index-adapter (index = red):
    (5')-adapter-(3')=GATCGGAAGAGCACACGTCTGAACTCCAGTCACATCACGATCTCGTATGCCGTCTTCTGCTTG

    Would I still be safe to just use the -a option? I am assuming yes.

    Leave a comment:


  • gwilkie
    replied
    I have also found that when using Nextera sample prep, you should trim at CTGTCTCTTATACACATCT instead of the usual AGATCGGAAGAGC.

    Best wishes, Gavin

    Leave a comment:


  • fkrueger
    replied
    Originally posted by blanco View Post
    Hi, I am wondering about the -a and -a2 parameters for paired end reads

    For example my libraries were made using the Illumina TruSeq adapters, one of which is:
    (5')-adapter-(3')=GATCGGAAGAGCACACGTCTGAACTCCAGTCACATCACGATCTCGTATGCCGTCTTCTGCTTG

    Should -a2 be the reverse of this or should I not use the a2 option at all?

    Thanks,
    blanco
    You don't need to use -a2 at all because Trim Galore will by default search for the first 13 bp of the Illumina adapter. This portion is shared between both read 1 and read 2 adapter, and they start to diverge only after this position. Normally it should thus be suffifcient to run it with the default parameters.

    Leave a comment:


  • blanco
    replied
    Hi, I am wondering about the -a and -a2 parameters for paired end reads

    For example my libraries were made using the Illumina TruSeq adapters, one of which is:
    (5')-adapter-(3')=GATCGGAAGAGCACACGTCTGAACTCCAGTCACATCACGATCTCGTATGCCGTCTTCTGCTTG

    Should -a2 be the reverse of this or should I not use the a2 option at all?

    Thanks,
    blanco

    Leave a comment:

Latest Articles

Collapse

  • seqadmin
    Essential Discoveries and Tools in Epitranscriptomics
    by seqadmin




    The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
    04-22-2024, 07:01 AM
  • seqadmin
    Current Approaches to Protein Sequencing
    by seqadmin


    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
    04-04-2024, 04:25 PM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, Today, 08:47 AM
0 responses
10 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-11-2024, 12:08 PM
0 responses
60 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 10:19 PM
0 responses
57 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 09:21 AM
0 responses
53 views
0 likes
Last Post seqadmin  
Working...
X