Header Leaderboard Ad

Collapse

Quality-, adapter- and RRBS-trimming with Trim Galore!

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • #61
    Thanks for helping Felix. A simple typo on my end unfortunately. Problem solved!

    Comment


    • #62
      For those with weaker heart (those that cannot use complex scripts in Linux and need a graphic interface) here is another (free) program for trimming qualities:

      An efficient SFF/FastQ viewer and editor (GUI)






      The gray/green curves in the second graphic shows the average quality before and after trimming the low quality ends.


      Only works on Fasta, FastQ, SFF for the moment.
      Sorry.
      Last edited by create.share; 05-30-2015, 12:19 AM.

      Comment


      • #63
        Hi everyone,

        not sure if this is the right thread but I'll give it a go:

        I'm trying to get rid of adapters in my sequences. There aren't many with adapters but I'd like to save what I can.

        When I run my command I get the following error:

        ==================================================================
        Your job looked like:

        ------------------------------------------------------------
        # LSBATCH: User input
        /gpfs/scratch/cbh12wsu/trim/trim_galore -phred33 --illumina --paired -phred33 --path_to_cutadapt /gpfs/scratch/cbh12wsu/trim/cutadapt.py -o adaptrim.fastq -length 50 1_L.fastq 2_R.fastq
        ------------------------------------------------------------

        Exited with exit code 2.

        The output (if any) follows:

        Path to Cutadapt set as: '/gpfs/scratch/cbh12wsu/trim/cutadapt.py' (user defined)
        File "/gpfs/scratch/cbh12wsu/trim/cutadapt.py", line 99
        print(rest, match.read.name, file=self.file)
        ^
        SyntaxError: invalid syntax
        Cutadapt seems to be working fine (tested command '/gpfs/scratch/cbh12wsu/trim/cutadapt.py --version')
        File "/gpfs/scratch/cbh12wsu/trim/cutadapt.py", line 99
        print(rest, match.read.name, file=self.file)
        ^
        SyntaxError: invalid syntax
        Failed to write to file '1_L.fastq_trimming_report.txt': No such file or directory
        ==================================================================

        Does indicate a problem with the cutadapt.py?

        Thanks!

        Comment


        • #64
          Hmm, when you just type "/gpfs/scratch/cbh12wsu/trim/cutadapt.py" on the command line do you see something like:

          Code:
          cutadapt version 1.8.1
          Copyright (C) 2010-2015 Marcel Martin <[email protected]>
          
          cutadapt removes adapter sequences from high-throughput sequencing reads.
          
          Usage:
              cutadapt -a ADAPTER [options] [-o output.fastq] input.fastq
          ?

          Comment


          • #65
            Hi Felix,

            thanks for replying so quick!

            I get the following error when I try your command:
            File "/gpfs/scratch/cbh12wsu/trim/cutadapt.py", line 99
            print(rest, match.read.name, file=self.file)
            ^
            SyntaxError: invalid syntax

            I think I should mention that I made cutadapt.py an executable. And copied it from it's original folder into the trim folder.

            Comment


            • #66
              Cutadapt probably needs its internal structure to be intact, can't you just do a normal installation and then point to the cutadapt executable (without renaming it to .py?)

              Comment


              • #67
                I put it back where it belongs and tried again. Same error as before.

                With normal installation you mean the pip command? I tried that before but I'm not allowed to do that on the cluster. I can load a cutadapt module but I was hoping making it an executable would solve the problem.

                Comment


                • #68
                  Hmm, maybe you need to contact your sys-admins then, you obviously need to get Cutadapt to be working before you can start using it. You should even be able to install it in your home directory and then point there?

                  Comment


                  • #69
                    Just install it as a user (pip install --user cutadapt) or use virtualenv.

                    Comment


                    • #70
                      I've installed it and when I run felix's command I get:
                      cutadapt version 1.8.1
                      ...

                      But I still get this error:
                      Path to Cutadapt set as: '/gpfs/scratch/cbh12wsu/trim/cutadapt/cutadapt/scripts/cutadapt' (user defined)
                      1.8.1
                      Cutadapt seems to be working fine (tested command '/gpfs/scratch/cbh12wsu/trim/cutadapt/cutadapt/scripts/cutadapt --version')
                      Failed to write to file '1_L.fastq_trimming_report.txt': No such file or directory

                      Used this command:
                      bsub -N -q short -o cut3.txt "/gpfs/scratch/cbh12wsu/trim/trim_galore -phred33 --illumina --paired -phred33 --path_to_cutadapt /gpfs/scratch/cbh12wsu/trim/cutadapt/cutadapt/scripts/cutadapt -o adaptrim.fastq -length 50 1_L.fastq 2_R.fastq"

                      More ideas?

                      Comment


                      • #71
                        Ok, Cutadapt seems to be working now. Do you have write permission in the folder where you keep your FastQ files?

                        Comment


                        • #72
                          these are the folder permissions: drwxr-xr-x
                          I also just did a fastqc run and that adds the files to the folder, so it should be fine.

                          could it be a problem that the to write file is called .fastqxx.txt?

                          Comment


                          • #73
                            no that shouldn't matter... Could it be that your queuing system does not take you to the same directory (option like -cwd)? What happens if you just run the command without using the queue?

                            By the way you can shorten the command like this:

                            Code:
                            /gpfs/scratch/cbh12wsu/trim/trim_galore --paired --path_to_cutadapt /gpfs/scratch/cbh12wsu/trim/cutadapt/cutadapt/scripts/cutadapt [COLOR="Red"]-o adaptrim.fastq[/COLOR] -length 50 1_L.fastq 2_R.fastq
                            By the way -o is meant to be an output directory, not a file name. This might be the reason why it can't create the output files in the directory called adaptrim.fastq

                            Comment


                            • #74
                              It is still running!!!!! What a stupid mistake! Thanks for finding it and shortening my command!

                              Comment


                              • #75
                                Great, enjoy!

                                Comment

                                Latest Articles

                                Collapse

                                • seqadmin
                                  How RNA-Seq is Transforming Cancer Studies
                                  by seqadmin



                                  Cancer research has been transformed through numerous molecular techniques, with RNA sequencing (RNA-seq) playing a crucial role in understanding the complexity of the disease. Maša Ivin, Ph.D., Scientific Writer at Lexogen, and Yvonne Goepel Ph.D., Product Manager at Lexogen, remarked that “The high-throughput nature of RNA-seq allows for rapid profiling and deep exploration of the transcriptome.” They emphasized its indispensable role in cancer research, aiding in biomarker...
                                  09-07-2023, 11:15 PM
                                • seqadmin
                                  Methods for Investigating the Transcriptome
                                  by seqadmin




                                  Ribonucleic acid (RNA) represents a range of diverse molecules that play a crucial role in many cellular processes. From serving as a protein template to regulating genes, the complex processes involving RNA make it a focal point of study for many scientists. This article will spotlight various methods scientists have developed to investigate different RNA subtypes and the broader transcriptome.

                                  Whole Transcriptome RNA-seq
                                  Whole transcriptome sequencing...
                                  08-31-2023, 11:07 AM

                                ad_right_rmr

                                Collapse

                                News

                                Collapse

                                Topics Statistics Last Post
                                Started by seqadmin, Yesterday, 09:05 AM
                                0 responses
                                13 views
                                0 likes
                                Last Post seqadmin  
                                Started by seqadmin, 09-21-2023, 06:18 AM
                                0 responses
                                10 views
                                0 likes
                                Last Post seqadmin  
                                Started by seqadmin, 09-20-2023, 09:17 AM
                                0 responses
                                12 views
                                0 likes
                                Last Post seqadmin  
                                Started by seqadmin, 09-19-2023, 09:23 AM
                                0 responses
                                27 views
                                0 likes
                                Last Post seqadmin  
                                Working...
                                X