Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Trimmomatic but no trimming

    Hii dear friends

    I have a question about Trimmomatic. I wrote this command:

    java -jar /usr/bin/trimmomatic-0.33.jar PE -phred64 72_L3_1.fq.gz 72_L3_2.fq.gz 71P_1.fq.gz 71UP_1.fq.gz 71P_2.fq.gz 71UP_2.fq.gz ILLUMINACLIP:Over.fasta:2:40:15


    72_L3_1.fq.gz
    and
    72_L3_2.fq.gz
    are my input files

    71P_1.fq.gz
    71UP_1.fq.gz
    71P_2.fq.gz
    and
    71UP_2.fq.gz
    are my output files. In my case P means paired and UP means unpaired.

    after I did fastqc I got overrepresented sequence (it is not adapter) and I stored it in Over.fasta and I want to trim it from my data.

    but when I run the command I see this:

    ILLUMINACLIP: Using 0 prefix pairs, 0 forward/reverse sequences, 0 forward only sequences, 0 reverse only sequences
    Input Read Pairs: 19759652 Both Surviving: 19759652 (100.00%) Forward Only Surviving: 0 (0.00%) Reverse Only Surviving: 0 (0.00%) Dropped: 0 (0.00%)

    It does not trim the sequence.

    Why it is like that?

    In addition, for Palindrome and Single mode U just selected 40 and 15 because the manual selected them. Actually I do not know which score I should select. How can I choose the correct score?

    Would you please help me in these two questions.

    -------------------------------------------------------------------
    I'm sorry for asking so many questions each day. I' working with RNA-seq data in these days. In each level, I should stop due to errors, warnings, blocks and confusions and this story continues.

    and thank you for helping me in solving all these problems.

  • #2
    Are you sure nothing was done the your sequences? 100% reads surviving does not necessarily mean no trimming was performed, especially when you don't specify MINLEN.

    Comment


    • #3
      Yes, I'm almost sure because when I do fastqc, the number of reads and nucleotides in them have not changed (I mean "Total sequences" and "Sequence length").

      Comment


      • #4
        Originally posted by Saeideh View Post
        ILLUMINACLIP: Using 0 prefix pairs, 0 forward/reverse sequences, 0 forward only sequences, 0 reverse only sequences
        The explanation is here (highlighted in red). Trimmomatic is not finding any valid sequences to use for trimming in the supplied clipping file 'Over.fasta' meaning you must have formatted it incorrectly.

        Comment


        • #5
          I put following sequences in Over.fasta

          CTCCCATTTCGCTCGCCGCTACTACGGGAATCGCTTTTGCTTTCTTTTCC
          GGCTTGCGGTGGATACCTAGGTACCCAGAGACGAGGAAGGGCGTAGCAAG

          Isn't it correct?

          Comment


          • #6
            That is not a valid fasta file.

            https://en.wikipedia.org/wiki/FASTA_format

            Comment


            • #7
              Alriiiiiight...

              I got it

              Thanks all

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Essential Discoveries and Tools in Epitranscriptomics
                by seqadmin


                The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
                Yesterday, 07:01 AM
              • seqadmin
                Current Approaches to Protein Sequencing
                by seqadmin


                Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                04-04-2024, 04:25 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 04-11-2024, 12:08 PM
              0 responses
              55 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 10:19 PM
              0 responses
              51 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 09:21 AM
              0 responses
              45 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-04-2024, 09:00 AM
              0 responses
              55 views
              0 likes
              Last Post seqadmin  
              Working...
              X