Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • FastQC, Kmer count, Trimmomatic: no success in trimming, still fail Kmer

    I'm using RNAseq for DGE and failed in trying to map paired end reads in TopHat2. Samples passed the first two parts of fastqc, but failed per base and Kmer count. Some samples did pass per base.

    I tried trimming them to remove adaptors, but nothing improved the Kmer plot. http://imgur.com/Fhbx534 All of the kmer plots from this data set (18) look similar. I used the adaptor settings built into trimmomatic, but none worked. I then emailed genewhiz and asked for their (truseq) adaptors, but trimming for those sequences didn't improve the plots.

    Ex per seq quality score http://imgur.com/5L5PaJU,Hg2mHk0#0
    per base http://imgur.com/5L5PaJU,Hg2mHk0#1

    Am I doing something wrong? Should I just trim everything across certain bases?

    Edit: reads are paired

    adaptor input file:
    >PrefixPE
    GATCGGAAGAGCACACGTCTGAACTCCAGTCAC
    >Prefix PE
    AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT
    Last edited by skmotay; 10-09-2014, 06:22 AM. Reason: links for images

  • #2
    Are your reads paired? If so, you can trim them with BBDuk via overlap, without specifying the adapter sequence, like this:

    bbduk.sh in1=r1.fq in2=r2.fq out1=trimmed1.fq out2=trimmed2.fq tpe tbo

    That package comes with both TruSeq and Nextera adapters, which are another possibility that you can try.

    Comment


    • #3
      Yes, reads are paired.

      I think I should have mentioned that I'm working in Galaxy, which I think limits what tools I can use. I don't see BBDuk in my tools.

      Comment


      • #4
        Ah, too bad. I'll see if I can get BBTools put into Galaxy. Though you can still download it and try out the Nextera adapter file. I would recommend, though, that you contact the source of the data to find out what kind of adapters were actually used - that will save a lot of time.

        Comment


        • #5
          I emailed genewhiz and they sent me the adaptor sequences (listed I original post). I'm not quite sure I wrote the adaptor sequence file correctly and I couldn't find a clear example.

          Comment


          • #6
            Does Fastqc give you a warning for overrepresented sequences?

            I don't know the default -k setting for fastqc but maybe it helps to increase it.
            My impression is that the adapter trimming worked. But the overrepresented k-mers test fails due to other reasons

            Comment


            • #7
              They all pass overrepresented sequences.

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Best Practices for Single-Cell Sequencing Analysis
                by seqadmin



                While isolating and preparing single cells for sequencing was historically the bottleneck, recent technological advancements have shifted the challenge to data analysis. This highlights the rapidly evolving nature of single-cell sequencing. The inherent complexity of single-cell analysis has intensified with the surge in data volume and the incorporation of diverse and more complex datasets. This article explores the challenges in analysis, examines common pitfalls, offers...
                Today, 07:15 AM
              • seqadmin
                Latest Developments in Precision Medicine
                by seqadmin



                Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

                Somatic Genomics
                “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
                05-24-2024, 01:16 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, Today, 08:18 AM
              0 responses
              10 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, Today, 08:04 AM
              0 responses
              12 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 06-03-2024, 06:55 AM
              0 responses
              13 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 05-30-2024, 03:16 PM
              0 responses
              27 views
              0 likes
              Last Post seqadmin  
              Working...
              X