Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How to use Scythe in order to cut adapter sequences?

    I read the manual and it says how you need a fasta file as an input?

    I am looking at the overrepresented sequences in the FASTQC files of my paired end reads, but I don't see an option in Scythe in order to remove them this way.

    How can I do this?

  • #2
    You need a fasta file of your adapter sequences, which will depend on the library preps you did, as well as any barcodes you added. What chemistry did you use and what was the sequencing platform?

    Also, you will most likely have to change the quality scoring to match your fastq files (for instance, with '-q 33' if you used the newest Illumina pipeline).

    Comment


    • #3
      I'm a biologist/computer scientist, not a chemist.

      The sequencing platform was Sanger.

      The minimum quality score should be 20, but i don't know how to specify this from the program. The minimum base pair length should be 50 and i dont know how to specify this either.

      Comment


      • #4
        There are no adapters in Sanger sequencing for you to trim (they're used in next-gen sequencing).

        BTW, have a look at Lucy, which I recall being commonly used to quality trim Sanger sequencing (It's been a few years since I've done Sanger sequencing).
        Last edited by dpryan; 10-08-2013, 12:38 PM. Reason: Add link to Lucy

        Comment


        • #5
          I agree with Devon that Lucy would work for what you want (I think it uses Phred scores, so you'd need those).

          Like Devon said, you don't have adapters in Sanger sequencing. You might have primer sequences on the very ends of your sequences that you can trim, but those would be specific to the region you're amplifying.

          You also wouldn't use scythe to do hard quality-score trimming even on next-gen data, it's just for identifying adapter contamination on the ends of reads where base-calling quality is typically low. You'd instead use something like sickle or Trimmomatic. If you have phred scores on your Sanger sequences then I guess sickle might work on those too.

          Comment


          • #6
            How to put together an adapter file for Scythe?

            I still would like some information on this if possible. I'm new on sequence analysis and I'm having trouble to figure out how to put together the adapters file.

            I have a barcode file, and this is what I have found online:

            adaptors.fasta: Provide contaminant sequences as a fasta-formatted file.
            See ´/usr/share/doc/scythe/illumina_adaptors.fa´.
            N.B.: Index/Barcode sequences should be substituted for Ns in the example adaptor file.

            And this is what they say in the READ.me:

            In the case of the original Solexa/Illumina adapter sequences, we've seen barcodes "upstream" of forward reads (in which case the reverse complement of the barcode will appear before the adapter sequence at the 3'-end of reverse reads - replacing the [NNNNNN]). We've also seen barcodes upstream of reverse reads (in which case the reverse complement of the barcode will appear before the adapter sequence at the 3'-end of forward reads - replacing the [MMMMMM]). Your definition of the barcode may be someone else's reverse-complemented barcode, and the barcode may or may not be 6 bases.

            But where do the NNNs and MMMs go in the sequences? They don't show an example. They only give you a illumina.adapters.fa file like this:

            >multiplexing-forward
            GATCGGAAGAGCACACGTCT
            >solexa-forward
            AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT
            >truseq-forward-contam
            AGATCGGAAGAGCACACGTCTGAACTCCAGTCAC
            >truseq-reverse-contam
            AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGTA
            >nextera-forward-read-contam
            CTGTCTCTTATACACATCTCCGAGCCCACGAGAC
            >nextera-reverse-read-contam
            CTGTCTCTTATACACATCTGACGCTGCCGACGA
            >solexa-reverse
            AGATCGGAAGAGCGGTTCAGCAGGAATGCCGAG

            Thank you for the time to reply to this.

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Essential Discoveries and Tools in Epitranscriptomics
              by seqadmin




              The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
              04-22-2024, 07:01 AM
            • seqadmin
              Current Approaches to Protein Sequencing
              by seqadmin


              Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
              04-04-2024, 04:25 PM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, Today, 08:47 AM
            0 responses
            12 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-11-2024, 12:08 PM
            0 responses
            60 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 10:19 PM
            0 responses
            59 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 09:21 AM
            0 responses
            54 views
            0 likes
            Last Post seqadmin  
            Working...
            X