Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • paolo.kunder
    Member
    • Aug 2011
    • 93

    small RNA Seq adapter trimming

    Dear All,
    I am new to small RNA Seq data.
    I have just received data and before I map my samples I would like to remove 5 and 3' adapters.
    Here is my FastQC output, is similar in all samples:

    Code:
    sequence	count	percentage	possible source
    GGCTGGTCCGATGGTAGTGGGTTATCAGAACTAGATCGGAAGAGCACACG	4842248	56.89688396826158	No Hit
    GGCTGGTCCGATGGTAGTGGGTTATCAGAACCAGATCGGAAGAGCACACG	496150	5.829810654236004	No Hit
    GGCTGGTCCGATGGTAGTGGGTTATCAGAACTTAGATCGGAAGAGCACAC	250074	2.9383937711325494	No Hit
    GGCTGGTCCGATGGTAGTGGGTTATCAGAACAAGATCGGAAGAGCACACG	226967	2.6668842784641402	No Hit
    GGCTGGTCCGATGGTAGTGGGTTATCAGAACAGATCGGAAGAGCACACGT	204732	2.4056208704283897	No Hit
    TGAGGTAGTAGTTTGTGCTGTTAGATCGGAAGAGCACACGTCTGAACTCC	82348	0.9675969923511569	Illumina Multiplexing PCR Primer 2.01 (100% over 28bp)
    TTCAAGTAATCCAGGATAGGCTAGATCGGAAGAGCACACGTCTGAACTCC	65591	0.7707006159870881	Illumina Multiplexing PCR Primer 2.01 (100% over 28bp)
    TAGCTTATCAGACTGATGTTGACAGATCGGAAGAGCACACGTCTGAACTC	56948	0.6691445271337941	Illumina Multiplexing PCR Primer 2.01 (100% over 27bp)
    TGAGGTAGTAGATTGTATAGTTAGATCGGAAGAGCACACGTCTGAACTCC	45012	0.5288953686757453	Illumina Multiplexing PCR Primer 2.01 (100% over 28bp)
    There's something wrong?
    Data is 50 bp x single read.
    What are all these over-represented sequences? The first more than half of all reads!
    Any suggestion?

    Many Thanks

    paolo
  • GenoMax
    Senior Member
    • Feb 2008
    • 7142

    #2
    Have you blasted them to see what shows up? Your RNA would be expected to be small so scan/trim as you normally would.
    Last edited by GenoMax; 02-06-2017, 08:21 AM.

    Comment

    • paolo.kunder
      Member
      • Aug 2011
      • 93

      #3
      Exactly. It match a pretty conserved intronic region of 32 bp in multiple species..
      I assume this is spike in..

      Comment

      • GenoMax
        Senior Member
        • Feb 2008
        • 7142

        #4
        Can you scan/trim first and then check what remains?

        Comment

        • paolo.kunder
          Member
          • Aug 2011
          • 93

          #5
          I did it, i trimmed adaptor from my reads and blasted.
          90% of my reads blast an intronic region conserved in multiple species..

          Comment

          • GenoMax
            Senior Member
            • Feb 2008
            • 7142

            #6
            Is this data from a monkey? That top sequence looks to be from one.

            Comment

            • paolo.kunder
              Member
              • Aug 2011
              • 93

              #7
              No human unfortunately......

              Comment

              • GenoMax
                Senior Member
                • Feb 2008
                • 7142

                #8
                Could this be some kind of contamination (if it is in all samples)? Perhaps talk with the owner of the data to see if there is any other logical explanation.

                Comment

                • paolo.kunder
                  Member
                  • Aug 2011
                  • 93

                  #9
                  Dont you think that this is the spike in from Exiqon that was added in the samples?

                  Comment

                  • GenoMax
                    Senior Member
                    • Feb 2008
                    • 7142

                    #10
                    I am not familiar with Exiqon but if it was indeed a spike-in would it to be > 50% of total reads?

                    Comment

                    • paolo.kunder
                      Member
                      • Aug 2011
                      • 93

                      #11
                      Ops I was wrong, my data is full of yRNA, no idea why...and what they are..

                      Comment

                      • GenoMax
                        Senior Member
                        • Feb 2008
                        • 7142

                        #12
                        Y RNA as in this entry?

                        Comment

                        Latest Articles

                        Collapse

                        • SEQadmin2
                          From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                          by SEQadmin2


                          Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                          The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                          ...
                          06-02-2026, 10:05 AM
                        • SEQadmin2
                          Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                          by SEQadmin2


                          With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                          Introduction

                          Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                          05-22-2026, 06:42 AM
                        • SEQadmin2
                          Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                          by SEQadmin2

                          Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                          Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                          05-06-2026, 09:04 AM

                        ad_right_rmr

                        Collapse

                        News

                        Collapse

                        Topics Statistics Last Post
                        Started by SEQadmin2, Today, 08:59 AM
                        0 responses
                        3 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 06-02-2026, 12:03 PM
                        0 responses
                        21 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 06-02-2026, 11:40 AM
                        0 responses
                        14 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 05-28-2026, 11:40 AM
                        0 responses
                        29 views
                        0 reactions
                        Last Post SEQadmin2  
                        Working...