Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • paolo.kunder
    Member
    • Aug 2011
    • 93

    reads of different size - RNA SEQ

    Dear All,
    I have just received some RNA Seq data and I noticed that not all reads are at the size I expected: (75bp)
    only 64 % are at the correct size,
    what should I do? Should I remove all reads below 75?
    It is a library selection problem?

    Many thanks,
    Paolo
    Attached Files
  • GenoMax
    Senior Member
    • Feb 2008
    • 7142

    #2
    If this is illumina data then it looks like they must have been pre-trimmed. If that is the case go ahead and use them.

    If not, then is this ion data?

    Comment

    • mastal
      Senior Member
      • Mar 2009
      • 666

      #3
      Is it Illumina data? It looks like your reads have already been trimmed to remove adapters, and they should be fine as they are. Ask your service provider what trimming has been done to the data.

      Comment

      • paolo.kunder
        Member
        • Aug 2011
        • 93

        #4
        Yes this is Illumina NextSeq500 data,
        I am a bit confused, why they should be of different length after trimming?
        I analyzed many RNA Seq data of Hiseq and they are all (100%) the same length, I also downloaded 26 Human Tissued form Encode with Hiseq and they are all (100%) the same length.

        Comment

        • paolo.kunder
          Member
          • Aug 2011
          • 93

          #5
          and moreover, why should I have more than 300'000 reads 32bp long,

          I have demultiplexed the data, with the following adapters

          index
          TCCGGAGA
          CGCTCATT
          ATTACTCG
          GAGATTCC
          ATTACTCG
          TCCGGAGA
          CGCTCATT
          GAGATTCC
          ATTACTCG
          TCCGGAGA
          CGCTCATT
          GAGATTCC
          ATTACTCG
          TCCGGAGA
          CGCTCATT
          GAGATTCC
          TCCGGAGA
          CGCTCATT
          ATTACTCG
          TCCGGAGA
          CGCTCATT
          ATTACTCG

          Comment

          • GenoMax
            Senior Member
            • Feb 2008
            • 7142

            #6
            These may be bad libraries that had short inserts which results in read-through into adapter on the other end. Those would generally need to be trimmed.

            Comment

            • GenoMax
              Senior Member
              • Feb 2008
              • 7142

              #7
              Originally posted by paolo.kunder View Post
              and moreover, why should I have more than 300'000 reads 32bp long,

              I have demultiplexed the data, with the following adapters

              index
              TCCGGAGA
              CGCTCATT
              ATTACTCG
              GAGATTCC
              ATTACTCG
              TCCGGAGA
              CGCTCATT
              GAGATTCC
              ATTACTCG
              TCCGGAGA
              CGCTCATT
              GAGATTCC
              ATTACTCG
              TCCGGAGA
              CGCTCATT
              GAGATTCC
              TCCGGAGA
              CGCTCATT
              ATTACTCG
              TCCGGAGA
              CGCTCATT
              ATTACTCG
              Demultiplexing with these barcodes should have nothing to do with the length of the reads.

              Comment

              • paolo.kunder
                Member
                • Aug 2011
                • 93

                #8
                so is definitely a bad library preparation?
                I have to be sure before going to complain to the service!

                Comment

                • GenoMax
                  Senior Member
                  • Feb 2008
                  • 7142

                  #9
                  If these reads were pre-trimmed then they likely represent bad libraries. This may not necessarily indicate bad library preparation since the original samples themselves may be bad. One would need to look at the QC for samples and then libraries before concluding either one (or both) are bad.

                  Comment

                  • paolo.kunder
                    Member
                    • Aug 2011
                    • 93

                    #10
                    these may help you?
                    Attached Files

                    Comment

                    • GenoMax
                      Senior Member
                      • Feb 2008
                      • 7142

                      #11
                      This is outside my sphere of expertise so someone from experimental side of things will need to verify it authoritatively but it looks like these libraries have short inserts. Remember the size of the fragments include illumina adapters (~80 bp). Shorter fragments tend to cluster more efficiently.
                      Last edited by GenoMax; 11-23-2015, 07:15 AM.

                      Comment

                      Latest Articles

                      Collapse

                      • SEQadmin2
                        Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                        by SEQadmin2


                        I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                        Here are nine questions we think about, in roughly the order they matter, before...
                        06-18-2026, 07:11 AM
                      • SEQadmin2
                        From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                        by SEQadmin2


                        Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                        The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                        ...
                        06-02-2026, 10:05 AM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by SEQadmin2, Yesterday, 05:37 AM
                      0 responses
                      6 views
                      0 reactions
                      Last Post SEQadmin2  
                      Started by SEQadmin2, 06-26-2026, 11:10 AM
                      0 responses
                      16 views
                      0 reactions
                      Last Post SEQadmin2  
                      Started by SEQadmin2, 06-17-2026, 06:09 AM
                      0 responses
                      51 views
                      0 reactions
                      Last Post SEQadmin2  
                      Started by SEQadmin2, 06-09-2026, 11:58 AM
                      0 responses
                      110 views
                      0 reactions
                      Last Post SEQadmin2  
                      Working...