Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • tirohia
    Member
    • Nov 2011
    • 47

    Treating paired end data as single end

    I have a colleague who is recommend that we treat paired end data as single end data so that we get two technical replicates - it's an exploratory experiment, there's one replicate of 4 different cultivars of apple in two conditions.

    Treating the PE data as SE seems ... wrong to me, but I can't immediately bring any objections to mind. Is there a reasonable reason that this should or shouldn't be done?

    Cheers
    Ben.
  • dpryan
    Devon Ryan
    • Jul 2011
    • 3478

    #2
    This is wrong because you would be deluding yourself into believing that you have (approximately) twice as much evidence supporting coverage of a gene/region/whatever-you're-looking-at than you actually do. Remember that the point of technical replicates is to increase the read count, which wouldn't actually be done by treating mates separately since they are not, in actuality, separate...the mapping position of one will restrict that of the other (or at least it should). Also, if you don't have a well characterized genome (at least the last apple genome I looked at was far from well characterized) then dropping the mate from alignments might also tank you alignment rate/reliability.

    Comment

    • tirohia
      Member
      • Nov 2011
      • 47

      #3
      Thanks for that.

      The best I had been able to come up was that it might impact on how well the reads are aligned to the reference genome. Which is similar to you last point I believe.

      Cheers
      Ben

      Comment

      • Brian Bushnell
        Super Moderator
        • Jan 2014
        • 2709

        #4
        Originally posted by tirohia View Post
        Thanks for that.

        The best I had been able to come up was that it might impact on how well the reads are aligned to the reference genome. Which is similar to you last point I believe.

        Cheers
        Ben
        No, it's much more than that. It's more like treating the passengers on the left and right side of a cruise ship as technical replicates. Obviously, both had the exact same journey.

        Using the first and last half of the file from a single library as technical replicates is absurd, but considering the left and right reads from a single run as independent is infinitely worse - it's a travesty, since they're physically linked.

        Comment

        • dpryan
          Devon Ryan
          • Jul 2011
          • 3478

          #5
          Originally posted by Brian Bushnell View Post
          No, it's much more than that. It's more like treating the passengers on the left and right side of a cruise ship as technical replicates. Obviously, both had the exact same journey.
          I'm totally stealing that analogy!

          Comment

          • Chipper
            Senior Member
            • Mar 2008
            • 323

            #6
            Tell him that you then also will split each read in two to get four replicates...

            Comment

            • woodydon
              Member
              • Jan 2010
              • 52

              #7
              Guess your colleague is tell you to split the read pairs into single reads as replicates. I think it depends on your questions for technical replicates.

              Comment

              • Brian Bushnell
                Super Moderator
                • Jan 2014
                • 2709

                #8
                Originally posted by woodydon View Post
                I think it depends on your questions for technical replicates.
                Not really. There are no useful questions that could be answered correctly by considering those as technical replicates. Also, read2 has lower quality than read1, so they will have 100% correlation bias plus different expression levels, which is perfect. Everything will be differentially expressed by the same amount.

                Comment

                • woodydon
                  Member
                  • Jan 2010
                  • 52

                  #9
                  Originally posted by Brian Bushnell View Post
                  Not really. There are no useful questions that could be answered correctly by considering those as technical replicates.
                  I agree with you. I am just wondering what his/her colleague is thinking.

                  Woody

                  Comment

                  • dpryan
                    Devon Ryan
                    • Jul 2011
                    • 3478

                    #10
                    Originally posted by Chipper View Post
                    Tell him that you then also will split each read in two to get four replicates...
                    Careful, someone will read that and think it's a good idea!

                    Comment

                    • tirohia
                      Member
                      • Nov 2011
                      • 47

                      #11
                      Colleague is new to bioinformatics following the lead of a non-bioinformatician they trust. (I'm the lone systems biology PhD student in the midst of molecular biologists and plant pathologists).

                      The experiment is a very loose (everyone acknowledges that) sneak peek to try and give some insight into what a properly designed experiment might be looking at. Still. Didn't seem right. As others seem to be doing, I'm am now officially stealing the cruise ship analogy.

                      Ta muchly.

                      Comment

                      • westerman
                        Rick Westerman
                        • Jun 2008
                        • 1104

                        #12
                        If you are doing a 'sneak peek' then you really don't need technical replicates. Your statistics will not be correct and your statisticians will be horrified but the stats shouldn't be that far off. Besides, tech reps are over-rated. :-)

                        Discussion of next-gen sequencing related bioinformatics: resources, algorithms, open source efforts, etc

                        Comment

                        Latest Articles

                        Collapse

                        • SEQadmin2
                          Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                          by SEQadmin2


                          I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                          Here are nine questions we think about, in roughly the order they matter, before...
                          06-18-2026, 07:11 AM
                        • SEQadmin2
                          From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                          by SEQadmin2


                          Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                          The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                          ...
                          06-02-2026, 10:05 AM

                        ad_right_rmr

                        Collapse

                        News

                        Collapse

                        Topics Statistics Last Post
                        Started by SEQadmin2, Yesterday, 11:10 AM
                        0 responses
                        7 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 06-17-2026, 06:09 AM
                        0 responses
                        42 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 06-09-2026, 11:58 AM
                        0 responses
                        103 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 06-05-2026, 10:09 AM
                        0 responses
                        125 views
                        0 reactions
                        Last Post SEQadmin2  
                        Working...