Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • tirohia
    Member
    • Nov 2011
    • 47

    Treating paired end data as single end

    I have a colleague who is recommend that we treat paired end data as single end data so that we get two technical replicates - it's an exploratory experiment, there's one replicate of 4 different cultivars of apple in two conditions.

    Treating the PE data as SE seems ... wrong to me, but I can't immediately bring any objections to mind. Is there a reasonable reason that this should or shouldn't be done?

    Cheers
    Ben.
  • dpryan
    Devon Ryan
    • Jul 2011
    • 3478

    #2
    This is wrong because you would be deluding yourself into believing that you have (approximately) twice as much evidence supporting coverage of a gene/region/whatever-you're-looking-at than you actually do. Remember that the point of technical replicates is to increase the read count, which wouldn't actually be done by treating mates separately since they are not, in actuality, separate...the mapping position of one will restrict that of the other (or at least it should). Also, if you don't have a well characterized genome (at least the last apple genome I looked at was far from well characterized) then dropping the mate from alignments might also tank you alignment rate/reliability.

    Comment

    • tirohia
      Member
      • Nov 2011
      • 47

      #3
      Thanks for that.

      The best I had been able to come up was that it might impact on how well the reads are aligned to the reference genome. Which is similar to you last point I believe.

      Cheers
      Ben

      Comment

      • Brian Bushnell
        Super Moderator
        • Jan 2014
        • 2709

        #4
        Originally posted by tirohia View Post
        Thanks for that.

        The best I had been able to come up was that it might impact on how well the reads are aligned to the reference genome. Which is similar to you last point I believe.

        Cheers
        Ben
        No, it's much more than that. It's more like treating the passengers on the left and right side of a cruise ship as technical replicates. Obviously, both had the exact same journey.

        Using the first and last half of the file from a single library as technical replicates is absurd, but considering the left and right reads from a single run as independent is infinitely worse - it's a travesty, since they're physically linked.

        Comment

        • dpryan
          Devon Ryan
          • Jul 2011
          • 3478

          #5
          Originally posted by Brian Bushnell View Post
          No, it's much more than that. It's more like treating the passengers on the left and right side of a cruise ship as technical replicates. Obviously, both had the exact same journey.
          I'm totally stealing that analogy!

          Comment

          • Chipper
            Senior Member
            • Mar 2008
            • 323

            #6
            Tell him that you then also will split each read in two to get four replicates...

            Comment

            • woodydon
              Member
              • Jan 2010
              • 52

              #7
              Guess your colleague is tell you to split the read pairs into single reads as replicates. I think it depends on your questions for technical replicates.

              Comment

              • Brian Bushnell
                Super Moderator
                • Jan 2014
                • 2709

                #8
                Originally posted by woodydon View Post
                I think it depends on your questions for technical replicates.
                Not really. There are no useful questions that could be answered correctly by considering those as technical replicates. Also, read2 has lower quality than read1, so they will have 100% correlation bias plus different expression levels, which is perfect. Everything will be differentially expressed by the same amount.

                Comment

                • woodydon
                  Member
                  • Jan 2010
                  • 52

                  #9
                  Originally posted by Brian Bushnell View Post
                  Not really. There are no useful questions that could be answered correctly by considering those as technical replicates.
                  I agree with you. I am just wondering what his/her colleague is thinking.

                  Woody

                  Comment

                  • dpryan
                    Devon Ryan
                    • Jul 2011
                    • 3478

                    #10
                    Originally posted by Chipper View Post
                    Tell him that you then also will split each read in two to get four replicates...
                    Careful, someone will read that and think it's a good idea!

                    Comment

                    • tirohia
                      Member
                      • Nov 2011
                      • 47

                      #11
                      Colleague is new to bioinformatics following the lead of a non-bioinformatician they trust. (I'm the lone systems biology PhD student in the midst of molecular biologists and plant pathologists).

                      The experiment is a very loose (everyone acknowledges that) sneak peek to try and give some insight into what a properly designed experiment might be looking at. Still. Didn't seem right. As others seem to be doing, I'm am now officially stealing the cruise ship analogy.

                      Ta muchly.

                      Comment

                      • westerman
                        Rick Westerman
                        • Jun 2008
                        • 1104

                        #12
                        If you are doing a 'sneak peek' then you really don't need technical replicates. Your statistics will not be correct and your statisticians will be horrified but the stats shouldn't be that far off. Besides, tech reps are over-rated. :-)

                        Discussion of next-gen sequencing related bioinformatics: resources, algorithms, open source efforts, etc

                        Comment

                        Latest Articles

                        Collapse

                        • SEQadmin2
                          From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                          by SEQadmin2


                          Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                          The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                          ...
                          06-02-2026, 10:05 AM
                        • SEQadmin2
                          Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                          by SEQadmin2


                          With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                          Introduction

                          Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                          05-22-2026, 06:42 AM
                        • SEQadmin2
                          Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                          by SEQadmin2

                          Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                          Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                          05-06-2026, 09:04 AM

                        ad_right_rmr

                        Collapse

                        News

                        Collapse

                        Topics Statistics Last Post
                        Started by SEQadmin2, 06-02-2026, 12:03 PM
                        0 responses
                        19 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 06-02-2026, 11:40 AM
                        0 responses
                        14 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 05-28-2026, 11:40 AM
                        0 responses
                        29 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 05-26-2026, 10:12 AM
                        0 responses
                        31 views
                        0 reactions
                        Last Post SEQadmin2  
                        Working...