Unconfigured Ad

Collapse
This is a sticky topic.
X
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • juan
    Member
    • Aug 2009
    • 14

    #31
    When converting from colorspace to basespace, if you reach an error (no color, aka "4" or "."), you have 4 possibilities for the rest of the read. How do you determine the correct one?

    Example:
    Colorspace T000.00
    Is basespace TTTTN followed by AA, CC, GG or TT

    Which is correct? If you have a sequencing error early in the read, such as position 1 or 2 it goes in the garbage?

    Comment

    • nilshomer
      Nils Homer
      • Nov 2008
      • 1283

      #32
      Originally posted by juan View Post
      When converting from colorspace to basespace, if you reach an error (no color, aka "4" or "."), you have 4 possibilities for the rest of the read. How do you determine the correct one?

      Example:
      Colorspace T000.00
      Is basespace TTTTN followed by AA, CC, GG or TT

      Which is correct? If you have a sequencing error early in the read, such as position 1 or 2 it goes in the garbage?
      Depends on the alignment tool. Missing colors are not a problem for BFAST since even if there are four possibilities, one typically has a higher likelihood (see the Viterbi algorithm for HMMs). Remember that sequence alignment can be thought of as a path finding problem, or HMM, etc.

      Comment

      • snetmcom
        Senior Member
        • Oct 2008
        • 159

        #33
        Originally posted by juan View Post
        When converting from colorspace to basespace, if you reach an error (no color, aka "4" or "."), you have 4 possibilities for the rest of the read. How do you determine the correct one?

        Example:
        Colorspace T000.00
        Is basespace TTTTN followed by AA, CC, GG or TT

        Which is correct? If you have a sequencing error early in the read, such as position 1 or 2 it goes in the garbage?
        You map your reads in colorspace. You do not decode the strand and then map. This gives you the benefits of 2base encoding while detecting errors throughout the tag.

        Comment

        • pmiguel
          Senior Member
          • Aug 2008
          • 2328

          #34
          Originally posted by juan View Post
          When converting from colorspace to basespace, if you reach an error (no color, aka "4" or "."), you have 4 possibilities for the rest of the read. How do you determine the correct one?

          Example:
          Colorspace T000.00
          Is basespace TTTTN followed by AA, CC, GG or TT

          Which is correct? If you have a sequencing error early in the read, such as position 1 or 2 it goes in the garbage?
          Just to reiterate what is said elsewhere in the thread: do not convert out of colorspace prior to alignment! (Instead convert your reference to colorspace.) It is not merely that you lose the benefits of dual base encoding by converting raw reads out of colorspace. Worse, any error in colorspace changes the "color frame--for lack of a better term" for the conversion. This means that any single base sequencing error will propagate through the rest of the read, ensuring that most of the rest of the base space bases are wrong also.

          Even if you must use software that is not colorspace-aware, there are tricks you can use to avoid converting out of colorspace.

          --
          Phillip

          Comment

          • deepak_bala
            Junior Member
            • Dec 2009
            • 7

            #35
            Hey guys, I need a clarification about the two-base encoding (should be called 'decoding'!). Maybe I haven't understood correctly, but we need to know the first base to be able to decode correctly, right? I am not clear how one would decide/know what base comes first, 'cos I think it is critical to decoding the sequence.

            Thanks for any help!

            Comment

            • ECO
              --Site Admin--
              • Oct 2007
              • 1360

              #36
              Originally posted by deepak_bala View Post
              Hey guys, I need a clarification about the two-base encoding (should be called 'decoding'!). Maybe I haven't understood correctly, but we need to know the first base to be able to decode correctly, right? I am not clear how one would decide/know what base comes first, 'cos I think it is critical to decoding the sequence.

              Thanks for any help!
              You're right, and you do know the first base based on the adapter/primer you use. I'm sure someone will chime in soon with a more detailed answer....

              Comment

              • deepak_bala
                Junior Member
                • Dec 2009
                • 7

                #37
                Originally posted by ECO View Post
                You're right, and you do know the first base based on the adapter/primer you use. I'm sure someone will chime in soon with a more detailed answer....
                Thanks for the reply. I was looking at it and maybe we can determine what the first base is with data from the primer 1 reads from primers n and (n-1).

                Correct me if I am wrong, anyone.

                Comment

                • snetmcom
                  Senior Member
                  • Oct 2008
                  • 159

                  #38
                  Originally posted by deepak_bala View Post
                  Thanks for the reply. I was looking at it and maybe we can determine what the first base is with data from the primer 1 reads from primers n and (n-1).

                  Correct me if I am wrong, anyone.
                  the first base is given to you in your files, but AGAIN, you do not want to decode color space to read your sequence. Align your sequence in color space first.

                  Comment

                  • deepak_bala
                    Junior Member
                    • Dec 2009
                    • 7

                    #39
                    Originally posted by snetmcom View Post
                    the first base is given to you in your files, but AGAIN, you do not want to decode color space to read your sequence. Align your sequence in color space first.
                    Thanks for the pointer. I will.

                    Comment

                    • anago
                      Junior Member
                      • Nov 2009
                      • 1

                      #40
                      Hi All,

                      back to the chemistry of SOLiD. With mate-paired libraries after the 'first mate' comes an internal adaptor. Do I think right that the 'second mate' is sequenced the same way but with primers matching with the internal adaptor?

                      Anago

                      Comment

                      • pmiguel
                        Senior Member
                        • Aug 2008
                        • 2328

                        #41
                        Originally posted by anago View Post
                        Hi All,

                        back to the chemistry of SOLiD. With mate-paired libraries after the 'first mate' comes an internal adaptor. Do I think right that the 'second mate' is sequenced the same way but with primers matching with the internal adaptor?

                        Anago
                        Yes, the "R3" mate-pair reads are primed out of the internal adaptor.

                        --
                        Phillip

                        Comment

                        • samanta
                          Senior Member
                          • Feb 2010
                          • 108

                          #42
                          Color space

                          Hello all,

                          I wrote this up on color space - nucleotide space conversion and added few Perl scripts to help you proceed.



                          Please feel free to comment.

                          Manoj
                          http://homolog.us

                          Comment

                          • lily Michelle
                            Junior Member
                            • Jul 2010
                            • 1

                            #43
                            Hi all,

                            Is the flurosence attached to the base or the phosphate group?

                            thanks

                            lily

                            Comment

                            • drambald
                              Junior Member
                              • Aug 2010
                              • 2

                              #44
                              May be is a stupid question but...

                              Hello, I a the following question, regarding the use of AB Solid for ChIP-seq analysis.

                              To my understanding:

                              we tag a protein with an antibody, dismantle the cells, denaturate and sonicate the DNA, collect the fragments that were attached to the protein which was attached to the antibody and sequence them.

                              Problem is: the fragments from sonication should be 100-500 bp long, we only sequence 50 bases at each read: when and how does this further "reduction" take place ?

                              What should happen is:
                              1. The fragments are attached to beads
                              2. Amplification takes place on attached fragments
                              3. The bead ends up looking like an octopus with N copies of the same fragment
                              4. Those copies are actually sequenced, but due to technical limitations only the first 50 bases can be read

                              is this correct?

                              best regards

                              Comment

                              • cmebai
                                Junior Member
                                • Aug 2010
                                • 1

                                #45
                                Firstly ,thanks for your time of explaining the wonderful tech.And i have a problem ,that is :
                                At the step of primer reset is it the same strand to be sequenced ?

                                Comment

                                Latest Articles

                                Collapse

                                • SEQadmin2
                                  From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                                  by SEQadmin2


                                  Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                                  The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                                  ...
                                  06-02-2026, 10:05 AM
                                • SEQadmin2
                                  Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                                  by SEQadmin2


                                  With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                                  Introduction

                                  Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                                  05-22-2026, 06:42 AM
                                • SEQadmin2
                                  Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                                  by SEQadmin2

                                  Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                                  Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                                  05-06-2026, 09:04 AM

                                ad_right_rmr

                                Collapse

                                News

                                Collapse

                                Topics Statistics Last Post
                                Started by SEQadmin2, 06-02-2026, 12:03 PM
                                0 responses
                                20 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 06-02-2026, 11:40 AM
                                0 responses
                                14 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 05-28-2026, 11:40 AM
                                0 responses
                                29 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 05-26-2026, 10:12 AM
                                0 responses
                                31 views
                                0 reactions
                                Last Post SEQadmin2  
                                Working...