Unconfigured Ad

Collapse
This is a sticky topic.
X
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • juan
    Member
    • Aug 2009
    • 14

    #31
    When converting from colorspace to basespace, if you reach an error (no color, aka "4" or "."), you have 4 possibilities for the rest of the read. How do you determine the correct one?

    Example:
    Colorspace T000.00
    Is basespace TTTTN followed by AA, CC, GG or TT

    Which is correct? If you have a sequencing error early in the read, such as position 1 or 2 it goes in the garbage?

    Comment

    • nilshomer
      Nils Homer
      • Nov 2008
      • 1283

      #32
      Originally posted by juan View Post
      When converting from colorspace to basespace, if you reach an error (no color, aka "4" or "."), you have 4 possibilities for the rest of the read. How do you determine the correct one?

      Example:
      Colorspace T000.00
      Is basespace TTTTN followed by AA, CC, GG or TT

      Which is correct? If you have a sequencing error early in the read, such as position 1 or 2 it goes in the garbage?
      Depends on the alignment tool. Missing colors are not a problem for BFAST since even if there are four possibilities, one typically has a higher likelihood (see the Viterbi algorithm for HMMs). Remember that sequence alignment can be thought of as a path finding problem, or HMM, etc.

      Comment

      • snetmcom
        Senior Member
        • Oct 2008
        • 159

        #33
        Originally posted by juan View Post
        When converting from colorspace to basespace, if you reach an error (no color, aka "4" or "."), you have 4 possibilities for the rest of the read. How do you determine the correct one?

        Example:
        Colorspace T000.00
        Is basespace TTTTN followed by AA, CC, GG or TT

        Which is correct? If you have a sequencing error early in the read, such as position 1 or 2 it goes in the garbage?
        You map your reads in colorspace. You do not decode the strand and then map. This gives you the benefits of 2base encoding while detecting errors throughout the tag.

        Comment

        • pmiguel
          Senior Member
          • Aug 2008
          • 2328

          #34
          Originally posted by juan View Post
          When converting from colorspace to basespace, if you reach an error (no color, aka "4" or "."), you have 4 possibilities for the rest of the read. How do you determine the correct one?

          Example:
          Colorspace T000.00
          Is basespace TTTTN followed by AA, CC, GG or TT

          Which is correct? If you have a sequencing error early in the read, such as position 1 or 2 it goes in the garbage?
          Just to reiterate what is said elsewhere in the thread: do not convert out of colorspace prior to alignment! (Instead convert your reference to colorspace.) It is not merely that you lose the benefits of dual base encoding by converting raw reads out of colorspace. Worse, any error in colorspace changes the "color frame--for lack of a better term" for the conversion. This means that any single base sequencing error will propagate through the rest of the read, ensuring that most of the rest of the base space bases are wrong also.

          Even if you must use software that is not colorspace-aware, there are tricks you can use to avoid converting out of colorspace.

          --
          Phillip

          Comment

          • deepak_bala
            Junior Member
            • Dec 2009
            • 7

            #35
            Hey guys, I need a clarification about the two-base encoding (should be called 'decoding'!). Maybe I haven't understood correctly, but we need to know the first base to be able to decode correctly, right? I am not clear how one would decide/know what base comes first, 'cos I think it is critical to decoding the sequence.

            Thanks for any help!

            Comment

            • ECO
              --Site Admin--
              • Oct 2007
              • 1360

              #36
              Originally posted by deepak_bala View Post
              Hey guys, I need a clarification about the two-base encoding (should be called 'decoding'!). Maybe I haven't understood correctly, but we need to know the first base to be able to decode correctly, right? I am not clear how one would decide/know what base comes first, 'cos I think it is critical to decoding the sequence.

              Thanks for any help!
              You're right, and you do know the first base based on the adapter/primer you use. I'm sure someone will chime in soon with a more detailed answer....

              Comment

              • deepak_bala
                Junior Member
                • Dec 2009
                • 7

                #37
                Originally posted by ECO View Post
                You're right, and you do know the first base based on the adapter/primer you use. I'm sure someone will chime in soon with a more detailed answer....
                Thanks for the reply. I was looking at it and maybe we can determine what the first base is with data from the primer 1 reads from primers n and (n-1).

                Correct me if I am wrong, anyone.

                Comment

                • snetmcom
                  Senior Member
                  • Oct 2008
                  • 159

                  #38
                  Originally posted by deepak_bala View Post
                  Thanks for the reply. I was looking at it and maybe we can determine what the first base is with data from the primer 1 reads from primers n and (n-1).

                  Correct me if I am wrong, anyone.
                  the first base is given to you in your files, but AGAIN, you do not want to decode color space to read your sequence. Align your sequence in color space first.

                  Comment

                  • deepak_bala
                    Junior Member
                    • Dec 2009
                    • 7

                    #39
                    Originally posted by snetmcom View Post
                    the first base is given to you in your files, but AGAIN, you do not want to decode color space to read your sequence. Align your sequence in color space first.
                    Thanks for the pointer. I will.

                    Comment

                    • anago
                      Junior Member
                      • Nov 2009
                      • 1

                      #40
                      Hi All,

                      back to the chemistry of SOLiD. With mate-paired libraries after the 'first mate' comes an internal adaptor. Do I think right that the 'second mate' is sequenced the same way but with primers matching with the internal adaptor?

                      Anago

                      Comment

                      • pmiguel
                        Senior Member
                        • Aug 2008
                        • 2328

                        #41
                        Originally posted by anago View Post
                        Hi All,

                        back to the chemistry of SOLiD. With mate-paired libraries after the 'first mate' comes an internal adaptor. Do I think right that the 'second mate' is sequenced the same way but with primers matching with the internal adaptor?

                        Anago
                        Yes, the "R3" mate-pair reads are primed out of the internal adaptor.

                        --
                        Phillip

                        Comment

                        • samanta
                          Senior Member
                          • Feb 2010
                          • 108

                          #42
                          Color space

                          Hello all,

                          I wrote this up on color space - nucleotide space conversion and added few Perl scripts to help you proceed.



                          Please feel free to comment.

                          Manoj
                          http://homolog.us

                          Comment

                          • lily Michelle
                            Junior Member
                            • Jul 2010
                            • 1

                            #43
                            Hi all,

                            Is the flurosence attached to the base or the phosphate group?

                            thanks

                            lily

                            Comment

                            • drambald
                              Junior Member
                              • Aug 2010
                              • 2

                              #44
                              May be is a stupid question but...

                              Hello, I a the following question, regarding the use of AB Solid for ChIP-seq analysis.

                              To my understanding:

                              we tag a protein with an antibody, dismantle the cells, denaturate and sonicate the DNA, collect the fragments that were attached to the protein which was attached to the antibody and sequence them.

                              Problem is: the fragments from sonication should be 100-500 bp long, we only sequence 50 bases at each read: when and how does this further "reduction" take place ?

                              What should happen is:
                              1. The fragments are attached to beads
                              2. Amplification takes place on attached fragments
                              3. The bead ends up looking like an octopus with N copies of the same fragment
                              4. Those copies are actually sequenced, but due to technical limitations only the first 50 bases can be read

                              is this correct?

                              best regards

                              Comment

                              • cmebai
                                Junior Member
                                • Aug 2010
                                • 1

                                #45
                                Firstly ,thanks for your time of explaining the wonderful tech.And i have a problem ,that is :
                                At the step of primer reset is it the same strand to be sequenced ?

                                Comment

                                Latest Articles

                                Collapse

                                ad_right_rmr

                                Collapse

                                News

                                Collapse

                                Topics Statistics Last Post
                                Started by SEQadmin2, 06-05-2026, 10:09 AM
                                0 responses
                                11 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 06-04-2026, 08:59 AM
                                0 responses
                                23 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 06-02-2026, 12:03 PM
                                0 responses
                                28 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 06-02-2026, 11:40 AM
                                0 responses
                                22 views
                                0 reactions
                                Last Post SEQadmin2  
                                Working...