Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • extari
    Member
    • Mar 2011
    • 10

    CLC and SNP discovery

    Hi there,

    I am using CLC 4.5 software for doing a de novo assembly and SNP discovery.
    I have already perform my de novo assembly, and I have mapped my reads to contigs with the main objective of SNP discovery.
    This software (CLC) has the option "SNP Detection", I have done it and I have discovered SNPs (after filtering).
    Does anyone of all know how can I get my consensus sequences with the ambiguities in one fasta file, or txt format?

    I mean, imagine this sequence with this SNP (*), and these ambiguities (-)
    5` ... A G G G A T A G C C C G A T A G G G T A C A G G T T A ... 3`
    - * - -
    When I try to save my consensus sequence as a txt file, I can only save as one of these:
    a) if in the assembly selected VOTE (for ambiguities):
    5` ... A G G G A T A G C C C G A T A G G G T A C A G G T T A ... 3`

    b) if in the assembly selected Unknown (for ambiguities):
    5` ... A G G G N T A G C C N G A T N G G G T A N A G G T T A ... 3`

    c) if in the assembly selected Ambiguity (for ambiguities):
    5` ... A G G G A R A G C C C R A T M G G G T A Y A G G T T A ... 3`

    But the one I want to save is this one: 5` ... A G G G A T A G C C C R A T A G G G T A C A G G T T A ... 3` (with the only annotated SNP).

    Could you help me, please???


    Thank you!
  • flipwell
    Member
    • Feb 2011
    • 14

    #2
    I haven't tried it, but during SNP detection you can select to annotate the consensus with SNPs or not - if you choose vote for ambiguities during alignment then annotate with SNPs during detection I'm guessing it should give you the result you are after

    Comment

    • extari
      Member
      • Mar 2011
      • 10

      #3
      Hi ya,

      I have tried to do what you have told me, but that one is not the solution, look what happens when I do that (see the attachment).
      The SNP is annotated, but I cannt have the fasta file as ambiguities (R) or as [A/G], what is the thing I would like, there must be a way to do that, but, how?
      Attached Files

      Comment

      • NextGenSeq
        Senior Member
        • Apr 2009
        • 482

        #4
        That's a terrible alignment. You should increase the gap penalty.

        Comment

        • extari
          Member
          • Mar 2011
          • 10

          #5
          no, don't worry about the aligment, that was a bad example.
          the question is how can i have the fasta file with the polymorphic site?

          Comment

          • szilva
            Member
            • Aug 2009
            • 16

            #6
            If you point to the sequence name on the left side of the panel and right click on the name, you should get a menu like "Open Copy of Sequence" . Once you have got a copy, you can export that into FASTA.

            Comment

            • extari
              Member
              • Mar 2011
              • 10

              #7
              Yes, that is true, but the problem persists, in this way I cannot have the polymorphism annotation in the sequence (as "R" or "[A/G]") in the FASTA file.
              I don't know how to do this, I am desperate!!!

              Comment

              • sklages
                Senior Member
                • May 2008
                • 628

                #8
                what does CLC recommend? AFAIK their support is quite helpful.

                Comment

                • extari
                  Member
                  • Mar 2011
                  • 10

                  #9
                  In the user guide there is nothing about this issue.
                  I have sent them an e-mail, I am waiting for response. If I can solve this issue, I will tell you, but I think there is no way to do this...

                  Comment

                  • extari
                    Member
                    • Mar 2011
                    • 10

                    #10
                    CLC support has answered

                    Hello all,

                    CLC support has answered to my question in a e-mail. They have told me this: "I'm afraid there is no way to export fasta sequence with ambiguities or IUPAC letters from the CLC software. However, if you have chosen to annotate your Consensus, and/or your Reference sequence, you could then export to GFF or Genbank format. The SNP variants that have been called will appear in the annotations. At that point though, you would have to script something to generate files of the format that suits you needs."

                    So, there is no way to do this...

                    If anyone knows about these type of script, please, let me know.

                    Thank you!

                    Comment

                    Latest Articles

                    Collapse

                    • SEQadmin2
                      Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                      by SEQadmin2


                      I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                      Here are nine questions we think about, in roughly the order they matter, before...
                      06-18-2026, 07:11 AM
                    • SEQadmin2
                      From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                      by SEQadmin2


                      Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                      The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                      ...
                      06-02-2026, 10:05 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by SEQadmin2, 06-26-2026, 11:10 AM
                    0 responses
                    16 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-17-2026, 06:09 AM
                    0 responses
                    49 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-09-2026, 11:58 AM
                    0 responses
                    108 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-05-2026, 10:09 AM
                    0 responses
                    125 views
                    0 reactions
                    Last Post SEQadmin2  
                    Working...