Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • CLC and SNP discovery

    Hi there,

    I am using CLC 4.5 software for doing a de novo assembly and SNP discovery.
    I have already perform my de novo assembly, and I have mapped my reads to contigs with the main objective of SNP discovery.
    This software (CLC) has the option "SNP Detection", I have done it and I have discovered SNPs (after filtering).
    Does anyone of all know how can I get my consensus sequences with the ambiguities in one fasta file, or txt format?

    I mean, imagine this sequence with this SNP (*), and these ambiguities (-)
    5` ... A G G G A T A G C C C G A T A G G G T A C A G G T T A ... 3`
    - * - -
    When I try to save my consensus sequence as a txt file, I can only save as one of these:
    a) if in the assembly selected VOTE (for ambiguities):
    5` ... A G G G A T A G C C C G A T A G G G T A C A G G T T A ... 3`

    b) if in the assembly selected Unknown (for ambiguities):
    5` ... A G G G N T A G C C N G A T N G G G T A N A G G T T A ... 3`

    c) if in the assembly selected Ambiguity (for ambiguities):
    5` ... A G G G A R A G C C C R A T M G G G T A Y A G G T T A ... 3`

    But the one I want to save is this one: 5` ... A G G G A T A G C C C R A T A G G G T A C A G G T T A ... 3` (with the only annotated SNP).

    Could you help me, please???


    Thank you!

  • #2
    I haven't tried it, but during SNP detection you can select to annotate the consensus with SNPs or not - if you choose vote for ambiguities during alignment then annotate with SNPs during detection I'm guessing it should give you the result you are after

    Comment


    • #3
      Hi ya,

      I have tried to do what you have told me, but that one is not the solution, look what happens when I do that (see the attachment).
      The SNP is annotated, but I cannt have the fasta file as ambiguities (R) or as [A/G], what is the thing I would like, there must be a way to do that, but, how?
      Attached Files

      Comment


      • #4
        That's a terrible alignment. You should increase the gap penalty.

        Comment


        • #5
          no, don't worry about the aligment, that was a bad example.
          the question is how can i have the fasta file with the polymorphic site?

          Comment


          • #6
            If you point to the sequence name on the left side of the panel and right click on the name, you should get a menu like "Open Copy of Sequence" . Once you have got a copy, you can export that into FASTA.

            Comment


            • #7
              Yes, that is true, but the problem persists, in this way I cannot have the polymorphism annotation in the sequence (as "R" or "[A/G]") in the FASTA file.
              I don't know how to do this, I am desperate!!!

              Comment


              • #8
                what does CLC recommend? AFAIK their support is quite helpful.

                Comment


                • #9
                  In the user guide there is nothing about this issue.
                  I have sent them an e-mail, I am waiting for response. If I can solve this issue, I will tell you, but I think there is no way to do this...

                  Comment


                  • #10
                    CLC support has answered

                    Hello all,

                    CLC support has answered to my question in a e-mail. They have told me this: "I'm afraid there is no way to export fasta sequence with ambiguities or IUPAC letters from the CLC software. However, if you have chosen to annotate your Consensus, and/or your Reference sequence, you could then export to GFF or Genbank format. The SNP variants that have been called will appear in the annotations. At that point though, you would have to script something to generate files of the format that suits you needs."

                    So, there is no way to do this...

                    If anyone knows about these type of script, please, let me know.

                    Thank you!

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Advancing Precision Medicine for Rare Diseases in Children
                      by seqadmin




                      Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
                      12-16-2024, 07:57 AM
                    • seqadmin
                      Recent Advances in Sequencing Technologies
                      by seqadmin



                      Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

                      Long-Read Sequencing
                      Long-read sequencing has seen remarkable advancements,...
                      12-02-2024, 01:49 PM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, 12-17-2024, 10:28 AM
                    0 responses
                    26 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 12-13-2024, 08:24 AM
                    0 responses
                    42 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 12-12-2024, 07:41 AM
                    0 responses
                    28 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 12-11-2024, 07:45 AM
                    0 responses
                    42 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X