Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • #31
    There is a setting in SOAPdenovo that I thought had some influence on this, used when you run 'SOAPdenovo contig' separately.

    -M mergeLevel(default 1,min 0, max 3): the strength of merging similar sequences during contiging

    However, when I experimented with different values it made no difference on the contig assembly results....not sure if it did anything with the 'consensus' base, probably not.

    If you search for 'bubbles' in the Abyss, Velvet and CLC documentation you will find a lot more detail on how they deal with SNPs.

    Comment


    • #32
      A 454 - SSAHA approach

      Just to throw in on the conversation, I pooled genomic DNA from 18 individuals, cut with a 4 base cutter, and sequenced a 15bp size fraction with two full runs of 454 reads (250bp). I assembled them gsAssembler which produced an average 20 reads per contig. Then I mapped the individual reads back to the contig consensus sequences using SSAHA2 and used the SSAHA_pipeline to call SNPs. It worked pretty well - wound up with about 8000 SNPs I could believe in, and the validation rate was about 95%. The predicted allele frequency was strongly correlated (>0.8) with the real allele frequency in the donors. My goal was just basic SNP discovery in a novel species and it fit the bill.

      Caveats - Beware of minor allele freqs near 0.5 which could arise from alignment of reads from duplicated loci; Screen out short tandem repeats because STR allelic differences in the alignment can cause false positive SNPs; Loci with only 4 mapped reads (minimum 2 reads per allele) may be useful but don't count on them.

      Comment


      • #33
        A piece of information,

        We developed a tool, called kisSnp that takes two sets of non assembled raw short reads and compare them for finding SNPs between these two sets.
        It outputs the SNPs with small flanking regions.
        It uses light memory and run in short time.

        All info and download can be found on the dedicated website: http://alcovna.genouest.org/

        Enjoy ! (remarks and comments are welcome)

        Comment


        • #34
          I checked your site quickly, it's very interesting.

          I do have a question though, without a reference won't you be missing all the homozygous variations?

          Also you need long enough reads to generate flanks no, anything smaller dans 50 even 75 wouldn't ne long enough.

          Or am I missing something.

          Comment


          • #35
            With the current version we detect only SNPs between individuals. One compares two set of reads, focusing on small substitutions that may be those SNPs.

            We are currently working on a version intra-individual, that will enable to detect heterozygous SNP of one individual.

            This may be done avoiding the use of a reference genome, if the coverage is sufficient.
            Reads of length 50 to 75 are indeed long enougth.

            Pierre

            Comment


            • #36
              DIAL by Dr. Ratan for SNP without reference genome

              Hi, Everyone:

              I am trying to use DIAL without success for unknown reason, even following exact instructions. So I am wondering if anyone in our community is using the DIAL to get SNP and sharing some experience. I contacted Dr. Ratan at Penn State, but got no response. Any comments on DIAL?

              I have a 454 sequencing run of 8 samples with barcodes each and got individual .sff file. When I perform DIAL by adding each .sff file, it worked sometime, and some time not working. I tested it with the supplied data and it worked for Adding but not working with Update (it returns with $ without error, but I check ps showing no such task).

              Comment


              • #37
                What version of newbler are you using? I tried DIAL and it would very specifically only work with v2.0 and nothing later.

                Comment


                • #38
                  I did give it a trial at 2.0 version by changing the newbler path in my .profile. What I got when I performed DIAL add is: Errors: unable to open sff file. SRR000375.sff (which is one of the test sff file).
                  Last edited by ybfu; 12-06-2010, 01:53 PM.

                  Comment


                  • #39
                    Hi everybody, I wondering to introduce and share the GBS-SNP-CROP:a reference-optional pipeline for SNP discovery and plant germplasm characterization using variable length, paired-end genotyping-by- sequencing data.
                    Recently published on BMC Bioinformatics, this methodology could be useful for population genomic studies in model and non model organism when or not a reference genome is available.

                    Please see the GBS-SNP-CROP GitHub page for more details and UserManual:
                    GBS SNP Calling Reference Optional Pipeline. Contribute to halelab/GBS-SNP-CROP development by creating an account on GitHub.


                    Best regards,
                    Arthur Melo

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Strategies for Sequencing Challenging Samples
                      by seqadmin


                      Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                      03-22-2024, 06:39 AM
                    • seqadmin
                      Techniques and Challenges in Conservation Genomics
                      by seqadmin



                      The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                      Avian Conservation
                      Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                      03-08-2024, 10:41 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, Yesterday, 06:37 PM
                    0 responses
                    7 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, Yesterday, 06:07 PM
                    0 responses
                    7 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 03-22-2024, 10:03 AM
                    0 responses
                    49 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 03-21-2024, 07:32 AM
                    0 responses
                    66 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X