Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • rbagnall
    Member
    • Jun 2010
    • 34

    #31
    and ANNOVAR



    and SeattleSeq Annotation



    both for annotation of SNP variants. ie. Nonsense, synonymous, splice, intronic, found in 1000 genomes, or HapMap frequency, which gene, amino acid change (from and too, with position) etc.

    Comment

    • Balat
      Member
      • May 2010
      • 36

      #32
      Hi,
      Most of the tools related to variant annotations seem to be related to analysing human data. I work on a plant species where we only have the draft genome and not annotated yet. I have done an rna-seq experiment and I have found several snps using samtools. I got a gtf file by aligning reads against the genome sequence with bowtie and tophat. The gtf file only has transcirpt information and no orf or CDS information. Does any one have a script which takes the positions of the snps, annotations from a gtf file and the genome sequence in fasta format and predict if the snps are synonymous or non-synonymous?

      Comment

      • qqcandy
        Member
        • Sep 2008
        • 15

        #33
        Does Dindel also do anchor-split mapping, as Pindel? Or the indels discovered by Dindel has to be supported by at least one mappable reads by the aligners, such as bwa/novoalign?

        Comment

        • krobison
          Senior Member
          • Nov 2007
          • 734

          #34
          GAMES (according to the paper) uses MySQL queries to go against UCSC; presumably you could easily reroute that to your own data.

          A quick look at ANNOVAR suggests it is all flat file based; create your own flat files in the right format and it should work.

          Comment

          • lh3
            Senior Member
            • Feb 2008
            • 686

            #35
            Originally posted by qqcandy View Post
            Does Dindel also do anchor-split mapping, as Pindel? Or the indels discovered by Dindel has to be supported by at least one mappable reads by the aligners, such as bwa/novoalign?
            You'd better run Pindel first and then Dindel.

            Comment

            • Michael.James.Clark
              Senior Member
              • Apr 2009
              • 207

              #36
              Hi Heng,

              Originally posted by lh3 View Post
              3. Cap base quality BAQ (with samtools).
              What do you mean by "cap base quality"?

              Can you give more detailed suggestions about how to effectively use BAQ?

              We would like to include it in our pipeline but are unsure about how best to utilize it.

              Thanks.
              Mendelian Disorder: A blogshare of random useful information for general public consumption. [Blog]
              Breakway: A Program to Identify Structural Variations in Genomic Data [Website] [Forum Post]
              Projects: U87MG whole genome sequence [Website] [Paper]

              Comment

              • drio
                Senior Member
                • Oct 2008
                • 323

                #37
                Originally posted by Michael.James.Clark View Post
                Hi Heng,
                What do you mean by "cap base quality"?

                Can you give more detailed suggestions about how to effectively use BAQ?

                We would like to include it in our pipeline but are unsure about how best to utilize it.
                Thanks.
                Here.
                -drd

                Comment

                • Michael.James.Clark
                  Senior Member
                  • Apr 2009
                  • 207

                  #38
                  Originally posted by drio View Post
                  Woosh, thanks. For some reason I was thinking there was more to it (probably because I've been doing half a dozen different approaches each with ten times as many parameters than the BAQ step).
                  Mendelian Disorder: A blogshare of random useful information for general public consumption. [Blog]
                  Breakway: A Program to Identify Structural Variations in Genomic Data [Website] [Forum Post]
                  Projects: U87MG whole genome sequence [Website] [Paper]

                  Comment

                  • epigen
                    Senior Member
                    • May 2010
                    • 101

                    #39
                    usage of samtools calmd for calculating BAQ

                    Originally posted by drio View Post
                    I just did what was recommended "here" and got the SVN version of samtools that should contain the calmd that calculates BAQ. But now I'm very confused about the parameters when I call this new samtools_svn_816 calmd [Version: 0.1.9-10 (r816)]:

                    Usage: samtools fillmd [-eubrS] <aln.bam> <ref.fasta>

                    Options: -e change identical bases to '='
                    -u uncompressed BAM output (for piping)
                    -b compressed BAM output
                    -S the input is SAM with header
                    -r read-independent local realignment

                    i.e. the same as in Version 0.1.9 (r783)

                    Acoording to the manual, there is a difference (highlighted in red):

                    calmd samtools calmd [-eubSr] [-C capQcoef] <aln.bam> <ref.fasta>

                    OPTIONS:
                    -e Convert a the read base to = if it is identical to the aligned reference base. Indel caller does not support the = bases at the moment.
                    -u Output uncompressed BAM
                    -b Output compressed BAM
                    -S The input is SAM with header lines
                    -C" INT" Coefficient to cap mapping quality of poorly mapped reads. See the pileup command for details. [0]
                    -r Perform probabilistic realignment to compute BAQ, which will be used to cap base quality.

                    Can anyone enlighten me whether just the usage message is not up to date or whether there is another version in the SVN?

                    Thank you in advance

                    Barbara

                    Comment

                    • Bruins
                      Member
                      • Feb 2010
                      • 78

                      #40
                      Originally posted by Michael.James.Clark View Post
                      What tools do people use for coding consequence determination after all of this?
                      This morning Dongliang Ge demonstrated his SequenceVariantAnalyzer at our institute. Sounds promising. It has very nice viewing functionality, but uses lots of memory.

                      I am not the person doing these analyses. People here use PolyPhen, SeattleSeq, Sift, I don't know what else.

                      Comment

                      • Michael.James.Clark
                        Senior Member
                        • Apr 2009
                        • 207

                        #41
                        Originally posted by Bruins View Post
                        This morning Dongliang Ge demonstrated his SequenceVariantAnalyzer at our institute. Sounds promising. It has very nice viewing functionality, but uses lots of memory.

                        I am not the person doing these analyses. People here use PolyPhen, SeattleSeq, Sift, I don't know what else.
                        We have used it and I liked it, but unfortunately when we tried it a couple months back, it only took hg18 and annotated SNPs using dbSNP127, which is a little outdated by now. Still a very nice program, though.
                        Mendelian Disorder: A blogshare of random useful information for general public consumption. [Blog]
                        Breakway: A Program to Identify Structural Variations in Genomic Data [Website] [Forum Post]
                        Projects: U87MG whole genome sequence [Website] [Paper]

                        Comment

                        Latest Articles

                        Collapse

                        • GATTACAT
                          Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                          by GATTACAT
                          Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
                          07-01-2026, 11:43 AM
                        • SEQadmin2
                          Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                          by SEQadmin2


                          I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                          Here are nine questions we think about, in roughly the order they matter, before...
                          06-18-2026, 07:11 AM

                        ad_right_rmr

                        Collapse

                        News

                        Collapse

                        Topics Statistics Last Post
                        Started by SEQadmin2, 07-02-2026, 11:08 AM
                        0 responses
                        17 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 06-30-2026, 05:37 AM
                        0 responses
                        18 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 06-26-2026, 11:10 AM
                        0 responses
                        21 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 06-17-2026, 06:09 AM
                        0 responses
                        54 views
                        0 reactions
                        Last Post SEQadmin2  
                        Working...