Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Software for variant frequency

    Hello all,

    I used to be able to use CLC for my SNP calling, which generates the reference base and the frequency of the variant allele in separate columns. I no longer have access to this program and have been using bwa-samtools. I have all of my files in bam format and have been generating SAM pileup tables using the -vcf option. The problem is it gives you the variant allele and consensus allele (not reference allele) and the phred scaled quality of the SNP, not its frequency. Is there a better software to use in order to get this information? Or is there a way to get this information from samtools?
    sasignor is online now Report Post Edit/Delete Message

  • #2
    Samtools pileup gives you the reference allele, consensus call, and all bases (with strandedness) among other things.

    I suggest you take a look at these pages:

    Comment


    • #3
      Ah, yes, the part about the reference allele was a typo. What I mean is I was looking for them all to be generated in separate columns - the variant, and the reference, not as part of a the read bases, as well as the frequency of the alleles as opposed to the SNP quality scores. I think something like VarScan is what I was looking for, although I am still interested as to what other people use to generate this type of output.

      Comment


      • #4
        Originally posted by sasignor View Post
        Ah, yes, the part about the reference allele was a typo. What I mean is I was looking for them all to be generated in separate columns - the variant, and the reference, not as part of a the read bases, as well as the frequency of the alleles as opposed to the SNP quality scores. I think something like VarScan is what I was looking for, although I am still interested as to what other people use to generate this type of output.

        http://varscan.sourceforge.net/using-varscan.html
        A few lines of code using the read bases column conditioned on the variant call and reference allele would do it. You have all the information.

        Comment


        • #5
          Here is a script I wrote for doing this. Pass the script the name of your pileup file (created with -vcf and any other filtering) and it will output the following (as tab delimited columns):

          1. Chromosome
          2. Position
          3. SNP (Ref > Alt base(s))
          4. Consensus quality
          5. Read depth
          6. Alt base fraction (Alt base/Read depth)
          7. Alt base %
          Attached Files

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Essential Discoveries and Tools in Epitranscriptomics
            by seqadmin




            The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
            04-22-2024, 07:01 AM
          • seqadmin
            Current Approaches to Protein Sequencing
            by seqadmin


            Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
            04-04-2024, 04:25 PM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 04-11-2024, 12:08 PM
          0 responses
          59 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 10:19 PM
          0 responses
          57 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 09:21 AM
          0 responses
          53 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-04-2024, 09:00 AM
          0 responses
          56 views
          0 likes
          Last Post seqadmin  
          Working...
          X