Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • annotation of local sequence "uniqueness"

    Hi all-- I am attempting to filter out false "SNPs" in my bowtie assemblies (yeast genomic DNA). One major predictor of SNP falseness is being located in a regions that shows sequence similarity with somewhere else in the genome (e.g., paralogs, telomeres).

    I've attempted to limit the read mismapping that underlies these SNPs by tweaking the bowtie parameters, but I still get quite a few. So an alternative would be to ask, for each region containing a putative SNP, what is the local uniqueness?

    Does anyone know of a previously generated track that catalogs this property? Or better still, a tool that would perform a sliding window analysis (probably by parsing BLAST or BLAT) to record it for any given sequence?

    Thanks!

  • #2
    Wouldn't you be able to infer true positives based on the depth and nucleotide ratios at that position? '

    Since you have the SAM/BAM output you could try varscan (http://varscan.sourceforge.net/) or the samtools pipeline (http://samtools.sourceforge.net/mpileup.shtml). You can filter these SNPs with these tools and they both provide nucleotide resolution for each putative SNP. More so the samtools pipeline, but I prefer varscan sine it is a bit cleaner.

    Comment


    • #3
      Hi Twaddlac--

      Thanks for the reply. I don't think depth can get me all the way. Sometimes the depth is lower than "regular" sequence, but not always. It can also look normal or be higher (which would make sense if reads from two locations in the genome are mapping to the same place). I must tolerate the possibility of real heterozygotes so allele frequency can only do so much, also.

      I am already using SAMtools, but I'll check out varscan, Thanks!

      Comment


      • #4
        The term "mappability" has emerged to refer to this - using this as a search term is likely to yield results. There are a few tools that compute mappability measures, for example GEM: http://sourceforge.net/apps/mediawik...ility_man_page.

        Comment


        • #5
          thanks gaffa, the right search term is exactly what I needed and I didn't know 'mappability'... something definitely wasn't right about "uniqueness". Thanks!

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Current Approaches to Protein Sequencing
            by seqadmin


            Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
            04-04-2024, 04:25 PM
          • seqadmin
            Strategies for Sequencing Challenging Samples
            by seqadmin


            Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
            03-22-2024, 06:39 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 04-11-2024, 12:08 PM
          0 responses
          25 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 10:19 PM
          0 responses
          28 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 09:21 AM
          0 responses
          24 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-04-2024, 09:00 AM
          0 responses
          52 views
          0 likes
          Last Post seqadmin  
          Working...
          X