Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Tool to deconvolve chromatogram?

    I posted the following at another forum and have not yet had a response. Is this what mpileup and/or GATK are for? Or do those analyze aligned data for indels and SNPs, rather than look at the raw data?
    ====
    I don't know if I'm using the proper terminology, but I have ab1 (sanger) sequencing chromatograms and I was wondering if there was any software out there that you were aware of to deconvolve two overlapped chimeric reads (e.g. overlaid indel-type mutations) and output two+ sequences as (fasta) output.

    For example, if I have a read that is unambiguously ACTGGCGA but then I have 60% population where I have an A and 40% where that A is deleted, followed by GCGTGA, phred will likely give me ACTGGCGAAGCGTGA, but is there a way for a basecaller to give me ACTGGCGAGCGTGA as a secondary call? Or, if I have 50/50 and I know that one version is ACTGGCGAAGCGTGA, supplying that as a comparison file for subtraction, leaving me with a residual signal of either the full ACTGGCGAGCGTGA or of GCGTGAx?

    Obviously I would have to be able to set a threshhold where I consider the result "noise" - either a fixed value of signal strength or e.g. 5% of the main peak strength in order to filter out illegitimate base calls of non-chimeric sequence.

    As a separate, but related, issue, is there a way to get phred (or any other program) to "filter" noise spikes in chromatograms? For example, sometimes I see spikes in pyrimidine signal strength that is way out of proportion to legitimate regions of call (the peak height using an ab1 viewer like bioedit or consed - I'm not sure if linear or log scale - is well over double of any nearby base or even any other place in the file. These typically have "width" of about 5 base calls)

  • #2
    There are numerous programs in the Sanger world to solve this; Mutation Surveyor is one I have used but there is also polyphred. Most commercial packages for Sanger data should attempt to do this.

    To answer your original question, GATK, mpileup and the like don't work directly with chromatograms, though if you convert them to FASTQ format they can -- but then you have lost the information in this case you are looking for.

    This underscores a key difference between Sanger and the newer technologies -- Sanger can generate a signal from a mixed population which you can attempt to deconvolve. The newer technologies are all looking at individual molecules (sometimes by amplifying them into a population to do so), and so this form of deconvolution just isn't relevant.

    Comment


    • #3
      Thanks!
      I'll try out polyphred, given that I have the package that it interfaces with.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Advancing Precision Medicine for Rare Diseases in Children
        by seqadmin




        Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
        12-16-2024, 07:57 AM
      • seqadmin
        Recent Advances in Sequencing Technologies
        by seqadmin



        Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

        Long-Read Sequencing
        Long-read sequencing has seen remarkable advancements,...
        12-02-2024, 01:49 PM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 12-17-2024, 10:28 AM
      0 responses
      33 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 12-13-2024, 08:24 AM
      0 responses
      49 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 12-12-2024, 07:41 AM
      0 responses
      34 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 12-11-2024, 07:45 AM
      0 responses
      46 views
      0 likes
      Last Post seqadmin  
      Working...
      X