Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • thickrick99
    Member
    • Jul 2014
    • 21

    Is SAMtools the right software for this project?

    Hi Everyone,

    Here is a brief summary of what I am trying to do with my project: Essentially, I want to find how a mutation in the mRNA sequence affects the amino acid sequence of a protein. I have whole exome sequencing data as a .sam file and I am interested in finding the flanking sequence +X nucleotides and -X nucleotides upstream and downstream from a specific site of the mutation. From here, I want to determine the amino acid sequence of that flanking sequence but it has to be correctly in frame from the original sequence.

    Here are a few questions that I had in terms of using SAMtools and accomplishing these tasks:

    1) I assume I need to find the consensus sequence for the reads in my whole exome sequencing data and how would I be able to do this with SAMtools. I found the mpileup command, but what would be the the reference fasta file in my case. Is finding the consensus even needed?

    2) My main issue is going from the .sam file reads to being able to pinpoint the location of interest and get the flanking sequence. What do I need to do to process the .sam exome sequencing file to be able to determine the flanking sequence?

    3) Once i find the flanking sequence, how do I figure out the amino acid sequence and adjust accordingly to make sure it is in frame?

    4) How do i account for the multiple transcripts that may exist for a particular gene because of alternative splicing?

    Sorry for all the questions, it is my first time working in this area. I appreciate any help! Thanks in advance!
  • colindaven
    Senior Member
    • Oct 2008
    • 417

    #2
    There are tools for this - snpEff and Annovar are popular.

    Snpeff is quite simple too. You SNP will need to be in an _annotated_ mRNA seq of course.

    Input is VCF format.

    Comment

    • thickrick99
      Member
      • Jul 2014
      • 21

      #3
      Thanks for the response! Both of these tools seem helpful, however do they output the sequence of the flanking region/altered amino acid sequence as well? I quickly looked through snpEff and Annovar and it seems like the tools only tell you what the impact of an SNP is or what the amino acid change is. Im interested in not only determine what the amino acid change is, but also using the mutated amino acid sequence after the SNP for further analysis.

      Do you know if these tools/other tools are able to accomplish this?

      Comment

      Latest Articles

      Collapse

      • SEQadmin2
        Nine Things a Sample Prep Scientist Thinks About Before Sequencing
        by SEQadmin2


        I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

        Here are nine questions we think about, in roughly the order they matter, before...
        06-18-2026, 07:11 AM
      • SEQadmin2
        From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
        by SEQadmin2


        Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


        The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
        ...
        06-02-2026, 10:05 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by SEQadmin2, 06-26-2026, 11:10 AM
      0 responses
      8 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-17-2026, 06:09 AM
      0 responses
      44 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-09-2026, 11:58 AM
      0 responses
      104 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-05-2026, 10:09 AM
      0 responses
      125 views
      0 reactions
      Last Post SEQadmin2  
      Working...