Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • output samtools.pl

    Dear users,
    I used BFAST program to generate a file .sam (using microRNA reads from SOLiD ABI).
    I run SAMTOOLS to analyse SNPs and mutations, and I generated at the end two output files:
    a pileup file and a file .snps (by samtools.pl varFilter).

    This is an example of my file .snps:

    g hsa-let-7a-3 61 * */+A 73 73 40 81 * +A 79 2 0 0 0
    g hsa-let-7a-3 63 * */-C 133 133 45 79 * -C 76 1 2 3 1
    g hsa-let-7a-3 64 * */-T 165 165 44 78 * -T 77 1 0 0 0
    g hsa-let-7a-3 68 * */-T 385 385 40 53 * -T 52 1 0 0 0
    g hsa-let-7a-3 70 * */+A 78 78 35 43 * +A 42 1 0 0 0
    D hsa-let-7b 22 * */+A 895 895 22 1717 * +A 1652 36 29 7 37
    D hsa-let-7b 24 * */+TC 14761 14761 23 1728 * +TC 1573 51 104 7 4
    D hsa-let-7b 25 * +TA/+CTA 17955 37444 23 1730 +TA +CTA 32 17 1681 35 18
    D hsa-let-7b 26 * +A/+TA 21497 46739 23 1732 +A +TA 393 162 1177 40 29
    D hsa-let-7b 27 * +A/* 31077 31077 23 1733 +A * 6 1651 76 4 5
    D hsa-let-7b 28 T Y 228 228 23 1736 ,,.+1C,,$C.C...+4ATTC.+4ATTC*.+4ATTCC,.+1C.,...+4ATTCC..+1CCC.+1CC.+4ATTC
    .+1C.+4ATTC.+1C..+4ATTC.+4ATTC.+4ATTCC.+1C*.+4ATTCC.+1CC.+4ATTCC.+1C...+1CCC.+1C.+4ATTCC..+1CC.+1C.+1C.+1C.+1CC.+1C.+4ATTCC.+1C.+1C...+4ATTC.+1C.C..+1C.C
    ..+1C..+1C..+1C.....+4ATTCC.+4ATTCC..+4ATTC..C.+4ATTC.C.+1C.+4ATTCC.+1C.+4ATTCC...+4ATTC.C.+4ATTC.+1C......C.+4ATTC..+1C.+4ATTC.+1C.C.+4ATTC.+1CCCC.+1C.+
    1C.+4ATTC.+1C.+4ATTC.+1C.+4ATTC.+4ATTC.+4ATTC.+1C.+1CC.+4ATTC.+1CCC.+1CC.+1C.+1CC.C.+1CC.+1C.+1C.+4ATTC.+4ATTC..+1C.+1C.+1CCC.+4ATTCC.+1CC.+1C...C.+1C.+1
    CC.+4ATTC..+1C.+1C.+1C.+1C*.+4ATTC..+4ATTC.+1C.+1C.+1C.+1C.+1CC.CC.+4ATTC.+1C.+1CC..+1C..+1C.+1C..+1C.+4ATTC.+1C.+1C.+1C...+1C...+1C.A.+4ATTC.+4ATTC.-1C.
    +3CGC.+2TC.+1C.+1CC.+1C.+4ATTC..+4ATTC.+4ATTCC..+1C.+1C.C.CC.+4ATTCC.+4ATTC.+1C.+1C.+4ATTC.+1CC.C.+1C.+4ATTC.+4ATTC.+4ATTC.+1CC.+4ATTCC.+4ATTC.+4ATTC.+1C
    .+1CCC.+1C.+1C.+1C.+1C.+1CC.+4ATTC.+1CCC.+1CCCC.+1C.+1C.+4ATTC.+1CC.+1C..+4ATTC.CC.+1C.+4ATTC.+1C.CC.+1C.C.+1C.+1C.+1CC.+1CC.+1C.+1C.+1C.+4ATTC.+4ATTC.+1
    C.+4ATTCC.+1C.+1C.+1C.+1C.+1CC.+5CATTC.+4ATTC.+4ATTCCC.+1C.+1C.+1CC..C.+4ATTC.+1C..+1C..+1C.+1CC.+1C*.+1CCC.+4ATTC.+4ATTC..+1C.+1C..+1C..C.+4ATTC.+4ATTC.
    +4ATTC.C.+4ATTC.+1CC..+1C.+1CC.+4ATTC.+1C.+1C.CCC.+1C.+1C.+4ATTC.+4ATTC.+1C.+1CC..-1CC.+1C.+1C.+1C..+1C.+1C..+1C*...+1CC.+1C...+4ATTC.+1C.+1CC..+4ATTC..+
    1C...+1CC.+1C.+1C..C.+1C.-2CACC..C..+4ATTCA.+1C..+1C.+4ATTC..+1C.+1C..+1CC.+4ATTC.+4ATTC.+1C.+1C..+6GTATTCC.C.+4ATTC.+1C..+1C.+4ATTC.+4ATTC.+4ATTCC.


    But I don't know how I can read this file. What correspond to each column?
    How or where can I find the number of reads (the number of count of miRNA in the alignment)?

    Thanx a lot!
    Bye
    M.Elena

  • #2
    Originally posted by m_elena_bioinfo View Post
    Dear users,
    I used BFAST program to generate a file .sam (using microRNA reads from SOLiD ABI).
    I run SAMTOOLS to analyse SNPs and mutations, and I generated at the end two output files:
    a pileup file and a file .snps (by samtools.pl varFilter).

    This is an example of my file .snps:

    g hsa-let-7a-3 61 * */+A 73 73 40 81 * +A 79 2 0 0 0
    g hsa-let-7a-3 63 * */-C 133 133 45 79 * -C 76 1 2 3 1
    g hsa-let-7a-3 64 * */-T 165 165 44 78 * -T 77 1 0 0 0
    g hsa-let-7a-3 68 * */-T 385 385 40 53 * -T 52 1 0 0 0
    g hsa-let-7a-3 70 * */+A 78 78 35 43 * +A 42 1 0 0 0
    D hsa-let-7b 22 * */+A 895 895 22 1717 * +A 1652 36 29 7 37
    D hsa-let-7b 24 * */+TC 14761 14761 23 1728 * +TC 1573 51 104 7 4
    D hsa-let-7b 25 * +TA/+CTA 17955 37444 23 1730 +TA +CTA 32 17 1681 35 18
    D hsa-let-7b 26 * +A/+TA 21497 46739 23 1732 +A +TA 393 162 1177 40 29
    D hsa-let-7b 27 * +A/* 31077 31077 23 1733 +A * 6 1651 76 4 5
    D hsa-let-7b 28 T Y 228 228 23 1736 ,,.+1C,,$C.C...+4ATTC.+4ATTC*.+4ATTCC,.+1C.,...+4ATTCC..+1CCC.+1CC.+4ATTC
    .+1C.+4ATTC.+1C..+4ATTC.+4ATTC.+4ATTCC.+1C*.+4ATTCC.+1CC.+4ATTCC.+1C...+1CCC.+1C.+4ATTCC..+1CC.+1C.+1C.+1C.+1CC.+1C.+4ATTCC.+1C.+1C...+4ATTC.+1C.C..+1C.C
    ..+1C..+1C..+1C.....+4ATTCC.+4ATTCC..+4ATTC..C.+4ATTC.C.+1C.+4ATTCC.+1C.+4ATTCC...+4ATTC.C.+4ATTC.+1C......C.+4ATTC..+1C.+4ATTC.+1C.C.+4ATTC.+1CCCC.+1C.+
    1C.+4ATTC.+1C.+4ATTC.+1C.+4ATTC.+4ATTC.+4ATTC.+1C.+1CC.+4ATTC.+1CCC.+1CC.+1C.+1CC.C.+1CC.+1C.+1C.+4ATTC.+4ATTC..+1C.+1C.+1CCC.+4ATTCC.+1CC.+1C...C.+1C.+1
    CC.+4ATTC..+1C.+1C.+1C.+1C*.+4ATTC..+4ATTC.+1C.+1C.+1C.+1C.+1CC.CC.+4ATTC.+1C.+1CC..+1C..+1C.+1C..+1C.+4ATTC.+1C.+1C.+1C...+1C...+1C.A.+4ATTC.+4ATTC.-1C.
    +3CGC.+2TC.+1C.+1CC.+1C.+4ATTC..+4ATTC.+4ATTCC..+1C.+1C.C.CC.+4ATTCC.+4ATTC.+1C.+1C.+4ATTC.+1CC.C.+1C.+4ATTC.+4ATTC.+4ATTC.+1CC.+4ATTCC.+4ATTC.+4ATTC.+1C
    .+1CCC.+1C.+1C.+1C.+1C.+1CC.+4ATTC.+1CCC.+1CCCC.+1C.+1C.+4ATTC.+1CC.+1C..+4ATTC.CC.+1C.+4ATTC.+1C.CC.+1C.C.+1C.+1C.+1CC.+1CC.+1C.+1C.+1C.+4ATTC.+4ATTC.+1
    C.+4ATTCC.+1C.+1C.+1C.+1C.+1CC.+5CATTC.+4ATTC.+4ATTCCC.+1C.+1C.+1CC..C.+4ATTC.+1C..+1C..+1C.+1CC.+1C*.+1CCC.+4ATTC.+4ATTC..+1C.+1C..+1C..C.+4ATTC.+4ATTC.
    +4ATTC.C.+4ATTC.+1CC..+1C.+1CC.+4ATTC.+1C.+1C.CCC.+1C.+1C.+4ATTC.+4ATTC.+1C.+1CC..-1CC.+1C.+1C.+1C..+1C.+1C..+1C*...+1CC.+1C...+4ATTC.+1C.+1CC..+4ATTC..+
    1C...+1CC.+1C.+1C..C.+1C.-2CACC..C..+4ATTCA.+1C..+1C.+4ATTC..+1C.+1C..+1CC.+4ATTC.+4ATTC.+1C.+1C..+6GTATTCC.C.+4ATTC.+1C..+1C.+4ATTC.+4ATTC.+4ATTCC.


    But I don't know how I can read this file. What correspond to each column?
    How or where can I find the number of reads (the number of count of miRNA in the alignment)?

    Thanx a lot!
    Bye
    M.Elena
    I assume you are having problems understanding the samtools pileup output and BFAST ran fine. Take a look at these two pages on the samtools website that explain the output format:


    Comment


    • #3
      I guess you are using "samtools.pl -p" which explains why your variants get filtered out. The first letter gives the reason:

      # d low depth
      # D high depth
      # W too many SNPs in a window (SNP only)
      # G close to a high-quality indel (SNP only)
      # Q low RMS mapping quality (SNP only)
      # g close to another indel with higher quality (indel only)

      Note that if a SNP is filtered out due to depth, it will not be tested with "G".

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Advancing Precision Medicine for Rare Diseases in Children
        by seqadmin




        Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
        12-16-2024, 07:57 AM
      • seqadmin
        Recent Advances in Sequencing Technologies
        by seqadmin



        Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

        Long-Read Sequencing
        Long-read sequencing has seen remarkable advancements,...
        12-02-2024, 01:49 PM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 12-17-2024, 10:28 AM
      0 responses
      33 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 12-13-2024, 08:24 AM
      0 responses
      48 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 12-12-2024, 07:41 AM
      0 responses
      34 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 12-11-2024, 07:45 AM
      0 responses
      46 views
      0 likes
      Last Post seqadmin  
      Working...
      X