Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • low percentage of reads mapped

    Hello,

    I was analyzing the SOLiD SAGE data of human sequence using SOLiD SAGE Analysis tool. I performed Mapping using 27 bp length with 1 mismatch. The reference was the complete set of human mRNA sequence from Refseq db. I then calculated % of reads that mapped to reference using the results file that gives the list of tags and their corresponding read files. I ran the analysis for 4 SAGE data and I got the following percentage:
    SAGE A : 15 % apporx.; SAGE B 16% approx.; SAGE C 17% and SAGE D 20 %

    What can be the reason for such low percentage of reads mapping to human mRNA reference? Is it a general result for most of the SOLiD SAGE experiments?

    Thanks
    Last edited by rahilsethi; 09-09-2010, 10:46 AM.

  • #2
    BLAT some of the unmapped reads to see what they are. If they are actually real mRNA, its your analysis. If they are genomic DNA or something else, it's your library.

    Comment


    • #3
      What it has to do with my analysis? The result is straight from the software SOLiD SAGE Analysis tool. In the result.tab file it produces read ids for the tags are mentioned. I counted the unique set from those read ids and divided it by the unique set of read ids from the read file to get the percentage of reads mapped to human mRNA obtained from Refseq database.
      I will still BLAT some of them and see where they are mapping
      If they are not mapping to mRNA then something should be with the reads generated by SOLiD SAGE run

      Comment


      • #4
        Hi Rahlisethi,

        NextGenSeq is giving you a "sanity check" to help with your troubleshooting. The SAGE might be working fine but your RNA might contain lots of transcripts from repetitive elements, for example. Then their position is not uniquely mappable in the genome. Or you might get hits to E. coli, or some other unexpected species, and suspect contamination from some source.

        --
        Phillip

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin


          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
          Yesterday, 07:01 AM
        • seqadmin
          Current Approaches to Protein Sequencing
          by seqadmin


          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
          04-04-2024, 04:25 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 04-11-2024, 12:08 PM
        0 responses
        40 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 10:19 PM
        0 responses
        41 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 09:21 AM
        0 responses
        36 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-04-2024, 09:00 AM
        0 responses
        55 views
        0 likes
        Last Post seqadmin  
        Working...
        X