Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • coverage file

    Hi all,

    I use tophat to analyze one dataset with 27bp length, and I am confused about the coverage.wig file it generated.

    Here is one example:

    chr1 11116146 11116147 47
    chr1 11116147 11116149 45
    chr1 11116149 11116150 42
    chr1 11116150 11116151 40
    chr1 11116151 11116152 5
    chr1 11116152 11116154 4
    chr1 11116154 11116657 0
    chr1 11116657 11116658 3
    chr1 11116658 11116659 4

    Does the third column stand for the number of reads that cover the site ?
    So does it mean there are 47 reads in 11116147 and 11116146 respectively ?
    But my reads is only 27bp, so the max number of reads coverage at one site should be < 27, I am confused why there are some sites that the number of reads coverage is > 27.

    Does anyone have some ideas ?

  • #2
    If you have a million reads, your coverage values will be in the range from 0 (for regions with no coverage) to a possible maximum of one million deep (if you are unlucky and all your reads map in one place).

    The fact that your reads are just 27bp long doesn't matter here.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Essential Discoveries and Tools in Epitranscriptomics
      by seqadmin




      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
      04-22-2024, 07:01 AM
    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, Yesterday, 11:49 AM
    0 responses
    15 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-24-2024, 08:47 AM
    0 responses
    16 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    61 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    60 views
    0 likes
    Last Post seqadmin  
    Working...
    X