Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • extra high coverage regions in exome sequencing data

    Hi everyone,

    What reasons are the extra high coverage regions in exome sequencing data caused by? I want to know if can affect variant calling. What effect can it bring, false positive or false negative?

    Thanks

  • #2
    There may be some variation in coverage due to changes in the capture efficiency of different regions - often attributed to GC content, but most capture system vendors are getting on top of this.

    Larger variations in coverage are more likely to be the result of mismapping of repetitive regions. Depending on how you've done your mapping you may find that reads which can map to multiple positions are allowed to map more than once in the genome, which can produce towers of reads in your results.

    Even if you only choose to see uniquely mapping reads you can still see overrepresented mismapped regions where you get reads from parts of the genome which are not present in the genome assembly, and are therefore invisible to the read mapping program.

    In general you should be very suspicious of results coming from regions with very high read coverage compared to the rest of the sample since mismapped sequences will lead to false SNP predictions and would dilute any real SNPs in the region.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Essential Discoveries and Tools in Epitranscriptomics
      by seqadmin




      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
      Yesterday, 07:01 AM
    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    58 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    54 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    45 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-04-2024, 09:00 AM
    0 responses
    55 views
    0 likes
    Last Post seqadmin  
    Working...
    X