Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • PCR duplicate removal for whole genome sequencing vs. whole exome sequencing

    Hi

    I did whole-genome sequencing and whole-exome sequencing on a whole-genome amplified (WGA’d) sample and got 2% of reads removed as duplicates by whole-genome sequencing but 80% of reads removed by whole-exome sequencing.

    I then did whole-exome sequencing on an unamplified HapMap control and WGA’d HapMap control and got 25% of reads removed from the unamplified HapMap control and 50% removed from the WGA’d HapMap control.

    I used Illumina standard PE101 whole-genome sequencing protocol for whole-genome sequencing and NimbleGen exome capture (version 2) for exome capture followed by Illumina sequencing.

    Can anyone share some thoughts on the big difference between whole-genome sequencing and whole-exome sequencing of my WGA’d sample in terms of duplicate removal? All your comments will be greatly appreciated!

  • #2
    We had the same issue at the beggining, hitting >60% dupes. We had to start with more dna and use a bit bigger fragment lengths to lower the values. We now typically get 20-30% dups.

    Lets not forget that if you have 100x coverage with 100bp reads the chances of having 2 reads with the same 5' position and/or having 2 fragments having the same sequence is pretty high. So at high coverage many duplicates aren't duplicates.

    How much coverage are you getting?

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Essential Discoveries and Tools in Epitranscriptomics
      by seqadmin


      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
      Yesterday, 07:01 AM
    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    39 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    41 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    35 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-04-2024, 09:00 AM
    0 responses
    55 views
    0 likes
    Last Post seqadmin  
    Working...
    X