Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • mard
    Member
    • Jan 2010
    • 21

    threshold for duplicate removal?

    I was trying out Picard's MarkDuplicates to remove duplicate reads before SNP identification in our targeted resequencing studies but I discovered that Picard classes non-identical reads that map to the same genomic location (same start and stop) as duplicates and only one read is kept. So this means that only a read from one haplotype can be kept for each location.

    We're thinking of testing if it might be better to apply a filter/threshold to keep a certain number of reads that map to the same location instead of discarding all except one. Just wondering if anyone has tried something like this?
  • bioinfosm
    Senior Member
    • Jan 2008
    • 483

    #2
    Thats an interesting point. I agree with you that only one haplo will be kept in such a filtering. I have been only filtering reads that map to multiple locations; but keep using the duplicates, and guess that brings the PCR-bias in SNP identification (Hets look 30-40% variant, not 50%)
    --
    bioinfosm

    Comment

    • mard
      Member
      • Jan 2010
      • 21

      #3
      Thanks for the information. I'm pretty new to next-gen analysis so am wondering if it's recommended to remove reads that map to multiple locations before SNP calling?

      Comment

      Latest Articles

      Collapse

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by SEQadmin2, Yesterday, 11:58 AM
      0 responses
      13 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-05-2026, 10:09 AM
      0 responses
      25 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-04-2026, 08:59 AM
      0 responses
      35 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-02-2026, 12:03 PM
      0 responses
      60 views
      0 reactions
      Last Post SEQadmin2  
      Working...