Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How to calculate FPR from dwgsim_eval output?

    Hello,

    I am using dwgsim_eval to evaluate different mapper's performance on my simulated reads. I want to plot a ROC graph about the result, so I need to calculate True Positive Rate (TPR) and False Positive Rate (FPR) for each mapping quality threshold. I guess TPR should be column 17 ("sensitivity
    # (mc' / (mc' + mi' + mu')) | sensitivity: the fraction of reads that should be mapped that are mapped correctly at or greater than the threshold"), then how should I calculate FPR? Thanks!

    BTW:
    I am quite confused with the following 3 definitions:
    # mu | the number of reads unmapped that should be mapped be mapped at the threshold
    # um | the number of reads mapped that should be unmapped be mapped at the threshold
    # uu | the number of reads unmapped that should be unmapped be mapped at the threshold

  • #2
    You can find your answer here: http://en.wikipedia.org/wiki/Sensiti...nd_specificity

    Comment


    • #3
      Originally posted by RockChalkJayhawk View Post
      Thanks RockChalkJayhawk! I've checked the definition of FPR(=FP/(FP+TN)) but I think my problem is that I am not sure how to find the corresponding term of FP and TN from the output of dwgsim_eval.

      The list of dwgsim_eval is given as follows:

      # thr | the minimum mapping quality threshold
      # mc | the number of reads mapped correctly that should be mapped at the threshold
      # mi | the number of reads mapped incorrectly that should be mapped be mapped at the threshold
      # mu | the number of reads unmapped that should be mapped be mapped at the threshold
      # um | the number of reads mapped that should be unmapped be mapped at the threshold
      # uu | the number of reads unmapped that should be unmapped be mapped at the threshold
      # mc' + mi' + mu' + um' + uu' | the total number of reads mapped at the threshold
      # mc' | the number of reads mapped correctly that should be mapped at or greater than that threshold
      # mi' | the number of reads mapped incorrectly that should be mapped be mapped at or greater than that threshold
      # mu' | the number of reads unmapped that should be mapped be mapped at or greater than that threshold
      # um' | the number of reads mapped that should be unmapped be mapped at or greater than that threshold
      # uu' | the number of reads unmapped that should be unmapped be mapped at or greater than that threshold
      # mc' + mi' + mu' + um' + uu' | the total number of reads mapped at or greater than the threshold
      # (mc / (mc' + mi' + mu')) | sensitivity: the fraction of reads that should be mapped that are mapped correctly at the threshold
      # (mc / mc' + mi') | positive predictive value: the fraction of mapped reads that are mapped correctly at the threshold
      # (um / (um' + uu')) | false discovery rate: the fraction of random reads that are mapped at the threshold
      # (mc' / (mc' + mi' + mu')) | sensitivity: the fraction of reads that should be mapped that are mapped correctly at or greater than the threshold
      # (mc' / mc' + mi') | positive predictive value: the fraction of mapped reads that are mapped correctly at or greater than the threshold
      # (um' / (um' + uu')) | false discovery rate: the fraction of random reads that are mapped at or greater than the threshold

      Comment


      • #4
        See the discussion here:


        What is a false positive here though, since we could have a read that is mapped and it can be "wrong" if:
        #1 mapped to the wrong position
        #2 not mapped

        #2 would seem to be a FP, but #1 doesn't fit into the FP/TP/FN/TN scheme. Hence use positive predictive value, since it only hurts sensitivity when a read does not map (so only care if it does map). Use a ROC plotting sensitivity vs. PPV.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Recent Innovations in Spatial Biology
          by seqadmin


          Spatial biology is an exciting field that encompasses a wide range of techniques and technologies aimed at mapping the organization and interactions of various biomolecules in their native environments. As this area of research progresses, new tools and methodologies are being introduced, accompanied by efforts to establish benchmarking standards and drive technological innovation.

          3D Genomics
          While spatial biology often involves studying proteins and RNAs in their...
          01-01-2025, 07:30 PM
        • seqadmin
          Advancing Precision Medicine for Rare Diseases in Children
          by seqadmin




          Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
          12-16-2024, 07:57 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 01-09-2025, 04:04 PM
        0 responses
        431 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 01-09-2025, 09:42 AM
        0 responses
        440 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 01-08-2025, 03:17 PM
        0 responses
        452 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 01-03-2025, 11:18 AM
        1 response
        50 views
        1 like
        Last Post Tonia
        by Tonia
         
        Working...
        X