Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Significance of Overlapping Regions

    I have two bed files of genomic coordinates. (say 12000 entries in one, 1000 in the other) I know how many unique bases in the genome are covered by each file (say 12000=15% of the genome, 1000=3% of the genome. I overlap them and get a result, say 50% of the 1000 overlap the entries in the 12000. Does anyone have suggestions on how to test the significance of this given that any overlap (not complete) is counted as overlap? I've thought of adding expanding the refernence (12000) entries by 50% of the average length of the 1000 entries, but that seems a bit to crude.

  • #2
    You can do a Monte carlo against randomly distributed regions, or even regions randomly distributed that are still the same distance to TSS or such. Will basically always call your result significant though, because the null model is bad.

    Comment


    • #3
      Sorry what does your last sentence mean? My plan is to use shuffleBed with intersectbed over 1000x and then plot the distribution of intersections and see where the actual data falls on the plot. Is that not sound?

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Advancing Precision Medicine for Rare Diseases in Children
        by seqadmin




        Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
        12-16-2024, 07:57 AM
      • seqadmin
        Recent Advances in Sequencing Technologies
        by seqadmin



        Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

        Long-Read Sequencing
        Long-read sequencing has seen remarkable advancements,...
        12-02-2024, 01:49 PM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 12-17-2024, 10:28 AM
      0 responses
      22 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 12-13-2024, 08:24 AM
      0 responses
      42 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 12-12-2024, 07:41 AM
      0 responses
      28 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 12-11-2024, 07:45 AM
      0 responses
      42 views
      0 likes
      Last Post seqadmin  
      Working...
      X