Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • summary bigWig rtracklayer

    Hi Guys,
    The below function in R crashes if no score exists for one of the intervals in the bigwig file. This is annoying as the whole run is stopped instead of issuing a warning.
    For Het chromosomes, I tried to escape them, as you can see below. But for other cases, this cannot be generalized

    Do you know a way to get around ?


    "x" is a data frame of three columns

    getBigWigScore<- function(x, bwf, type="mean"){
    require(rtracklayer)
    bwf <- BigWigFile(bwf)
    features <- GRanges(x[,1], IRanges(x[,2], x[,3]))
    ## the summary below dies if a feature has no score found in the bw file instead of throwing a warning or return NA ...
    features <- features[!seqnames(features) %in% c("chr2LHet", "chr2RHet", "chr3LHet", "chr3RHet", "chrXHet", "chrYHet", "chrUextra")]
    return(unlist(runValue(summary(bwf, features, type=type))))
    }

    Best and many thanks for any help,
    Last edited by klebsiella; 03-20-2014, 09:35 AM.

  • #2
    I'm not even sure how you could have a valid bigWig (or even Wig) file without all of the positions having scores. It would seem to make sense to just remove those regions by adding multiple "definition" lines.

    Comment


    • #3
      Thanks for the reply,
      Well I validated the bigwig files and they contain no errors.
      For instance, the problem occurred for a feature starting with "0" (0 based coordinates file).
      I agree with you, a wig file is supposed to contain continuous data.
      Also, could you please elaborate on what you mean by adding a few definition lines ?

      A work around for me now was to use either Java Genomics Toolkit or bwtool (very very fast) via the system call from R.
      Thanks
      Last edited by klebsiella; 03-20-2014, 11:42 AM.

      Comment


      • #4
        Sorry for the delay, I just realized that I never replied to this!

        You can have multiple definition lines like the following the remove the need for having lines with no values:

        Code:
        fixedStep chrom=chr1 start=200601 step=100
        11
        22
        33
        fixedStep chrom=chr1 start=401601 step=100
        111
        122
        133
        That way, you can just leave out whatever portions aren't covered.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin




          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
          04-22-2024, 07:01 AM
        • seqadmin
          Current Approaches to Protein Sequencing
          by seqadmin


          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
          04-04-2024, 04:25 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 04-11-2024, 12:08 PM
        0 responses
        59 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 10:19 PM
        0 responses
        57 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 09:21 AM
        0 responses
        51 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-04-2024, 09:00 AM
        0 responses
        56 views
        0 likes
        Last Post seqadmin  
        Working...
        X