Error in CountOverlaps

biofreak

Member

Join Date: Jun 2011

Posts: 44
- Share
- Tweet
#1

Error in CountOverlaps

08-09-2013, 01:04 PM

hi
I get the following error when I run:
countOverlaps(exonRangesListtx[[hif1Ind[r]]], align)

In .Seqinfo.mergexy(x, y) :
Each of the 2 combined objects has sequence levels not in the other:
- in 'x': 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, X, Y, MT, MG153_PATCH, MG3835_PATCH, MG4136_PATCH, MG4151_PATCH, MG4209_PATCH, MG4211_PATCH, MG4212_PATCH, MG4213_PATCH, MG4214_PATCH, GL456210.1, GL456211.1, GL456212.1, GL456213.1, GL456216.1, GL456219.1, GL456221.1, GL456233.1, GL456239.1, GL456350.1, GL456354.1, GL456359.1, GL456360.1, GL456366.1, GL456367.1, GL456368.1, GL456370.1, GL456372.1, GL456378.1, GL456379.1, GL456381.1, GL456382.1, GL456383.1, GL456385.1, GL456387.1, GL456389.1, GL456390.1, GL456392.1, GL456393.1, GL456394.1, GL456396.1, JH584292.1, JH584293.1, JH584294.1, JH584295.1, JH584296.1, JH584297.1, JH584298.1, JH584299.1, JH584300.1, JH584301.1, JH584302.1, JH584303.1, JH584304.1
- in 'y': chr1, chr10, chr11, chr12, chr13, chr14, chr15, chr16, chr17, chr18, chr19, chr2, chr3, chr4, chr5, chr6, chr7, chr8, chr9, chrM, chrX, chrY
Make sure to always combine/compare [... truncated] "

The two objects are as follows:
1) exonRangesListtx[[hif1Ind[r]]]
GRanges with 15 ranges and 3 elementMetadata cols:
seqnames ranges strand | exon_id exon_name exon_rank
<Rle> <IRanges> <Rle> | <integer> <character> <integer>
[1] 12 [73907904, 73908324] + | 76987 ENSMUSE00000405171 1
[2] 12 [73926557, 73926747] + | 310532 ENSMUSE00001223008 2
[3] 12 [73927986, 73928131] + | 4438 ENSMUSE00000114679 3
[4] 12 [73928214, 73928298] + | 4437 ENSMUSE00000114678 4
[5] 12 [73930657, 73930769] + | 4442 ENSMUSE00000114690 5
[6] 12 [73932274, 73932476] + | 4440 ENSMUSE00000114682 6
[7] 12 [73936190, 73936296] + | 4441 ENSMUSE00000114687 7
[8] 12 [73937653, 73937800] + | 4443 ENSMUSE00000114695 8
[9] 12 [73939529, 73939749] + | 4444 ENSMUSE00000114697 9
[10] 12 [73940348, 73940631] + | 4439 ENSMUSE00000114680 10
[11] 12 [73941688, 73941852] + | 52364 ENSMUSE00000297829 11
[12] 12 [73941961, 73942385] + | 4445 ENSMUSE00000114699 12
[13] 12 [73944079, 73944187] + | 4435 ENSMUSE00000114675 13
[14] 12 [73944911, 73945037] + | 4436 ENSMUSE00000114677 14
[15] 12 [73945552, 73947530] + | 68877 ENSMUSE00000369537 15
---
seqlengths:
1 2 3 4 5 6 7 8 ... JH584298.1 JH584299.1 JH584300.1 JH584301.1 JH584302.1 JH584303.1 JH584304.1
195471971 182113224 160039680 156508116 151834684 149736546 145441459 129401213 ... 184189 953012 182347 259875 155838 158099 114452
>

2)aligns:
GappedAlignments with 1020 alignments and 0 elementMetadata cols:
seqnames strand cigar qwidth start end width ngap
<Rle> <Rle> <character> <integer> <integer> <integer> <integer> <integer>
[1] chr12 * 50M 50 73907984 73908033 50 0
[2] chr12 * 50M 50 73907985 73908034 50 0
[3] chr12 * 50M 50 73908016 73908065 50 0
[4] chr12 * 50M 50 73908032 73908081 50 0
[5] chr12 * 50M 50 73908039 73908088 50 0
[6] chr12 * 50M 50 73908043 73908092 50 0
[7] chr12 * 50M 50 73908081 73908130 50 0
[8] chr12 * 50M 50 73908084 73908133 50 0
[9] chr12 * 50M 50 73908084 73908133 50 0
... ... ... ... ... ... ... ... ...
[1012] chr12 * 50M 50 73947246 73947295 50 0
[1013] chr12 * 50M 50 73947248 73947297 50 0
[1014] chr12 * 50M 50 73947268 73947317 50 0
[1015] chr12 * 50M 50 73947284 73947333 50 0
[1016] chr12 * 50M 50 73947325 73947374 50 0
[1017] chr12 * 50M 50 73947358 73947407 50 0
[1018] chr12 * 50M 50 73949633 73949682 50 0
[1019] chr12 * 50M 50 73949656 73949705 50 0
[1020] chr12 * 50M 50 73949680 73949729 50 0
---
seqlengths:
chr1 chr10 chr11 chr12 chr13 chr14 chr15 chr16 chr17 chr18 ... chr4 chr5 chr6 chr7 chr8 chr9 chrM chrX chrY
195471971 130694993 122082543 120129022 120421639 124902244 104043685 98207768 94987271 90702639 ... 156508116 151834684 149736546 145441459 129401213 124595110 16299 171031299 91744698

Can someone please help?
thanks a lot.
Tags: None
Wolfgang Huber

Senior Member

Join Date: Aug 2009

Posts: 109
- Share
- Tweet
#2

08-09-2013, 11:53 PM

Dear Biofreak

did you notice that the naming of the chromosomes ('sequence levels' in Bioconductor jargon) is different in the two objects that you are trying to overlap? The first uses '1, 2, 3, 4, 5, ...', the second 'chr1, chr10, chr11, chr12,....'. The software is not yet clever enough to repair that for you. You will need to unify the naming before calling 'countOverlaps'.

This exact question has been extensively discussed on the Bioconductor mailing list, and search engines will readily show up these threads for you. If questions remain, consider posting project-specific, technical questions to the Bioconductor mailing list, as tje response there may be wider and faster, and please also follow their posting guide, in particular, provide a reproducible example (that someone else besides you can run) and the version numbers (output of 'sessionInfo()').

Best wishes
Wolfgang

Last edited by Wolfgang Huber; 08-10-2013, 12:03 AM.

Wolfgang Huber
EMBL
Comment

Previous template Next

Essential Discoveries and Tools in Epitranscriptomics

by seqadmin

The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
- Channel: Articles
04-22-2024, 07:01 AM
Current Approaches to Protein Sequencing

by seqadmin

Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
- Channel: Articles
04-04-2024, 04:25 PM

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 15 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

Error in CountOverlaps

Comment

Latest Articles

ad_right_rmr

News