Seqanswers Leaderboard Ad

**pecanton** · 07-08-2013, 09:16 AM

Also, I saw an error in my code to run HTseq. the -i ID option I was using only works if I also put -t gene, since that's the type of feature that has that ID. However, I'm not sure if I should count by exon or by gene. I suppose it depends on the downstream analysis I want to do, but I'm not very clear on the benefits of using either couting procedure.

**chadn737** · 07-08-2013, 09:19 AM

For some reason htseq-count does not like everything in line 15 after the second semicolon.

Code:

supercont1.1	VectorBase	contig	1	5856339	.	.	.	ID=supercont1.1;molecule_type=dsDNA;[COLOR="Red"]GenBank:supercontig:AaegL1:supercont1.1:1:5856339:1;translation_table=1;topology=linear;localization=chromosomal;[/COLOR]

If I delete everything highlighted in red, it proceeds normally.

**chadn737** · 07-08-2013, 09:20 AM

Originally posted by pecanton View Post

Also, I saw an error in my code to run HTseq. the -i ID option I was using only works if I also put -t gene, since that's the type of feature that has that ID. However, I'm not sure if I should count by exon or by gene. I suppose it depends on the downstream analysis I want to do, but I'm not very clear on the benefits of using either couting procedure.

Thats because your ID's for your exons are all different and would need to be modified to match the ID for the gene. That would be a pain in the ***.

**pecanton** · 07-09-2013, 07:00 AM

Well, since the Aedes aegypti genome is not fully assembled, and exists as supercotings (more than 2000), it would still be a pain to go and edit every one of the "contig" lines from the gff3. I'll report back if I find an easier way.

**dpryan** · 07-09-2013, 07:36 AM

If it's just choking on the "contig" entries, which will be later ignored anyway, then just write a small script to edit those and leave the others unchanged. Alternatively, you should just be able to use grep to remove them (something like "grep -v -w 'contig' ...").

Topics	Statistics	Last Post
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Yesterday, 08:47 AM	0 responses 14 views 0 likes	Last Post by seqadmin Yesterday, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 54 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM

Seqanswers Leaderboard Ad

Announcement

HTseq count parsing headache

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News