SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
CAGE Libraries on Illumina? mnkyboy RNA Sequencing 4 07-07-2014 04:55 PM
how to get RG tags from tophat qiudao Bioinformatics 2 07-27-2011 06:43 PM
Tags in Multiplex suludana Illumina/Solexa 2 12-01-2009 05:59 AM
Extracting random tags? idonaldson Bioinformatics 4 11-24-2009 02:25 AM
Using some of the 12 multiplex tags bioinfosm Illumina/Solexa 2 08-04-2009 09:52 PM

Reply
 
Thread Tools
Old 03-19-2012, 12:51 PM   #1
LukaszKielpinski
Junior Member
 
Location: Denmark

Join Date: Jan 2012
Posts: 3
Default CAGE tags

Hi Everybody!

My problem is about mapping where only the location of 5'end of the read is meaningful, rest of the sequence is needed only to direct this end into the location - let's call this location a CAGE tag.
I did mapping using TopHat and while it is very easy to extract CAGE tags for +strand (it is read start position in SAM file), it is not at all the same story for minus strand. If this were bowtie mapping I would just add read length to the start position, but since this mapping allows for indels and exons it is always different number that would have to been added (it is possible to deduce it from CIGAR).
Do any of you know a straightforward method of obtaining this CAGE tags, or I have to figure out how to use CIGAR?
Best,
Lukasz

Edit: After more extensive search I found that in R package GenomicRanges there is function cigarToWidth() which is what I need! Anyway, maybe there are some even easier solutions.

Last edited by LukaszKielpinski; 03-19-2012 at 01:11 PM.
LukaszKielpinski is offline   Reply With Quote
Old 03-19-2012, 04:00 PM   #2
Dario1984
Senior Member
 
Location: Sydney, Australia

Join Date: Jun 2011
Posts: 165
Default

Why did you use TopHat ? CAGE tags are short. It doesn't work well with reads less than about 50 bases long.
Dario1984 is offline   Reply With Quote
Old 03-20-2012, 12:34 AM   #3
LukaszKielpinski
Junior Member
 
Location: Denmark

Join Date: Jan 2012
Posts: 3
Default

It is actually not exactly CAGE protocol, this is only "CAGE-like" mapping philosophy where only interesting information is the 5'end location. My reads are 94 nt.
LukaszKielpinski is offline   Reply With Quote
Old 04-01-2012, 11:55 PM   #4
SEQond
Member
 
Location: Italy

Join Date: Jul 2010
Posts: 27
Default CAGE pipeline software

Quote:
Originally Posted by Dario1984 View Post
Why did you use TopHat ? CAGE tags are short. It doesn't work well with reads less than about 50 bases long.
Dario1984 do you use as a pipeline the exact software mentioned in the article 1. Takahashi, H., et al., 5' end-centered expression profiling using cap-analysis gene expression and next-generation sequencing. Nat Protoc, 2012. 7(3): p. 542-61.

?

Since I haven't found any thing else this is what I use.
SEQond is offline   Reply With Quote
Old 04-03-2012, 07:00 PM   #5
Dario1984
Senior Member
 
Location: Sydney, Australia

Join Date: Jun 2011
Posts: 165
Default

That's a good reference. I need to read it in detail. They use BWA to do mapping. I would agree with that. We use Bowtie, which is similar. TopHat is not relevant because the typical CAGE tags are 27 nt long. Lukasz has some strange protocol though, so it is relevant to him with the long read lengths that he has.
Dario1984 is offline   Reply With Quote
Reply

Tags
cage, cigar, mapping, tophat

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 07:01 AM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2019, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO