SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
TF Motif track UCSC Livi81 General 0 06-15-2011 03:10 PM
Upload Bam file to custom track UCSC Genome Browser gabrielw Bioinformatics 4 06-15-2011 11:26 AM
Uploading BAM Track to UCSC Genome Browser jjw14 Bioinformatics 3 07-09-2010 11:57 AM
TopHat - wig file not compatible with UCSC? dukevn Bioinformatics 3 11-30-2009 06:53 AM
FindPeaks (ChIP-Seq) update apfejes Bioinformatics 25 01-08-2009 08:09 AM

Reply
 
Thread Tools
Old 03-21-2008, 12:19 PM   #1
apfejes
Senior Member
 
Location: Oakland, California

Join Date: Feb 2008
Posts: 236
Default FindPeaks 3.1.3 (alpha) - ChIP-Seq and UCSC-compatible short-read wig track creator

Hi All,

Just thought I'd take a second to let people know that FindPeaks 3.1.3 has been released. It's currently alpha code, so there are probably still a few bugs remaining, but since I've been running the code myself, it seems to do what I expect it to do.

In any case, if you'd like to see some of the improvements over FindPeaks 2.1.3 (the last version available on the web), I've put an abbreviated version of the change log on the download page.

http://www.bcgsc.ca/platform/bioinfo...releases/3.1.3

Happy ChIP-Seq-ing!

Anthony Fejes
__________________
The more you know, the more you know you don't know. —Aristotle
apfejes is offline   Reply With Quote
Old 04-02-2008, 08:55 AM   #2
bioinfosm
Senior Member
 
Location: USA

Join Date: Jan 2008
Posts: 482
Default

any comparisons with the Solexa's BeadStudio chipSEQ module?

I am looking to search for peaks on the whole human genome reference for a TF
bioinfosm is offline   Reply With Quote
Old 04-02-2008, 09:07 AM   #3
apfejes
Senior Member
 
Location: Oakland, California

Join Date: Feb 2008
Posts: 236
Default

I have not tried out the Illumina Chip-Seq module - I don't use the Windows platform, so I probably won't get around to trying it out, either.

We tend to move very quickly to develop new features and fix bugs with our in-house version, which makes it an excellent tool for trying to squeeze out new information from ChIP-Seq experiments. It's also developed to be useful for both pipeline processing and for desktop use, whereas the Illumina version appears to be very GUI-centric. They're likely developed for different audiences.

As for your goal of trying to search for peaks on the whole human genome reference with a transcription factor - All chip-Seq programs can do that. FindPeaks, the Wold Lab's version (and others as well) and I'd be very surprised if the Illumina tool couldn't do it.

The underlying issue is what methods are available for finding Peaks in the package you select. Not all peaks contain the same amount of information, and how you extract and process that information is important.
__________________
The more you know, the more you know you don't know. —Aristotle

Last edited by apfejes; 04-02-2008 at 09:10 AM.
apfejes is offline   Reply With Quote
Old 05-27-2008, 11:34 AM   #4
bioinfosm
Senior Member
 
Location: USA

Join Date: Jan 2008
Posts: 482
Default

I see that findpeaks depends on the unique hits from Eland. But one can run eland with any seed length (default 32). Have you done tests to see what could be a better option. Using a lower threshold for eland like 25 to get more input data to find peaks...?

thanks !
bioinfosm is offline   Reply With Quote
Old 05-27-2008, 11:44 AM   #5
apfejes
Senior Member
 
Location: Oakland, California

Join Date: Feb 2008
Posts: 236
Default

Hey Bioinfosm,

You're right, you can get better/worse results depending on the length from Eland. What we normally do is to run it at several lengths (e.g, 32, 29, 26, 23) and then use the ParseMultipleElands.jar application to pick the best hit from the longest Eland run. This way, you get the most specific hit, without losing too much information. The best of all possible worlds.

I once had the opportunity to take a look at 51-mers from a (Non-chip-seq) solexa run (last June, so the quality wasn't great), and found that the vast majority of hits came from 32-mers or greater, but there were still a significant number of 21- and 23-mers that were recovered by doing the shorter align lengths, about 3M and 3.25M respectively, out of a total of 30M reads. (Interestingly enough, there were about 3 Million 51-mers that aligned for the full sequence length, but thats another subject.)

Cheers,

Anthony
__________________
The more you know, the more you know you don't know. —Aristotle
apfejes is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 04:22 AM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2017, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO