SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
BFAST - Help cutcopy11 SOLiD 17 10-17-2013 08:12 AM
BFAST indexing phatjoe Bioinformatics 1 09-08-2011 08:39 PM
BFAST using GPUs nikhil.stephen Bioinformatics 7 02-20-2011 11:14 PM
BFAST thanks you for your help! (was: ... needs your help) nilshomer Bioinformatics 5 04-21-2010 08:29 PM
Bfast jsun529 Bioinformatics 19 11-12-2009 09:32 AM

Reply
 
Thread Tools
Old 07-12-2010, 04:37 AM   #1
epigen
Senior Member
 
Location: Germany

Join Date: May 2010
Posts: 101
Default BFAST bfast.submit.pl configuration

Could anyone please provide a working config file for bfast.submit.pl to use as an example? We tried to create a config file, but it's sometimes hard to guess which parameter name in the schema stands for which in the BFAST programs. Also, Eclipse reports that the provided xml schema is incorrect at some places. Despite our efforts, the bfast.submit.pl exited without producing output. (We're using the data from the latest BFAST version.) bfast.submit.pl looks like a valuable tool and it would be very useful to get it to run for our cluster.

In that context, I'd like to know what the most efficient way of running BFAST is. I can use a node with 16 CPUs, up to 128 GB RAM. The 10 indexes for the human genome are 12 GB each so it's probably impossible to load them all into memory and keep enough space for the rest, especially when using pipes. As I noted, reading the indexes (one at a time as done by default) is the most time-consuming part in our case. Instead of splitting up the reads much and call multiple instances of bfast match with all indexes, I think it would be better to process all reads with one of the indexes in parallel.

Thanks in advance for the help

Barbara
epigen is offline   Reply With Quote
Old 03-18-2011, 07:51 AM   #2
Vincenzo
Junior Member
 
Location: Barcellona

Join Date: Feb 2011
Posts: 4
Default

Dears,
this is my config file.
I have not a sam file in the output, but only some sh script: match.sh, localign.sh, postprocess.sh, sam.sh.
I don't know what is the next step because, as i write you, it's sometimes hard to guess which parameter name in the schema stands for which in the BFAST programs.
A suggestion:
if you have any validation errors with eclipse don't worry, they're not preclude the running of bfast.
I am a beginner so i don't know if this can help you. I hope..
Anyway, this is the BASIC xml version to run it:
<?xml version="1.0" encoding="UTF-8"?>
<bfastConfig xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:noNamespaceSchemaLocation="BfastConfig_5.xsd">
<globalOptions>
<bfastBin>/cbbl/users/vincenzo/bin/</bfastBin>
<samtoolsBin>/cbbl/users/vincenzo/bin/</samtoolsBin>
<qsubBin>/cbbl/users/vincenzo/bin/run_parallel/</qsubBin>
<fastaFileName>/cbbl/users/vincenzo/bin/run_parallel/DH10B.fa</fastaFileName>
<runDirectory>/cbbl/users/vincenzo/bin/run_parallel/</runDirectory>
<readsDirectory>/cbbl/users/vincenzo/bin/run_parallel/</readsDirectory>
<outputDirectory>/cbbl/users/vincenzo/bin/run_parallel/</outputDirectory>
<tmpDirectory>/cbbl/users/vincenzo/bin/run_parallel/</tmpDirectory>
<outputID>result</outputID>
<cleanUsedIntermediateFiles>0</cleanUsedIntermediateFiles>
<numReadsPerFASTQ localalignSplit="20" matchSplit="20">20</numReadsPerFASTQ>
<timing>ON</timing>
<queueType>SGE</queueType>
<space>NT</space>
</globalOptions>
<matchOptions>
<readCompression>gz</readCompression>
<threads>1</threads>
</matchOptions>
<localalignOptions>
<threads>1</threads>
</localalignOptions>
<postprocessOptions>
<threads>1</threads>
</postprocessOptions>
<samOptions>
<samtools>1</samtools>
</samOptions>
</bfastConfig>

To add any parameter, you can inspire from XSD and from bfast.submit.pl.
If you solved it, please share your code.
Thanks a lot.
Vincenzo
Vincenzo is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 05:56 AM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO