Seqanswers Leaderboard Ad

**winsettz** · 09-26-2013, 09:09 AM

Have you tried subsampling your data? Take ten percent of your reads or something like it through the workflow?

I'm more a bowtie2 user...but in either case it may help to post the commands you're using.

**daniel_g** · 09-26-2013, 09:16 AM

Yes, I have, using the -u flag that Bismark provides for reading the first x reads. As far as I can remember this worked quite reliably.

Actually I should also mention that the jobs which are currently failing are chunked files, each containing 1/4 of the reads in the original Fastq file.

The command that I'm running is just a shell script containing the command
bismark -n 2 --non_directional -1 [$gzipped fastq file] -2 [$gzipped fastq file] &> [$log file]

It's being submitted to the cluster with
qsub -l nodes=1: ppn=4,walltime=20:00:00:00 [$shell script]

**GenoMax** · 09-26-2013, 10:00 AM

Daniel: If this is a "shared" use cluster then you are likely running out of memory because there may be other jobs that are running on the same node (18GB per node is not a large amount BTW).

Have you tried to run the job requesting exclusive access (so the only job on that node will be yours)?

Are you running PBS or SGE (looks like PBS but want to confirm)?

**daniel_g** · 09-26-2013, 10:05 AM

Yes, it is a shared cluster. Perhaps that's the issue then.

It is indeed PBS. I understood though that if I use a command like:

qsub -l nodes=1: ppn=4

then I would get a node entirely to myself—is that not the case?

**GenoMax** · 09-26-2013, 10:21 AM

Originally posted by daniel_g View Post

Yes, it is a shared cluster. Perhaps that's the issue then.

It is indeed PBS. I understood though that if I use a command like:

qsub -l nodes=1: ppn=4

then I would get a node entirely to myself—is that not the case?

Yes that looks like you are requesting exclusive access (not a PBS user myself but the command looks right). Is there a space between the : and ppn=4. That should not be there.

Also want to verify that bowtie you are using has been compiled for 64-bit?

**dpryan** · 09-26-2013, 11:07 AM

18 gigs may simply not be enough for a non-directional library (I was running into issues with non-directional libraries on my desktop computer when I had about that much RAM). Remember that in addition to storing the entire genome in memory, you're also loading a bowtie index for that genome 4 times (plus all of the memory for buffering). That can occupy a fair bit of space.

If you can use more than one node on that cluster at a time, are fine with using bowtie2, and are generally comfortable with compiling code, you might try bison. It should have a lower per-node memory requirement, since it splits the instances of bowtie2 onto individual nodes.

**fkrueger** · 09-26-2013, 11:37 AM

When running Bismark for a human genome on the cluster (default mode) I personally tend to request 7 cores and ~12-14GB of RAM (the many cores are 1 for Bismark, 2 for Bowtie, and 4 for streaming/writing to gzipped files). Bowtie2 might use slightly more than that.

Just for the record, the number of reads in the input files doesn't have any significant impact on the amount of memory used, it really is what the others have described above.

**daniel_g** · 09-26-2013, 11:40 AM

The space between ":" and "p" was just to prevent the forum from turning it into an emoticon.

Yes, I'm using 64-bit bowtie.

Hm, that might explain why I've had fewer issues with directional alignment compared to non-directional. Good to know, thanks.

I think we're pretty set on using bowtie1 but perhaps I'll take a look at bison. Thank you.

**GenoMax** · 09-26-2013, 05:04 PM

Originally posted by daniel_g View Post

The space between ":" and "p" was just to prevent the forum from turning it into an emoticon.

For future reference: When you edit a post use the "Go Advanced" button. That gives you additional tools (look for them at the top of the edit box) which can be used to mark command lines as "quotes" or "code" that prevents the translation into emoticons. Also improves their readability.

**gturco** · 10-04-2013, 07:18 PM

This happened to me, turned out I was out of space on my hard drive. Had to remove/zip some files.

Topics	Statistics	Last Post
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Today, 08:47 AM	0 responses 11 views 0 likes	Last Post by seqadmin Today, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 59 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 54 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM

Seqanswers Leaderboard Ad

Announcement

Memory issues running Bismark/Bowtie alignment on a cluster

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News