Seqanswers Leaderboard Ad

**Tom Bair** · 02-24-2009, 01:28 PM

I know there have been some changes I saw alot of differences in speed between the two versions (using MULTI also)

I think though that the MULTI uses mpi it is just to indicate that it is multicore mpi vs multi cpu. I have no idea what they do differently though I could think of a ton of stuff that would make sense to do differently based on core to core communication vs cpu to cpu over gigabit.

I think I would diagnose and get mpi running and that might solve it.

**engencore** · 03-09-2009, 10:18 AM

failure to load CWF files/runAnalysisPipe

I've been successful with runAnalysisPipe on some 1/4 Ti signal processing and now I'm trying a full Ti (333 Mb) but end up with the following errors in gsRUnProcessor with both the on and off rig (note: below is --verbose output):

[Wed Feb 25 09:02:21 2009][Debug][] Logging configured.
[Wed Feb 25 09:02:21 2009][Information][] gsRunProcessor 2.0.00.22 (Build 184) Starting
[Wed Feb 25 09:02:21 2009][Debug][] Parsing pipeline: /etc/gsRunProcessor/signalProcessing.xml
[Wed Feb 25 09:02:21 2009][Debug][ProcessingEngine] Adding step NukeSignalStrengthBalancer (pass 1)
[Wed Feb 25 09:02:21 2009][Debug][ProcessingEngine] Adding step BlowByCorrector
[Wed Feb 25 09:02:21 2009][Debug][ProcessingEngine] Adding step CafieCorrector
[Wed Feb 25 09:02:21 2009][Debug][ProcessingEngine] Adding step NukeSignalStrengthBalancer (pass 2)
[Wed Feb 25 09:02:21 2009][Debug][ProcessingEngine] Adding step IndividualWellScaler
[Wed Feb 25 09:02:21 2009][Debug][ProcessingEngine] Adding step MostLikelyErrorSubtractor
[Wed Feb 25 09:02:21 2009][Debug][ProcessingEngine] Adding step WellScreener (pass 1)
[Wed Feb 25 09:02:21 2009][Debug][ProcessingEngine] Adding step MetricsGenerator
[Wed Feb 25 09:02:21 2009][Debug][ProcessingEngine] Adding step QualityFilter
[Wed Feb 25 09:02:21 2009][Debug][ProcessingEngine] Adding step BaseCaller
[Wed Feb 25 09:02:29 2009][Information][] Detected processor speed: 2129 MHz.
[Wed Feb 25 09:02:39 2009][Notice][ProcessingEngine] Starting job eb6dd82e-0344-11de-ad6c-0010182f8ff4.
[Wed Feb 25 09:02:39 2009][Debug][ProcessingEngine] Creating 1 processing group(s).
[Wed Feb 25 09:02:39 2009][Information][ProcessingEngine] Using memory-only storage for flowgrams.
[Wed Feb 25 09:02:39 2009][Notice][ProcessingEngine] Processing Group 0 : Loading data.
[Wed Feb 25 09:02:39 2009][Information][ProcessingEngine] Opening file /home/kc/Desktop/D_2009_02_20_12_47_04_FLX02070135_imageProcessingOnly/regions/2.cwf
[Wed Feb 25 09:02:39 2009][Debug][ProcessingEngine] Region 2 : Process 0 is loading 2049,1 4095,4095
-------
Any ideas? Roche as recommended a total reinstall of OS and software.

**Tom Bair** · 03-09-2009, 10:54 AM

It looks like pretty normal output. The only thing I see is that it looks like only one cpu is being used I get more like:
[Thu Mar 05 23:04:08 2009][Information][] gsRunProcessor 2.0.00.22 (Build 184) Starting[Thu Mar 05 23:04:08 2009][Information][] gsRunProcessor 2.0.00.22 (Build 184) Starting[Thu Mar 05 23:04:08 2009][Information][] gsRunProcessor 2.0.00.22 (Build 184) Starting
[Thu Mar 05 23:04:08 2009][Information][] gsRunProcessor 2.0.00.22 (Build 184) Starting
[Thu Mar 05 23:04:08 2009][Information][] gsRunProcessor 2.0.00.22 (Build 184) Starting
[Thu Mar 05 23:04:08 2009][Information][] gsRunProcessor 2.0.00.22 (Build 184) Starting
[Thu Mar 05 23:04:08 2009][Information][] gsRunProcessor 2.0.00.22 (Build 184) Starting
[Thu Mar 05 23:04:11 2009][Information][] Detected processor speed: 2826 MHz.
[Thu Mar 05 23:04:21 2009][Notice][ProcessingEngine] Starting job 3945dede-0a0c-11de-8556-001d0933401b.
[Thu Mar 05 23:04:21 2009][Information][ProcessingEngine] Using memory-only storage for flowgrams.

You might show us your env |grep GS
and give a little bit on what hardware

**westerman** · 03-10-2009, 05:43 AM

I agree with Tom. Looks normal to me.

BTW: As a follow-up to the original message (by myself) in this thread, I did get MPI to work on my computers. This enables MULTI to also work. I'm still a bit irritated that MULTI requires MPI but as long as I can get it to work, hey, that is good enough.

**engencore** · 03-13-2009, 06:13 AM

many thanks everyone. i heard from GSSupport and this is the solution:
"edit the ~/.bash_profile so the following environmental variables are set:

export GS_LAUNCH_MODE=GSRPM
export GS_CACHEDIR=/data

On the data rig it should read:

export GS_LAUNCH_MODE=MULTI
export GS_CACHEDIR=/data

The rig state I have of your sequencing machine does not show that these environment variables are set. "
With re: to hardware, the log is from our FLX instrument (circa Feb 07)

**westerman** · 03-13-2009, 09:02 AM

Originally posted by engencore View Post

export GS_CACHEDIR=/data

If it is not obvious, the GS_CACHEDIR should set to some place where you have lots of temporary (or scratch) space. On my off-data rig this is not "/data" but rather "/scratch/westerm" so I have 'export GS_CACHERDIR=/scratch/westerm'

**engencore** · 03-18-2009, 09:28 AM

runAnalysisPipe crash

I've resintalled the 2.0.00.20 SW as root since runAnalysisPipe was not running properly. I have MULTI in the bash_profile and got the following:
-----------------
[root@engencorelinux R_2009_02_20_12_45_31_FLX02070135_adminrig_Project6-Sample1 018]# runAnalysisPipe --verbose D_2009_02_20_12_47_04_FLX02070135_imageProcessin gOnly/
Output files will appear in /root/Desktop/2009_02_20/R_2009_02_20_12_45_31_FLX02 070135_adminrig_Project6-Sample1018/D_2009_03_18_12_26_27_localhost_signalProces sing
[Debug] Root interfaces: 127.0.0.1:4540|172.20.73.210:4540|
[Debug] Logging configured.
[Information] gsRunProcessor 2.0.00.20 (Build 91) Starting
[Debug] Parsing pipeline: /etc/gsRunProcessor/signalProcessing.xml
peer[Debug] Trying UDP log connection to root.
peer[Debug] Confirming UDP log connection to root.
[Debug] Logging configured.
[Information] gsRunProcessor 2.0.00.20 (Build 91) Starting
[Debug] Parsing pipeline: /etc/gsRunProcessor/signalProcessing.xml
ProcessingEngine[Debug] Adding step NukeSignalStrengthBalancer
ProcessingEngine[Debug] Adding step BlowByCorrector
ProcessingEngine[Debug] Adding step CafieCorrector
ProcessingEngine[Debug] Adding step NukeSignalStrengthBalancer
ProcessingEngine[Debug] Adding step IndividualWellScaler
ProcessingEngine[Debug] Adding step MostLikelyErrorSubtractor
ProcessingEngine[Debug] Adding step WellScreener
ProcessingEngine[Debug] Adding step MetricsGenerator
ProcessingEngine[Debug] Adding step QualityFilter
ProcessingEngine[Debug] Adding step BaseCaller
ProcessingEngine[Debug] Adding step NukeSignalStrengthBalancer
ProcessingEngine[Debug] Adding step BlowByCorrector
ProcessingEngine[Debug] Adding step CafieCorrector
ProcessingEngine[Debug] Adding step NukeSignalStrengthBalancer
ProcessingEngine[Debug] Adding step IndividualWellScaler
ProcessingEngine[Debug] Adding step MostLikelyErrorSubtractor
ProcessingEngine[Debug] Adding step WellScreener
ProcessingEngine[Debug] Adding step MetricsGenerator
ProcessingEngine[Debug] Adding step QualityFilter
ProcessingEngine[Debug] Adding step BaseCaller
[Information] Detected processor speed: 2128 MHz.
ProcessingEngine[Notice] Starting job 880c6af2-13d9-11de-ac7b-0010182f8ff4.
ProcessingEngine[Debug] Creating 2 processing group(s).
ProcessingEngine[Debug] Creating 2 processing group(s).
ProcessingEngine[Information] Using memory-only storage for flowgrams.
ProcessingEngine[Debug] Rank 0 is member 0 of group 0.
ProcessingEngine[Notice] Processing Group 0 : Loading data.
ProcessingEngine[Debug] Rank 1 is member 0 of group 1.
ProcessingEngine[Notice] Processing Group 1 : Loading data.
ProcessingEngine[Debug] Opening file /root/Desktop/2009_02_20/R_2009_02_20_12_45 _31_FLX02070135_adminrig_Project6-Sample1018/D_2009_02_20_12_47_04_FLX02070135_i mageProcessingOnly/regions/2.cwf
ProcessingEngine[Debug] Opening file /root/Desktop/2009_02_20/R_2009_02_20_12_45 _31_FLX02070135_adminrig_Project6-Sample1018/D_2009_02_20_12_47_04_FLX02070135_i mageProcessingOnly/regions/1.cwf
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 1[cli_1]: aborting job:
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 1
[Fatal] Processing aborted via SIGINT.
[Information] Deleting partial results files.
[0]0:Return code = 0, signaled with Interrupt
[0]1:Return code = 1

--------------
I changed MULTI to SINGLE and it appears to run fine now. Is the problem the eth0 on the datarig?
thanks,
joe

**cdwan** · 04-02-2009, 07:48 AM

If I'm running multi-host MPI (8 threads, 4 on each of 2 machines): Should GC_CACHEDIR be a shared directory visible to both of them, or could it be space that is truly local to the machine (i.e: on a local disk).

The latter would be a great way to reduce both network traffic and load on the shared fileserver, provided that it worked right.

**westerman** · 04-06-2009, 07:14 AM

In regards to GC_CACHEDIR being local or shared, I am not sure and the manual does not seem to address the question. I would suspect that any cache directory could be local. Perhaps the best bet is to try it both ways and see what happens. Unfortunately I do not have a good way to this test out on my machines. Please report back.

**cdwan** · 04-06-2009, 07:23 AM

If I specify GC_CACHEDIR then I see one 8GB file appear in that directory for each thread that is running. It doesn't appear to affect runtime or results whether the cache files are local to the nodes or on a shared filesystem. However, I haven't seen the systems lock up since I started using GC_CACHEDIR, so perhaps it has other benefits in terms of memory usage or something.

I find this sort of systems archeology pretty frustrating. "Hey look! this button does this other thing!" It's also a bit worrisome that we don't know whether these options affect scientific results. Has anyone received more detailed assistance from Roche on best practices for running these tools on a cluster? I feel like these questions might best be answered in an advanced version of the user manual.

**sklages** · 04-07-2009, 10:14 AM

The "Genome Sequencer System Site Preparation Guide, October 2008"
says (p.39ff):

"GS_CACHEDIR
This should be set to the location of a fast local disk.
Up to 8GB of temporary files per process could be generated.
The default is ‘/data’ if a ‘/data’ directory exists,
otherwise /tmp is used."

The Roche manuals are not too bad, at least if there are no problems.
From my experience many vendors do not supply much background
information (in terms of software implementation) about their machines.

We are running signal processing on a 32 core system using
GS_LAUNCH_MODE=MULTI. I'd say a cache should always
be local to the machines the jobs are run on.

just my 2p,
Sven

**cdwan** · 04-07-2009, 10:22 AM

That's quite interesting. Especially since in some cases overflowing /tmp will bring the system to a screeching halt (our hard lock-up observed above). If the processes are writing up to 8GB apiece there by default they could easily run out of space. Also, since most systems clear /tmp at boot time, the evidence would be gone by the time the system came up clean.

Thank you for the tip. I'll dig around and see if one of those manuals got left at the site.

**sklages** · 04-07-2009, 10:29 AM

I was not aware of GS_CACHEDIR until I filled up /tmp and our server
refused continue working ;-)

The Roche support pointed me to the Site Preparation Guide, which I think
is the wrong place to put this essential information in.

**Tom Bair** · 04-07-2009, 01:10 PM

engencore,

That error looks like an mpi problem, I think, you may want to make sure openMPI is installed and happy

Tom

Topics	Statistics	Last Post
TIGR Systems Offer a Compact Alternative to CRISPR for Gene Editing by seqadmin Started by seqadmin, 03-03-2025, 01:15 PM	0 responses 160 views 0 likes	Last Post by seqadmin 03-03-2025, 01:15 PM
Highlights from AGBT 2025 – Part II by seqadmin Started by seqadmin, 02-28-2025, 12:58 PM	0 responses 248 views 0 likes	Last Post by seqadmin 02-28-2025, 12:58 PM
Highlights from AGBT 2025 – Part I by seqadmin Started by seqadmin, 02-24-2025, 02:48 PM	0 responses 622 views 0 likes	Last Post by seqadmin 02-24-2025, 02:48 PM
Selecting the Right AI Model for Bioinformatics Research by seqadmin Started by seqadmin, 02-21-2025, 02:46 PM	0 responses 265 views 0 likes	Last Post by seqadmin 02-21-2025, 02:46 PM

Seqanswers Leaderboard Ad

Announcement

gsRunProcessor: error with v.2.0.00.22

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News