![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Installing SplitRead | pepsimax | Bioinformatics | 0 | 06-14-2012 03:26 AM |
installing goseq | chknbio | Bioinformatics | 2 | 05-29-2012 02:00 PM |
problems installing package goseq | PFS | Bioinformatics | 2 | 05-29-2012 01:18 PM |
installing birdsuite | rworthi | Bioinformatics | 3 | 12-03-2011 01:04 AM |
Installing CisGenome | Mich_Scientist | Bioinformatics | 3 | 12-18-2010 05:17 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Member
Location: Tennessee, USA Join Date: Sep 2010
Posts: 61
|
![]()
Hi,
I am trying to install SMRTanalysis package from pacbio on SUSE Linux server.
Then run the command '/smartanalysis/analysis/bin/smrtpipe.py' and got an error message as below: Traceback (most recent call last): File "/data1/smartanalysis/analysis/bin/smrtpipe.py", line 4, in <module> import pkg_resources File "/usr/local/python2.7/lib/python2.7/site-packages/distribute-0.6.34-py2.7.egg/pkg_resources.py", line 2803, in <module> working_set.require(__requires__) File "/usr/local/python2.7/lib/python2.7/site-packages/distribute-0.6.34-py2.7.egg/pkg_resources.py", line 696, in require needed = self.resolve(parse_requirements(requirements)) File "/usr/local/python2.7/lib/python2.7/site-packages/distribute-0.6.34-py2.7.egg/pkg_resources.py", line 594, in resolve raise DistributionNotFound(req) pkg_resources.DistributionNotFound: pbpy==0.1 Its some errors related to python. The current python path its searching for is (/usr/local/python2.7/) which is our local python install. I know SMRTanalysis have python bundled under folder (smartanalysis/analysis/lib/python2.7). But it always search in local python directory and throws error. I have setup the path for "SEYMOUR_HOME" in setup.sh. Any suggestions regarding this? |
![]() |
![]() |
![]() |
#2 |
Senior Member
Location: San Francisco Join Date: Aug 2012
Posts: 322
|
![]()
The bundled python is required for things to work. Try:
Code:
source /opt/smrtanalysis-1.4.0/etc/setup.sh Code:
which python Code:
/opt/smrtanalysis-1.4.0/redist/python2.7/bin/python |
![]() |
![]() |
![]() |
#3 |
Member
Location: Tennessee, USA Join Date: Sep 2010
Posts: 61
|
![]()
Hi rhall,
Thanks for your reply. After I source setup.sh, it displayed me correct python version. However, when I tried to run smrtpipe.py, I got following errors Code:
Traceback (most recent call last): File "/data1/smrtanalysis-1.4.0/analysis/bin/smrtpipe.py", line 5, in <module> pkg_resources.run_script('pbpy==0.1', 'smrtpipe.py') File "build/bdist.linux-i686/egg/pkg_resources.py", line 489, in run_script keys.append(dist.key) File "build/bdist.linux-i686/egg/pkg_resources.py", line 1207, in run_script def __init__(self,module): File "/data1/smrtanalysis-1.4.0/analysis/lib/python2.7/pbpy-0.1-py2.7.egg/EGG-INFO/scripts/smrtpipe.py", line 11, in <module> from pbpy.smrtpipe.SmrtPipeMain import SmrtPipeMain, _sanityCheck File "/data1/smartanalysis/analysis/lib/python2.7/pbpy-0.1-py2.7.egg/pbpy/smrtpipe/SmrtPipeMain.py", line 22, in <module> from pbpy.smrtpipe.engine.SmrtCloud import SmrtCloudWorkflow File "/data1/smartanalysis/analysis/lib/python2.7/pbpy-0.1-py2.7.egg/pbpy/smrtpipe/engine/SmrtCloud.py", line 9, in <module> from pbpy.smrtpipe.engine.SmrtPipeWorkflow import SmrtPipeWorkflow File "/data1/smartanalysis/analysis/lib/python2.7/pbpy-0.1-py2.7.egg/pbpy/smrtpipe/engine/SmrtPipeWorkflow.py", line 35, in <module> from pbpy.smrtpipe.engine.SmrtDAG import SMRTDAG File "/data1/smartanalysis/analysis/lib/python2.7/pbpy-0.1-py2.7.egg/pbpy/smrtpipe/engine/SmrtDAG.py", line 25, in <module> from pbpy.plot.PlotHelpers import makeHBarPlotPng File "/data1/smartanalysis/analysis/lib/python2.7/pbpy-0.1-py2.7.egg/pbpy/plot/PlotHelpers.py", line 10, in <module> import matplotlib.pyplot as plt File "/data1/smartanalysis/analysis/lib/python2.7/matplotlib-1.0.1-py2.7-linux-x86_64.egg/matplotlib/pyplot.py", line 23, in <module> from matplotlib.figure import Figure, figaspect File "/data1/smartanalysis/analysis/lib/python2.7/matplotlib-1.0.1-py2.7-linux-x86_64.egg/matplotlib/figure.py", line 18, in <module> from axes import Axes, SubplotBase, subplot_class_factory File "/data1/smartanalysis/analysis/lib/python2.7/matplotlib-1.0.1-py2.7-linux-x86_64.egg/matplotlib/axes.py", line 14, in <module> import matplotlib.axis as maxis File "/data1/smartanalysis/analysis/lib/python2.7/matplotlib-1.0.1-py2.7-linux-x86_64.egg/matplotlib/axis.py", line 10, in <module> import matplotlib.font_manager as font_manager File "/data1/smartanalysis/analysis/lib/python2.7/matplotlib-1.0.1-py2.7-linux-x86_64.egg/matplotlib/font_manager.py", line 52, in <module> from matplotlib import ft2font ImportError: /usr/lib64/libstdc++.so.6: version `GLIBCXX_3.4.11' not found (required by /data1/smartanalysis/analysis/lib/python2.7/matplotlib-1.0.1-py2.7-linux-x86_64.egg/matplotlib/ft2font.so) Thanks |
![]() |
![]() |
![]() |
#4 | |
Senior Member
Location: East Coast USA Join Date: Feb 2008
Posts: 7,087
|
![]() Quote:
On a local cluster with a working install of SMRTanalysis 1.4.0 there is only one "libstdc++.so.6" and the latest I see is GLIBCXX_3.4.8, if I do Code:
strings /usr/lib/libstdc++.so.6 | grep GLIBC Code:
strings /opt/smrtanalysis-1.4.0/analysis/lib/python2.7/matplotlib-1.0.1-py2.7-linux-x86_64.egg/matplotlib/ft2font.so | grep GLIBC |
|
![]() |
![]() |
![]() |
#5 |
Senior Member
Location: San Francisco Join Date: Aug 2012
Posts: 322
|
![]()
Sorry, I notice that you are using SUSE. SMRTanalysis is only distributed for Ubuntu 10.04 and Centos 5.6. While Ubuntu is SUSE based getting SMRTanalysis to work will likely prove futile given the version differences for things like glibc. Unfortunately, given the complexity of the system, building from source on different Linux distributions is not an option. Depending on what you have planned there are three options:
1. Install Ubuntu 10.04 - Not generally practical, but the best option if you intend using SMRTanalysis a lot, or installing the web server for other people to use. 2. Install Ubuntu 10.04 (the server version has the smallest footprint) in a virtual machine (VM) that runs on your SUSE system. This is probably the best option, it will give good performance, but the VM will likely take up a lot of disk space. For setting up SUSE as a VM host: http://doc.opensuse.org/documentatio.../book.kvm.html Or. http://www.oracle.com/technetwork/se...ads/index.html I would highly recommend virtualbox, it is very easy to use. 3. If you only want to try SMRTanalysis and are not going to do any heavy computation, or simply as a test to see if you want to go to the effort of installing it in a VM then the Amazon AMI route is very easy, but has a cost associated with it. |
![]() |
![]() |
![]() |
#6 |
Senior Member
Location: San Francisco Join Date: Aug 2012
Posts: 322
|
![]() Last edited by rhall; 04-10-2013 at 01:59 PM. |
![]() |
![]() |
![]() |
#7 |
Member
Location: Tennessee, USA Join Date: Sep 2010
Posts: 61
|
![]()
Dear rhall and GenoMax,
Thanks you very much for your help and comments regarding installation. We built new ubuntu server and installed SMRTanalysis correctly ![]() I want to try AHA pipeline to improve existing assembly with pacbio data. ![]() Thanks Sagar |
![]() |
![]() |
![]() |
#8 |
Member
Location: Tennessee, USA Join Date: Sep 2010
Posts: 61
|
![]()
Hello,
Again some problems. I was able to run the smrtpipe.py command without any errors. However when I tried to run the SMRTpipe example as given in http://pacb.com/devnet/files/softwar...ce%20Guide.pdf The data is located at: Code:
/opt/smrtanalysis/common/test/smrtpipe/lambda_resequencing/* Code:
fofnToSmrtpipeInput.py lambda_resequencing.fofn > input.xml Code:
/opt/smrtanalysis/smartanalysis/common/protocols/lambda_RS_Resequencing.1.xml Few error lines I see in master.log file are: Code:
[DEBUG] 2013-02-19 11:21:48,983 [pbpy.io.MetaAnalysisXml load 116] No header found in input.xml. Unable to load jobId [DEBUG] 2013-02-19 11:21:48,984 [pbpy.smrtpipe.InputData loadXml 214] Skipping assignment of JobId. Unable to find header in input.xml [INFO] 2013-02-19 11:45:30,281 [pbpy.smrtpipe.SmrtPipeContext movieFiles 365] Found /data2/smart/smartanalysis/common/test/primary/lambda/Analysis_Results/m120404_104101_00114_c100318002550000001523015908241265_s1_p0.bas.h5 (81059282 bytes) [WARNING] 2013-02-19 11:45:30,282 [pbpy.smrtpipe.SmrtPipeMain _getBasVersions 456] Unable to correctly Parse the basH5 versions. Allowing job to proceed, but please fix the compatibility matrix under $SEYMOUR_HOME/common/etc. Error unable to create file (File accessability: Unable to open file) Can anybody post the xml files that worked. Instructions for creating appropriate xml files are way beyond understanding of biologist. More clarification is needed from pacbio ![]() ![]() Thanks Sagar |
![]() |
![]() |
![]() |
#9 |
Senior Member
Location: San Francisco Join Date: Aug 2012
Posts: 322
|
![]()
The protocols in /opt/smartanalysis/common/protocols/ are not for use with smrtpipe.py, they are templates used with the SMRT portal web interface. The settings.xml file used with smrtpipe.py can be much simpler, but could also include lots of parameters. The simplest settings.xml for filtering, mapping, and calling consensus on lambda data (using all default parameters)
Code:
<?xml version="1.0" encoding="UTF-8" standalone="yes"?> <smrtpipeSettings> <protocol id="lambda_resequencing.1"> <param name="reference"> <value>common/references/lambda</value> </param> </protocol> <moduleStage name="fetch" editable="true"> <module label="Fetch v1" id="P_Fetch"> </module> </moduleStage> <moduleStage name="filtering"> <module label="Filter" id="P_Filter"> </module> </moduleStage> <moduleStage name="mapping" editable="true"> <module label="BLASR v1" id="P_Mapping"> </module> </moduleStage> <moduleStage name="consensus" editable="true"> <module label="Quiver v1" id="P_GenomicConsensus"> </module> </moduleStage> </smrtpipeSettings> P.S. the [WARNING] in the log output is really just a warning, real errors will be tagged [ERROR], and an exit will be tagged [CRITICAL]. The reason that you got zero output but no [ERROR] or [CRITICAL] is that the xml file you used is a template so includes some of the necessary input, without including the tags that do the computation. |
![]() |
![]() |
![]() |
#10 |
Senior Member
Location: East Coast USA Join Date: Feb 2008
Posts: 7,087
|
![]()
I verified that "Rhall's" minimal settings.xml file does indeed work with SMRTpipe v.1.4 on the command line with lambda test data . Thanks Rhall!
PacBio command line is not for the faint of heart. One problem with previous versions of the web interface was that the "role/account based" restrictions for SMRTcell data did not work (not good in a core environment where everyone could see/access all data). They have supposedly been fixed in SMRTanalysis v.1.4. Rhall: By chance is that something you have looked at? Last edited by GenoMax; 02-19-2013 at 10:45 AM. |
![]() |
![]() |
![]() |
#11 | |
Member
Location: Tennessee, USA Join Date: Sep 2010
Posts: 61
|
![]()
Dear rhall and GenoMax,
Thanks for the reply. I was able to run the analysis with rhalls minimal settings.xml. Meanwhile our system admin setup the SMRTportal (pretty quick ![]() Quote:
In SMRTportal, I selected the "RS_AHA_scaffolding" protocol and "Soap_scaffolds.fasta" as reference. Now how can I input the pacbio data and run the AHA algorithm? I guess using error corrected data would be best. Any suggestions? Thanks Sagar |
|
![]() |
![]() |
![]() |
#12 |
Senior Member
Location: San Francisco Join Date: Aug 2012
Posts: 322
|
![]()
Yes in 1.4 jobs and raw data can be assigned to a group, then only users in that group can access and see the data.
|
![]() |
![]() |
![]() |
#13 | |
Member
Location: Tennessee, USA Join Date: Sep 2010
Posts: 61
|
![]() Quote:
my raw data looks lie this: Code:
long_m130123_002504_42153_c100461682550000001523059505101395_s1_p0.bas.h5 m130123_002504_42153_c100461682550000001523059505101395_s1_p0-02.log m130123_002504_42153_c100461682550000001523059505101395_s1_p0-03.log m130123_002504_42153_c100461682550000001523059505101395_s1_p0-04.log m130123_002504_42153_c100461682550000001523059505101395_s1_p0.bas.h5 m130123_002504_42153_c100461682550000001523059505101395_s1_p0.ccs.fasta m130123_002504_42153_c100461682550000001523059505101395_s1_p0.ccs.fastq m130123_002504_42153_c100461682550000001523059505101395_s1_p0.fasta m130123_002504_42153_c100461682550000001523059505101395_s1_p0.fastq m130123_002504_42153_c100461682550000001523059505101395_s1_p0.sts.csv m130123_002504_42153_c100461682550000001523059505101395_s1_p0.sts.xml strobe_m130123_002504_42153_c100461682550000001523059505101395_s1_p0.bas.h5 Code:
filtered_subreads_CF080.fastq Corrected_pacbio.fasta Code:
[INFO] 2013-02-19 16:21:52,513 [pbpy.smrtpipe.SmrtPipeScope fitRefLengthToScope 95] Total length of reference is 7024.52 kbp [INFO] 2013-02-19 16:21:52,514 [pbpy.smrtpipe.SmrtPipeScope fitRefLengthToScope 99] Reference scope is huge [INFO] 2013-02-19 16:21:52,514 [pbpy.smrtpipe.modules.HybridAssembly run 452] Genome scope is large enough to potentially slow down nucmer repeat detection, so refusing to run. Not running nucmer can increase false positive scaffolds links induced by repeats. To allow nucmer execution increase DENOVO_GENOME_SCOPES in smrtpipe.rcor on command line with e.g. -DDENOVO_GENOME_SCOPES=small:1,large:1,huge:1. ValueError: invalid literal for int() with base 10: '' [ERROR] 2013-02-19 16:21:52,550 [pbpy.smrtpipe.SmrtPipeMain exit 760] invalid literal for int() with base 10: '' ![]() Thanks Sagar |
|
![]() |
![]() |
![]() |
#14 |
Senior Member
Location: San Francisco Join Date: Aug 2012
Posts: 322
|
![]()
sagarutturkar,
The basic input type into SMRT portal is PacBio raw data. You should be able to Import SMRT Cells from the 'import and manage' tab, pointing it to the directory structure of the data that comes off the machine. The filtered_subreads.fastq is useful for using non PacBio software, but you should use raw data for SMRT portal workflows. Once you have the data imported you should be able to run the RS_AHA_scaffolding workflow, with the reference you have imported, and the raw data. I'm not aware of a method for using the corrected reads (pacbioToCA) to scaffold an assembly without going to the command line, and outside of the SMRT portal / pipe system. |
![]() |
![]() |
![]() |
#15 | |
Senior Member
Location: San Francisco Join Date: Aug 2012
Posts: 322
|
![]()
Sorry,
Quote:
|
|
![]() |
![]() |
![]() |
#16 | |
Senior Member
Location: San Francisco Join Date: Aug 2012
Posts: 322
|
![]()
The raw data is not complete, it should look like this:
Quote:
|
|
![]() |
![]() |
![]() |
#17 |
Senior Member
Location: San Francisco Join Date: Aug 2012
Posts: 322
|
![]()
Command line AHA including howto to go from filtered subreads and corrected reads:
https://github.com/PacificBioscience...ining/wiki/AHA |
![]() |
![]() |
![]() |
#18 | |
Member
Location: Tennessee, USA Join Date: Sep 2010
Posts: 61
|
![]() Quote:
Code:
[ERROR] 2013-02-19 16:21:52,547 [pbpy.smrtpipe.SmrtPipeMain run 648] invalid literal for int() with base 10: ![]() About SMRT cells importing - kind of weired things happening. I downloaded the complete data and imported it again. It showed a message "1 SMRT cell imported". However, in design jobs I do not see any SMRT cells available. I tried to use search option but no luck. Does import of SMRT cells need to be approved by administrator? I might be doing some silly mistake or missing some step ![]() When I tried to re-import from same location, I got message "No New SMRT cells found". Have you noticed something like this before. Thanks Sagar |
|
![]() |
![]() |
![]() |
#19 |
Senior Member
Location: San Francisco Join Date: Aug 2012
Posts: 322
|
![]()
The [ERROR] is a python exception, could you post the entire log so we can get some more context?
The SMRT cell should show up, it does not need to be approved, my only thought is that maybe you do not have a group assigned to you. If it is just a personal copy of SMRT portal, log in as administrator and check under 'admin' that you are in group 'all' (note you can place a user in multiple groups by 'ctrl' clicking), also check that your role is 'scientist' (although I'm not sure what the different roles actually mean). SMRT Cells will only import once, any further attempt will give 'No New SMRT cells found', so it sounds like they are there somewhere. |
![]() |
![]() |
![]() |
#20 | ||
Member
Location: Tennessee, USA Join Date: Sep 2010
Posts: 61
|
![]() Quote:
I have attached log file for the job that failed through SMRTportal (CF80_AHA.zip). Quote:
Many thanks for your help. -Sagar |
||
![]() |
![]() |
![]() |
Thread Tools | |
|
|