![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
parallel de novo assembler | tmy1018 | Bioinformatics | 3 | 10-22-2012 09:31 AM |
PubMed: A Comparison of Parallel Pyrosequencing and Sanger Clone-Based Sequencing and | Newsbot! | Literature Watch | 0 | 11-01-2011 03:00 AM |
Contrail - a hadoop-based de novo sequence assembler | samanta | General | 0 | 09-08-2011 12:16 PM |
looking for reference genome based assembler for short-reads | zchou | Bioinformatics | 3 | 12-16-2009 09:13 PM |
PubMed: ABySS: A parallel assembler for short read sequence data. | Newsbot! | Literature Watch | 0 | 03-03-2009 06:00 AM |
![]() |
|
Thread Tools |
![]() |
#261 | ||||
Senior Member
Location: Québec, Canada Join Date: Jul 2008
Posts: 260
|
![]() Quote:
http://seqanswers.com/forums/showthread.php?t=26278 Paper: http://genomebiology.com/2012/13/12/R122 Quote:
Quote:
Quote:
Séb |
||||
![]() |
![]() |
![]() |
#262 |
Junior Member
Location: United States Join Date: Mar 2013
Posts: 9
|
![]()
I'm trying to use Ray on some environmental metagenomes, but every time I set up a run, the output contig and scaffold files are empty. I haven't seen any indication during the assembly run, and all sequences are recognized by Ray. One possibility that comes to mind is from some errors that came up while compiling. Specifically, when trying to use the make install command, an error pops up saying there is no README file in ray-build/RayPlatform. If I place the README file in RayPlatform and try again, I get an error saying there is already a file named RayPlatform.
Edit: After reinstalling Ray with the developer package, there are no installation errors. As of now, I'm stumped why there are no assembled sequences in either of the output Contigs.fasta or Scaffolds.fasta files. Last edited by bossanova352; 01-12-2014 at 12:25 AM. |
![]() |
![]() |
![]() |
#263 |
Junior Member
Location: The Netherlands Join Date: Apr 2010
Posts: 7
|
![]()
I'm trying to use Ray with a mix of Illumina SE and 454 data using the -amos
mpiexec -n 10 Ray -s illumina.fasta -s 454.fasta -k 31 -amos -o mix 454 reads have an avg length of 396bp and the illumina reads are 60bp The resulting amos file AMOS.afg can be browsed using Tablet, but I noticed that the original read names are converted to just a number and thus making it impossible to easily trace which reads end up in a particular contig. Is it possible to save the original readnames in the AMOS output? |
![]() |
![]() |
![]() |
#264 | |
Senior Member
Location: Québec, Canada Join Date: Jul 2008
Posts: 260
|
![]() Quote:
Do you have any error in your standard output file ? ("grep Error log.stdout") |
|
![]() |
![]() |
![]() |
#265 | |
Senior Member
Location: Québec, Canada Join Date: Jul 2008
Posts: 260
|
![]() Quote:
|
|
![]() |
![]() |
![]() |
#266 | |||
Junior Member
Location: United States Join Date: Mar 2013
Posts: 9
|
![]() Quote:
Quote:
Quote:
mpiexec -n 10 Ray -k 57 -i ../SFBloom_paired_trimmed_1.fa -o RayOutputTest Last edited by bossanova352; 01-13-2014 at 12:03 PM. |
|||
![]() |
![]() |
![]() |
#267 | |
Senior Member
Location: Québec, Canada Join Date: Jul 2008
Posts: 260
|
![]() Quote:
|
|
![]() |
![]() |
![]() |
#268 | ||
Junior Member
Location: United States Join Date: Mar 2013
Posts: 9
|
![]() Quote:
Quote:
|
||
![]() |
![]() |
![]() |
#269 |
Senior Member
Location: Québec, Canada Join Date: Jul 2008
Posts: 260
|
![]() |
![]() |
![]() |
![]() |
#270 | |||
Junior Member
Location: United States Join Date: Mar 2013
Posts: 9
|
![]() Quote:
Quote:
Quote:
Last edited by bossanova352; 01-14-2014 at 11:34 AM. |
|||
![]() |
![]() |
![]() |
#271 | |
Senior Member
Location: Québec, Canada Join Date: Jul 2008
Posts: 260
|
![]() Quote:
Can you provide a couple of lines from CoverageDistribution.txt (head) ? |
|
![]() |
![]() |
![]() |
#272 | ||
Junior Member
Location: United States Join Date: Mar 2013
Posts: 9
|
![]() Quote:
Quote:
|
||
![]() |
![]() |
![]() |
#273 |
Senior Member
Location: Québec, Canada Join Date: Jul 2008
Posts: 260
|
![]()
Hi,
Ray 2.3.1 is now available on http://denovoassembler.sourceforge.net/download.html. Significant changes: * This version includes "Surveyor" to compute similarity (or distance) matrices for hundreds or possibli thousands of samples. * fix compilation error on Apple OS X Mavericks * fix infinite loop when running on 2 CPU cores * fix a bug when the number of ranks is a prime number All changes in Ray: Rob Egan (1): fix compilation on NERSC's edison machine using PrgEnv-intel Sébastien Boisvert (30): SequencesLoader: fix bad automatic pairing of sequence files SequencesLoader: fix compilation warnings Surveyor: verify buffer size before getting producer Surveyor: add a variable to store the period Surveyor: run in actor-model-only mode spawn actors with spawn instead of spawnActor Documentation: add some documentation for Surveyor SeedExtender: add some assertions Searcher: disable verbose outputs Surveyor: skip invalid files coloring: added comments for coloring subsystem update release procedure next release will be 2.3.1 fix infinite loop when running on 2 CPU cores fix a bug when the number of ranks is a prime number print number of payloads add some code to test directed surveys with Surveyor fix reproducibility issue for similarity and distance matrices Surveyor: support nucleotides in lower case report invalid edges as warnings instead of errors documentation: add license in README Surveyor: report 0 hits when necessary SeedingData: provide prototypes for friend functions Surveyor: fix compilation issue without debug code seeds: add a parameter -minimum-seed-length (default 100) add option -graph-only to stop after graph building fix compilation error on Apple OS X Mavericks use CONFIG_ASSERT instead of ASSERT for optional code version 2.3.1 update releases Changes in RayPlatform: Rob Egan (1): fix compilation on NERSC's edison machine using PrgEnv-intel Sébastien Boisvert (15): communication: relay buffer bytes instead of buffer 64-bit integers core: add a actor-model-only mode actors: add playground status with -debug core: add buffer statistics with -debug actor model: change the method name from spawnActor to spawn fix the code for testing message integrity fix a regression introduced in a01f97eae41bcd759bfc521d84053552cf38d521 files: add method to check if a file is valid add mini-rank information in the message metadata fix mini-rank runtime engine print registered message tags in debug mode documentation: add LGPLv3 info in README communication: some routes don't require routing use CONFIG_ASSERT instead of ASSERT for optional code fix compilation warning |
![]() |
![]() |
![]() |
#274 |
Junior Member
Location: Nebraska Join Date: Nov 2012
Posts: 2
|
![]()
Hello,
When I look through some outputs generated from the amos file following assembly, many of the contigs were assigned 0 reads (used default bank2contig after seeing many contigs were not showing up in the generated sam file). Obviously, this does not make much sense, but I was wondering if anyone else has came across this? I was trying to avoid mapping by using the amos file and now I just want to confirm that the contigs I am getting are 'real' I suppose. I thought this may be due to read recycling at first, but reads show up under multiple contigs still. Anyone have other ideas what is causing this issue or how to correct it during assembly? Chris |
![]() |
![]() |
![]() |
#275 |
Member
Location: NJ Join Date: Oct 2012
Posts: 97
|
![]()
I am trying to assemble 275 paired end Illumina reads that I have interleaved together. Previously I was successfuly ran the interleaved files at the Kmer value 137. I compiled latest Ray version at Max Kmer size of 600 (technically 599).
That code was: Code:
mpiexec -n 30 Ray -k 137 -i interleaved.fastq -o Ray_K137 Now if I try a smaller Kmer value, I am running into a weird error Chunk Size error. I have tried: Code:
mpiexec -n 10 Ray -k 51 -i interleaved.fastq -o Ray_K51_try3 Code:
mpiexec -n 30 Ray -k 51 -i interleaved.fastq -o Ray_K51_try3 Code:
Rank 0 : VirtualCommunicator (service provided by VirtualCommunicator): 2957916 virtual messages generated 115295 real messages (3.89785%) Rank 0 freed 549453824 bytes from the path memory pool (chunks: 131) Rank 0: gossiping generated 0 messages (gossips: 0 ---> 0) Critical exception: The length of the requested memory exceeds the CHUNK_SIZE: 36423920 > 33554432 Ray: RayPlatform/memory/MyAllocator.cpp:97: void* MyAllocator::allocate(int): Assertion `false' failed. [BioLinux301:05209] *** Process received signal *** [BioLinux301:05209] Signal: Aborted (6) [BioLinux301:05209] Signal code: (-6) [BioLinux301:05209] [ 0] /lib/x86_64-linux-gnu/libpthread.so.0(+0x10340) [0x7f4a34d27340] [BioLinux301:05209] [ 1] /lib/x86_64-linux-gnu/libc.so.6(gsignal+0x39) [0x7f4a34987bb9] [BioLinux301:05209] [ 2] /lib/x86_64-linux-gnu/libc.so.6(abort+0x148) [0x7f4a3498afc8] [BioLinux301:05209] [ 3] /lib/x86_64-linux-gnu/libc.so.6(+0x2fa76) [0x7f4a34980a76] [BioLinux301:05209] [ 4] /lib/x86_64-linux-gnu/libc.so.6(+0x2fb22) [0x7f4a34980b22] [BioLinux301:05209] [ 5] Ray() [0x533b50] [BioLinux301:05209] [ 6] Ray() [0x4f7552] [BioLinux301:05209] [ 7] Ray() [0x551768] [BioLinux301:05209] [ 8] Ray() [0x5550ab] [BioLinux301:05209] [ 9] Ray() [0x5562ea] [BioLinux301:05209] [10] Ray() [0x413379] [BioLinux301:05209] [11] Ray() [0x40c5bf] [BioLinux301:05209] [12] /lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xf5) [0x7f4a34972ec5] [BioLinux301:05209] [13] Ray() [0x40e0cf] [BioLinux301:05209] *** End of error message *** zsh: abort Ray -k 51 -i interleaved.fastq -o Ray_K51_try3 Really appreciate on how to proceed. Last edited by Zapages; 05-08-2015 at 05:22 AM. |
![]() |
![]() |
![]() |
#276 |
Member
Location: Netherlands Join Date: Apr 2011
Posts: 16
|
![]()
Maybe this question was already asked somewhere, but I can not find it:
Is there a way to set the maximum insert size for paired end assembly with Ray? If not, what is the maximum insert size considered? I have an assembly which uses both normal insert size Illumina reads ( ~ 250 bp) and some longer insert sizes ( ~ 500 bp). When adding this last library, the results do not improve, which I think is suspicious.. Any ideas? |
![]() |
![]() |
![]() |
#277 |
Member
Location: Germany/Netherlands Join Date: Feb 2014
Posts: 98
|
![]()
Contigs, or scaffolds?
Have you tried giving the a possible distance for the reads to the assembler? |
![]() |
![]() |
![]() |
#278 |
Junior Member
Location: United states Join Date: Aug 2015
Posts: 1
|
![]()
Hi Folks,
I have serious problem with Ray and open mpi I am using a cluster with 4 nodes each has 8 cores and surprisingly when I run ray on single node with mpirun -np 8 it takes shorter time than I use two nodes and so on for example for one node it takes 5mins and for two nodes mpirun -np16 it taks 8min and for 3 nodes mpirun -24 it takes 12 mins and so on can any body please help me to find out the problem |
![]() |
![]() |
![]() |
#279 |
Member
Location: Iowa Join Date: Oct 2012
Posts: 41
|
![]()
Can you explain Ray Surveyor in a bit more detail? I'm having a hard time understanding the documentation but I think this could be of use to me.
|
![]() |
![]() |
![]() |
Tags |
assembler, genome, illumina, mix |
Thread Tools | |
|
|