Go Back   SEQanswers > Bioinformatics > Bioinformatics

Similar Threads
Thread Thread Starter Forum Replies Last Post
ERROR: [bcf_sync] incorrect number of fields.. vinodhsri Bioinformatics 21 08-11-2017 02:42 PM
Error: number of labels must match number of conditions in cuffdiff bvk Bioinformatics 12 05-20-2015 12:41 PM
Error: number of labels must match number of conditions shpyang Bioinformatics 2 08-15-2013 02:45 PM
PubMed: Ray: simultaneous assembly of reads from a mix of high-throughput sequencing Newsbot! Literature Watch 0 03-01-2011 11:30 AM

Thread Tools
Old 09-21-2015, 05:54 AM   #1
Location: UK

Join Date: Nov 2014
Posts: 14
Default Ray Error: miscount of the number of reads

Dear all,

I am trying to run Ray to de novo assemble a nematode genome.
I run into the following error:

mpirun \
-n 32 \
/mnt/Programs/Ray-2.3.1/Ray \
-k 81 \
-o rayk81 \
-p ../../clean_reads/ ../../clean_reads/ \
-p ../../clean_reads/ ../../clean_reads/ \
-p ../../clean_reads/ ../../clean_reads/ \
-p ../../clean_reads/ ../../clean_reads/


Rank 7: File ../../clean_reads/ (Number 7) has 10233322 sequences
Rank 6: File ../../clean_reads/ (Number 6) has 10231913 sequences
Rank 5: File ../../clean_reads/ (Number 5) has 10722610 sequences
Rank 2: File ../../clean_reads/ (Number 2) has 14151655 sequences
Rank 4: File ../../clean_reads/ (Number 4) has 10722031 sequences
Rank 3: File ../../clean_reads/ (Number 3) has 14152522 sequences
Rank 0: File ../../clean_reads/ (Number 0) has 100860164 sequences
Rank 1: File ../../clean_reads/ (Number 1) has 100860164 sequences
Rank 0 wrote rayk81/NumberOfSequences.txt
Rank 0 wrote rayk81/SequencePartition.txt

Rank 0 : Error, ../../clean_reads/ contains 14151655 sequences and ../../clean_reads/ contains 14152522 sequences (must be the same)
The problem detected by Ray (not the same number of sequences in the left and right read files) is wrong. Actually Ray does not seem to correctly count the sequences:

grep -c '^>' ../../clean_reads/
grep -c '^>' ../../clean_reads/  
A head of my read files looks completely normal (regular multiline fasta).
I have also run other assemblers successfully on this data, so I know there is no format problem with the files.

Any insights on what could be causing this problem?

Best Wishes,
standonn is offline   Reply With Quote

de novo assemby, genome assembly, ray

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

All times are GMT -8. The time now is 04:05 PM.

Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2018, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO