Seqanswers Leaderboard Ad

**javijevi** · 02-09-2010, 10:57 AM

Originally posted by javijevi View Post

Splitting unmatched reads into temp files.
bfast: RunMatch.c:718: FindMatchesInIndexSet: Assertion `numReads == numWritten' failed.
Splitting unmatched reads into temp files.
bfast: RunMatch.c:718: FindMatchesInIndexSet: Assertion `numReads == numWritten' failed.

Just to tell that I made a mistake in copying twice the last two lines of the output.

**nilshomer** · 02-09-2010, 11:57 AM

Originally posted by javijevi View Post

Hi all,

I successfully went along the first steps of BFAST pipeline, including the indexes creation, but got the below copied error when running 'bfast match' step with the following command for a fastq test file with 9 reads:

bfast match -f reference_genome.fa -A 1 -r test.fastq -i 1 -I 2-10 1> matches.bmf 2> match.log &

Contents of match.log:
(...)
Searching index file 1/1 (index #1, bin #1) complete...
Found 4 matches.
Found matches for 4 reads.
Copying unmatched reads for secondary index search.
Splitting unmatched reads into temp files.
bfast: RunMatch.c:718: FindMatchesInIndexSet: Assertion `numReads == numWritten' failed.
Splitting unmatched reads into temp files.
bfast: RunMatch.c:718: FindMatchesInIndexSet: Assertion `numReads == numWritten' failed.

Any idea?

Thanks in advance.

Any reason why you want to use secondary indexes? I would recommend using all the indexes in the primary search (no secondary indexes).

This may be a bug (with the secondary search). Please submit your report to [email protected] so we can resolve the issue quickly.

**nilshomer** · 02-09-2010, 12:54 PM

Originally posted by nilshomer View Post

Any reason why you want to use secondary indexes? I would recommend using all the indexes in the primary search (no secondary indexes).

This may be a bug (with the secondary search). Please submit your report to [email protected] so we can resolve the issue quickly.

I have found the bug and fixed the latest source code available via GIT. Let me know if you have any problems: )

**javijevi** · 02-09-2010, 02:43 PM

Originally posted by nilshomer View Post

Any reason why you want to use secondary indexes? I would recommend using all the indexes in the primary search (no secondary indexes).

In BFAST book, you can find the following: 'If you wish to have a secondary set of indexes, which are used if no matches are found in the main set of indexes, use the -I option'. So, I thought that it was more efficient to not use a mismatch-allowing index, e.g., 1110111110011111, for reads which were already mapped by using an all-matchs index, that is, 11111111111111.

Obviously, I missed something important in this issue because of the complexity of the index-based search algorithm for a biologist, and I therefore will blindly follow your recommendation about not using secondary indexes.

**nilshomer** · 02-09-2010, 03:57 PM

Originally posted by javijevi View Post

In BFAST book, you can find the following: 'If you wish to have a secondary set of indexes, which are used if no matches are found in the main set of indexes, use the -I option'. So, I thought that it was more efficient to not use a mismatch-allowing index, e.g., 1110111110011111, for reads which were already mapped by using an all-matchs index, that is, 11111111111111.

Obviously, I missed something important in this issue because of the complexity of the index-based search algorithm for a biologist, and I therefore will blindly follow your recommendation about not using secondary indexes.

I have spent a lot of time thinking about the indexing strategy and I would follow the strategy found in section 7.1 where we use 10 "main" indexes and no secondary indexes.

I apologize for the confusion but I tried to keep options for flexibility.

Topics	Statistics	Last Post
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Yesterday, 08:47 AM	0 responses 12 views 0 likes	Last Post by seqadmin Yesterday, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 59 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 54 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM

Seqanswers Leaderboard Ad

Announcement

BFAST error in FindMatchesInIndexSet function

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News