Unconfigured Ad

**javijevi** · 02-09-2010, 10:57 AM

Originally posted by javijevi View Post

Splitting unmatched reads into temp files.
bfast: RunMatch.c:718: FindMatchesInIndexSet: Assertion `numReads == numWritten' failed.
Splitting unmatched reads into temp files.
bfast: RunMatch.c:718: FindMatchesInIndexSet: Assertion `numReads == numWritten' failed.

Just to tell that I made a mistake in copying twice the last two lines of the output.

**nilshomer** · 02-09-2010, 11:57 AM

Originally posted by javijevi View Post

Hi all,

I successfully went along the first steps of BFAST pipeline, including the indexes creation, but got the below copied error when running 'bfast match' step with the following command for a fastq test file with 9 reads:

bfast match -f reference_genome.fa -A 1 -r test.fastq -i 1 -I 2-10 1> matches.bmf 2> match.log &

Contents of match.log:
(...)
Searching index file 1/1 (index #1, bin #1) complete...
Found 4 matches.
Found matches for 4 reads.
Copying unmatched reads for secondary index search.
Splitting unmatched reads into temp files.
bfast: RunMatch.c:718: FindMatchesInIndexSet: Assertion `numReads == numWritten' failed.
Splitting unmatched reads into temp files.
bfast: RunMatch.c:718: FindMatchesInIndexSet: Assertion `numReads == numWritten' failed.

Any idea?

Thanks in advance.

Any reason why you want to use secondary indexes? I would recommend using all the indexes in the primary search (no secondary indexes).

This may be a bug (with the secondary search). Please submit your report to [email protected] so we can resolve the issue quickly.

**nilshomer** · 02-09-2010, 12:54 PM

Originally posted by nilshomer View Post

Any reason why you want to use secondary indexes? I would recommend using all the indexes in the primary search (no secondary indexes).

This may be a bug (with the secondary search). Please submit your report to [email protected] so we can resolve the issue quickly.

I have found the bug and fixed the latest source code available via GIT. Let me know if you have any problems: )

**javijevi** · 02-09-2010, 02:43 PM

Originally posted by nilshomer View Post

Any reason why you want to use secondary indexes? I would recommend using all the indexes in the primary search (no secondary indexes).

In BFAST book, you can find the following: 'If you wish to have a secondary set of indexes, which are used if no matches are found in the main set of indexes, use the -I option'. So, I thought that it was more efficient to not use a mismatch-allowing index, e.g., 1110111110011111, for reads which were already mapped by using an all-matchs index, that is, 11111111111111.

Obviously, I missed something important in this issue because of the complexity of the index-based search algorithm for a biologist, and I therefore will blindly follow your recommendation about not using secondary indexes.

**nilshomer** · 02-09-2010, 03:57 PM

Originally posted by javijevi View Post

In BFAST book, you can find the following: 'If you wish to have a secondary set of indexes, which are used if no matches are found in the main set of indexes, use the -I option'. So, I thought that it was more efficient to not use a mismatch-allowing index, e.g., 1110111110011111, for reads which were already mapped by using an all-matchs index, that is, 11111111111111.

Obviously, I missed something important in this issue because of the complexity of the index-based search algorithm for a biologist, and I therefore will blindly follow your recommendation about not using secondary indexes.

I have spent a lot of time thinking about the indexing strategy and I would follow the strategy found in section 7.1 where we use 10 "main" indexes and no secondary indexes.

I apologize for the confusion but I tried to keep options for flexibility.

Topics	Statistics	Last Post
High-Resolution Sequencing Exposes Hidden Toxoplasma Diversity by SEQadmin2 Started by SEQadmin2, Yesterday, 11:08 AM	0 responses 6 views 0 reactions	Last Post by SEQadmin2 Yesterday, 11:08 AM
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, 06-30-2026, 05:37 AM	0 responses 11 views 0 reactions	Last Post by SEQadmin2 06-30-2026, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 19 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 53 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM

Unconfigured Ad

BFAST error in FindMatchesInIndexSet function

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News