@KevinLam
Indeed, I started the development for color space using these datasets:
However, these data contain too many errors (in color space) to be assembled de novo (in color space), in my opinion. My estimation is that the error rate in color space ranges from 8% to 12% for these two datasets. That would explain the total lack of de novo assemblies performed so far with SOLiD technology.
So, you are free to try Ray with csfasta files, but it is not 100% tested yet.
Perhaps the last version of the SOLiD sequencer produces more reliable readouts, but that I don't know. And I am sure someone else is more aware of that than me on SeqAnswers.com.
Thank you, happy assembly!
***
The Ray Project Team
Indeed, I started the development for color space using these datasets:
However, these data contain too many errors (in color space) to be assembled de novo (in color space), in my opinion. My estimation is that the error rate in color space ranges from 8% to 12% for these two datasets. That would explain the total lack of de novo assemblies performed so far with SOLiD technology.
So, you are free to try Ray with csfasta files, but it is not 100% tested yet.
Perhaps the last version of the SOLiD sequencer produces more reliable readouts, but that I don't know. And I am sure someone else is more aware of that than me on SeqAnswers.com.
Thank you, happy assembly!
***
The Ray Project Team
Comment