Dear All,
I have ~ 100000 DNA sequences in a fasta file. I am interested in finding the overlap graph of these sequences. I just want a simple overlap finder (which takes minOverlapLength and percentIdentityIntheOverlapRegion as inputs). The overlap will be at the end of the sequences (similar to assembly)
I have tried looking into many assembly programs. All the programs I looked gave the contigs as output. Some assembly programs gave information about the reads in each contig. But none of the assembly programs gave any information about the overlap graph, which was used to find the contigs. In addition, since assembly programs work with reads, they look for certain depth of coverage during overlap finding. I do not want this.
Could anyone point me towards a simple overlap-graph finder.
Thanks,
Navin.
I have ~ 100000 DNA sequences in a fasta file. I am interested in finding the overlap graph of these sequences. I just want a simple overlap finder (which takes minOverlapLength and percentIdentityIntheOverlapRegion as inputs). The overlap will be at the end of the sequences (similar to assembly)
I have tried looking into many assembly programs. All the programs I looked gave the contigs as output. Some assembly programs gave information about the reads in each contig. But none of the assembly programs gave any information about the overlap graph, which was used to find the contigs. In addition, since assembly programs work with reads, they look for certain depth of coverage during overlap finding. I do not want this.
Could anyone point me towards a simple overlap-graph finder.
Thanks,
Navin.
Comment