Hi,
I have some genomes that I will be uploading to NCBI soon. I have been told that all N's need to be removed and the contigs split at this position.
I am new to command line interface so I was hoping someone could recommend a program and simple script that could do this for me. I would like to remove all N's and then split the contig at the location of the N's results in two new contigs. For example
Contig 1: ATCGGATAANNNNNNNNNATCGCCGAT
Contig 1.1: ATCGGATAA
Contig 1.2 ATCGCCGAT
Thanks!
I have some genomes that I will be uploading to NCBI soon. I have been told that all N's need to be removed and the contigs split at this position.
I am new to command line interface so I was hoping someone could recommend a program and simple script that could do this for me. I would like to remove all N's and then split the contig at the location of the N's results in two new contigs. For example
Contig 1: ATCGGATAANNNNNNNNNATCGCCGAT
Contig 1.1: ATCGGATAA
Contig 1.2 ATCGCCGAT
Thanks!
Comment