Hi all,
I have some pacbio long read data, about 10x coverage of a 120M genome. I already have the reference genome. However it is not complete and there are many gaps in it. What I am trying to do is to error correct my pacbio sequence and assemble the genome. Later on I will add more illumina data trying to close the gaps.
My question about he error correction is: Can I use the incomplete reference genome to error correct my pacbio data? My plan is to convert the genome fasta into pacBioToCA required frg format. And then feed my pacbio data and the genome frg data to the correction pipeline to output error corrected data. My concern is : will pacBioToCA accept relatively long genome scalfold data as high identity sequence to correct my pacbio data?
Suggestions and help is greatly appreciatedl
Stuart
I have some pacbio long read data, about 10x coverage of a 120M genome. I already have the reference genome. However it is not complete and there are many gaps in it. What I am trying to do is to error correct my pacbio sequence and assemble the genome. Later on I will add more illumina data trying to close the gaps.
My question about he error correction is: Can I use the incomplete reference genome to error correct my pacbio data? My plan is to convert the genome fasta into pacBioToCA required frg format. And then feed my pacbio data and the genome frg data to the correction pipeline to output error corrected data. My concern is : will pacBioToCA accept relatively long genome scalfold data as high identity sequence to correct my pacbio data?
Suggestions and help is greatly appreciatedl
Stuart
Comment