Hello,
I am using Mosaik to assembly reads using a reference. Here is my problem. The total data set consists of 18 million reads. After filtering the reads for a certain criteria I am interested in I am down to 6 million reads. To get these 6 million reads into a separate file I have converted the fastq to fasta then used xdget to retrieve the reads of interest. I now have a fasta file consisting of the 6 million reads. For the "Build" portion of Mosaik, I need either a fastq file or a fasta file and an accompanying file of base quality scores.
Does anyone know of an equivalent way retrieve specific sequences from a fastq files (such as xdget to fasta)?
Thanks,
John
I am using Mosaik to assembly reads using a reference. Here is my problem. The total data set consists of 18 million reads. After filtering the reads for a certain criteria I am interested in I am down to 6 million reads. To get these 6 million reads into a separate file I have converted the fastq to fasta then used xdget to retrieve the reads of interest. I now have a fasta file consisting of the 6 million reads. For the "Build" portion of Mosaik, I need either a fastq file or a fasta file and an accompanying file of base quality scores.
Does anyone know of an equivalent way retrieve specific sequences from a fastq files (such as xdget to fasta)?
Thanks,
John
Comment