Hi, I'm not much of a bioinformatician, and I am trying to learn how to use biopieces, which our real bioinformatician left behind on our server. Mostly it seems straightforward, but I am stumped by how to generate a sequence file containing only sequences (forward or reverse) that contain a given primer sequence. The reason this is important is that multiple amplicons were mixed together and co-sequenced in lanes of the 454 run, and these amplicons cannot be separated by barcodes, but rather have to be separated based on the primer sequences.
So after I use read_sff, should I look for the primer pattern using patscan_seq? If so, then how do I generate a fastq file containing only sequences containing the primers of interest- do I use write_fastq, and that will collect all the sequences identified by patscan_seq if I pipe them together? I'm a little unsure of how biopieces works.
Thanks for your help,
Liz
So after I use read_sff, should I look for the primer pattern using patscan_seq? If so, then how do I generate a fastq file containing only sequences containing the primers of interest- do I use write_fastq, and that will collect all the sequences identified by patscan_seq if I pipe them together? I'm a little unsure of how biopieces works.
Thanks for your help,
Liz
Comment