Hi all,
My Data:
Illumina whole genome data from ~160 individuals. Reads were initially 100 nt but after quality trimming I have a distribution of read lengths between 50-100 nt.
My Problem:
I am interested in regions of the genome that are not annotated. The loci are very small (~80 bp). Because some of my reads are larger than my reference, I can not map them against my reference loci. My solution was to only map the first 50 nt of my reads. However, I can't figure out an efficient way to do this. Bowtie1 has an option to trim bases from the 5' or 3' end of the read (-5/--trim5 <int> and -3/--trim3 <int>), but instead of trimming off bases from a side, I'd rather just use the first 50 of each read.
Is there a way to do this using bowtie? I'm trying to avoid going back to the raw data...
Thanks,
John
My Data:
Illumina whole genome data from ~160 individuals. Reads were initially 100 nt but after quality trimming I have a distribution of read lengths between 50-100 nt.
My Problem:
I am interested in regions of the genome that are not annotated. The loci are very small (~80 bp). Because some of my reads are larger than my reference, I can not map them against my reference loci. My solution was to only map the first 50 nt of my reads. However, I can't figure out an efficient way to do this. Bowtie1 has an option to trim bases from the 5' or 3' end of the read (-5/--trim5 <int> and -3/--trim3 <int>), but instead of trimming off bases from a side, I'd rather just use the first 50 of each read.
Is there a way to do this using bowtie? I'm trying to avoid going back to the raw data...
Thanks,
John
Comment