Hello all
I have a genome that I am looking to parse a genome based on another file that has a name associated with the coordinates of the specific name in the genome of my species (for this purpose I will just use the first line of the fasta file of the genome)
My coordinate file looks like the lines below
The genome file looks like the one below
I am new to using bioperl but i know that you can implement
to extract that specific sequence, my problem is I am working on a much bigger scale and with many more names and locations. So I know I need to open one file (the one with the names and coordinates) to loop through and substitute for the 20, 35 each time but I don't know how to accomplish this through bioperl. I can extract one string based on one location but implementing it on a grander scale is giving me problems.
Cheers
I have a genome that I am looking to parse a genome based on another file that has a name associated with the coordinates of the specific name in the genome of my species (for this purpose I will just use the first line of the fasta file of the genome)
My coordinate file looks like the lines below
name1 20 35
name2 35 40
name2 1 4
name2 35 40
name2 1 4
GL849905.1 dna:scaffold:GL849905.1:1:5315331:1 REF
TTATATTCATATATTTATATATATATATATATATATTATATATACACACACATATCCAGATACAGGCTCATAAACATGCATACACGTGTGCATATACCATGCTAAATTCGCACAAATACATATACGAATACACATACCGCACTGTGCCTTTACATATATAGAAACGTACGCATATTTGACTCACGCGCATACACACAATGCCTGCACACACACATACCTACATAATAAACATAGACTGAGATTTAGGACCAGAACTAAAGGCTCAATAAGCATTTATTAAGCACCTACAGCATGCCAAAC
TTATATTCATATATTTATATATATATATATATATATTATATATACACACACATATCCAGATACAGGCTCATAAACATGCATACACGTGTGCATATACCATGCTAAATTCGCACAAATACATATACGAATACACATACCGCACTGTGCCTTTACATATATAGAAACGTACGCATATTTGACTCACGCGCATACACACAATGCCTGCACACACACATACCTACATAATAAACATAGACTGAGATTTAGGACCAGAACTAAAGGCTCAATAAGCATTTATTAAGCACCTACAGCATGCCAAAC
$string = $seq_obj->subseq(20,35)
Cheers
Comment