Hello,
I have a question that i need help. i have this seq in fasta file.
Qustions
1) How many genes are represented in this data and how many sequences are there for each sequenced gene.
2) What is the average read length before and after trimming (denoted by NOTRIM_LEN and Len respectively)
3) Are any of the DNA sequences in the file identical to each other, and if so what is the highest number of identical sequences? (Hint: sort isn’t just for numbers!)
I have a question that i need help. i have this seq in fasta file.
Code:
>HG2FEE201A723Q SAMPLE=USERID-19_JOBID-10_HG2FEE201_166281C_MID21 GENE=PR STRAND=- NOTRIM_LEN=512 Mean:33 Len:497 Trimmedat5':0 Trimmedat3':5 AlignmentScore: 21630 AmpliconCoverage: 402 FullCoverage: Y ---CTTGTCTCAAT-AAGGTAGGGGGCCA---GATAAGGGAGGCTCTCTTAGACACAGGAGCAGATGATACAGTATTAGAAGAAATAAGTTTGCCAGGAAAATGGAAACCAAAAATGATAGGGGGAATTGGAGGTTTTATCAAAGTAAGACAGTATGATCAAGTACCTATAGAAATTTGTGGAAAAAAGGCTATAGGCACAGTATTAATAGGACCTACACCTATCAACATAATTGGAAGGAATATGTTGACTCAACTTGGATGCACACTAAATTTTCCAATTAGTCCCATTGAAACTGTACCAGTAAAATTAAAGCCAGGAATGGATGGCCCAAAGGTCAAACAATGGCCATTGACAGAAGAGAAAATAAAAGCATTAACAGC---A---ATTTGTGAAGA---AATGGAGAAGGAA >HG2FEE201B2MWP SAMPLE=USERID-19_JOBID-10_HG2FEE201_166281C_MID21 GENE=PR STRAND=+ NOTRIM_LEN=544 Mean:31 Len:450 Trimmedat5':0 Trimmedat3':61 AlignmentScore: 19950 AmpliconCoverage: 402 FullCoverage: Y
1) How many genes are represented in this data and how many sequences are there for each sequenced gene.
2) What is the average read length before and after trimming (denoted by NOTRIM_LEN and Len respectively)
3) Are any of the DNA sequences in the file identical to each other, and if so what is the highest number of identical sequences? (Hint: sort isn’t just for numbers!)
Comment