I am a new comer in this field.recently,i have to finish a bioinformatics task.the contents are as follow:
basic data(from IIlumina GA II or GA IIx):
1.long pair reads form sequencing:1.fq and 2.fq;
2.the scaffold constructed by the reads;
task:
after the process of sequencing a long pair fragment,using perl to assess the distribution of the length of mate-pair inserts,and find out how many sequences can be assembled at last.
Can somebody explain what the task want me to do?As a beginer,I really don't know what the length of mate-pair inserts mean?and How to identify the sequences can be used to assembled.
Thanks very much!
basic data(from IIlumina GA II or GA IIx):
1.long pair reads form sequencing:1.fq and 2.fq;
2.the scaffold constructed by the reads;
task:
after the process of sequencing a long pair fragment,using perl to assess the distribution of the length of mate-pair inserts,and find out how many sequences can be assembled at last.
Can somebody explain what the task want me to do?As a beginer,I really don't know what the length of mate-pair inserts mean?and How to identify the sequences can be used to assembled.
Thanks very much!
Comment