Unconfigured Ad

**[email protected]** · 11-28-2012, 10:00 AM

hello,

I also want to do the same but could not find any software. Did you find any overlaping assembler?

**sklages** · 11-29-2012, 04:39 AM

Have a look at Celera Assembler, http://sourceforge.net/apps/mediawik...itle=Main_Page

**amaurizio** · 03-19-2014, 03:02 AM

Hello,
I used Pear to merge my Illumina (MiSeq sequencer) pair-end reads.
I'd like to know the overlap length in each of the merged reads in order to calculate min length, max, average, mode, median etc...
Can you suggest my how to do?
I tried with a perl script (length R1 + length R2 - length MergedR1R2) but I am not very good in programming....
Can anybody help me and tell me how to do this?
Thanks!!

**krobison** · 03-24-2014, 05:11 AM

Code:

#!/usr/bin/perl
#!/usr/bin/perl
# debugging left as exercise for student :-)
use strict;

foreach my $arg(@ARGV)
{
   my %lengths=();
   # read sequence file for lengths
   my $rdr=new Bio::SeqIO(-file=>$arg,-format=>'fastq');
   my $sum=0; my $cnt=0;
   while (my $rec=$rdr->next_seq)
   {
      $lengths{$rec->length}++; $sum+=$rec->length; $cnt++;
   }
   my @sortedLengths=sort {$a<=>$b} keys %lengths;
   my $minLen=$sortedLengths[0];
   my $maxLen=$sortedLengths[$#sortedLengths];
   my $meanLen=$sum/$cnt;
   my ($modeLen)=sort {$lengths{$b}<=>$lengths{$a}} keys %lengths;

   my $medLen="error";
   # calculating median left as exercise

   print join("\t",$arg,$minLen,$maxLen,$meanLen,$modeLen,$medLen),"\n";
}

**amaurizio** · 03-25-2014, 07:27 AM

Thank you very much!

Topics	Statistics	Last Post
High-Resolution Sequencing Exposes Hidden Toxoplasma Diversity by SEQadmin2 Started by SEQadmin2, Yesterday, 11:08 AM	0 responses 7 views 0 reactions	Last Post by SEQadmin2 Yesterday, 11:08 AM
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, 06-30-2026, 05:37 AM	0 responses 11 views 0 reactions	Last Post by SEQadmin2 06-30-2026, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 19 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 53 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM

Unconfigured Ad

Need some suggestion for overlap assembler

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News