Seqanswers Leaderboard Ad

**Heisman** · 05-22-2012, 09:03 PM

I have not read that paper, but could you in more detail describe the wet lab steps, how many lanes of data you have, and what your reservations are regarding following what is done in that paper?

**jimmybee** · 05-22-2012, 09:28 PM

Its a protocols paper, so nothing biologically relevant. I was asking advice as whether the process of merging with small insert sizes is common, appropriate given the stats I have given in my post, and whether this was a standard for ancient DNA analysis. There's not many good Bioinformatics methods papers for ancient DNA out there so it would be great to get advice on what others use.

Only have one lane - about 158 million pairs. Wet lab steps isn't my area sorry..

**Heisman** · 05-23-2012, 04:41 AM

Originally posted by jimmybee View Post

Its a protocols paper, so nothing biologically relevant. I was asking advice as whether the process of merging with small insert sizes is common, appropriate given the stats I have given in my post, and whether this was a standard for ancient DNA analysis. There's not many good Bioinformatics methods papers for ancient DNA out there so it would be great to get advice on what others use.

Only have one lane - about 158 million pairs. Wet lab steps isn't my area sorry..

Oh, I see, you mean merging the two paired ends if they overlap to yield one single end read? I thought you meant merging multiple lanes of data. I'll look at that paper when I get to work and can access it but my gut instinct is that since some of your insert sizes are >200bp then there is no point in merging; I'd rather align all of the data identically. But, I'll glance at that paper in a couple of hours.

**Heisman** · 05-23-2012, 05:47 AM

So glancing through the paper I think it's probably fine to follow it. If I had more time I'd read it in detail so perhaps somebody else will chime in.

**jimmybee** · 05-23-2012, 09:30 PM

Thanks mate. Looks like a good way to go, especially considering my lack of experience in the QC of small insert libraries in paired end sequencing

**technical vault** · 05-24-2012, 08:28 AM

I actually wrote something that did something similar for my masters project, unfortunately it wasn't as intelligent as this in choosing the most likely overlap. Alas I don't have access to the final code (it's buried on a UCL fileserver which i no longer have access to). However, it was based on this: http://almlab.mit.edu/vibrioGenomes/SHERA_temp/ which might be worth a look.

**jimmybee** · 05-24-2012, 02:58 PM

Ok no problem, I'll have a look at it. All good ideas

**Bucky** · 05-24-2012, 07:40 PM

Check out pandaseq, works pretty well:

PANDAseq: paired-end assembler for illumina sequences - BMC Bioinformatics

http://www.biomedcentral.com/1471-2105/13/31/abstract

Background Illumina paired-end reads are used to analyse microbial communities by targeting amplicons of the 16S rRNA gene. Publicly available tools are needed to assemble overlapping paired-end reads while correcting mismatches and uncalled bases; many errors could be corrected to obtain higher sequence yields using quality information. Results PANDAseq assembles paired-end reads rapidly and with the correction of most errors. Uncertain error corrections come from reads with many low-quality bases identified by upstream processing. Benchmarks were done using real error masks on simulated data, a pure source template, and a pooled template of genomic DNA from known organisms. PANDAseq assembled reads more rapidly and with reduced error incorporation compared to alternative methods. Conclusions PANDAseq rapidly assembles sequences and scales to billions of paired-end reads. Assembly of control libraries showed a 4-50% increase in the number of assembled sequences over naïve assembly with negligible loss of "good" sequence.

Topics	Statistics	Last Post
Enhanced Neoantigen Detection: Introducing NeoHunter by seqadmin Started by seqadmin, Yesterday, 07:17 AM	0 responses 11 views 0 likes	Last Post by seqadmin Yesterday, 07:17 AM
A Close Examination at Probiotic-Related Bacteremia by seqadmin Started by seqadmin, 05-02-2024, 08:06 AM	0 responses 19 views 0 likes	Last Post by seqadmin 05-02-2024, 08:06 AM
Expanded Genetic Insights into Blood Pressure Regulation by seqadmin Started by seqadmin, 04-30-2024, 12:17 PM	0 responses 20 views 0 likes	Last Post by seqadmin 04-30-2024, 12:17 PM
The Role of Enhancers in Defining Cell Fate by seqadmin Started by seqadmin, 04-29-2024, 10:49 AM	0 responses 28 views 0 likes	Last Post by seqadmin 04-29-2024, 10:49 AM

Seqanswers Leaderboard Ad

Announcement

Ancient DNA adaptor removal and read merging

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News