Unconfigured Ad

**arolfe** · 07-03-2012, 04:59 AM

I'd start with the assumption that you'll change everything in the bioinformatics pipeline between the initial and final versions and that you'll do lots and lots of testing and tweaking along the way. Make sure the whole thing is automated/scripted such that you can run the script with options to specify (1) input file (2) which programs to use (eg, which aligner) and (3) which options. You don't need to write all that on the first pass, just start with a simple initial version and work your way up. Scripting it all like this then makes it easy to start 10 variations on the cluster over the weekened so you can come in Monday morning to compare results.

I like shell scripts for this, since they make it easy to cut and paste commands when you're debugging. If you save intermediate results to disk at every point (rather than piping | them from one command to the next) then you can run just part of your pipeline by hand when necessary.

If you weren't already planning on it, I'd generate a reference sequence input for your aligner that's the mouse + viral genomes. After you align, you can just take reads that map to the viral chromosome. This avoids some of the difficulty of deciding what's viral and what's mouse because the two genomes are competing for reads in the alignment.

I've had good luck with Bowtie, Bowtie2, and Freebayes for SNP calling, though there are lots of options. One thing to watch out for in SNP calling is what assumptions the program makes- does it assume you're working on a diploid genome?

good luck!

Alex

**BurlEarl** · 07-05-2012, 09:02 AM

Thanks Alex.

I didnt really think to just use the endogenous sequences as reference to compete them away from the viral genome. As for SNP calling for pooled sequences, I was told to check out SNVer. They even have a GUI for numbskulls like me! Hopefully I can manage without. I just got my server space up and running, so I have a whole new set of stuff to play with.

Thanks again,
Earl

**Geoffreyion** · 08-01-2012, 04:26 AM

Exactly I also think that the post is too long but it's quite informative also. Right. It will be also boring reading this post. This post is all about the virus errors causing your system to get encrypted. It should be read to be caution against further viruses. click here

Topics	Statistics	Last Post
High-Resolution Sequencing Exposes Hidden Toxoplasma Diversity by SEQadmin2 Started by SEQadmin2, 07-02-2026, 11:08 AM	0 responses 11 views 0 reactions	Last Post by SEQadmin2 07-02-2026, 11:08 AM
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, 06-30-2026, 05:37 AM	0 responses 13 views 0 reactions	Last Post by SEQadmin2 06-30-2026, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 20 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 54 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM

Unconfigured Ad

Excited to get started on Viral sequencing

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News