Seqanswers Leaderboard Ad

**arolfe** · 07-03-2012, 04:59 AM

I'd start with the assumption that you'll change everything in the bioinformatics pipeline between the initial and final versions and that you'll do lots and lots of testing and tweaking along the way. Make sure the whole thing is automated/scripted such that you can run the script with options to specify (1) input file (2) which programs to use (eg, which aligner) and (3) which options. You don't need to write all that on the first pass, just start with a simple initial version and work your way up. Scripting it all like this then makes it easy to start 10 variations on the cluster over the weekened so you can come in Monday morning to compare results.

I like shell scripts for this, since they make it easy to cut and paste commands when you're debugging. If you save intermediate results to disk at every point (rather than piping | them from one command to the next) then you can run just part of your pipeline by hand when necessary.

If you weren't already planning on it, I'd generate a reference sequence input for your aligner that's the mouse + viral genomes. After you align, you can just take reads that map to the viral chromosome. This avoids some of the difficulty of deciding what's viral and what's mouse because the two genomes are competing for reads in the alignment.

I've had good luck with Bowtie, Bowtie2, and Freebayes for SNP calling, though there are lots of options. One thing to watch out for in SNP calling is what assumptions the program makes- does it assume you're working on a diploid genome?

good luck!

Alex

**BurlEarl** · 07-05-2012, 09:02 AM

Thanks Alex.

I didnt really think to just use the endogenous sequences as reference to compete them away from the viral genome. As for SNP calling for pooled sequences, I was told to check out SNVer. They even have a GUI for numbskulls like me! Hopefully I can manage without. I just got my server space up and running, so I have a whole new set of stuff to play with.

Thanks again,
Earl

**Geoffreyion** · 08-01-2012, 04:26 AM

Exactly I also think that the post is too long but it's quite informative also. Right. It will be also boring reading this post. This post is all about the virus errors causing your system to get encrypted. It should be read to be caution against further viruses. click here

Topics	Statistics	Last Post
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Yesterday, 08:47 AM	0 responses 12 views 0 likes	Last Post by seqadmin Yesterday, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 59 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 54 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM

Seqanswers Leaderboard Ad

Announcement

Excited to get started on Viral sequencing

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News