Go Back   SEQanswers > Bioinformatics > Bioinformatics

Similar Threads
Thread Thread Starter Forum Replies Last Post
Demultiplexing HiSeq 2000 reads containing an N at the 5' end rtraborn Bioinformatics 4 09-25-2014 10:47 AM
What proportion of 'bad' quality reads are expected using HiSeq 2000 for RNA-Seq bob-loblaw Bioinformatics 2 07-02-2013 11:45 AM
Hiseq 2000 for Sale. kcacdna Illumina/Solexa 0 06-13-2013 06:14 AM
Concerns for combining data from HiSeq 2000 and HiSeq 2500 jaaker Illumina/Solexa 1 02-04-2013 03:56 PM
Help to analyze Illumina HiSeq 2000 Human data kiradi Bioinformatics 4 12-09-2011 05:30 AM

Thread Tools
Old 04-14-2015, 10:07 AM   #1
Junior Member
Location: Brazil

Join Date: Apr 2015
Posts: 5
Default How analyze DNA's reads by HiSeq-2000?

I need to analyze reads of viral DNA by deep sequencing using HiSeq-2000. I'll check the substitutions throughout the genome and identify the viral virants, but i don't know what package or scripts to use. Do you have any suggestions, friends?

vitor is offline   Reply With Quote
Old 04-14-2015, 10:29 AM   #2
Senior Member
Location: Stanford

Join Date: Jun 2009
Posts: 179

What kind of data do you have so far ("raw" FASTQ or is it already processed?) and what exactly do you want to achieve?
jwfoley is offline   Reply With Quote
Old 04-14-2015, 10:38 AM   #3
Junior Member
Location: Brazil

Join Date: Apr 2015
Posts: 5

jwfoley, the data's still being processed and I'll receive in FASTQ. I want to find mutation hot spots, nucleotide substituations and comparing the diversity of the genome at various times.
vitor is offline   Reply With Quote
Old 04-14-2015, 10:39 AM   #4
diego diaz
Location: Santiago, Chile

Join Date: Oct 2013
Posts: 62

Hi vitor,

First, you need to trim your reads, to discard low quality bases. You could use Trimmomatic, fastx_toolkit, prinseq, etc. Then, you have to map your sequences to the reference genome. If you want to analyze substitutions, I think it is more suitable to chose a more sensitive mapper, as SMALT, but your reads should be short (75 bp) , because they were sequenced with HiSeq, in this case I think bowtie 1 is good enough, Once you have your reads mapped, you have to perform a SNP calling. I recommend you freebayes, because it allows to analyze haploid organisms. With your VCF file generated with freeBayes you can perform a functional analysis with SNPeff.

I hope it helps!

Last edited by diego diaz; 04-14-2015 at 10:48 AM.
diego diaz is offline   Reply With Quote

bioinformatics help, dna, mutation/variant base, ngs analysis, virus

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

All times are GMT -8. The time now is 09:35 AM.

Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO