![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
CNV analysis on Illumina / Solid genomes | rcorbett | Bioinformatics | 3 | 07-06-2012 07:27 AM |
Sequencing data analysis of pair-end sequencing | Smriti | Illumina/Solexa | 8 | 11-04-2011 07:51 AM |
retain multihit reads for CNV analysis | jorge | Bioinformatics | 0 | 08-30-2011 12:41 AM |
CNV variation effects on exome sequencing? | wrighth | Bioinformatics | 0 | 12-21-2010 11:43 AM |
Target enrichment and CNV analysis | gendxdoc | Sample Prep / Library Generation | 1 | 09-27-2010 04:47 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Member
Location: US Join Date: May 2010
Posts: 54
|
![]()
Hi Guys,
I have exome sequencing data, I am interested in knowing the CNV information. Does anyone know the best tools for CNV analysis on sequencing data? Thanks |
![]() |
![]() |
![]() |
#2 |
Member
Location: Connecticut Join Date: Jun 2009
Posts: 74
|
![]()
I would think with exome data there might be too big a bias due to selection and pcr. A crude CNV info could be extracted though
|
![]() |
![]() |
![]() |
#3 |
Senior Member
Location: Boston area Join Date: Nov 2007
Posts: 747
|
![]()
It has certainly been claimed that you can still get a CNV signal (similarly, selection of RNA is claimed to retain expression information); still, I would think you'd want a lot of samples enriched with the same strategy.
In any case, there are a bunch of tools in this space you might try out, though I don't have specific recommendations |
![]() |
![]() |
![]() |
#4 |
Member
Location: Connecticut Join Date: Jun 2009
Posts: 74
|
![]()
K,
You are right about the RNA, but the bait design should be different. Nonetheless, it always bothered me when expression profiling involves amplification steps... Wasn't there a protocol claiming no amplification procedure for RNA sequencing? I remember it was in Nature last year or a year before |
![]() |
![]() |
![]() |
#5 |
Member
Location: US Join Date: May 2010
Posts: 54
|
![]()
So is CNV most accurate when it is whole genome sequencing?
|
![]() |
![]() |
![]() |
#6 |
Member
Location: Milan Join Date: Mar 2010
Posts: 35
|
![]()
Hi,
the tools referred above seem mostly for the whole genome sequencing and not for the targeted capture and sequencing. Does any one have other suggestions? |
![]() |
![]() |
![]() |
#7 |
Member
Location: India Join Date: Aug 2010
Posts: 78
|
![]()
It really does not matter if its Exome data or WGS, a simple strategy is to ignore the Intronic regions(you would be having the coordinates)
On the other hand if you really want to be very accurate and you sample is something like, Tumor Normal pair then the job is far more easy. As the regions with zero Read count/depth will be common in both the samples so, no false positives that way. By far I have found CNVnator from the 1000 genomes group to be the best, very high sensitivity and less of false positives across a wide range of CNV size. |
![]() |
![]() |
![]() |
#8 |
Member
Location: cinci Join Date: Apr 2010
Posts: 66
|
![]()
I am also interested in doing this as well. What are yo referring to? There are generally paired read approach, read depth approach, split read approach, and sequence assembly approach for general structural variants.
|
![]() |
![]() |
![]() |
#9 | |
Member
Location: India Join Date: Aug 2010
Posts: 78
|
![]() Quote:
I have come across two ways of detecting CNV from NGS data. First is the individual sample approach which you were talking about initially and CNVnator uses. Take a window overlapping/non-overlapping of 'n' bp and based on the read count/read depth inside it, call CNV. The second approach is a Pair approach(not Paired end data). Where you compare/take ratio of a Sample vs Test. The main advantage is the CNV common in both are ignored(as ration will be roughly ~1). This can be useful if you have Exome only data. As using it with a Exome reference genome will take care of the Intron regions in the sample. eg. RDXplorer uses Read dept information, CNVseg read count in a static overlapping window, CNVnator can use read count in 100 base pair bins. So exactly what kind of sample/data do you have? |
|
![]() |
![]() |
![]() |
#10 | |
Junior Member
Location: harbin,china Join Date: Mar 2010
Posts: 9
|
![]() Quote:
![]() |
|
![]() |
![]() |
![]() |
#11 | |
Member
Location: India Join Date: Aug 2010
Posts: 78
|
![]() Quote:
Tools like CNVnator, CNVer and some others have other approaches so with a bit of manual intervention you could get the desired results. Additionally yesterday I read in a post that CLC bio has a CNV detection tool that works well with Exome data. Regards, pg |
|
![]() |
![]() |
![]() |
#12 | |
Junior Member
Location: NY, USA Join Date: Dec 2010
Posts: 9
|
![]() Quote:
|
|
![]() |
![]() |
![]() |
#13 | ||
Member
Location: India Join Date: Aug 2010
Posts: 78
|
![]() Quote:
For eg. CNVnator uses a bin size of 100bp, for the high coverage trio data from the 1000 genomes project and 300bp bin size for low coverage data from the project. But as the author has mentioned in the paper, depending on the data it is variable. Quote:
Even for our yet to be published CNV detection program we had to do a sensitivity analysis to determine a the optimal bin size. Finally the bin size is determined depending on the read length and coverage, for our algorithm. Thanking you, -- pg |
||
![]() |
![]() |
![]() |
#14 |
Member
Location: Malmö Join Date: Sep 2008
Posts: 37
|
![]() |
![]() |
![]() |
![]() |
Thread Tools | |
|
|