Ah, that can be am indicator of a few things, the most likely of which is that there's just a high level of PCR duplication in those regions. You can most easily check for this by opening the affected files in IGV or another viewer and just seeing if a disproportionate number of reads (or pairs) have the same start/stop bounds.
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Try increasing the --maxNumberOfReads argument for SomaticIndelDetector. That may be resolve your problem.
--maxNumberOfReads / -mnr ( int with default value 10000 )
Maximum number of reads to cache in the window; if number of reads exceeds this number, the window will be skipped and no calls will be made from it.
Comment
-
Thank you dpryan and id0,
Oh..However, I have used picard but still PCR duplicates ?
I will try use id0's recommendation.
There are two more problems I have after this:
1. I have converted .vcf output of SomaticIndelDetector using convert2annovar.pl. However, I am confused with MuTect .wig.txt out to convert so that I can use it as a In-put file for Annovar ?
2. MuTect can run without control samples using only tumor . SomaticIndelDetector does not work without control sample.bam. It asks minimum 2 samples. My case samples are from cell lines, there is no control or normal. How can I use SomaticIndelDetector ? I studied few articles and googled but can not understand. Can somebody explain me please ?
Thank you in Advance.
Comment
Latest Articles
Collapse
-
by seqadmin
The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...-
Channel: Articles
04-22-2024, 07:01 AM -
-
by seqadmin
Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...-
Channel: Articles
04-04-2024, 04:25 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Yesterday, 11:49 AM
|
0 responses
15 views
0 likes
|
Last Post
by seqadmin
Yesterday, 11:49 AM
|
||
Started by seqadmin, 04-24-2024, 08:47 AM
|
0 responses
16 views
0 likes
|
Last Post
by seqadmin
04-24-2024, 08:47 AM
|
||
Started by seqadmin, 04-11-2024, 12:08 PM
|
0 responses
62 views
0 likes
|
Last Post
by seqadmin
04-11-2024, 12:08 PM
|
||
Started by seqadmin, 04-10-2024, 10:19 PM
|
0 responses
60 views
0 likes
|
Last Post
by seqadmin
04-10-2024, 10:19 PM
|
Comment