The new blog series covers improvements over a 3 month time frame using the various data releases and includes in house PGM runs.
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Yes, a gapped alignment was used. Using the following command line options
-g 20 -x5
which corresponds to a gap opening penalty of 20 and extension penalty of 5.
The default is -g 40, -x 15.
I may be wrong but I thought the predicted QV (or projected) is based on a prediction algorithm/method. For Ion Torrent, it reads in a phredTable of precalculated values for a range of QVs. I'll blog about this one day.
Therefore, the predicted QV is independent on what goes on with the empirical QVs based on alignment. FYI: Illumina MiSeq is pretty much spot on with their predicted vs actual. I think that's because the technology it's built on has been around for a while thus their predictive algorithms have had time to mature.... Some one correct me if I am wrong, could be just full of crap
Comment
-
Yes that sound right to me.
What I meant was that if you are calibrating your quality values based on agreement between a set of your reads and a known sequence, then you would need to do an alignment to that reference.
Then you check each base in a given QV bin and see if it is was right or wrong. If the proportion of wrong:right is correct, great. If not, adjust your QV table.
Right? But what if you used something other than the default Novoalign settings. Or used another aligner altogether? Without gaps in your alignment, every indel causes every base downstream to disagree with the reference. It would cause you to recalibrate your QV table and drastically reduce the quality of your bases.
You give a link to Novoalign -- is that the official alignment engine of the Ion Torrent, or is it possible they are using a more gap-draconian alignment methodology?
I don't have any particular expertise in calibrating quality values. But it just occurs to me that indels may be the cause in the descrepancies between the QVs assigned and the actual accuracy of the reads with your aligner.
What do you think?
--
Phillip
Comment
-
Novoalign is not the official alignment program. tmap is the official one created by Nils Homer so he would be the best to answer any questions on that.
Regarding QV calibration, I'm quite new to the concept so would need to brush up before I comment. I will look at the QV prediction code then ask on the Ion Community to make sure I've understood it before blogging what I've learned.
Comment
-
I will be interested to read what you find.
Seems like there is a real issue here caused by the high indel error frequencies of 454 and Ion Torrent. To a first approximation it is completely legitimate to allow indel miscalls resulting from incorrect estimations of the numbers of bases in a run to produce much lower quality values. But because utilizing gapped alignment procedures can side-step most of the issues caused by indels of this sort, one doesn't want to penalize the QVs too much.
--
Phillip
Comment
-
Broadly, this is how QVs are predicted using six metrics and a phred lookup table. This is detailed in my latest blog post.
Comment
-
Hi lek2k,
Would it be possible to generate the plots in the figures using a gapless alignment method? Just for comparison?
One possibility here is that Novoalign is calling correct bases that Ion Torrent would score as miscalls. Every base call after an indel is a miscall if you do not use a gapped aligner.
--
Phillip
Comment
-
Originally posted by pmiguel View PostHi lek2k,
Would it be possible to generate the plots in the figures using a gapless alignment method? Just for comparison?
One possibility here is that Novoalign is calling correct bases that Ion Torrent would score as miscalls. Every base call after an indel is a miscall if you do not use a gapped aligner.
--
Phillip
Comment
Latest Articles
Collapse
-
by seqadmin
The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...-
Channel: Articles
05-06-2024, 07:48 AM -
-
by seqadmin
The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...-
Channel: Articles
04-22-2024, 07:01 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 05-07-2024, 06:57 AM
|
0 responses
12 views
0 likes
|
Last Post
by seqadmin
05-07-2024, 06:57 AM
|
||
Started by seqadmin, 05-06-2024, 07:17 AM
|
0 responses
16 views
0 likes
|
Last Post
by seqadmin
05-06-2024, 07:17 AM
|
||
Started by seqadmin, 05-02-2024, 08:06 AM
|
0 responses
22 views
0 likes
|
Last Post
by seqadmin
05-02-2024, 08:06 AM
|
||
Started by seqadmin, 04-30-2024, 12:17 PM
|
0 responses
24 views
0 likes
|
Last Post
by seqadmin
04-30-2024, 12:17 PM
|
Comment