Seqanswers Leaderboard Ad

**swbarnes2** · 08-02-2011, 02:37 PM

You aren't going to be able to figure out which individual nucleotides are "new" and which are old. The question is meaningless. The mutation would have happened in one cell, (either in an ancestor gamete stem cell, or in one cancer cell, or one cell of the developing zygote, or in one resistant bacterium, etc) and you are looking at the DNA from many descendant cells, that have undergone many rounds of DNA replication since the mutation event. If I had to guess, I'd say that the issue is polymerase slippage, but is it more likely for the polymerase to slip on the TCT as opposed to an AGA? I don't know, and I doubt anyone else knows either.

The net change can just be calculated from the variants. I'm not sure what summing that up over lots of variants would tell you.

Why would you treat a SNP as an insertion and deletion, unless you had evidence that the strain you were looking at was a revertant of a previous indel? Indels are rarer than simple SNPs, so how would you distinguish which SNPs were caused by a base change, and which were caused by the lightening strike of two sequential indels at the same locus?

**sulicon** · 08-02-2011, 03:14 PM

Thanks, swbarnes2.

These indels are called from the sequencing data of some clones. Each clone corresponds to one mRNA molecule and there is at most one clone for a locus in the library. What we are interested in is how the indels would influence the final protein products. So I hope I'm able to calculate the net change for each indel: If it's a loss/gain of 3,6,9.. nts, then the reading frame would be kept. Otherwise, the frame would be shifted and a premature protein would be generated in most of the cases.

It would be great if I can split all the indels into two categories: deletions and insertion, and variance sties in each category could be further split into 'in-frame' and 'out-of-frame' ones. Although it's not necessary to distinguish the net change, say deletion or insertion, here, I think it would be easier for others to understand than just telling them they are indels.

What I'm worried about is whether it's possible to get a complex indel. For example, the wild-type (reference) sequence is 'ACTG' and the alternative sequence observed is 'ACGGG'. In this case, the 'T' is deleted, and two 'G's are inserted. This indel could also be interpreted as a 'T'->'G' mutation and an insertion of a 'G'. So I'm curious whether we can make a decision which interpretation is better?

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 25 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 24 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Can I always get the net gain/loss of nts from an indel?

Comment

Comment

Latest Articles

ad_right_rmr

News