Seqanswers Leaderboard Ad

**flxlex** · 09-22-2010, 10:55 PM

Unfortunately, the 454ISotigsLayout.txt part you copied doesn't align. Could you paste it with 'code' tags around?

Other than that, it does sound strange that you get more than one isotig for each transcript. You really handpicked clones, and sequenced them pooled? Could there be paralogs (multiple members of the same gene)?

You could try increasing the alignment stringency by going higher for -mi and -ml, this might get paralogs split into separate isogroups...

**cram** · 09-23-2010, 09:18 AM

The only difference between isotig00022 and isotig00023 is the former one has an additional contig, which is only 4 nt in length! I don't think there is an exon only with 4 nucleotides...

This can happen sometimes as a result of sequencing errors, especially in homopolymer regions - a few reads will have a few bogus nucs and newbler decides it's a mini-exon.

**sulicon** · 09-23-2010, 09:31 AM

Thanks for your suggestion. I have edited the alignment.

Our collaborators performed the experiments. It was said that clones from paralogs were put into different "deepwells". However, some isotigs indeed come from genes sharing some similarity with our target genes in both ends -- they should be amplified by non-specific primer-target interaction... And in some cases, more then one clones were picked into one "deepwell" -- we got this conclusion by Sanger sequencing some of the clones.

They suggested us that we should use "-genomic" (the default) option, rather than "-cdna", for this analysis, so that Newbler could make "isotigs of an isogroup into one contig if possible". And they also suggested higher -mi (94%) and -ml (50) parameters in assembly. I no longer get multiple isotags from single gene in this way, but I have to deal with "segment contigs" -- I think in cases where there are indeed multiple isoforms sequenced for one gene, Newbler has to split them into segments if "-genomic" option used.

Maybe I should just discard these isotigs (when -cnda option used) or segments (when -genomic option used). It would be fine if newbler could assemble all the isoforms correctly (using -cnda), but I'm afraid there would be many artificial ones and some irrelevant isotags (from different genes) have been grouped together due to some vector sequences remained (I don't know why these vector sequences are failed to be trimmed...)

**sulicon** · 09-23-2010, 09:32 AM

Originally posted by cram View Post

This can happen sometimes as a result of sequencing errors, especially in homopolymer regions - a few reads will have a few bogus nucs and newbler decides it's a mini-exon.

Thanks for your explanation

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 39 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 41 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 35 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 55 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Should I trust in the Isotigs assembled by newbler in our experiment?

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News