![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
RNA-Seq: distinguish lncRNA of smRNA precursors | int11ap1 | RNA Sequencing | 2 | 03-20-2014 06:40 AM |
Alternative splice or RNA-seq generated error? | Louis_Lemire | General | 1 | 07-22-2011 03:38 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Member
Location: Brazil Join Date: Oct 2013
Posts: 12
|
![]()
Hii everyone!
I have received thousands of transcripts generated by a non-stranded RNA-seq and I have just annotated them (I used a in-house bash script to extract from blast results xml the most informative and frequent annotations from blast hits). However, I have found that many transcripts have the same annotation, e.g. two transcripts have been annotated as 1-aminocyclopropane-1-carboxylate oxidase and so on. Please find attached a file containing some examples of blast alignments of these transcritps. Would you consider these cases as a result of wrong assembly? How can I distinguish assembly error from true splice or isoforms in RNA-seq? Moreover, can I consider two highly similar reverse complement transcripts as only one single transcripts since this is a non-stranded RNA-seq? Best regards, Marcio |
![]() |
![]() |
![]() |
#2 |
Super Moderator
Location: Walnut Creek, CA Join Date: Jan 2014
Posts: 2,707
|
![]()
Many organisms have multiple copies of genes that may be slightly different. I would worry about classifying two sequences with only 89% or 95% identity as the same transcript. Assembly error rates should be far below that.
But it may depend on the organism... is it haploid or diploid/polyploid? |
![]() |
![]() |
![]() |
#3 |
Senior Member
Location: East Coast USA Join Date: Feb 2008
Posts: 7,080
|
![]()
Examples you included in your file are good hits over most entire length of those contigs. What method did you use to assemble the contigs? What was the average depth that led to that consensus sequence? Since this is non-stranded library you do have a 50-50 chance of sequencing either strand.
|
![]() |
![]() |
![]() |
#4 |
Super Moderator
Location: Walnut Creek, CA Join Date: Jan 2014
Posts: 2,707
|
![]()
I may have misinterpreted something... are the Blast alignments you posted of the transcriptome to itself, or to some other database?
|
![]() |
![]() |
![]() |
#5 |
Member
Location: Brazil Join Date: Oct 2013
Posts: 12
|
![]()
Hi Brian and GenoMax!
The Blast alignments I posted are of the transcriptome to itself. Anyway, transcripts are from a tetraploid organism. GenoMax, the sequences have been assembled by someone else. I will have to check with him how he has assembled the reads. Best, Marcio |
![]() |
![]() |
![]() |
#6 |
Super Moderator
Location: Walnut Creek, CA Join Date: Jan 2014
Posts: 2,707
|
![]()
If it's tetraploid, then depending on the organism's degree of heterozygosity, those may very well be the same transcript from different ploidies (which would not really be considered misassemblies)... or they could be two copies of the same gene with different genomic coordinates. I don't know that there's an easy way to tell. If possible, I'd try to inbreed the organism as much as possible before doing assemblies.
|
![]() |
![]() |
![]() |
Tags |
assembly error, reverse complement, rna-seq |
Thread Tools | |
|
|