Gene prediction of a draft assembly was done with Augustus, and some of my peptide sequences contain either single or strings of "X"s. From what I've read, "X"s could be caused by strings of N's in the assembly. However, I have also noticed that "X" may represent some internal stop codons in some cases of my transcripts.
This prediction was run without turning on the --noInFrameStop flag. How does one decide whether or not to report transcripts with internal stop codons? I'm curious to know whether these transcripts are of any use and how authetic they are.
I am getting errors flagged during submissions to public databases because of this. Is it advisable to just discard these transcripts? What would be the best way to handle transcripts with internal stop codons?
This prediction was run without turning on the --noInFrameStop flag. How does one decide whether or not to report transcripts with internal stop codons? I'm curious to know whether these transcripts are of any use and how authetic they are.
I am getting errors flagged during submissions to public databases because of this. Is it advisable to just discard these transcripts? What would be the best way to handle transcripts with internal stop codons?
Comment