Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Quality checking transcriptome assemblies

    Dear all,

    I am currently working on 454 transcriptome data. I have assembled the reads using the current newbler version, MIRA, and will try Velvet/Oases later on.
    However, I am still not sure how to decide on one assembly to use for all further analysis. Comparing the basic statistics like contig length, reads per contig, N50 and so on only gives hints to which may be the better assembly. From these stats I would have choosen the newbler assembly, but they do not really tell me how well the transcripts are assembled. Alternative splicing and gene copies can make for bad assemblies, for example.
    Some methods I thought of that may tell me more are looking at single genes that have a known sequence, and check how the transcripts for these genes are assembled. However, that has to be done manually, and is only possible for a few genes.
    Maybe one could also blast all contigs and check which assembly gives more or better hits, or how many hits one contig has on average. Too many hits per contig could maybe mean that multiple transcripts are assembled to one contig.

    How do you check assembly quality beyond the basic stats? Without a sequenced genome, of course.

    Cheers,

    Till

  • #2
    It is mentioned by Oases that the transcriptome assembly is quite different from genome assembly, due to the alternative splicing and different gene expression levels. Therefore, the old statistics may not be applicable to transcriptome assembly.
    Xi Wang

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Essential Discoveries and Tools in Epitranscriptomics
      by seqadmin




      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
      04-22-2024, 07:01 AM
    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, Today, 08:47 AM
    0 responses
    12 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    60 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    59 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    54 views
    0 likes
    Last Post seqadmin  
    Working...
    X