Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • TopHat & Cufflinks failing to assemble full length transcripts

    Hi,

    First post on SeqAnswers. The discussions here are very useful.

    We are using Tophat (v1.0.13) and Cufflinks (0.8.3) without a reference GTF and then use Cuffcompare to identify the assembled transcripts.

    We are finding many transcripts reported as novel isoforms that we suspect are actually just the main transcript being divided into 2 fragments, or are leaving out a few exons at the beginning which are clearly covered by reads.

    For example, if the gene has 34 exons, novel isoform j1 is identified as the first 23 exons, and j2 is made of the last 11 exons. Another example, a novel transcript is reported which begins from exon2, however there are an equal # of reads covering the first exon.

    Examination of the .wig file shows coverage of the complete transcript but for some reason the full length transcript is not being assembled. We've changed the # of bp on either side of the splice junction, with no avail. We also run the butterfly search, and no change with that option either.

    Does anyone have suggestions for us?

    Thanks,
    Jessica

  • #2
    Are you certain there are spliced reads connecting those exons as well? You want to visualize the read alignments in IGV to ensure that you have reads spanning all the junctions.

    Comment


    • #3
      I'm also interested in mRNA-seq reads assembly. It seems that there is not a good soft do that work because of the alternative splice.

      Comment


      • #4
        Try new version instead of this Cufflinks (0.8.3). I got greatly improved transcript assembling results with my single read data. Lot of them "full" length when compared with existing annotations.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Strategies for Sequencing Challenging Samples
          by seqadmin


          Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
          03-22-2024, 06:39 AM
        • seqadmin
          Techniques and Challenges in Conservation Genomics
          by seqadmin



          The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

          Avian Conservation
          Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
          03-08-2024, 10:41 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Yesterday, 06:37 PM
        0 responses
        7 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, Yesterday, 06:07 PM
        0 responses
        7 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 03-22-2024, 10:03 AM
        0 responses
        49 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 03-21-2024, 07:32 AM
        0 responses
        66 views
        0 likes
        Last Post seqadmin  
        Working...
        X