Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Using Tophat with low quality Illumina Reads

    Hello,


    i know that Tophat is one of best choices to get exon-exon structures with short reads but my problem is that the sequencer got problems with their laser while detecting the nt. So all of the reads (completly 82 bp) get an "N" at Postions 54 / 55 of 82. My question ist can TooHat or even BLAT handle this?


    regards

    Philip

  • #2
    Originally posted by sphil View Post
    Hello,
    So all of the reads (completly 82 bp) get an "N" at Postions 54 / 55 of 82. My question ist can TooHat or even BLAT handle this?

    Philip
    Philip,
    I did comparison study between TopHat and BLAT recently on my 75 nt data set. I replaced certain position with N. Both TopHat and BLAT can handle this case well ( two N at position 50,51, TopHat found 164,481 junctions, ie. 1.3% less than original data set).

    Lifeng
    Last edited by lifeng.tian; 05-15-2010, 07:05 AM.

    Comment


    • #3
      hey...
      thanks for fast reply i will post my results in case....

      edit:

      i know it should be a new thread but i think its almost a 1 qustion / 1 answer post so:
      can tophat handle long FLX-454 reads? (avg. length of 260bp) or is blat state of the art to get exon-exon structures with long reads?

      greets

      philip
      Last edited by sphil; 05-15-2010, 04:40 AM.

      Comment


      • #4
        Originally posted by sphil View Post
        hey...

        can tophat handle long FLX-454 reads? (avg. length of 260bp) or is blat state of the art to get exon-exon structures with long reads?

        philip

        with my illumina data (130 nt, paired), BLAT works better than TopHat, it
        found 10% more known Refseq splice junctions.

        It'll be interesting to compare both with your data. My bet is BLAT will win

        Comment


        • #5
          hey, ok i will brief you if anything unpredictable will happen!

          Comment


          • #6
            Originally posted by sphil View Post
            Hello,


            i know that Tophat is one of best choices to get exon-exon structures with short reads but my problem is that the sequencer got problems with their laser while detecting the nt. So all of the reads (completly 82 bp) get an "N" at Postions 54 / 55 of 82. My question ist can TooHat or even BLAT handle this?


            regards

            Philip
            Hi Philip,
            Tophat internally uses Bowtie which has a combination of parameters that you can adjust to allow it to map in the presence of 'n' mismatches ( I am assuming a 'N' at any position is treated as a mismatch by the algorithm). Some of these parameters are:

            --initial-read-mismatches Reads are initially mapped, allowing up to this many mismatches in each read alignment. The default is 2.

            --segment-mismatches Read segments are mapped independently, allowing up to this many mismatches in each segment alignment. The default is 2.

            -m/--splice-mismatches <int> The maximum number of mismatches that may appear in the "anchor" region of a spliced alignment. The default is 0.

            Hope that helps,

            Thanks

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Current Approaches to Protein Sequencing
              by seqadmin


              Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
              04-04-2024, 04:25 PM
            • seqadmin
              Strategies for Sequencing Challenging Samples
              by seqadmin


              Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
              03-22-2024, 06:39 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 04-11-2024, 12:08 PM
            0 responses
            18 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 10:19 PM
            0 responses
            22 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 09:21 AM
            0 responses
            16 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-04-2024, 09:00 AM
            0 responses
            47 views
            0 likes
            Last Post seqadmin  
            Working...
            X