Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Using Tophat with low quality Illumina Reads

    Hello,


    i know that Tophat is one of best choices to get exon-exon structures with short reads but my problem is that the sequencer got problems with their laser while detecting the nt. So all of the reads (completly 82 bp) get an "N" at Postions 54 / 55 of 82. My question ist can TooHat or even BLAT handle this?


    regards

    Philip

  • #2
    Originally posted by sphil View Post
    Hello,
    So all of the reads (completly 82 bp) get an "N" at Postions 54 / 55 of 82. My question ist can TooHat or even BLAT handle this?

    Philip
    Philip,
    I did comparison study between TopHat and BLAT recently on my 75 nt data set. I replaced certain position with N. Both TopHat and BLAT can handle this case well ( two N at position 50,51, TopHat found 164,481 junctions, ie. 1.3% less than original data set).

    Lifeng
    Last edited by lifeng.tian; 05-15-2010, 07:05 AM.

    Comment


    • #3
      hey...
      thanks for fast reply i will post my results in case....

      edit:

      i know it should be a new thread but i think its almost a 1 qustion / 1 answer post so:
      can tophat handle long FLX-454 reads? (avg. length of 260bp) or is blat state of the art to get exon-exon structures with long reads?

      greets

      philip
      Last edited by sphil; 05-15-2010, 04:40 AM.

      Comment


      • #4
        Originally posted by sphil View Post
        hey...

        can tophat handle long FLX-454 reads? (avg. length of 260bp) or is blat state of the art to get exon-exon structures with long reads?

        philip

        with my illumina data (130 nt, paired), BLAT works better than TopHat, it
        found 10% more known Refseq splice junctions.

        It'll be interesting to compare both with your data. My bet is BLAT will win

        Comment


        • #5
          hey, ok i will brief you if anything unpredictable will happen!

          Comment


          • #6
            Originally posted by sphil View Post
            Hello,


            i know that Tophat is one of best choices to get exon-exon structures with short reads but my problem is that the sequencer got problems with their laser while detecting the nt. So all of the reads (completly 82 bp) get an "N" at Postions 54 / 55 of 82. My question ist can TooHat or even BLAT handle this?


            regards

            Philip
            Hi Philip,
            Tophat internally uses Bowtie which has a combination of parameters that you can adjust to allow it to map in the presence of 'n' mismatches ( I am assuming a 'N' at any position is treated as a mismatch by the algorithm). Some of these parameters are:

            --initial-read-mismatches Reads are initially mapped, allowing up to this many mismatches in each read alignment. The default is 2.

            --segment-mismatches Read segments are mapped independently, allowing up to this many mismatches in each segment alignment. The default is 2.

            -m/--splice-mismatches <int> The maximum number of mismatches that may appear in the "anchor" region of a spliced alignment. The default is 0.

            Hope that helps,

            Thanks

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Essential Discoveries and Tools in Epitranscriptomics
              by seqadmin


              The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
              Yesterday, 07:01 AM
            • seqadmin
              Current Approaches to Protein Sequencing
              by seqadmin


              Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
              04-04-2024, 04:25 PM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 04-11-2024, 12:08 PM
            0 responses
            39 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 10:19 PM
            0 responses
            41 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 09:21 AM
            0 responses
            35 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-04-2024, 09:00 AM
            0 responses
            55 views
            0 likes
            Last Post seqadmin  
            Working...
            X