Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • tophat pair end reads

    I just finished mapping my ABI pair end colorspace reads with tophat. The reads were 50bp F3 and 25bp F5.

    When I tried to convert the .bam file into .sam for Cufflinks assembly using Picard Tools, I got this error:

    Exception in thread "main" java.lang.RuntimeException: SAM validation error: ERROR: Record 127338, Read name 507_970_1560_F3, Mate Alignment start (542169) must be <= reference sequence length (531507) on reference Contig1

    It looks like a read is being "mapped" outside of the reference contig?

    Bowtie has --fr, --rf, --ff options when aligning pair-end reads. Does tophat take into account that SOLiD mate pairs are both in forward direction? so the --ff would be used.
    Last edited by damiankao; 10-28-2010, 01:21 AM.

  • #2
    Originally posted by damiankao View Post
    I just finished mapping my ABI pair end colorspace reads with tophat. The reads were 50bp F3 and 25bp F5.

    When I tried to convert the .bam file into .sam for Cufflinks assembly using Picard Tools, I got this error:

    Exception in thread "main" java.lang.RuntimeException: SAM validation error: ERROR: Record 127338, Read name 507_970_1560_F3, Mate Alignment start (542169) must be <= reference sequence length (531507) on reference Contig1

    It looks like a read is being "mapped" outside of the reference contig?

    Bowtie has --fr, --rf, --ff options when aligning pair-end reads. Does tophat take into account that SOLiD mate pairs are both in forward direction? so the --ff would be used.

    TopHat maps left and right reads separately using Bowtie, that is, it doesn't use Bowtie's pair searching like --fr, --rf, --ff. Using the mapped reads, TopHat finds pairs if the two reads of a pair are on different strand (it ignores if they are on the same strand) and the inner distance is within user specified range.

    Comment


    • #3
      TopHat mapping outside reference contig

      Originally posted by damiankao View Post
      I just finished mapping my ABI pair end colorspace reads with tophat. The reads were 50bp F3 and 25bp F5.

      When I tried to convert the .bam file into .sam for Cufflinks assembly using Picard Tools, I got this error:

      Exception in thread "main" java.lang.RuntimeException: SAM validation error: ERROR: Record 127338, Read name 507_970_1560_F3, Mate Alignment start (542169) must be <= reference sequence length (531507) on reference Contig1

      It looks like a read is being "mapped" outside of the reference contig?

      Bowtie has --fr, --rf, --ff options when aligning pair-end reads. Does tophat take into account that SOLiD mate pairs are both in forward direction? so the --ff would be used.
      I also received this error
      Mate Alignment start must be <= reference sequence length on reference
      when trying to run picard to convert to BAM
      was this ever resolved?
      thanks,

      Comment


      • #4
        TopHat

        I was wondering why the inner distance is important in TopHat how it is related to alignment? Any one tried using variable inner distance (-r/--mate-inner-dist <int> ) and check what may be the effect on alignments, if any.

        Comment


        • #5
          I have aligned the reads using tophat version 2, but the error message remain. Why there possible exist the possition longer than the real length, in spite of the seperate alignment of tophat.

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Essential Discoveries and Tools in Epitranscriptomics
            by seqadmin




            The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
            04-22-2024, 07:01 AM
          • seqadmin
            Current Approaches to Protein Sequencing
            by seqadmin


            Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
            04-04-2024, 04:25 PM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, Yesterday, 11:49 AM
          0 responses
          15 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-24-2024, 08:47 AM
          0 responses
          16 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-11-2024, 12:08 PM
          0 responses
          61 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 10:19 PM
          0 responses
          60 views
          0 likes
          Last Post seqadmin  
          Working...
          X