Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Tophat chromosome position greater than chromosome size

    I use tophat to map the rna-seq reads (paired-end). It seems that some of the reads in the .sam do not make sense. Please see some examples below:

    Example line 1:
    NB500923:20:H7WCJBGXX:1:12109:13961:14748 385 chr1 11646 3 151M chr22 114359002 0 CTTTTGGATTTTTGCCAGTCTAACAGGTGAAGCCCTGGAGATTCTTATTAGTGATTTGGGCTGGGGCCTGGCCATGTGTATTTTTTTAAATTTCCACTGATGATTTTGCTGCATGGCCGGTGTTGAGAATGACTGCGCAAATTTGCCGGAT AAAAAEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE<EEEEEEEEEEEEEEEEEEEEE/AEEEEEEEEAEEEEEEEEEEEEEEEEEEEEEEEEAEA/EAA<EEEEEEAEA/<AEE XA:i:0 MD:Z:151 NM:i:0 NH:i:2 CC:Z:chr15 CP:i:102519374 HI:i:0

    In the above line, it says the start position of the mate on chr22 is 114359002, however, the size of the chr22 is only 51304566

    Example line 2:
    NB500923:20:H7WCJBGXX:1:23107:14442:20318 323 chr1 11696 0 151M chrUn_GL000249 155257832 0 GTGATTTGGGCTGGGGCCTGGCCATGTGTATTTTTTTAAATTTCCACTGATGATTTTGCTGCATGGCCGGTGTTGAGAATGACTGTGCAAATTTGCCGGATTTCCTTCGCTGTTCCTGCATGTAGTTTAAACGAGATTGCCAGCACCGGGT AAAAAEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEAEEEEEEEEEEEEEAEEEEEAEEEEEEE<EEEEEEEEEEEEEEEAAEEA<EEEEEE XA:i:2 MD:Z:85C21T43 NM:i:2 NH:i:8 CC:Z:= CP:i:11696 HI:i:4

    It says the start position of the mate on chrUn_GL000249 is 155257832, however, the total size of chrUn_GL000249 is only 38502.

    Can anyone tell how this could happen? and how to fix the file?

    Thanks,

  • #2
    Can anyone help?

    Comment


    • #3
      Looks like this is a bug in the version of tophat2 that you're using. If you're using the most recent version, then please report this to the authors so they can fix it.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Recent Advances in Sequencing Analysis Tools
        by seqadmin


        The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
        Yesterday, 07:48 AM
      • seqadmin
        Essential Discoveries and Tools in Epitranscriptomics
        by seqadmin




        The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
        04-22-2024, 07:01 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, Yesterday, 07:17 AM
      0 responses
      11 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 05-02-2024, 08:06 AM
      0 responses
      19 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-30-2024, 12:17 PM
      0 responses
      20 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-29-2024, 10:49 AM
      0 responses
      28 views
      0 likes
      Last Post seqadmin  
      Working...
      X