Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • strange SAM output

    Hi,

    I use tophat to run RNAseq project, the sequence is from solexa pair end .
    the below is part of one line in the result.

    ILLUMINA-7A0261_0001:3:77:8496:8665#0 161 chr1 554335 3 64M = 1395011 0

    from the SAM1 manual, I think the insert size is 840676(1395011-554335). but ISIZE(inferred Insert SIZE) is 0. how can explain it.




    zhidong

  • #2
    Can you try using other short read mapper such as bwa to see the insert size distribution? Or you can use the Bio:B::Sam perl modules to access the entire sam/bam, and infer the insert size distribution.

    Comment


    • #3
      thank you , macrowave.

      but I run RNAseq project. bwa seems not fit for mapping RNA sequence to Genome

      Comment


      • #4
        Can you provide more specific information, such as the fragment size, read length, the expected insert sizes, reference type (genome or transcriptome)? Form my experience, paired-end mapping with bowtie to predicted transcriptome yielded expected insert size distribution. By the way, BWA is perfectly fine for mRNA-Seq mapping to the genome, it's just harder to estimate insert size because of the introns, and you'll get weird inferred size distribution as the variable intron length. The Bio:B::Sam perl module has functions to access all proper mapped paired reads from sam/bam , from no matter which mapper you use. So it's a good idea to get all pairs in a region and see the real distribution.

        Comment


        • #5
          Just realized that your problem might be a bug in TopHat. In the newest TopHat release notes, they say 'TLEN field in SAM format is correctly output', which means you may be using an older release that doesn't output the isize correctly.

          Comment


          • #6
            and the sam flag 161 (1+32+128) means the paired reads mapped one forward, one reverse, but for some reason, the aligner thinks the pair isn't right (not properly aligned?), so that might be also a reason it returns a zero isize.

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Essential Discoveries and Tools in Epitranscriptomics
              by seqadmin




              The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
              04-22-2024, 07:01 AM
            • seqadmin
              Current Approaches to Protein Sequencing
              by seqadmin


              Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
              04-04-2024, 04:25 PM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, Yesterday, 08:47 AM
            0 responses
            12 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-11-2024, 12:08 PM
            0 responses
            60 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 10:19 PM
            0 responses
            59 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 09:21 AM
            0 responses
            54 views
            0 likes
            Last Post seqadmin  
            Working...
            X