Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Confusion about one flag in sam format

    Hello,

    I met one problem in the Tophat mapping. Does anyone know what the flag difference between 97 and 99 is?

    The annotation of 99 is: read paired, read mapped in proper pair, mate reverse strand, first in pair

    The annotation of 97 is: read paired, mate reverse strand, first in pair

    Seems that the only difference is “read mapped in proper pair”. Is it due to the estimated insertion length? Can 97 be counted as a concordant pair?

    Many thanks.

  • #2
    "Proper pair" likely means that both reads are on the same chromosome, point in towards each other, and maybe that their distance is some appropriate number (I don't know how bowtie determines that, maybe the settings you give it in the command line)

    So any of those things could be off, and give you a 97 flag.

    So if the reads point away from each other, or are on different chromosomes, no, you should not count them as concordant.

    Comment


    • #3
      The meaning of "read mapped in proper pair" is down to the aligner/assembler, and would consider the expected read pair orientation (typically --> <--, also <-- --> for mate pair, and --> --> or <-- <-- for native Roche 454 reads if not flipped to look like traditional paired reads) if mapped to the same reference, and the expected fragment/template length (i.e. separation of the reads, field TLEN aka ISIZE).
      Last edited by maubp; 11-09-2012, 03:19 AM. Reason: typo

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Essential Discoveries and Tools in Epitranscriptomics
        by seqadmin


        The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
        Yesterday, 07:01 AM
      • seqadmin
        Current Approaches to Protein Sequencing
        by seqadmin


        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
        04-04-2024, 04:25 PM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 04-11-2024, 12:08 PM
      0 responses
      39 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 10:19 PM
      0 responses
      41 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 09:21 AM
      0 responses
      35 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-04-2024, 09:00 AM
      0 responses
      55 views
      0 likes
      Last Post seqadmin  
      Working...
      X