Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • SAM files aligned by Bowtie - Mystified.

    Hi,

    This is one of my first forays in SOLID data so please bear with me if the question is basic;

    I had aligned SOLID data individually (forward, reverse) using bowtie color space -
    bowtie -t -p 4 -C --sam --chunkmbs 1000 /Bowtie-Reference/Bowtie-C/h_sapiens_37_asm_c -f read1.csfasta -Q read1.QV.qual read1.sam

    I ended up getting a sam file with what appears to me a malformed sam files;

    1_2_1079_F3 4 * 0 0 * * 0 0 CCGTGATGGCTACCCCTGGGGTTACATATAAATT *?626A@@:@-@@@@2>>@6*5=**.;.--<@=3 XM:i:0

    1_3_10_F3 0 gi|224589818|ref|NC_000006.11| 33410152 255 33M * 0 0 ATTCCCGCCTCTCCTTTCATTTGTCCACATCTC \IFIAGPMOYZL>PRAN`a/#S30FHP?J/!!' XA:i:2 MD:Z:33 NM:i:0 CM:i:4

    I kind of figured that "*" on 3rd column, "@" and some CIGAR sequence in the next line as mal-formed elements. I would very much appreciate if anyone can shed some light on this and suggest any solution to this problem.

    Thanks

    Uma

  • #2
    I am not sure I understand your question. It appears to me that your first SAM line (starting with '1_2...') is a valid line where the read does not map while your second SAM lines (starting with '1_3...') is also a valid line that represents a read that does map.

    Comment


    • #3
      Originally posted by westerman View Post
      I am not sure I understand your question. It appears to me that your first SAM line (starting with '1_2...') is a valid line where the read does not map while your second SAM lines (starting with '1_3...') is also a valid line that represents a read that does map.
      One thing I noticed is that the phred scores for the second read span too big of a range to be any of the typical formats.

      Edit: Although perhaps it's just not one I've ever seen! I had a reply similar to your written yesterday before noticing the Phred issue.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Recent Innovations in Spatial Biology
        by seqadmin


        Spatial biology is an exciting field that encompasses a wide range of techniques and technologies aimed at mapping the organization and interactions of various biomolecules in their native environments. As this area of research progresses, new tools and methodologies are being introduced, accompanied by efforts to establish benchmarking standards and drive technological innovation.

        3D Genomics
        While spatial biology often involves studying proteins and RNAs in their...
        Yesterday, 07:30 PM
      • seqadmin
        Advancing Precision Medicine for Rare Diseases in Children
        by seqadmin




        Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
        12-16-2024, 07:57 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 12-30-2024, 01:35 PM
      0 responses
      21 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 12-17-2024, 10:28 AM
      0 responses
      41 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 12-13-2024, 08:24 AM
      0 responses
      55 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 12-12-2024, 07:41 AM
      0 responses
      40 views
      0 likes
      Last Post seqadmin  
      Working...
      X