Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Morgane_AUS
    Member
    • Aug 2014
    • 25

    SAM file after Bowtie is messed up

    Hi,



    I have chip-seq data from E. coli (51 bp). I mapped my fastq file to my reference genome (custom build) using Bowtie in Galaxy. In the SAM output file, some rows have the sequence in the quality score columns, and the quality scores in the OPT column. Some rows are fine.



    Anyone would know what is causing it and how to fix that ?



    Thanks
  • dpryan
    Devon Ryan
    • Jul 2011
    • 3478

    #2
    Are you sure that's actually the case? It's incredibly more likely that you're just miscounting the columns.

    Comment

    • Morgane_AUS
      Member
      • Aug 2014
      • 25

      #3
      See step 14

      Comment

      • GenoMax
        Senior Member
        • Feb 2008
        • 7142

        #4
        Have you set the file format to "fastqsanger" for your original data files (I can't tell from the history you shared). Here is how you would do it: https://wiki.galaxyproject.org/Suppo...ognize_dataset Then you should not have to groom your data. If this is recent data correct so it should already be in sanger fastq format.

        It appears that part of illumina fastq header (1:N:0:18) is missing from the reads that appear to have an alignment (at least that is what it looks like in the web page).

        Comment

        • Morgane_AUS
          Member
          • Aug 2014
          • 25

          #5
          Hi,

          thank you for looking at my data.

          I have tried without grooming, just changing data type (my reads are illumina 1.9 encoding) and I have the exact same result.

          The illumina fastq header (1:N:0:18) is present for all reads in the fastq file.

          I have tried galaxy GVL instance and galaxy main. Same results.

          I don't have this problem when I use BWA mapping. But it's better to use Bowtie for E. coli reads since BWA looks for intron so better used for eukaryotes is that right ?

          Comment

          • blancha
            Senior Member
            • May 2013
            • 367

            #6
            I don't have this problem when I use BWA mapping. But it's better to use Bowtie for E. coli reads since BWA looks for intron so better used for eukaryotes is that right ?
            No, BWA, like Bowtie does not take into account the introns.
            Only splice-junction aware aligners, like TopHat and STAR do, in which case you have to provide them with the genome annotation indicating the location of the junctions.
            TopHat actually delegates the alignment to Bowtie1 or 2, and only handles the splicing.

            In the link to the Galaxy instance that you posted, you are using a version of Bowtie that dates back to 2010, version 0.12.7. It's not clear from your post if you've already tried this, but the first troubleshooting step I would take would be to upgrade to a more modern version of Bowtie. There is a long list of bugs that have been fixed in Bowtie since 2010.

            Comment

            • Morgane_AUS
              Member
              • Aug 2014
              • 25

              #7
              Thanks for that ! That's really helpful.

              I didn't check which version of Bowtie I was using thinking that the Galaxy main instance would display the most up to date version. I will have a look at that.

              Thanks a lot.

              Comment

              Latest Articles

              Collapse

              • SEQadmin2
                From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                by SEQadmin2


                Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                ...
                Yesterday, 10:05 AM
              • SEQadmin2
                Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                by SEQadmin2


                With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                Introduction

                Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                05-22-2026, 06:42 AM
              • SEQadmin2
                Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                by SEQadmin2

                Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                05-06-2026, 09:04 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by SEQadmin2, Yesterday, 12:03 PM
              0 responses
              19 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, Yesterday, 11:40 AM
              0 responses
              14 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 05-28-2026, 11:40 AM
              0 responses
              29 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 05-26-2026, 10:12 AM
              0 responses
              31 views
              0 reactions
              Last Post SEQadmin2  
              Working...