Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • miRNA missed by Bowtie alignment

    Hi,

    I hope this question does not sound to stupid. I have the following problem. I have NGS data from miRNA sequencing. I trimm the reads, so that I have a minimum length of 16nt and a maximum length of 35nt. I used bowtie (not bowtie2) to align/map the reads against a reference. My reference is a bowtie index, based on the human mature miRNAs taken from miRBase. In order to build the index and map the sequences after I exchanged all "U"s by "T"s.

    My problem is that I have for example a read with the following sequence:
    TCTCCCAACCCTTGTACCAGTGT

    Bowtie tells me that this sequence does not match.If I blast it I find it to be mir-150-5p. The identitity is 100%.
    My read is 1nt LONGER than the actual reference sequence (TCTCCCAACCCTTGTACCAGTG). Therefore bowtie does not find it. I tried different parameters for bowtie, but only manage to get it, if I shorten my read by 1 nt, which is of course not an option, as this is just a single example we found by random.

    Do I make here any principle error? Is it impossible to map a read which is longer than the reference? Any suggestion? Do I need to "artifially" modify my mirBase sequences before I make an index? Like attaching 10Gs at both ends?

  • #2
    The best option is to use a different aligner which is able to map reads off the ends of your reference scaffolds. Why are you using bowtie1, anyway? Bowtie2 is much better.

    Comment


    • #3
      Thanks for the answer.

      I am using bowtie1 as it is normally sad to be better when you work with short reads. I already tried bowtie2 with the whole dataset, the results are pretty different and less good. But I did not invest too much time in bowtie2 with regard to miRNA. For other projects I am using bowtie2 as we have long reads normally. I tested the specific sequence I mentioned above with bowtie2 also just "found" it after I shortened it,

      Maybe I need to use a completely different aligner?

      Comment


      • #4
        You should work with bowtie1 mapping small reads; no doubt. Could you please specify the parameters you are using. And what kind of reference are you using, miRNA database or something similar?
        Best

        Comment


        • #5
          You would need local, rather than global, alignment for this, which bowtie1 doesn't do (at least if memory serves). Bowtie2 can do local alignment, but you might also follow Brian's advice and just use a different aligner (I suspect that he'd suggest BBMap ).

          Comment


          • #6
            Originally posted by dpryan View Post
            You would need local, rather than global, alignment for this, which bowtie1 doesn't do (at least if memory serves). Bowtie2 can do local alignment, but you might also follow Brian's advice and just use a different aligner (I suspect that he'd suggest BBMap ).

            I would have suggested that, but I've never actually tried it with 16bp reads. It works fine with 25bp reads, though. For short reads you can increase sensitivity by using a shorter kmer length and reducing max gap length, using flags like "k=10 maxindel=8". BBMap does global alignment, but it still extends off the ends of scaffolds up to around 44% of the read length by default.

            Comment


            • #7
              Thanks for the answers!

              I was also thinking about the local alignment. Therefore I at least tried bowtie2, but now way.

              I will try 2 things:
              a) new aligner - as you suggested BBMap, I would also try this SHRiMP
              b) modifying my miRNA-reference set

              I used bowtie with the following parameters: bowtie -f -l 16 -v 2 -t -a --best --strata

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Essential Discoveries and Tools in Epitranscriptomics
                by seqadmin




                The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                04-22-2024, 07:01 AM
              • seqadmin
                Current Approaches to Protein Sequencing
                by seqadmin


                Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                04-04-2024, 04:25 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 04-11-2024, 12:08 PM
              0 responses
              59 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 10:19 PM
              0 responses
              57 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 09:21 AM
              0 responses
              53 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-04-2024, 09:00 AM
              0 responses
              56 views
              0 likes
              Last Post seqadmin  
              Working...
              X