Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Can BLAST do the RNA-seq reads mapping efficiently?

    Hi all,

    I'm dealing with low-quality reads of a bacteria which are 100 base long. Thus, most of the software cannot map back most of the reads to the reference genome with no more than three mismatches. So can I use BLAST to do the mapping work? Will it to slow?

    Thanks

  • #2
    amount of reads? size of your reference genome? what software you tried already?

    Comment


    • #3
      if you think you have more than three mismatches, why do you limit your aligner to it? (Your questions sounds a bit as if you think that most short-read aligners had a hard-coded limit to the maximum number of mismatches allowed. They don't.)

      Comment


      • #4
        if you have illulmina reads you might want to read this review:
        Discussion of any scientific study related to high content or next generation genomics. Whole genome association, metagenomics, digital gene expression, etc.

        Comment


        • #5
          Originally posted by Thorondor View Post
          if you have illulmina reads you might want to read this review:
          http://seqanswers.com/forums/showthread.php?t=11045
          Can someone send me the paper via PM ?
          Thank you !

          Comment


          • #6
            Originally posted by Thorondor View Post
            amount of reads? size of your reference genome? what software you tried already?
            Hi Thorondor,

            I used bowtie to map32129789 reads (76bp) to the bacteria E.coli K12 with a tolerance of mismatches <=3. However, only 0.02% of the reads mapped uniquely to the genome.
            After using BLAST check the result I found for most of the reads, only the previous 35b of reads got hit in the genome. Thus, it's probably because of the sequencing error that I cannot use bowtie to map them back.

            So I want to try BLAST. But do you think it's too slow? Or does this program work accurately enough for reads mapping?

            Thanks for reply~

            Comment


            • #7
              Originally posted by Simon Anders View Post
              if you think you have more than three mismatches, why do you limit your aligner to it? (Your questions sounds a bit as if you think that most short-read aligners had a hard-coded limit to the maximum number of mismatches allowed. They don't.)
              Hi Simon,


              The problem is the quality of the reads, which you can see from the upper post. The sequencing error made only smaller than 35bp of most of the reads can map back to the reference genome. If I want to take advantage of the current reads, do you think it's good enough for me to use BLAST?

              Thanks~

              Comment


              • #8
                Actually, Bowtie should disregard mismatches on low-quality bases, but this does not work that well.

                However, why don't you simply trim off the bad-quality parts and give the trimmed reads to Bowtie? In a bacterial genome, 35 bp should be more than enough to get good matches.

                regarding BLAST: It is way too slow for millions of reads, and its model of scoring mismatches is based on assuming mutations rather than read errors to be the cause, which is inappropriate.

                Comment


                • #9
                  so why not trim the reads first and then try to map without quality values? You don't want to assemble bad reads anyway and they mostly won't overlap if you don't trim. Or you could just cut all your fastq reads to a length of 40 bp and then try to map with BWA.

                  Maybe you also want to have a look at Cufflinks for your assembly? http://cufflinks.cbcb.umd.edu/

                  edit: simon was faster :-/

                  Comment


                  • #10
                    Originally posted by Simon Anders View Post
                    Actually, Bowtie should disregard mismatches on low-quality bases, but this does not work that well.

                    However, why don't you simply trim off the bad-quality parts and give the trimmed reads to Bowtie? In a bacterial genome, 35 bp should be more than enough to get good matches.

                    regarding BLAST: It is way too slow for millions of reads, and its model of scoring mismatches is based on assuming mutations rather than read errors to be the cause, which is inappropriate.
                    Thanks Simon! That's very helpful.

                    Comment


                    • #11
                      Originally posted by Thorondor View Post
                      so why not trim the reads first and then try to map without quality values? You don't want to assemble bad reads anyway and they mostly won't overlap if you don't trim. Or you could just cut all your fastq reads to a length of 40 bp and then try to map with BWA.

                      Maybe you also want to have a look at Cufflinks for your assembly? http://cufflinks.cbcb.umd.edu/

                      edit: simon was faster :-/

                      Thanks Thorondor!

                      Comment

                      Latest Articles

                      Collapse

                      • seqadmin
                        Strategies for Sequencing Challenging Samples
                        by seqadmin


                        Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                        03-22-2024, 06:39 AM
                      • seqadmin
                        Techniques and Challenges in Conservation Genomics
                        by seqadmin



                        The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                        Avian Conservation
                        Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                        03-08-2024, 10:41 AM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by seqadmin, Yesterday, 06:37 PM
                      0 responses
                      10 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, Yesterday, 06:07 PM
                      0 responses
                      9 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 03-22-2024, 10:03 AM
                      0 responses
                      50 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 03-21-2024, 07:32 AM
                      0 responses
                      67 views
                      0 likes
                      Last Post seqadmin  
                      Working...
                      X