Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How should I count the percentage of messenger RNA and ribosomal RNA in the samples

    Dear All

    We have some E coli. total RNA-seq data and my PI would like to count the percentage of messenger RNA and ribosomal RNA in the data. My idea is mapping the data to the transcriptome and rRNA data separately and then count the reads#.

    Do you think this is a doable plan? Actually, I have no idea where to find the transcriptome and rRNA reference for the E coli.

    Is there is an alternate way to do this analysis? Any suggestion will be appreciated.

    Thanks.

  • #2
    You can find the rRNA sequences for E. coli here. You can align rest of the data to the genome and count the reads using a GTF file. I don't recollect if E. coli has overlapping reading frames but otherwise it should be straight forward to do the counts.

    Comment


    • #3
      Originally posted by GenoMax View Post
      You can find the rRNA sequences for E. coli here. You can align rest of the data to the genome and count the reads using a GTF file. I don't recollect if E. coli has overlapping reading frames but otherwise it should be straight forward to do the counts.
      Thank you so much.

      Btw, how / where should I download the rRNA sequences? I explore the database for hours but still cannot figure out where to download the sequence.

      Thanks again.

      Al

      Comment


      • #4
        Use the links I included above. Then click on "nucleotide sequence" in the operations panel to the right.

        Comment


        • #5
          Originally posted by GenoMax View Post
          Use the links I included above. Then click on "nucleotide sequence" in the operations panel to the right.
          Great, Thanks,

          So what should I do is copy those sequences into a text file and build as a reference and map the sample sequencing data to this reference, right?

          One more question, can I just use the mappable reads number as the rRNA reads number?

          Thanks a lot again.

          AL

          Comment


          • #6
            Originally posted by GenoMax View Post
            Use the links I included above. Then click on "nucleotide sequence" in the operations panel to the right.

            Dear GenoMax

            I just finished the mapping to the rRNA reference and got 70% mapping rate. Is this a normal range for the mRNA seq? Can I say 70% reads are rRNA? how to understand the result? Could you please give me some pointers? Thank you so much.

            AL

            Comment


            • #7
              You say mRNA seq but had you done any ribosomal RNA depletion (e.g. https://www.illumina.com/products/by...-bacteria.html) on your samples? If not, it is not surprising to see a large fraction of your sample to be rRNA. Unless you are working with rRNA that part of the sequence data is wasted (reason to do ribo-depletion).

              Comment


              • #8
                Originally posted by GenoMax View Post
                You say mRNA seq but had you done any ribosomal RNA depletion (e.g. https://www.illumina.com/products/by...-bacteria.html) on your samples? If not, it is not surprising to see a large fraction of your sample to be rRNA. Unless you are working with rRNA that part of the sequence data is wasted (reason to do ribo-depletion).
                Oh no. I will confirm with the lab to see what kit they use. how should I remove those reads from the raw reads? I saw someone said it's not necessary to do that.

                Appreciate your help again.

                AL

                Comment


                • #9
                  You could extract the unmapped reads from the alignment you did (if you did include them in your alignment file) or redo the alignment and collect the unmapped reads in a separate file.

                  You could also ignore these reads when you do read counts. You would want to compare samples and make sure rRNA contamination levels are more or less the same across your pool of samples. You don't want one sample to have 70% rRNA and other 5% (if total number of reads are more or less similar).

                  Comment


                  • #10
                    Originally posted by GenoMax View Post
                    You could extract the unmapped reads from the alignment you did (if you did include them in your alignment file) or redo the alignment and collect the unmapped reads in a separate file.

                    You could also ignore these reads when you do read counts. You would want to compare samples and make sure rRNA contamination levels are more or less the same across your pool of samples. You don't want one sample to have 70% rRNA and other 5% (if total number of reads are more or less similar).
                    Thanks a lot. I mapped the reads by Tophat, one of the output files is unmapped.bam. Can I just convert this file to a fastq file and map again?

                    Another sample mapping is still running, I will compare the % of rRNA.

                    Thanks again for your help

                    AL

                    Comment


                    • #11
                      You should stop using TopHat for new projects. Use BBMap, STAR, HISAT2 etc.

                      Comment


                      • #12
                        Originally posted by GenoMax View Post
                        You should stop using TopHat for new projects. Use BBMap, STAR, HISAT2 etc.
                        Ok, Thanks a lot.

                        Is there an alternative software for cufflink? It seems very slow.

                        Comment

                        Latest Articles

                        Collapse

                        • seqadmin
                          Strategies for Sequencing Challenging Samples
                          by seqadmin


                          Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                          03-22-2024, 06:39 AM
                        • seqadmin
                          Techniques and Challenges in Conservation Genomics
                          by seqadmin



                          The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                          Avian Conservation
                          Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                          03-08-2024, 10:41 AM

                        ad_right_rmr

                        Collapse

                        News

                        Collapse

                        Topics Statistics Last Post
                        Started by seqadmin, Yesterday, 06:37 PM
                        0 responses
                        10 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, Yesterday, 06:07 PM
                        0 responses
                        9 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, 03-22-2024, 10:03 AM
                        0 responses
                        51 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, 03-21-2024, 07:32 AM
                        0 responses
                        67 views
                        0 likes
                        Last Post seqadmin  
                        Working...
                        X