Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Error when getting rRNA and tRNA rmask table from UCSC

    Im trying to get a GTF with rRNA and tRNA for mm10, as input for --mask-file for Cufflinks.

    I chose the rmask table for mm10, filter for "repClass," type rRNA "OR" repClass = "tRNA"

    I get this error:

    ---------------------------------------------------------------------------
    Can't start query:
    select genoStart,genoEnd,repName,swScore,strand from rmsk where genoName='chr1' and (rmsk.repClass = 'tRNA' OR (rRNA))

    ---------------------------------------------------------------------------
    ---------------------------------------------------------------------------
    mySQL error 1054: Unknown column 'rRNA' in 'where clause' (profile=<noProfile>, host=mysqlrr, db=mm10)
    ---------------------------------------------------------------------------


    Any help?

  • #2
    Alternatively, get the mm10 GTF from Ensembl: ftp://ftp.ensembl.org/pub/release-75...Cm38.75.gtf.gz

    Grep out the sections you need

    Code:
    $ cat Mus_musculus.GRCm38.75.gtf | grep "rRNA"

    Comment


    • #3
      How would that work if I wanted the whole chrMT instead?

      Btw, I use the iGenomes UCSC mm10 GTF, but maybe its the same.

      Comment


      • #4
        Originally posted by sindrle View Post
        How would that work if I wanted the whole chrMT instead?

        Btw, I use the iGenomes UCSC mm10 GTF, but maybe its the same.
        Try this (should include MT). Any GTF file you wish to use should work. Adjust grep terms as needed.

        Code:
        $ grep -w 'rRNA\|Mt_tRNA\|MT' Mus_musculus.GRCm38.75.gtf > mask_file.gtf
        OR

        Code:
        $ egrep -wi 'rRNA|Mt_tRNA|MT' Mus_musculus.GRCm38.75.gtf > mask_file.gtf

        Comment


        • #5
          Thank you very much. Oddly I get empty outputs.

          When I open mm10 GTF in textedit I cant find chrM. Cant find any rRNA or tRNA as well.

          Comment


          • #6
            At least the Ensembl version of the GTF I linked above has "MT" (not chrM) as the designation. I also see rRNA and Mt_tRNA in that file.

            Are you using the GTF from iGenomes site?

            Comment


            • #7
              Yes I am using iGenomes.

              Comment


              • #8
                UCSC mm10 GTF from iGenomes appears to be very different than the Ensembl GRCm38 linked above. For one it does not seem to have any chrM annotations. I am getting the Ensembl one from iGenomes and will look at it later today.

                Comment


                • #9
                  Great! Thank you.

                  Comment


                  • #10
                    Ensembl mm10 GTF file from iGenomes has annotations for rRNA, Mt_tRNA, MT.

                    Comment


                    • #11
                      Thanks a lot. Ill try some AWK or grep and see if I can get them out.

                      Comment


                      • #12
                        Any chance you know what they are called in the iGenomes UCSC mm10?

                        Comment

                        Latest Articles

                        Collapse

                        • seqadmin
                          Essential Discoveries and Tools in Epitranscriptomics
                          by seqadmin




                          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                          04-22-2024, 07:01 AM
                        • seqadmin
                          Current Approaches to Protein Sequencing
                          by seqadmin


                          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                          04-04-2024, 04:25 PM

                        ad_right_rmr

                        Collapse

                        News

                        Collapse

                        Topics Statistics Last Post
                        Started by seqadmin, Yesterday, 11:49 AM
                        0 responses
                        15 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, 04-24-2024, 08:47 AM
                        0 responses
                        16 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, 04-11-2024, 12:08 PM
                        0 responses
                        61 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, 04-10-2024, 10:19 PM
                        0 responses
                        60 views
                        0 likes
                        Last Post seqadmin  
                        Working...
                        X