Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Bacterial ITS analysis

    Hello, I was wondering if anyone has done Bacterial ITS (not fungal ITS) community diversity analyses before? I do 16S rRNA all the time using Qiime or Mothur. However, the two software doesn't support bacterial ITS. If anyone has done it before, can you tell me your analysis work flow?

    Thanks
    Ben

  • #2
    Originally posted by SDPA_Pet View Post
    However, the two software doesn't support bacterial ITS.
    Sure they do, however, afaik there's no reference for bacterial ITS so you'd be doing just de novo clustering. No idea how you'd go about assigning taxonomy to your reads. Blastn against a very sparse collection perhaps?
    savetherhino.org

    Comment


    • #3
      HI rhinoceros,

      Thanks. "Blastn against a very sparse collection". Do you mean BLASTn against a small and specific database. I know ncbi has mircrobial rRNA database. I am not sure if this database only has 16S or includes ITS regions. It is obvious that ncbi NT database is too big for me.

      Any suggestions?

      Ben

      Comment


      • #4
        Originally posted by SDPA_Pet View Post
        HI rhinoceros,

        Thanks. "Blastn against a very sparse collection". Do you mean BLASTn against a small and specific database. I know ncbi has mircrobial rRNA database. I am not sure if this database only has 16S or includes ITS regions. It is obvious that ncbi NT database is too big for me.

        Any suggestions?

        Ben
        NCBI hosts a 16S db, which doesn't include ITS seqs. Bacterial ITS sequencing has never been a thing, so a bacterial ITS reference db would be limited to the few fully sequenced bacterial genomes, hence "sparse" and pretty much useless..
        Last edited by rhinoceros; 02-17-2014, 12:40 AM.
        savetherhino.org

        Comment


        • #5
          Originally posted by rhinoceros View Post
          NCBI hosts a 16S db, which doesn't include ITS seqs. Bacterial ITS sequencing has never been a thing, so a bacterial ITS reference db would be limited to the few fully sequenced bacterial genomes, hence "sparse" and pretty much useless..
          Do you mean if I do BLASTn search, I should use NCBI nr database instead of nt database?

          I know NCBI BLASTn web portal can search nr and nt database at the same time? Can BLAST command line BLAST against two databases at the same time?

          Comment


          • #6
            Originally posted by SDPA_Pet View Post
            Do you mean if I do BLASTn search, I should use NCBI nr database instead of nt database?

            I know NCBI BLASTn web portal can search nr and nt database at the same time? Can BLAST command line BLAST against two databases at the same time?
            What I mean is that you should forget about the whole thing. Anyway, nr is a protein db. The default NCBI's web blastn is against non-redundant version of nt. But many sequences in nt (or nr for that matter) don't have any species annotation other than "uncultured bacterium/archaeon/organism". What good would that information be to you? Optimally, you'd build an ITS subset of e.g. refseq_genomic, but again, this db wouldn't have that many sequences, so it would be a rather miserable reference db.
            savetherhino.org

            Comment


            • #7
              Originally posted by rhinoceros View Post
              What I mean is that you should forget about the whole thing. Anyway, nr is a protein db. The default NCBI's web blastn is against non-redundant version of nt. But many sequences in nt (or nr for that matter) don't have any species annotation other than "uncultured bacterium/archaeon/organism". What good would that information be to you? Optimally, you'd build an ITS subset of e.g. refseq_genomic, but again, this db wouldn't have that many sequences, so it would be a rather miserable reference db.
              OK. Thanks. Just double check with you this,

              The default NCBI BLASTn database on the web is nucleotide collection (nt/nr).

              So the nt means nt database
              and nr means non-redundant version of nt?

              Yes, I understand it is very hard to get exact species name for microbes even using 16S rRNA gene marker.

              Comment


              • #8
                Originally posted by SDPA_Pet View Post
                OK. Thanks. Just double check with you this,

                The default NCBI BLASTn database on the web is nucleotide collection (nt/nr).

                So the nt means nt database
                and nr means non-redundant version of nt?
                Yeah, it's a special non-redundant version of the nt db which they distribute via their ftp. In their ftp, the "main" protein db is called just "nr".

                Yes, I understand it is very hard to get exact species name for microbes even using 16S rRNA gene marker.
                Well, yeah, that's usually the case. However, as I'm sure you're aware, there's no consensus of what makes a prokaryote species, and anyway, genus level annotation is usually good enough. If you want more specific than that, you'd be looking into phylogenomics instead..
                savetherhino.org

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  Essential Discoveries and Tools in Epitranscriptomics
                  by seqadmin




                  The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                  04-22-2024, 07:01 AM
                • seqadmin
                  Current Approaches to Protein Sequencing
                  by seqadmin


                  Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                  04-04-2024, 04:25 PM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, Today, 11:49 AM
                0 responses
                11 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, Yesterday, 08:47 AM
                0 responses
                16 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 04-11-2024, 12:08 PM
                0 responses
                61 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 04-10-2024, 10:19 PM
                0 responses
                60 views
                0 likes
                Last Post seqadmin  
                Working...
                X