Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Using Trinity package

    Hi everyone

    I have some doubts about how to make trinity work for me.

    I am currently working on RNA seq data from the Illumina. The reads are single end.
    I am interested in 80 Ribosomal protein genes. I want to do transcriptome assembly for my 80 ribosomal protein genes. I came across this TRINITY package. It does de novo assembly.

    Now here are the command line options i used for running my single end data using trinity

    Trinity.pl --seqType fq --CPU 4 --single ERR030898.fq

    This will generate several output files, 2 of which are single.fa and Trinity.fa.

    Can someone explain me what are single .fa and Trinity.fa


    After getting trinity.fa step what analysis should i do so that i get transcriptome assembly for only my ribosomal protein genes(80).

  • #2
    Suggest reading the downstream analysis part of Trinity.

    Compare the best free open source Bio-Informatics Software at SourceForge. Free, secure and fast Bio-Informatics Software downloads from the largest Open Source applications and software directory


    As for 'Trinity.fa' -- those are your assembled transcripts. In other words this is the file you want. 'single.fa' is an intermediate file generated from your initial data. You can ignore it.

    Comment


    • #3
      Trinity

      Hey Rick
      Thanks for replying. I am doing the downstream analysis of Trinity now, but still what i want to ask is how would this downstream analysis give me result of my 80 ribosomal protein genes i am interested in because Trinity.fa file have transcripts for everything , how to get the results for my interest i.e ribosomal protein genes.

      Regards
      Varun

      Comment


      • #4
        Not to be too of a wisecrack here, but what do you already know about your 80 ribosomal proteins? Use that information to pull out your desired transcripts in Trinity.fasta.

        In other words if you know the sequences of your proteins then you can use blast, blat, etc. to match the transcripts. If you know only that they are ribosomal then blasting the transcripts against 'nr' will give you keywords. Perhaps using a tool like blast2go will give even better refinement. Or perhaps you know something else about those 80 proteins that will help in the selection -- CG%? Length? But the point here is that only you know what information you have that can differentiate between ribosomal proteins and all of the other transcripts your organism. I can not help much aside from giving general ideas.

        To use an analogy, say you are in a physical library full of books. You want all 80 books in the library which were written by Issac Asimov. How would you go about finding those books? You could look at the covers -- maybe his name is written on them. You could look inside the books in order to see if they are written in his style. Maybe you know that all of his books have a pink stripe on. Or perhaps you know nothing about his books and thus will have to wade through all of the books making an intelligent decision as to which books you want.

        Comment


        • #5
          Hey Rick
          Thanks for the reply.
          The only information which i have about my 80 ribosomal protein genes is genomic sequences for each gene combined in a single fasta format.

          Can i use blast/blat to match with transcrips?

          Regards
          Varun

          Comment


          • #6
            Originally posted by aevgup View Post
            Hey Rick
            Thanks for the reply.
            The only information which i have about my 80 ribosomal protein genes is genomic sequences for each gene combined in a single fasta format.

            Can i use blast/blat to match with transcrips?
            Yes you can. It will be interesting to see what you pull out.

            While Trinity is nice for de-novo work, when I have known reference then I use a program such as Tophat which can map reads versus the known. This is much more accurate (assuming a closely related and quality reference) than relying on de-novo methods.

            Comment


            • #7
              Hi
              I used Tophat since i had the refernce but somehow the junctions.bed file produced was giving some weird results and so i thought of trying trinity

              Comment


              • #8
                Trinity may certainly give you interesting results. For example, it may be entirely possible that you do not have sequence results for your organism. Perhaps you sequenced some strange fungus. It happens. By doing de-novo work you can find this out. Good luck!

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  Advancing Precision Medicine for Rare Diseases in Children
                  by seqadmin




                  Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
                  12-16-2024, 07:57 AM
                • seqadmin
                  Recent Advances in Sequencing Technologies
                  by seqadmin



                  Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

                  Long-Read Sequencing
                  Long-read sequencing has seen remarkable advancements,...
                  12-02-2024, 01:49 PM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, 12-17-2024, 10:28 AM
                0 responses
                33 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 12-13-2024, 08:24 AM
                0 responses
                49 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 12-12-2024, 07:41 AM
                0 responses
                34 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 12-11-2024, 07:45 AM
                0 responses
                46 views
                0 likes
                Last Post seqadmin  
                Working...
                X