Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Wish to do differential analysis, complete newbie.

    When it comes to finding/using established tools I am completely untrained.

    We are starting with illumina paired-end reads (we don't necessarily care if the analysis is done with tools supporting paired-ends per se, the sequencing just happened to be done this way). The paired ends have already been separated by the sequencer pipeline.

    In an earlier post I asked how one would trim for quality, and I found a few tools to do that.

    Now I'm supposed to perform an alignment and use the alignment information to get expression profiles. (Eventually we would like to use these expression profiles to compare different samples.)

    It seems like cufflinks should be able to do this sort of thing (I'm assuming it would be working in partnership with bowtie or tophat for this.). Are there any other pipelines and how would I best find the sequence of commands that would prepare the data and produce the mapping files that I need to run these analyses?

  • #2
    Hi Kotoro,

    The online manuals of Tophat and Cufflinks should be sufficient for you to carry out the analysis pipeline.

    Douglas

    Comment


    • #3
      what produces the gtf file that cufflinks is looking for?

      Comment


      • #4
        Hi Kotoro,

        For the gtf file, one potential solution is to use the Table Browser from the UCSC genomes website

        If you're working with human or mouse samples, you can choose the mRNA track in the "groups" selection, then mouse mRNAs in the "track" selection and refGene as the "table"

        Then, in the output format field, you'd be able to choose GTF which should get you going.

        For expression analysis, the best workflow would be to use TopHat followed by Cufflinks, you cannot use Bowtie -> Cufflinks, as Bowtie would not match reads to splice junctions.

        Vineeth

        Comment


        • #5
          Our current goal is to compare quality of frozen vs paraffin embedded tissue samples as to their usefulness for sequencing and expression analysis with the hypothesis being that they are sufficiently similar so as not to significantly bias the resulting sequence data in a damaging or misleading manner.

          As I am still a student early in my training I am unfamiliar with the tools and methods required to test this hypothesis. Is the tophat/cufflinks combination even appropriate for this or am I moving in the wrong direction?

          --edit:

          Forgot to mention it, but we are working with human normal/cancer tissues.
          Last edited by Kotoro; 07-07-2011, 10:13 AM.

          Comment


          • #6
            Yes, the tophat/cufflinks will serve you fine for this purpose.

            Though if you want to be really sure, I am guessing you do, otherwise you(your lab) would not have gone in for NGS just to confirm this, I would use the more conservative DESeq.

            What you can then do is ... map to the genome with bowtie, and have HTSeq do the counting for you, you can then give these to DESeq for differential expression analysis.

            You will need some basic knowledge of python and R to do this, so if you are not familiar with these two, then it is better to stick to tophat/cufflinks

            Comment


            • #7
              python and R are the languages i've been meaning to learn but haven't had the chance to learn yet.

              (I know c/c++, java, perl and batch languages (like windows batch files and Bash scripting), so its not outside my capability to understand new languages. I am just not intimately familiar with python & R yet.)

              Comment


              • #8
                The manuals for the tuxedo suite are fairly specific to use and they do tell you the output format, but they're not particularly informative to how to interpret the output.

                What exactly does the output tell me other than which differences exist (as in cuffcompare and cuffdiff. I'm not sure what to do with these results to get a meaningful comparison and declare whether the samples are similar enough or too different.) How can I relate the output to the overall hypothesis?

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  Essential Discoveries and Tools in Epitranscriptomics
                  by seqadmin




                  The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                  04-22-2024, 07:01 AM
                • seqadmin
                  Current Approaches to Protein Sequencing
                  by seqadmin


                  Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                  04-04-2024, 04:25 PM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, Yesterday, 11:49 AM
                0 responses
                15 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 04-24-2024, 08:47 AM
                0 responses
                16 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 04-11-2024, 12:08 PM
                0 responses
                61 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 04-10-2024, 10:19 PM
                0 responses
                60 views
                0 likes
                Last Post seqadmin  
                Working...
                X