Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Qualitative comparison of different RNA-Seq experiments

    Hi,

    I'm trying to do a qualitative comparison of publicly available RNA-Seq datasets from different bacteria. Given that the growth conditions are comparable is it possible, e.g. based on RPKM, to make a qualitative comparison of expression levels of conserved genes?

    Cheers,
    Tim

  • #2
    What do you think the problems might be?

    Comment


    • #3
      I'm not sure how to compare them. Usually I'd go with something like edgeR or similar for DE analysis. But in this case I'm not sure if that's suitable because it's different bacteria, different experimental setups etc. Therefore, I guess it's more a "gene A of orgA relatively highly expressed, in organismB it's not" ... but what normalization/approach to us? Does that make my problem more clear?

      Tim

      Comment


      • #4
        You are, indeed, going to have the problems you mentioned.
        The data is from various sources, prepared differently, using different methods, on different machines analyzing expression for sequences that , though conserved, are different sequences.

        Comparing them with deep certainty about the results is going to be challenging.

        Back in the day, I remember some guys using all the CEL files for Affymetrix expression chips and comparing expression between the various expressions from various sources in NIH's GEO (gene expression omnibus) database. A lot easier with a single chip type and single manufacturer's recommended procedure.

        This EdegeR or DESEQ type approach are good for getting a measure of expression differences, that's not your problem.

        The noise and biases of the various methods used to create the reads is your problem.

        Perhaps limiting data to the best/most used protocols and sequencing machine might a good approach.

        Comment


        • #5
          Maybe you should compare pairs

          You're going to stumble into many normalization issues here. I'm having trouble comparing experiments from the same lab in two different experimental settings, let alone different labs and bacteria.
          I would try another approach using simple statistics. I assume you have enough replicates so you can pair the experiments, taking the two conditions in each bacteria, growth phase, lab etc. and in each pair determine, for each gene, if it was up or down regulated (you'll have to normalize them, using DESeq normalization or other method). Then, if you have about 15 (or even less, depends on the number of genes you're testing) such pairs you can run the sign test on each gene. You should get p-values with sufficient power even after FDR correction.

          Comment


          • #6
            If it were me, I would not even try to merge the two datasets. Analyze each independently just as you would any other cross-species comparison. Pull out your two independent gene lists and look for the overlap between them. Then you can make some comparisons of the commonly detected differential expression in each. First thing I'd want to see is do the two gene lists even rank order correlate at all. You can then do some stats if you wished to compare the independently derived gene lists.
            Michael Black, Ph.D.
            ScitoVation LLC. RTP, N.C.

            Comment


            • #7
              Thanks everybody.
              The "simple stats" solution is what I was thinking of, too. I will have a look into sign test and rank order conservation. Thanks for the help so far.

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Advancing Precision Medicine for Rare Diseases in Children
                by seqadmin




                Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
                12-16-2024, 07:57 AM
              • seqadmin
                Recent Advances in Sequencing Technologies
                by seqadmin



                Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

                Long-Read Sequencing
                Long-read sequencing has seen remarkable advancements,...
                12-02-2024, 01:49 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 12-17-2024, 10:28 AM
              0 responses
              22 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 12-13-2024, 08:24 AM
              0 responses
              42 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 12-12-2024, 07:41 AM
              0 responses
              28 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 12-11-2024, 07:45 AM
              0 responses
              42 views
              0 likes
              Last Post seqadmin  
              Working...
              X