Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • francisb
    Junior Member
    • Aug 2014
    • 2

    Classify sample based on RNAseq data and micro-array data

    I have a sample on which I will perform RNAseq. I would like to compare its transcriptome to the transcriptomes of several cell types and find the cell type it resembles most. The cell type data is, however, micro-array data.

    Does anyone have an idea how I can correlate my RNAseq data with the micro-array data? How can I include biological replicates of my RNAseq experiment, and how many replicates would I need? (I do not need any statistical significance, I just need to pick the best match)

    Thanks a lot in advance!
  • khughitt
    Junior Member
    • Dec 2012
    • 7

    #2
    Hi Francis,

    Assuming there aren't any publicly available microarray datasets for your cell type of interest, you may be able to make your RNA-Seq data look more like microarray data by log-transforming it and applying voom.

    Then, an easy place to start to look for relationships between the cell types would be to perform look at a PCA plot of all of the samples. In this case, the more closely related samples should group together in the plot.

    HTH,
    Keith

    Comment

    • mbblack
      Senior Member
      • Aug 2009
      • 245

      #3
      Since you need to compare to microarray data, why are you not performing microarray experiments with your data? If you read through the literature you will find that actual relative expression or estimated fold change does not correlate well at all between the two technologies. Nor, at least much of the time, do the two technologies even detect the same sets of significantly differentially expressed genes (the overlap in genes can often be less than 50%).

      If the intent is to try to identify the cell type by matching expression patterns, then I'd say you should at least be comparing directly similar data. Go ahead and do RNAseq with your samples if you want that data for other types of analysis, but don't try and compare RNAseq from an unknown cell type to microarray data of known cell types. I think any conclusions based on that sort of comparison will be highly criticized and far too ambiguous. Expression as measured by the two technologies is not really comparable across different cell types (or tissues or species, etc), not for pattern matching as you are proposing.

      As for how many replicates. how many were used in the microarray experiments? What was the variance in expression observed across those (i.e. how variable are these cell types when it comes to gene expression)? Are these different cell strains, cell lines, or actual wild type cell samples?
      Last edited by mbblack; 08-28-2014, 05:05 AM.
      Michael Black, Ph.D.
      ScitoVation LLC. RTP, N.C.

      Comment

      • francisb
        Junior Member
        • Aug 2014
        • 2

        #4
        Thanks for your replies!

        @mbblack Good question. The plan was to do RNAseq because we want to do other analyses as well, and we prefer RNAseq for those. If micro-array is needed to address this question, then I will consider doing that as well.

        @khughhitt Thanks for this suggestion! How should I include variability across replicates in this type of analysis?

        Comment

        • khughitt
          Junior Member
          • Dec 2012
          • 7

          #5
          When using voom, create a factor for each of your cell types, and possibly another one if you have differing batches across the samples, e.g.:

          R code
          Code:
          celltype = c("a", "a", "a", "b", "b", "b")
          model = model.matrix(~0+celltype) 
          
          # Or, if you have multiple batches, 
          # model = model.matrix(~0+celltype+batch)
          Next, you will want to perform some normalization (e.g. size factor or quantile normalization) on the counts, and call voom passing in the above model:

          Code:
          voom_result = voom(exprs(normed_counts), model, plot=TRUE)
          
          # Your transformed counts
          head(voom_result$E)
          Just to be clear, I haven't actually made this kind of comparison yet, so please take everything with a grain of salt and spend some time reading up on Voom and the differences between microarray/RNA-Seq. I have used the above approach to transform RNA-Seq data for use with Microarray methods with good success, so it seems like a reasonable place to start for your goals.
          Keith

          Good luck!

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Pathogen Surveillance with Advanced Genomic Tools
            by seqadmin




            The COVID-19 pandemic highlighted the need for proactive pathogen surveillance systems. As ongoing threats like avian influenza and newly emerging infections continue to pose risks, researchers are working to improve how quickly and accurately pathogens can be identified and tracked. In a recent SEQanswers webinar, two experts discussed how next-generation sequencing (NGS) and machine learning are shaping efforts to monitor viral variation and trace the origins of infectious...
            03-24-2025, 11:48 AM
          • seqadmin
            New Genomics Tools and Methods Shared at AGBT 2025
            by seqadmin


            This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

            The Headliner
            The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
            03-03-2025, 01:39 PM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 03-20-2025, 05:03 AM
          0 responses
          49 views
          0 reactions
          Last Post seqadmin  
          Started by seqadmin, 03-19-2025, 07:27 AM
          0 responses
          57 views
          0 reactions
          Last Post seqadmin  
          Started by seqadmin, 03-18-2025, 12:50 PM
          0 responses
          50 views
          0 reactions
          Last Post seqadmin  
          Started by seqadmin, 03-03-2025, 01:15 PM
          0 responses
          201 views
          0 reactions
          Last Post seqadmin  
          Working...