Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Overlap/comparison microarray and RNAseq

    Hi all,

    Not sure if this is the proper forum, but here goes. We did our first RNAseq analysis of a fungal genome. We had 2 conditions (stress and non-stressed) and 2 time points with three biological replicates of each condition and time point. One of the time points was also represented on a microarray experiment (Aglilent arrays) done with the same genome under the same conditions.

    I used edgeR to determine the DE genes. There were ~1200 genes DE on the microarray and ~2000 DE for the same time point in the RNAseq experiment. The overlap of DE genes was 590/1200. However the direction of the change was often different between them.

    My questions:
    1) If anyone done a comparison between an Agilient microarray and RNAseq, what sort of concordance was observed?

    2) The edgeR was done after RPKM normalization. How much does that affect the number of DE genes? I have the BAM files from the Bowtie alignment so I can redo the edgeR starting from non-normalized counts.

    3) Is it even worth spending time trying to determine the level of concordance because these technologies are too different to compare?

    I did see a recent paper in BMC Genomics comparing Affy exon arrays with RNAseq and they found a fairly high level of concordance.

    Thanks in advance for any thoughts, comments or advice.

    Regards,
    Maureen

  • #2
    1. Do NOT feed edgeR or DESeq with normalized data. Both tools need raw counts. Otherwise, you just get nonsense. (I guess we need to put this in flashing red letters somewhere, given the number of people who do it wrong.) So, please redo everything with raw counts. Use, e.g., htseq-count to get the raw counts.

    2. When checking for concordance, it is not very helpful to look at number of hits, because genes which are borderline significant will let the concordance look worse than it is. The better idea is to make a scatter plot of log fold change according to microarrays vs log fold change according to RNA-Seq. Plot all genes and use colour to mark significant ones.

    Comment


    • #3
      Originally posted by MDonlin View Post
      3) Is it even worth spending time trying to determine the level of concordance because these technologies are too different to compare?
      Well, technologies might be very different, but you are interested in the biological answer and that should be independent of the technology. If the two techniques give you very different results it would suggest there is something wrong with at least one of them.

      I wouldn't be to much worried about the small overlap, because these sort of experiments (few replicates, multiple test correction) give you a lot of false negative. A study that used 72 replicated suggested that most genes are regulated. The problem is that the genes that you call differentially express should be concordant on the "direction" among the different technologies...

      Comment


      • #4
        Hi Stefanoberrl,
        I totally agree with you. I believe that's a very important step to compare RNA-Seq and Microarray. Is there any work done in this regard? or any comment from anyone who has done this or know this? Thanks.

        Comment


        • #5
          Originally posted by Simon Anders View Post
          1. Do NOT feed edgeR or DESeq with normalized data. Both tools need raw counts. Otherwise, you just get nonsense. (I guess we need to put this in flashing red letters somewhere, given the number of people who do it wrong.) So, please redo everything with raw counts. Use, e.g., htseq-count to get the raw counts.

          2. When checking for concordance, it is not very helpful to look at number of hits, because genes which are borderline significant will let the concordance look worse than it is. The better idea is to make a scatter plot of log fold change according to microarrays vs log fold change according to RNA-Seq. Plot all genes and use colour to mark significant ones.
          Hi Simon,

          Already got the raw data count from RNA-seq which I used htseq-count. But, how to make a scatter plot to compare both RNA-seq and microarray data? Thank you

          Comment


          • #6
            What scatter plot are you talking about? This is a two year old thread! You can't seriously expect to get a useful answer if you post one-sentence questions without any context.

            Comment


            • #7
              Originally posted by Simon Anders View Post
              What scatter plot are you talking about? This is a two year old thread! You can't seriously expect to get a useful answer if you post one-sentence questions without any context.
              Hi Simon,

              Sorry for the silly question. Alright let straight forward. I got 2 raw data from two different platform (microarray and rna-seq). As we all know that RNA-seq data perform better than microarray which gave more sensitivity and deep coverage. So, I want to create a scatter-plot to compare between RNA-seq and Microarray data. So the data should be log2-transformed in oder to get the data more easy to read? I would say that the y-axis (rna-seq) and x-axis (microarray) where the dot scattered around the straight line shows the agreement between 2 data. I have around 500 number of gene from each data. What would you suggest to create this scatterplot? Using R? Please help, thank you!

              Comment


              • #8
                I have the same question. I have my DEG after rnaseq and microarray analysis (DESeq & Affy). I want to compare the 2 datasets in R (scatterplot, Venndiagramm). Any tips or scripts available?

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  Advancing Precision Medicine for Rare Diseases in Children
                  by seqadmin




                  Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
                  12-16-2024, 07:57 AM
                • seqadmin
                  Recent Advances in Sequencing Technologies
                  by seqadmin



                  Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

                  Long-Read Sequencing
                  Long-read sequencing has seen remarkable advancements,...
                  12-02-2024, 01:49 PM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, 12-17-2024, 10:28 AM
                0 responses
                22 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 12-13-2024, 08:24 AM
                0 responses
                42 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 12-12-2024, 07:41 AM
                0 responses
                28 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 12-11-2024, 07:45 AM
                0 responses
                42 views
                0 likes
                Last Post seqadmin  
                Working...
                X