Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • RNASeq pipeline

    Hi,

    I was wanting to know if it is alright to normalize my RNA seq data before mapping the reads to the genome? I ask this because I have the raw file, however the mapping was performed by a bioinformatician who did not provide us the mapped reads file but just the output from after running all the pipelines.

    Also, can I run a differential gene expression analysis on the raw reads - even before mapping? Would these results be accepted as valid fold changes if performed on the original files?

    Thank you

  • #2
    Originally posted by Sow View Post
    Hi,

    I was wanting to know if it is alright to normalize my RNA seq data before mapping the reads to the genome?
    No.

    I ask this because I have the raw file, however the mapping was performed by a bioinformatician who did not provide us the mapped reads file but just the output from after running all the pipelines.
    He should provide you the mapped reads upon request.

    Also, can I run a differential gene expression analysis on the raw reads - even before mapping?
    No, that's impossible. How can you know which read goes to which gene without mapping? That's the definition of mapping.

    Comment


    • #3
      Thanks so much Brain,
      I should have mentioned that this is an small RNA Seq data. So mapping tells us if the reads come from the genome or if it's a random degraded transcript. However it is right not to run differential expression analysis on the raw reads - isn't that what deseq requires us to do?
      Also- I was hoping I could obtain the fold change of the miRNAs ( obtained after running the pipeline) from the differential expression Analysis of the raw reads?

      Comment


      • #4
        Note that raw reads and raw read counts (what DESeq requires) have a different meaning. Raw reads suggests your data as it came from the sequencer. Raw read counts is the number of reads that map to the genes / features of interest without normalization for factors like library size. You cannot get raw read counts without mapping first.

        Comment


        • #5
          Originally posted by Brian Bushnell View Post
          No, that's impossible. How can you know which read goes to which gene without mapping? That's the definition of mapping.
          Not completely true. K-mer counting based tools like "salmon" or "kallisto" can perform differential gene expression analysis without the "classical" mapping. But you'll need to have well curated transcriptome sequences and identification of novel transcripts or isoforms is obviously also not possible.
          Nevertheless, the tools work with "raw reads" as the op requested

          Comment


          • #6
            Originally posted by WhatsOEver View Post
            Not completely true. K-mer counting based tools like "salmon" or "kallisto" can perform differential gene expression analysis without the "classical" mapping.
            Or Seal

            But I would argue that what those tools do is mapping, just without an alignment phase.

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Essential Discoveries and Tools in Epitranscriptomics
              by seqadmin




              The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
              04-22-2024, 07:01 AM
            • seqadmin
              Current Approaches to Protein Sequencing
              by seqadmin


              Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
              04-04-2024, 04:25 PM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, Yesterday, 11:49 AM
            0 responses
            15 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-24-2024, 08:47 AM
            0 responses
            16 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-11-2024, 12:08 PM
            0 responses
            62 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 10:19 PM
            0 responses
            60 views
            0 likes
            Last Post seqadmin  
            Working...
            X