Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • different replicate numbers

    Dear all,

    I would like to know if it is possible to do a differential expression analysis on samples that have different replicate numbers?

    I recently received my data using illumina hi-seq for bacteria samples. I had 3 replicates per sample but when I did the alignment on bowtie2 1 sample had a replicate with only 49% alignment rate.

    Does anyone have suggestion on how to proceed? I am thinking of just doing a 3 replicate sample A vs 2 replicate sample B. Is this acceptable or should I adjust it to 2 replicates per sample?

    Thank you for your answer.

    Jason

  • #2
    Originally posted by pandapiggie View Post
    Dear all,

    I would like to know if it is possible to do a differential expression analysis on samples that have different replicate numbers?
    Yes it is possible to do this.


    I recently received my data using illumina hi-seq for bacteria samples. I had 3 replicates per sample but when I did the alignment on bowtie2 1 sample had a replicate with only 49% alignment rate.
    I would not automatically discard this sample based on the mapping rate but would investigate the cause of the low mapping first. Possible causes are adapter sequences left on the reads preventing the mapping or sample contamination, ribosomal contamination etc. Did you do any preprocessing of the reads? You might find FastQC and FastQ Screen useful for investigating this. If you captured the unmapped reads do a blast search of some to see what hits you get.

    Does anyone have suggestion on how to proceed? I am thinking of just doing a 3 replicate sample A vs 2 replicate sample B. Is this acceptable or should I adjust it to 2 replicates per sample?
    Do not adjust to 2 replicates per sample. Keep as much data as possible.

    Comment


    • #3
      @pandapiggie you may want to use bowtie v.1 (or turn off splice detection in other aligners) to see if that improves alignment (since this is bacterial data).

      There is always the possibility that that one replicate may be bad. Have you scanned/trimmed these sequences for presence of adapter?

      Comment


      • #4
        Originally posted by GenoMax View Post
        @pandapiggie you may want to use bowtie v.1 (or turn off splice detection in other aligners) to see if that improves alignment (since this is bacterial data).

        There is always the possibility that that one replicate may be bad. Have you scanned/trimmed these sequences for presence of adapter?

        Dear GenoMax,

        Thanks for the suggestion. May I ask if I were to use bowtie v1 for that sample do I need to redo the alignment for all the other samples?

        Sorry for the beginner question.

        Comment


        • #5
          Sorry another question pop up in this case.

          Which pipeline would be more accurate in this case? I was thinking using cufflinks but from what I understand DESeq2 is good for my case as there is uneven number of replicates between samples.

          Can anyone help me?

          Thanks

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Essential Discoveries and Tools in Epitranscriptomics
            by seqadmin


            The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
            Yesterday, 07:01 AM
          • seqadmin
            Current Approaches to Protein Sequencing
            by seqadmin


            Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
            04-04-2024, 04:25 PM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 04-11-2024, 12:08 PM
          0 responses
          55 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 10:19 PM
          0 responses
          51 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 09:21 AM
          0 responses
          45 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-04-2024, 09:00 AM
          0 responses
          55 views
          0 likes
          Last Post seqadmin  
          Working...
          X