Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Joining single end and paired end data in RNA seq

    Hi,

    I've been trying to join one single end and one paired end data set (both Illumina HiSeq) in an RNA seq experiment. The PE data has been trimmed to the same length as the SE and I used only R1. 12 libraries were sequenced by both SE and PE. The read depth of the PE data was substantially lower than the SE.

    MDS plotting of TMM normalized cpms of the replicates shows a batch effect between SE and PE. The pearson correlations of normalized cpms are also quite poor, ranging from 0.95 to 0.99.

    So, is it even possible to join SE and PE data for RNA seq? Or can the difference I'm seeing be due difference in sequence chemistry?

    The ComBat function in the sva package removes the batch effect and the replicates cluster perfectly afterwards. However, I've seen threads saying that batch removed data should only be used for clustering purposes and is not meant to be continued working with.

    The RNA data is meant for pattern recognition, not DEG analysis...

  • #2
    Have you tried analyzing the PE dataset as a PE dataset (don't trim them to the same length as SE and use both reads) to obtain expression data?

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Essential Discoveries and Tools in Epitranscriptomics
      by seqadmin




      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
      Yesterday, 07:01 AM
    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    57 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    53 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    45 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-04-2024, 09:00 AM
    0 responses
    55 views
    0 likes
    Last Post seqadmin  
    Working...
    X