Seqanswers Leaderboard Ad

**dpryan** · 03-27-2014, 01:50 AM

You might also consider just keeping the single-end and paired-end separate and then using that as a blocking factor in your experimental design. Having said that, if the library-type effect is minimal (as indicated by PCA, clustering, etc.), then you might as well go ahead and sum things...but I'd check that the results are similar enough first.

**RocheKermit** · 03-27-2014, 03:26 AM

Right.
I need to update my question, and actually the experiment has been conducted with only paired-end BUT the read quality filtering and trimming downstream steps conducted to the removal of some mate pairs (approx. 10% of pairs become single/orphan).
So the mixture of single and paired read doesn't come from the biochemistry, but rather from QC filtering.
We can state that it's healthy to merge back the paired and single in HTSeqc-count, isn't it? Qualitatively speaking et least.
About the counting, on one hand 1 mapped single read conduct to one count, on the other hand, a paired read will also be counted only once. Don't we overweight the single-end reads by simply summing single + paired end reads?

**dpryan** · 03-27-2014, 04:10 AM

A single read and a pair both describe the position of the fragment that was sequenced. In both cases, you can consider that it's actually the fragment that's getting counted, so then nothing is being given undue weight. The only real objection to that is that single-end reads don't give you the full bounds, so there are cases where they'll lead to slightly inflated counts (e.g., when the other end of the fragment actually overlaps a different feature, but you have no way of knowing this), but the effect of that is likely quite small (again, you could judge this by clustering things).

**RocheKermit** · 03-27-2014, 06:35 AM

Great answer!
I agree with that and I'll go on with the proposed strategy, which is the following:
1. QC of fastq files, trimming ..
2. Alignment of single read, alignment of paired reads
3. HTSeq-count of single reads, HTSeq-count of paired reads
4. Sum of counts for each gene of single reads + paired reads
5. Happy EdgeR or DESeq or whatever...

Thank you so much for your help!!

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 18 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 22 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 17 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 49 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Counts on single and paired ends reads merged bam file

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News