![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Galaxy: filtering for unique/reliable alignments | onson001 | Bioinformatics | 1 | 10-20-2011 05:47 PM |
BWA:high amount of unique alignments despite high mismatch tolerance | moritzhess | Bioinformatics | 2 | 09-05-2011 12:31 PM |
Help with Bowtie, only unique alignments | khb | General | 1 | 12-16-2010 01:35 AM |
More Unique ELAND Alignments Than Reads? | seq7 | Bioinformatics | 2 | 10-06-2010 08:16 AM |
Unique VS Non-Unique read analysis | samt | Bioinformatics | 2 | 09-29-2009 10:44 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Member
Location: Institute, WV Join Date: May 2010
Posts: 24
|
![]()
I have few questions regarding the best practices that are adopted, in dealing with multiple alignments from a single read and presence of identical reads in the data (from Biology stand point for the later meaningful analysis of data) :
I am curious, how important it is to deal with identical reads. Having many identical reads in data means something wrong with the experiment? What could be considered as max. cutoff value for the number of identical reads in the data, so as to not consider those reads? In the other case of a single read aligning at multiple places in a genome, what should be the cutoff value for number of multiple alignments, so as to not consider those reads? |
![]() |
![]() |
![]() |
#2 |
Senior Member
Location: WashU Join Date: Aug 2010
Posts: 117
|
![]()
Your first question regarding duplicate/identical reads is an interesting one that has been discussed at great length in this forum. Among the issues discussed are how to identify duplicate reads, where they come from, how many to expect in particular types of libraries, if they should removed, and ways to remove them.
One discussion of read duplicates and links to other posts discussing them further can be found here: Removing duplicates... What type of libraries are you sequencing (whole genome, exome, ChIP-Seq, RNA-seq, miRNA, mitochondrial genomes, pooled PCR amplicons, etc.)?? The type of library and the corresponding analysis goals (measure expression, identify mutations, peak finding, etc.) can a have major impact on the way duplicates are handled. If you provide specific details of you libraries, someone with similar data may respond... |
![]() |
![]() |
![]() |
#3 |
Member
Location: Institute, WV Join Date: May 2010
Posts: 24
|
![]()
malachig,
Thanks for the reply. I'll go through those discussions. In my case, I am studying 3 types of experiments. I suppose the criteria to handle duplicate reads and handling of reads aligning to multiple places would be different for these three cases. 1. ChIP sequencing to analyze and quantify chromatin modifications 2. Whole transcriptome sequencing to quantify differential gene expression, between different conditions 3. mi-RNA sequencing . To quantify the differential expression of the micro-RNAs. Last edited by sridharacharya; 09-20-2010 at 07:11 AM. |
![]() |
![]() |
![]() |
Tags |
unique alignments, unique reads |
Thread Tools | |
|
|