![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
merge BAM files from 2 SOLiD PE runs and remove duplicates | epigen | Bioinformatics | 12 | 03-23-2012 02:55 AM |
Problem to use Bedtools after filtering uniquely mapped reads with samtools | eilosei | Bioinformatics | 2 | 12-21-2011 04:51 PM |
Can pre-filtering reads affect your analysis results? | PFS | Bioinformatics | 0 | 03-24-2011 11:08 AM |
Picard - MakeDuplicates (remove pcr duplicates) | dmb | Bioinformatics | 2 | 03-16-2011 08:56 AM |
how critical is the filtering of potential PCR duplicates? | julien | General | 3 | 03-26-2010 10:24 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Member
Location: Scandinavia Join Date: Dec 2009
Posts: 20
|
![]()
Hello,
Does is a must to preform pre-filtering for color space reads before mapping? Could it has a big differences in downstream analysis when I mapped the SOLiD color space reads without pre-filtering but removed the PCR duplicates with Picard tool from mapped reads? Could anyone kindly please share with me your opinion? Thank you. Have a nice day. |
![]() |
![]() |
![]() |
#2 |
Senior Member
Location: Stockholm, Sweden Join Date: Feb 2008
Posts: 319
|
![]()
I think you first have to clarify more precisely what you mean by pre-filtering, before we can answer.
|
![]() |
![]() |
![]() |
#3 |
Member
Location: Scandinavia Join Date: Dec 2009
Posts: 20
|
![]()
Sorry for the unclear question.
The pre-filtering I refer here is view the reads with tools for example FastQC or Fastx and then trim out the so-called bad bases before mapping the reads. The problem is the FastQC and Fastx are develop to handle reads generated from Illumina and 454 platform. The color space (csfasta) reads can't be imported directly to these tools. Some Perl conversation scripts also having problem when convert the csfasta to fastq. Thanks. |
![]() |
![]() |
![]() |
#4 |
Senior Member
Location: Stockholm, Sweden Join Date: Feb 2008
Posts: 319
|
![]()
OK. So, pre-filtering (I would call it quality filtering) is distinct from duplicate removal and you can think of them as independent filtering steps.
For SOLiD specific quality filtering, and looking at the data in a somewhat similar way to FastQC, I have used this toolkit: http://hts.rutgers.edu/filter/ Then you don't need to convert to FASTQ. For some types of analysis, you may not need to do quality filtering (e g ChIP-seq, RNA-seq). The bad reads will (in general) simply fail to map. For de novo assembly, or resequencing where variant calling is important, you should do quality filtering. |
![]() |
![]() |
![]() |
#5 | |
Member
Location: Scandinavia Join Date: Dec 2009
Posts: 20
|
![]() Quote:
I'm doing RNA-seq analysis with SOLiD platform. I read people mentioned carry out quality filtering (someone also called it as pre-filtering) before mapping is recommended. However, not much about how to deal with SOLiD csfasta but Illumina and 454 reads. Thanks for the information. It is useful! |
|
![]() |
![]() |
![]() |
Tags |
color space, mapping, pcr duplicates, pre-filtering, solid |
Thread Tools | |
|
|