![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
pre-filtering before running alignment: help needed | angerusso | Bioinformatics | 2 | 11-15-2011 07:49 PM |
pre-built indexes | biofreak | RNA Sequencing | 2 | 07-26-2011 03:52 PM |
Filtering SOLiD reads before mapping?? Conflicting advice | hlwright | SOLiD | 5 | 06-27-2011 06:10 AM |
Can pre-filtering reads affect your analysis results? | PFS | Bioinformatics | 0 | 03-24-2011 11:08 AM |
filtering pre-mRNA | chrisbala | Bioinformatics | 0 | 11-19-2010 09:15 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Member
Location: USA Join Date: Apr 2011
Posts: 21
|
![]()
Hi,
We have some mated paired end sequences (50bp) to be mapped to the reference genome for SNP (and maybe indel) discovery. The data I have are csfasta and qual files. Could any one let me know if I need to do pre-filtering for the sequences before I use any software to map them? If I use bowtie, I should remove the orphan reads (and maybe try to map the orphan reads using a different parameter set). If I use BFAST, should I do the same? If I do need to filter the sequences based on the quality score, what's the cut-off threshold people normally use? Average of Q10? How to translate the quality score to the % error rate like the Phred score? Thanks! Nan |
![]() |
![]() |
![]() |
#2 |
Member
Location: China Join Date: Jan 2011
Posts: 11
|
![]()
csfasta_quality_filter.pl this script will be used about qulity contorl in solid.
you can find this script by google |
![]() |
![]() |
![]() |
#3 |
Junior Member
Location: bogota, colombia Join Date: Apr 2011
Posts: 5
|
![]()
Hi,
I am on index step, but apparently there is an error: In function "RGIndexLayoutCreate": Fatal Error[OutOfRange]. Message: Layout must begin with a one. Could someone help me with this problem??? |
![]() |
![]() |
![]() |
#4 | |
Member
Location: USA Join Date: Apr 2011
Posts: 21
|
![]() Quote:
you mean the indexing of the reference genome using bowtie? Below is the command I use (assuming bowtie-build and all_reference.fa are in the same folder) ./bowtie-build -C -f all_reference.fa reference_Color |
|
![]() |
![]() |
![]() |
#5 | |
Member
Location: USA Join Date: Apr 2011
Posts: 21
|
![]() Quote:
I downloaded the program. It has many parameters to use for trimming. Can anyone tell me normally what value they use for filtering (if any)? Thanks! 1. num_colors_to_hard_trim 2. min_median_qv 3. max_bad_colors_in_first_ten 4. max_number_bad_colors 5. num_consec_colors_to_trim 6. trim_terminal_bad_colors 7. min_read_length |
|
![]() |
![]() |
![]() |
#6 |
Member
Location: China Join Date: Jan 2011
Posts: 11
|
![]()
solid not supply the parameters, choose them by yourself, i advice you can statistics raw data before using last script.
|
![]() |
![]() |
![]() |
#7 | |
Member
Location: USA Join Date: Apr 2011
Posts: 21
|
![]() Quote:
As to the statistics, besides median/average quality score, what else should I look into? Thanks! |
|
![]() |
![]() |
![]() |
#8 |
Member
Location: China Join Date: Jan 2011
Posts: 11
|
![]()
there is a paper called :Analysis of quality raw data of second generation sequencers with Quality Assessment Software. you can find it by google. it is very simple ,it wil help you .
some parameters contains: min \max\Q20\mean\median you should consider |
![]() |
![]() |
![]() |
#9 |
Member
Location: Milano, Italy Join Date: Aug 2011
Posts: 93
|
![]()
I downloaded and executet csfasta_quality_filter.pl script on my SOLiD 5500 csfasta (and qual) data to trim a fixed number of colors. ( code below)
perl csfasta_quality_filter.pl -f F5.cs fasta -q F5.QV.qual -o 5_bases_trimmed_F5.csfasta --num_colors_to_hard_trim 5 I was wondering about the output file, from the manual (Filtered and trimmed reads are output in csfasta format to a user-specified filename). And where is my associated QV.qual file trimmed? Any ideas? I cannot supply TopHat or any other alignment software with a trimmed csfasta and a full lenght associated quality file.. |
![]() |
![]() |
![]() |
#10 | |
Member
Location: China Join Date: Jan 2011
Posts: 11
|
![]() Quote:
|
|
![]() |
![]() |
![]() |
Thread Tools | |
|
|