![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Non-commercial SNP calling soft | *#1* | Bioinformatics | 4 | 07-08-2012 03:16 AM |
Clipping adapters | nsl | RNA Sequencing | 6 | 05-27-2011 03:08 PM |
Basic Seq Soft | ednot | General | 1 | 05-02-2011 06:42 PM |
controlling clipping behavior in bwa 'aln' and 'bwtsw' aligners | jnfass | Bioinformatics | 1 | 01-10-2010 02:49 PM |
what's clipping point? | jordi | 454 Pyrosequencing | 1 | 05-26-2009 02:47 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Member
Location: Europe Join Date: Apr 2010
Posts: 46
|
![]()
Hi,
When I run BWA without specifying a "q" value (which defaults to 0 as I understand it from the manual), I would not expect any trimming to occur. However, the resulting alignments have lots of soft-clippings at the edges. Aren't these trimmings? Thanks! |
![]() |
![]() |
![]() |
#2 |
Senior Member
Location: 45°30'25.22"N / 9°15'53.00"E Join Date: Apr 2009
Posts: 258
|
![]()
Which value have you specified? Why would you expect trimming not to occur?
Also, if you specify a q value, you should see information about trimming while bwa is running. d |
![]() |
![]() |
![]() |
#3 |
Member
Location: Europe Join Date: Apr 2010
Posts: 46
|
![]()
Hi, I didn't specify a "q" value, and the BWA manual implies that this means a default value of "0" is used.
The official description of "q" is a bit cryptic for a non-mathematician, but I thought that the default value of "0" would lead to no trimming? If this isn't the case, how can I prevent trimming? Thanks. |
![]() |
![]() |
![]() |
#4 | |
Senior Member
Location: 45°30'25.22"N / 9°15'53.00"E Join Date: Apr 2009
Posts: 258
|
![]() Quote:
Can you post a soft-clipped entry? Could it be some effect of SW alignment instead? d |
|
![]() |
![]() |
![]() |
#5 |
Member
Location: Europe Join Date: Apr 2010
Posts: 46
|
![]()
Hi,
Below is an example (both ends shown). I'm not sure what you mean by this being an artefact of SW alignment? I would have thought that trimming would either (a) be allowed or (b) not allowed. Thanks for your help! SRR018256.13099683 83 RN28S1|NR_003287.2 4925 29 51M 4550 -426 CCCCCCGTCACGCACCGCACGTTCGTGGGGAACCTGGCGCTAAACCATTCG #%#&&$($($&'%$,#&+%+'+&)((0,**.0++,+1)65.7C+II<@II. XT:A:U NM:i:2 SM:i:29 AM:i:29X0:i:1 X1:i:0 XM:i:2 XO:i:0 XG:i:0 MD:Z:0T1G48 SRR018256.13099683 163 RN28S1|NR_003287.2 4550 29 45M6S 4925 426 GTTAGTTTTACCCTACTGATGATGTGTTGTTGCCATAGTAATCCTNTNTAG I+I;-77I=,10>9/55I)*;%1+%*++%0+))&$%#'$&"'%))!#!$"% XT:A:M NM:i:1 SM:i:29 AM:i:29XM:i:1 XO:i:0 XG:i:0 MD:Z:36G8 |
![]() |
![]() |
![]() |
#6 | |
Senior Member
Location: 45°30'25.22"N / 9°15'53.00"E Join Date: Apr 2009
Posts: 258
|
![]() Quote:
Trimming is quite different, as it is performed at alignment time evaluating the read qualities. d |
|
![]() |
![]() |
![]() |
#7 |
Member
Location: NY Join Date: Aug 2008
Posts: 19
|
![]()
How do I know that –q INT option I set has taken effect? I have 51-nt pair-end reads too. It seems to me that nothing has been trimmed, as the read lengths indicated by CIGAR string column are all 51. is there any other column to check on whether quality trimming has occured?
A related question is the same as Bio.X2Y’s: the resulting alignments have lots of soft-clippings at the edges. Aren't these trimmings? Based on the description of –q INT option in BWA documentation, I would expect soft-clippings (due to trimming) only occur at the right end of sequences, instead of both ends. But I see soft-clippings occur at both ends frequently. Thanks a lot for any inputs! It would also be great if anyone could clarify BWA quality trimming issue a little bit as quite a few people here have similar questions. |
![]() |
![]() |
![]() |
#8 |
Senior Member
Location: Boston Join Date: Feb 2008
Posts: 693
|
![]()
bwa may do smith-waterman alignment, which produces soft clipping.
|
![]() |
![]() |
![]() |
#9 |
Member
Location: NY Join Date: Aug 2008
Posts: 19
|
![]()
What about the quality trimming? Does it actually happen, or it produces soft-clippings too? Thanks!
|
![]() |
![]() |
![]() |
#10 | |
Member
Location: boston Join Date: Jun 2011
Posts: 27
|
![]() Quote:
This may be a late answer. To my understanding, you guys are confused about "-q opiton(quality trimming)" and " soft-clipped". -q option is to trim those crappy ends of reads with very low Phred score, ie. bad quality, which can be due to sequencing errors. Such trimming serves as pre-processing before running BWA. While "soft-clipped" refers to the reads whose certain part may find nowhere to align to, say, for those split-read covering breakpoints. BWA still preserves those "unmapped" part for downstream analysis because it could be caused by say translocation, deletion blablabla. So basically you are talking about two different things. |
|
![]() |
![]() |
![]() |
#11 | |
Member
Location: ireland Join Date: Apr 2014
Posts: 18
|
![]() Quote:
The problem here is: why does bwa clipped/trimmed reads when -q option is not specified? is soft-clipping its part of bwa's nature? I have also noticed that lots alignment tools do the soft-clipping, even it is not an option stated in the manual or parameters. On one side, soft-clipping would generate more alignments, or maybe 'higher' alignment rate, but what about if we want the alignment results with exactly 1 mismatch? I think the soft-clipping is a bit collision to the mismatch option. For "4S26M", would the '4' also count as mismatch allowed = 4? |
|
![]() |
![]() |
![]() |
#12 |
Super Moderator
Location: Walnut Creek, CA Join Date: Jan 2014
Posts: 2,707
|
![]()
I don't know if this is that case for that specific read, since you didn't post the whole line, but the sam specification requires clipping if a read goes of the end of a reference sequence.
|
![]() |
![]() |
![]() |
Tags |
bwa, soft-clipping |
Thread Tools | |
|
|