Hi there,
I'm evaluating the CRAM compression. So, I'm comparing the same BAM sample pre and post CRAM compression.
I found that in my data the TLEN (Template Length) field is sistematically modified summing 1 to this value.
For instance an original read like:
After compression/decompression looks like:
Anybody knows something about this issue? I have gone through some posts that talk about differences among the aligners in the TLEN calculation.
Other changes that are OK for me are the read name which are replaced by numbers and tags which are removed, except the read group tag.
Thanks!
Pablo.
I'm evaluating the CRAM compression. So, I'm comparing the same BAM sample pre and post CRAM compression.
I found that in my data the TLEN (Template Length) field is sistematically modified summing 1 to this value.
For instance an original read like:
Code:
SRR107049.155163702 163 1 69171 44 76M = 69404 [B][COLOR="Red"]308[/COLOR][/B] CTATGGAGGAATCGTGTTTGGAAACCTTCTTATTGTCATAACAGTGGTATCTGACTCCCACCTTCACTCTCCCATG S?;<CC>D=?@=C<>D?A@F?@CC D3A?<C?>=>G?CA=:AE>E@HC><??CDAEABD??:9=<AAFCBCABE<>S RG:Z:SRR107049 NM:i:0 OQ:Z:GGG@AFDE@E@BC@CBFGFFAFEDD4>B9DBD??EDBC?;AD<ADEC>?C=DBDE@AD>A;:;=>EGEBE>BE<A9
Code:
2 163 1 69171 44 76M = 69404 [B][COLOR="Red"]309[/COLOR][/B] CTATGGAGGAATCGTGTTTGGAAACCTTCTTATTGTCATAACAGTGGTATCTGACTCCCACCTTCACTCTCCCATG S?;<CC>D=?@=C<>D?A@F?@CCD3A?<C?>=>G?CA=:AE>E@HC><??CDAEABD??:9=<AAFCBCABE<>S RG:Z:SRR107049
Anybody knows something about this issue? I have gone through some posts that talk about differences among the aligners in the TLEN calculation.
Other changes that are OK for me are the read name which are replaced by numbers and tags which are removed, except the read group tag.
Thanks!
Pablo.
Comment