![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
sam to bam conversion error, no @SQ lines in the header, missing header? | efoss | Bioinformatics | 17 | 12-03-2015 04:28 AM |
BWA sam and Samtools sam->bam conversion problem | maasha | Bioinformatics | 6 | 06-05-2013 07:39 AM |
samtools: parse error in SAM to BAM conversion | chrisW | Bioinformatics | 12 | 01-14-2013 06:16 PM |
Bowtie2 Sam output | Derek-C | Bioinformatics | 3 | 11-13-2012 11:10 AM |
Tophat v1.1.4 potential error with sam to bam conversion? | jb2 | Bioinformatics | 6 | 11-17-2011 12:52 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Member
Location: C:/Program files/Google/Chrome Join Date: Jul 2012
Posts: 34
|
![]()
Hi all,
I am trying convert a sam file (PE bowtie2 default fr output) to bam file using samtools view command. But it's throwing the following error. Code:
$ samtools view -bS -o aln3.bam aln3.sam [samopen] SAM header is present: 17 sequences [sam_read1] reference '451450' is recognized as '*' Parse error at line 2026299: sequence and quality are inconsistent Code:
sed -n 2026299p aln3.sam 133 chr12 451450 0 * = 451450 0 * * YT:Z:UP YF:Z:LN When I checked for such lines Code:
$ sed -n '/^[^H]/=' aln3.sam | wc -l 5422 How do I proceed now?? Do I delete these 5422 lines from sam file?? This thread here had the same problem but it was with bwa aligner. Also my bowtie output after aligning says (many such lines are produced before showing final alignment stats): Code:
Warning: skipping mate #2 of read '' because length (0) <= # seed mismatches (0) Warning: skipping mate #2 of read '' because it was < 2 characters long Code:
1018542 reads; of these: 1018542 (100.00%) were paired; of these: 1018249 (99.97%) aligned concordantly 0 times 188 (0.02%) aligned concordantly exactly 1 time 105 (0.01%) aligned concordantly >1 times Thank you. Between I am new to analysis and I hope you people dont mind if my questions are too simple. Last edited by a_mt; 11-16-2012 at 10:41 PM. |
![]() |
![]() |
![]() |
#2 |
Senior Member
Location: Los Angeles Join Date: Nov 2013
Posts: 142
|
![]()
Hello. I have the same issue
Warning: skipping mate #1 of read 'SN860:381:H80WNADXX:1:1101:18171:2118 1:N:0:1' because length (0) <= # seed mismatches (0) Warning: skipping mate #1 of read 'SN860:381:H80WNADXX:1:1101:18171:2118 1:N:0:1' because it was < 2 characters long |
![]() |
![]() |
![]() |
#3 |
Super Moderator
Location: Walnut Creek, CA Join Date: Jan 2014
Posts: 2,707
|
![]()
It sounds like you have a 0bp read, possibly due to quality trimming. You can try removing such reads (and their mates) with a tool that allows you to specify a minimum read length.
|
![]() |
![]() |
![]() |
#4 |
Junior Member
Location: RUSSIA MOSCOW Join Date: Mar 2015
Posts: 3
|
![]()
I solved this problem after trimming fastaq files with fastx_trimmer -f and fastx_trimmer -t operations with no 0bp read in output files. So the solution for me was to remove artifacts with fastx_artifacts_filter and then synchronize pairs with fastqCombinePairedEnd.py
|
![]() |
![]() |
![]() |
Thread Tools | |
|
|