Seqanswers Leaderboard Ad

**sdriscoll** · 06-18-2013, 12:29 PM

This comes down to how you built the index for BWA. What FASTA file(s) did you use? If you didn't build the index from FASTA sequences that are full chromosome references then you won't get alignments in terms of chromosomes.

Also that last line of idxstats is probably just the number of unaligned reads. Typically unmapped reads have an '*' in the third column.

**swbarnes2** · 06-18-2013, 08:39 PM

Did you read SAM format description?

Yes, the third column of a sam file has the chromosome name.

You've done something very wrong, though.

MD:Z:44C34G22T0G0G0G0C0G0C0C0G0C0C0G0G0G0G0C0G0C0T0G0G0C0C0G0C0T0T0C0G0C0G0C0G0C0C0G0G3T14T2A5G0T4C20G0T1A26A5

Means that you used the wrong fastq file in the sampe step.

**sdriscoll** · 06-19-2013, 07:23 AM

Also I recommend mem over the aln/sampe pipeline. It's simpler and it works better.

**prs321** · 06-19-2013, 08:17 AM

Originally posted by sdriscoll View Post

This comes down to how you built the index for BWA. What FASTA file(s) did you use? If you didn't build the index from FASTA sequences that are full chromosome references then you won't get alignments in terms of chromosomes.

Also that last line of idxstats is probably just the number of unaligned reads. Typically unmapped reads have an '*' in the third column.

I used db11.fasta

I did build the index.

And the last part of what you said makes no sense because the first row describes the name, sequences, # of mapped reads, and # of unmapped reads. How does the second row (* 0 0 32694) describe the # of unmapped reads when the first row already lists the # of unmapped reads?

**prs321** · 06-19-2013, 08:18 AM

Originally posted by swbarnes2 View Post

Did you read SAM format description?

Yes, the third column of a sam file has the chromosome name.

You've done something very wrong, though.

Means that you used the wrong fastq file in the sampe step.

Could this have anything to do with the fact that Serratia marcescens is a bacteria with only 1 chromosome?

**swbarnes2** · 06-19-2013, 08:43 AM

A read can be unmapped, and associated with a chromosome, if it hangs off the edge. You have 2900 such reads. The rest of the unmapped reads didn't map at all, that's the 155004.

I used bwa and samtools on single chromosome bacterial references all the time. You messed up your sampe command, that's why you have that nonsense MD part. That's the only mistake you appear to have made, everything else looks normal, so I'm not sure what you think the problem is.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 55 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 51 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 45 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 55 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Why don't my SAM files list the chromosomes?

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News