Seqanswers Leaderboard Ad

**Tally** · 01-23-2012, 02:57 PM

When importing sam to bam, I use 'view' rather than 'import' :

@SQ header lines present in SAM file:
samtools view –bS alignment.sam > alignment.bam

@SQ header lines absent from SAM file:
samtools view –bt reference.fasta.fai alignment.sam > alignment.bam

Not sure if that will help at all but might be worth a try
Good luck

**HeidiJTP** · 01-24-2012, 05:55 AM

Thanks for the suggestion Tally, I just reran everything and used the command view instead of import, but still no luck

**HeidiJTP** · 01-24-2012, 07:02 AM

I think there are two problems here:

1) Display of NNNNs instead of sequence
This seems to be related in part to the actual terminal window. I thought it was weird that the NNNs don't appear until exactly after I start scrolling across the terminal. If I resize the terminal before running the 'tview' command, the position where the NNNs begin also changes. It may not be important, as according to mpileup output the NNNs are only occurring in between aligned regions.

2) Incomplete alignment
My fault!!! Helps when you use the correct reference sequence...

**sudeep** · 02-12-2012, 03:58 AM

Originally posted by HeidiJTP View Post

I think there are two problems here:

1) Display of NNNNs instead of sequence
This seems to be related in part to the actual terminal window. I thought it was weird that the NNNs don't appear until exactly after I start scrolling across the terminal. If I resize the terminal before running the 'tview' command, the position where the NNNs begin also changes. It may not be important, as according to mpileup output the NNNs are only occurring in between aligned regions.

I am also facing this problem. Does anybody know what is the work around ?

**Crypticfortune** · 02-14-2012, 04:42 AM

Originally posted by sudeep View Post

I am also facing this problem. Does anybody know what is the work around?

To summarize, I think there's 3 main problems that trip up users:

Forgot to specify the reference on the command line (eg. "samtools tview foo.bam" => "samtool tview foo.bam foo.fa")
Fasta file has different names for sequences. This is painful to fix, but you'll have to either rewrite all the sequence names (e.g. ">chr1" lines in foo.fa) to match the bam file sequence names, or rewrite the sequence references in the SAM/BAM file. The former's probably easier, but definitely the "right" way to go is to use the same fasta files when building the alignment to begin with
Corrupt fasta files? I can't confirm this, but I suspect samtools might choke on reading FASTA files with dos/windows CR/LF linebreak codes (shows up as ^M in unix terminals a lot). This would explain HeidiJTP and naluru's 80 character problem (as 80 characters per line is common). You can normalize your dos/windows ASCII files to unix with the dos2unix command (e.g. dos2unix foo.fa).

Also, it may not be what you're looking for if you care about the reference outside of mapped areas, but as an alternative, Samscope infers and displays the reference from BAM data alone (MD + CIGAR tags) without relying on FASTA reference files.

**kristina.gagalova** · 05-02-2016, 12:37 AM

I have solved this issue with renaming the fai (fasta index file). I had FileName.fa.fai and rename it FileName.fai. I think the program expects it like that.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 37 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 41 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 35 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 54 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News