Seqanswers Leaderboard Ad

**drio** · 12-12-2010, 01:39 PM

Try passing the actual fasta file instead of the of the index (fai) in the last command (samtools pileup).

**mindlessbrain** · 12-13-2010, 12:17 AM

Tried this:

Code:

samtools pileup -vcf [B]/home/jill/bwa-0.5.8c/Project/yeast.nt[/B] /home/jill/bwa-0.5.8c/Project/yeastbwaoutsort.bam | tee /home/jill/bwa-0.5.8c/Project/raw.txt | /home/jill/samtools-0.1.11/misc/samtools.pl varFilter -D100 > /home/jill/bwa-0.5.8c/Project/flt.txt
awk '($3=="*"&&$6>=50)||($3!="*"&&$6>=20)' /home/jill/bwa-0.5.8c/Project/flt.txt > /home/jill/bwa-0.5.8c/Project/final.txt

Without the .fai . Still got empty pileup files.

**mindlessbrain** · 12-13-2010, 04:40 AM

I've been trying to do the same thing with MAQ, that is get SNPS, and I'm getting an error there too. I'm not sure if the two are related or not though.

Code:

maq fasta2bfa /home/jill/maq/Project/yeast.nt /home/jill/maq/Project/yeast.nt.bfa
maq fasta2bfa /home/jill/maq/Project/yeast.fasta /home/jill/maq/Project/yeast.fasta.bfa
#-- 1 sequences have been converted.
maq match /home/jill/maq/Project/yeast.fasta.map /home/jill/maq/Project/yeast.nt.bfa /home/jill/maq/Project/yeast.fasta.bfa

#[ma_load_reads] loading reads...
#[ma_load_reads] set length of the first read as 4624380.
#[ma_load_reads] 1*2 reads loaded.
#[ma_longread2read] encoding reads... 2 sequences processed.
#[ma_match] set the minimum insert size as 4624381.
#[match_core] Total length of the reference: 12155026
#[match_core] round 1/3...
#[match_core] making index...
#[match_index_sorted] no reasonable reads are available. Exit!

ETA: I also tried the bam/sam script I have above with completely different files. Still, nothing.

**swbarnes2** · 12-13-2010, 09:54 AM

Why only one read processed?

**mindlessbrain** · 12-13-2010, 11:04 PM

Originally posted by swbarnes2 View Post

Why only one read processed?

No idea. I was hoping someone could tell me. This is my first time using bwa/sam tools.

**drio** · 12-14-2010, 06:00 AM

Originally posted by mindlessbrain View Post

Tried this:

Code:

samtools pileup -vcf [B]/home/jill/bwa-0.5.8c/Project/yeast.nt[/B] /home/jill/bwa-0.5.8c/Project/yeastbwaoutsort.bam | tee /home/jill/bwa-0.5.8c/Project/raw.txt | /home/jill/samtools-0.1.11/misc/samtools.pl varFilter -D100 > /home/jill/bwa-0.5.8c/Project/flt.txt
awk '($3=="*"&&$6>=50)||($3!="*"&&$6>=20)' /home/jill/bwa-0.5.8c/Project/flt.txt > /home/jill/bwa-0.5.8c/Project/final.txt

Without the .fai . Still got empty pileup files.

Try with /home/jill/maq/Project/yeast.fasta, instead of yeast.nt

**swbarnes2** · 12-14-2010, 10:25 AM

Well, this obviously won't work if you only have one read processed. Does your .sam file only have one non-header line in it? Something must be wrong with your fastq.

**mindlessbrain** · 12-16-2010, 01:35 AM

I got it working! Somewhere along the line my fastq had been corrupted. So the commands I had above were fine, with the exception of swapping the reference for the .fai file.

Now I have an SNP file, from my last command:

Code:

samtools pileup -vcf /home/jill/bwa-0.5.8c/Project/yeast_mrna_genes.fasta /home/jill/bwa-0.5.8c/Project/yeastbwaoutsort.bam.bam | tee /home/jill/bwa-0.5.8c/Project/raw.txt | /home/jill/samtools-0.1.11/misc/samtools.pl varFilter -D100 > /home/jill/bwa-0.5.8c/Project/flt.txt
awk '($3=="*"&&$6>=50)||($3!="*"&&$6>=20)' /home/jill/bwa-0.5.8c/Project/flt.txt > /home/jill/bwa-0.5.8c/Project/final.txt

I've been trying to figure out what each column corresponds to. I know the first is the reference sequence name, then position of SNP, then SNPreference, SNPquery(?), after that though, no idea. The documentation has a slightly different format, except for the one they don't explain. =)

AB016599 1118 G R 215 215 36 81 a,,,a,,,.a,,a,,.,,,,,,,,,.aa,,,A.A.,.A....aA.AAAaAAa.,a,,a,,,,,,aaaaa,,,a,,a,,,.. CCACA0ACCBCC%CCCC%CC>CD%CCC@%CCCCCCCACCC?CCC%CACBBC:CCC%CCCCCCCBC<CCCC%CCC%C%CCAC
EF123147 124 T G 42 42 35 5 ^Fg^:g^Fg^Fg^Fg CC;@C
X84042 4 T A 36 36 25 3 ^:A^:A^:A ACC
X84042 5 C A 36 36 25 3 AAA ACC

**drio** · 12-16-2010, 06:29 AM

Explained here.
Also, since you are on it, try mpileup instead of pileup and compare results.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 25 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 29 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 24 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Trouble getting SNPS from bwa/samtools

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News