Seqanswers Leaderboard Ad

**keebs42** · 06-25-2009, 09:56 AM

Going to try setting the -f flag when running samtools pileup..

**JackyH** · 06-30-2009, 04:36 AM

Hi,

have you managed to solve this problem at all? I am seeing exactly the same and can't quite figure out where I've gone wrong.

Thanks,

Jacky

**keebs42** · 06-30-2009, 05:48 AM

I ended up finding downloading the NCBI human reference build 36 in fasta format (split by chromosome), and then using the -f flag when creating the pileup. The reference column seems to be correct after doing this.

samtools pileup -f ref.fasta alignment.bam > alignment.pu

Hope that helps!
Jonathan

**JackyH** · 06-30-2009, 06:14 AM

Mh, that's curious. I have tried that with RefSeq as a reference but I still don't see the reference base. Maybe it's an issue with the format of the Fasta header (contains a colon in my case).

Thanks for your fast reply anyhow!
Jacky

**RAS** · 07-13-2010, 12:22 PM

Still can't get correct reference sequence column

Hi,

I still can't get a pileup file where the reference sequence shows bases instead of "N"s. I'd like to create pileup files of sequences from the 1,000 genomes project aligned with NCBI human reference sequences. I am using the -f flag--indicating that the reference sequence is in FASTA format--and also need to use the -c flag--indicating that the pileup file should have the consensus sequence for the original .bam file. In the main samtools-0.1.7a folder of a GNU/Linux computer, I've typed many variants of the following:

./samtools pileup -cf /ifs/scratch/.../humanReferenceGenome/UCSCBuild36/chrX.fa /ifs/scratch/.../fatherAlignment/NA12891.chromX.ILLUMINA.bwa.CEU.high_coverage.20100517.bam > NA12891.ChrX.UCSC36.pileup

The resulting pileup file contains the NA12891 consensus sequence. I've tried using a number of reference sequences, including builds 36.1, 36.2, 36.3 and 37 of the NCBI reference genome and build 36 of the UCSC reference genome, in the hope that one of these reference sequences would also appear in the pileup file. I would very much appreciate any suggestions.

Thanks,
Rebecca

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 15 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

What is the reference sequence? ( can I find it in a .bai index?)

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News