Seqanswers Leaderboard Ad

**pbluescript** · 02-09-2012, 05:13 AM

You should check out bedtools.

http://code.google.com/p/bedtools/

**shawpa** · 02-09-2012, 05:16 AM

I was looking into bedtools but I don't know which command to use.

**dpryan** · 02-09-2012, 06:38 AM

The VCF file is a text file with that information in it (largely separated into columns in fact). You can just open it in your favorite text document view/editor (MS Word, Notepad, gvim, Preview.app). Frankly, the easiest way to split the calls by chromosome is probably just to use grep (assuming you're using Linux or a Mac). So "grep chrX file.vcf > chrX.vcf" for each chromosome. I'm sure someone can think of a short one line command with awk and sed to avoid having to grep for each chromosome, but frankly you probably have a small number of chromosomes so this won't be very labor intensive. Note that the result of the grep command won't really be a valid VCF file, since you'll miss much of the header, but that won't matter if you just want to go through it by hand.

**shawpa** · 02-09-2012, 07:06 AM

Originally posted by dpryan View Post

The VCF file is a text file with that information in it (largely separated into columns in fact). You can just open it in your favorite text document view/editor (MS Word, Notepad, gvim, Preview.app). Frankly, the easiest way to split the calls by chromosome is probably just to use grep (assuming you're using Linux or a Mac). So "grep chrX file.vcf > chrX.vcf" for each chromosome. I'm sure someone can think of a short one line command with awk and sed to avoid having to grep for each chromosome, but frankly you probably have a small number of chromosomes so this won't be very labor intensive. Note that the result of the grep command won't really be a valid VCF file, since you'll miss much of the header, but that won't matter if you just want to go through it by hand.

Thanks! I get that this is probably a stupid question, but if I type grep chr1, it pulls chr1, 11, 12, 13 etc. What am I doing wrong?

**pbluescript** · 02-09-2012, 07:25 AM

Originally posted by shawpa View Post

Thanks! I get that this is probably a stupid question, but if I type grep chr1, it pulls chr1, 11, 12, 13 etc. What am I doing wrong?

Use grep -w chr1

**dpryan** · 02-09-2012, 07:27 AM

Originally posted by shawpa View Post

Thanks! I get that this is probably a stupid question, but if I type grep chr1, it pulls chr1, 11, 12, 13 etc. What am I doing wrong?

Nothing, I should have foreseen that! Mea culpa. There's a "word boundary" switch you can give to grep (at least on my computer). So try "grep -w chrX file.vcf > chrX.vcf". That should work better!

Topics	Statistics	Last Post
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Today, 08:47 AM	0 responses 11 views 0 likes	Last Post by seqadmin Today, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 59 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 54 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM

Seqanswers Leaderboard Ad

Announcement

getting meaningful allele out of vcf file

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News