SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
Genomic coordinates to gene names Layla Bioinformatics 8 04-17-2014 03:23 PM
Genomic coordinates for amino acid variation ... how ? niyl_p Bioinformatics 1 05-24-2012 06:30 PM
Genomic coordinates from Gene Names Palgrave Bioinformatics 0 05-09-2012 03:40 AM
Go from list of genes to all exon coordinates? Heisman Bioinformatics 10 05-06-2011 12:10 AM
getting genomic coordinates from gene accesion information mathew Bioinformatics 11 03-18-2011 12:37 PM

Reply
 
Thread Tools
Old 09-12-2012, 09:21 AM   #1
shyam_la
Member
 
Location: California

Join Date: Mar 2012
Posts: 97
Question Question of retrieving nucleotides from a list of genomic coordinates..

Lets say I have an excel file, one column with chr number and the next with genomic coordinates, running into several thousands in numbers. Is there some online / offline tool into which I can input this information and get as output the nucleotides at these loci in hg19?

For eg.
Input
1 23354
2 345344
3 43543553

Output
1 23354 T
2 345344 C
3 43543553 A
shyam_la is offline   Reply With Quote
Old 09-12-2012, 10:01 AM   #2
dariober
Senior Member
 
Location: Cambridge, UK

Join Date: May 2010
Posts: 311
Default

Quote:
Originally Posted by shyam_la View Post
Lets say I have an excel file, one column with chr number and the next with genomic coordinates, running into several thousands in numbers. Is there some online / offline tool into which I can input this information and get as output the nucleotides at these loci in hg19?

For eg.
Input
1 23354
2 345344
3 43543553

Output
1 23354 T
2 345344 C
3 43543553 A
If could reformat your excel file to BED format (even within excel) and save it as plain text (say as mypositions.bed). Then, you can use bedtools as something like (assuming you have already the FASTA file for hg19):

Code:
bedtools getfasta -fi hg19.fa -bed mypositions.bed -tab
Dario
dariober is offline   Reply With Quote
Old 09-12-2012, 12:53 PM   #3
shyam_la
Member
 
Location: California

Join Date: Mar 2012
Posts: 97
Default

Thank you. Will try that out.. Is it possible to do a similar thing with an aligned sorted BAM file?
shyam_la is offline   Reply With Quote
Old 09-12-2012, 01:49 PM   #4
jparsons
Member
 
Location: SF Bay Area

Join Date: Feb 2012
Posts: 62
Default

It's trivial to convert a sorted BAM file into a bed file.
Look at bedtools documentation. (bamtobed, in particular)
jparsons is offline   Reply With Quote
Old 09-12-2012, 02:21 PM   #5
shyam_la
Member
 
Location: California

Join Date: Mar 2012
Posts: 97
Default

Quote:
Originally Posted by jparsons View Post
It's trivial to convert a sorted BAM file into a bed file.
Look at bedtools documentation. (bamtobed, in particular)
No I meant a BAM file in place of a fasta file..
shyam_la is offline   Reply With Quote
Old 09-13-2012, 07:12 AM   #6
westerman
Rick Westerman
 
Location: Purdue University, Indiana, USA

Join Date: Jun 2008
Posts: 1,104
Default

Since a BAM should contain overlapping reads that may or may not agree at any particular base instead of a single sequence then the answer to your question is not straight-forward. First you'll need to generate a consensus sequence via 'samtools', 'bcftools' and 'vcfutils' ... see: http://samtools.sourceforge.net/mpileup.shtml. Having gotten that then you can pull out the bases.

There may be easier ways but that is how I would do it.
westerman is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 01:22 PM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO