![]() |
Question of retrieving nucleotides from a list of genomic coordinates..
Lets say I have an excel file, one column with chr number and the next with genomic coordinates, running into several thousands in numbers. Is there some online / offline tool into which I can input this information and get as output the nucleotides at these loci in hg19?
For eg. Input 1 23354 2 345344 3 43543553 Output 1 23354 T 2 345344 C 3 43543553 A |
Quote:
Code:
bedtools getfasta -fi hg19.fa -bed mypositions.bed -tab |
Thank you. Will try that out.. Is it possible to do a similar thing with an aligned sorted BAM file?
|
It's trivial to convert a sorted BAM file into a bed file.
Look at bedtools documentation. (bamtobed, in particular) |
Quote:
|
Since a BAM should contain overlapping reads that may or may not agree at any particular base instead of a single sequence then the answer to your question is not straight-forward. First you'll need to generate a consensus sequence via 'samtools', 'bcftools' and 'vcfutils' ... see: http://samtools.sourceforge.net/mpileup.shtml. Having gotten that then you can pull out the bases.
There may be easier ways but that is how I would do it. |
All times are GMT -8. The time now is 02:25 AM. |
Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2021, vBulletin Solutions, Inc.