Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • ewhitt
    Junior Member
    • Nov 2015
    • 2

    extracting specific fasta sequences from whole genome file?

    Sorry if this seems like a naive and simple question. I have the whole genome of my species as a fasta file and a file containing a list of protein IDs. What is the easiest way to get a file that contains the fasta sequence for each of the proteins in the second file?
  • GenoMax
    Senior Member
    • Feb 2008
    • 7142

    #2
    What species?

    If it is one of the common genomes NCBI is likely to have the protein fasta file available.

    Comment

    • ewhitt
      Junior Member
      • Nov 2015
      • 2

      #3
      I can get the protein fasta file. It's more the extracting the sequence for my specific list of protein IDs that I'm struggling with.
      It is the monarch butterfly so not a particularly common species.

      Comment

      • GenoMax
        Senior Member
        • Feb 2008
        • 7142

        #4
        Look into faSomeRecords utility from Jim Kent: http://hgdownload.soe.ucsc.edu/admin.../faSomeRecords I have provided linux link but OS X/Source code is also available at the site.

        Provide the list of ID's to faSomeRecords and it will extract the fasta sequence for your ID's, which you can save to a new file.

        Comment

        Latest Articles

        Collapse

        • GATTACAT
          Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
          by GATTACAT
          Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
          07-01-2026, 11:43 AM
        • SEQadmin2
          Nine Things a Sample Prep Scientist Thinks About Before Sequencing
          by SEQadmin2


          I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

          Here are nine questions we think about, in roughly the order they matter, before...
          06-18-2026, 07:11 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by SEQadmin2, 07-02-2026, 11:08 AM
        0 responses
        17 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-30-2026, 05:37 AM
        0 responses
        18 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-26-2026, 11:10 AM
        0 responses
        21 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-17-2026, 06:09 AM
        0 responses
        54 views
        0 reactions
        Last Post SEQadmin2  
        Working...