Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • .fasta and .fa are the same file?

    I'm already google but I'm not sure.
    Since in BWA tool use .fa as a reference ,and in GATK tool use .fasta .
    I already download hs37d5.fa file ,so can I use it to generate .dict and .fai file
    to use in gatk or not?

    I'm new guy in this field, please help me ,and sorry for my bad english

    Thank you,
    Silenus

  • #2
    Files ending with ".fa" and ".fasta" (and others, I've also come across ".fas" or ".fna") are typically FASTA files. You should be able to use that file with both BWA and the GATK.

    Comment


    • #3
      If in any doubt, just open the file in any ASCII text editor and look at it. The FASTA file format is about as simple a text format as you can get, and easily checked:

      Michael Black, Ph.D.
      ScitoVation LLC. RTP, N.C.

      Comment


      • #4
        Originally posted by mbblack View Post
        If in any doubt, just open the file in any ASCII text editor and look at it. The FASTA file format is about as simple a text format as you can get, and easily checked:

        http://blast.ncbi.nlm.nih.gov/blastcgihelp.shtml
        Opening 3+ GB text files (like human genome FASTA file the TO mentioned) with a standard text editor will most certainly use up all the RAM and typically render the computer inresponsive - not a very good idea! If you working in a command line environment, use "less" or something similar to look at the file.

        Comment


        • #5
          Originally posted by sarvidsson View Post
          Files ending with ".fa" and ".fasta" (and others, I've also come across ".fas" or ".fna") are typically FASTA files. You should be able to use that file with both BWA and the GATK.
          Oh I got it ,thank you sir @sarvidsson

          Comment


          • #6
            Originally posted by mbblack View Post
            If in any doubt, just open the file in any ASCII text editor and look at it. The FASTA file format is about as simple a text format as you can get, and easily checked:

            http://blast.ncbi.nlm.nih.gov/blastcgihelp.shtml
            haha , by the way thanks for your comment @mbblack

            Comment


            • #7
              Originally posted by sarvidsson View Post
              Opening 3+ GB text files (like human genome FASTA file the TO mentioned) with a standard text editor will most certainly use up all the RAM and typically render the computer inresponsive - not a very good idea! If you working in a command line environment, use "less" or something similar to look at the file.
              Yes, I wasn't specifically thinking of the size of the specific file in question, but my point was that if you simply look at the first few lines of any text file you'd know whether it was in FASTA format or not. The extension is a meaningless reference, as I have had people send me files named "foo.fa" that were not at all actual FASTA files.
              Michael Black, Ph.D.
              ScitoVation LLC. RTP, N.C.

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Strategies for Sequencing Challenging Samples
                by seqadmin


                Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                03-22-2024, 06:39 AM
              • seqadmin
                Techniques and Challenges in Conservation Genomics
                by seqadmin



                The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                Avian Conservation
                Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                03-08-2024, 10:41 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, Yesterday, 06:37 PM
              0 responses
              10 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, Yesterday, 06:07 PM
              0 responses
              9 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 03-22-2024, 10:03 AM
              0 responses
              49 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 03-21-2024, 07:32 AM
              0 responses
              67 views
              0 likes
              Last Post seqadmin  
              Working...
              X