Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Convert .fna file from NCBI to .fa or .fasta file

    Hello,
    I am totally new to this (I am a student following a course in bio-informatics) and I wanted to use a genome found on NCBI (in .fna or genbank format with .gff annotation) as a reference genome in STAR as an exercise but I cannot find a way to convert the .fna file so the genome can be read by STAR in --genomeFastaFiles. It is a genome not found on normal genome database sites (UCSC e.g.) since it is from a copepod and not much genomic work is done on copepods...
    Is this even possible to use such a genome as a reference genome or is this a bad idea from the start?
    Thank you in advance,
    kind regards,
    Josefien

  • #2
    As far as I know, .fna just means fasta nucleic acid (as opposed to .faa, fasta amino acid, for protein sequences), so the file is actually in fasta format.

    Comment


    • #3
      The problem is; STAR is not recognizing this fasta format (.fna), I am getting an error that is impossible to read this fasta file, that why I wondered if it was not possible to convert from .fna to .fa. Or do you think it is a problem with the file itself and STAR is able to read/load .fna files?

      Comment


      • #4
        Just rename the .fna extension to .fa (as long as the file is in fasta format). That should work.

        Code:
        $ cp file.fna file.fa
        If you are not sure about the format of the file post the output of this command
        Code:
        $ head -10 file.fna

        Comment


        • #5
          Originally posted by GenoMax View Post
          Just rename the .fna extension to .fa (as long as the file is in fasta format). That should work.

          Code:
          $ cp file.fna file.fa
          If you are not sure about the format of the file post the output of this command
          Code:
          $ head -10 file.fna
          thank you very much it is working now !

          Comment


          • #6
            could you please explain that changing the file extension won't effect the results for which we are mapping with a reference genome in case of RNA-seq for HISAT2 software?

            Comment


            • #7
              No the results will not be affected since we are not changing sequence/content of any data files. We are only renaming the file.

              Comment


              • #8
                will the content is same in both files?

                Comment


                • #9
                  Yes. As long as you only change the file name.

                  Comment


                  • #10
                    Thank you for your help

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Techniques and Challenges in Conservation Genomics
                      by seqadmin



                      The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                      Avian Conservation
                      Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                      03-08-2024, 10:41 AM
                    • seqadmin
                      The Impact of AI in Genomic Medicine
                      by seqadmin



                      Artificial intelligence (AI) has evolved from a futuristic vision to a mainstream technology, highlighted by the introduction of tools like OpenAI's ChatGPT and Google's Gemini. In recent years, AI has become increasingly integrated into the field of genomics. This integration has enabled new scientific discoveries while simultaneously raising important ethical questions1. Interviews with two researchers at the center of this intersection provide insightful perspectives into...
                      02-26-2024, 02:07 PM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, 03-14-2024, 06:13 AM
                    0 responses
                    34 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 03-08-2024, 08:03 AM
                    0 responses
                    72 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 03-07-2024, 08:13 AM
                    0 responses
                    81 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 03-06-2024, 09:51 AM
                    0 responses
                    68 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X