Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Convert .fna file from NCBI to .fa or .fasta file

    Hello,
    I am totally new to this (I am a student following a course in bio-informatics) and I wanted to use a genome found on NCBI (in .fna or genbank format with .gff annotation) as a reference genome in STAR as an exercise but I cannot find a way to convert the .fna file so the genome can be read by STAR in --genomeFastaFiles. It is a genome not found on normal genome database sites (UCSC e.g.) since it is from a copepod and not much genomic work is done on copepods...
    Is this even possible to use such a genome as a reference genome or is this a bad idea from the start?
    Thank you in advance,
    kind regards,
    Josefien

  • #2
    As far as I know, .fna just means fasta nucleic acid (as opposed to .faa, fasta amino acid, for protein sequences), so the file is actually in fasta format.

    Comment


    • #3
      The problem is; STAR is not recognizing this fasta format (.fna), I am getting an error that is impossible to read this fasta file, that why I wondered if it was not possible to convert from .fna to .fa. Or do you think it is a problem with the file itself and STAR is able to read/load .fna files?

      Comment


      • #4
        Just rename the .fna extension to .fa (as long as the file is in fasta format). That should work.

        Code:
        $ cp file.fna file.fa
        If you are not sure about the format of the file post the output of this command
        Code:
        $ head -10 file.fna

        Comment


        • #5
          Originally posted by GenoMax View Post
          Just rename the .fna extension to .fa (as long as the file is in fasta format). That should work.

          Code:
          $ cp file.fna file.fa
          If you are not sure about the format of the file post the output of this command
          Code:
          $ head -10 file.fna
          thank you very much it is working now !

          Comment


          • #6
            could you please explain that changing the file extension won't effect the results for which we are mapping with a reference genome in case of RNA-seq for HISAT2 software?

            Comment


            • #7
              No the results will not be affected since we are not changing sequence/content of any data files. We are only renaming the file.

              Comment


              • #8
                will the content is same in both files?

                Comment


                • #9
                  Yes. As long as you only change the file name.

                  Comment


                  • #10
                    Thank you for your help

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Recent Advances in Sequencing Analysis Tools
                      by seqadmin


                      The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
                      Today, 07:48 AM
                    • seqadmin
                      Essential Discoveries and Tools in Epitranscriptomics
                      by seqadmin




                      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                      04-22-2024, 07:01 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, Today, 07:17 AM
                    0 responses
                    11 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 05-02-2024, 08:06 AM
                    0 responses
                    19 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 04-30-2024, 12:17 PM
                    0 responses
                    20 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 04-29-2024, 10:49 AM
                    0 responses
                    28 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X