Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • phylip to fasta

    Hi,

    Can anyone provide me a script in python or perl that converts phylip interleaved into fasta? I already found some scripts but it seems that they all work with phylip sequential and not with interleaved.

    Thanks a lot!!

  • #2
    What a coincidence, I have a script which does just that in the BBMap package!

    Code:
    phylip2fasta.sh in=file.phylip out=file.fasta

    Comment


    • #3
      Thanks so much!

      I already saw that there are others useful scripts in bbmap.

      Comment


      • #4
        Originally posted by Brian Bushnell View Post
        What a coincidence, I have a script which does just that in the BBMap package!

        Code:
        phylip2fasta.sh in=file.phylip out=file.fasta
        Sorry Brian, but seems that your script does not work well with my files. Can you tell me why?

        For example:
        file.phy
        3 120
        seq1 ATGTGTTCAGCTCTAGTCGACAAGTATTTCAACTACCAGGAACCTCCGAATCGACAACTT
        seq2 ATGTGTTCGGCTCTTGTAGAGAGACATTTCAGCTTCATAGCGTCTTCG------------
        seq3 ATGTGTTCAGCTCTAGTCGACAAATATTTTAACTACCAGGAACCTCCTGATCGACAACAC

        CAACTTTCGTTTGAAGACTCCAAAATGGCCTACGACAACGGGTCGGACAGCCAGGGCTCG
        ------AGGTTTGAAGACTCGAGAATGGCTTACGACAACGGGTCGGACAGCCAGGGTTCG
        CAGCTTCAATTTGAGGCCACCAAAATGGCCTACGACAACGGCTCAGATAGCCAAGGCTCC

        file.fas
        >seq1
        ATGTGTTCAGCTCTAGTCGACAAGTATTTCAACTACCAGGAACCTCCGAATCGACAACTT
        >seq2
        ATGTGTTCGGCTCTTGTAGAGAGACATTTCAGCTTCATAGCGTCTTCG
        >seq3
        ATGTGTTCAGCTCTAGTCGACAAATATTTTAACTACCAGGAACCTCCTGATCGACAACAC
        >CAACTTTCGTTTGAAGACTCCAAAATGGCCTACGACAACGGGTCGGACAGCCAGGGCTCG

        >------AGGTTTGAAGACTCGAGAATGGCTTACGACAACGGGTCGGACAGCCAGGGTTCG

        >CAGCTTCAATTTGAGGCCACCAAAATGGCCTACGACAACGGCTCAGATAGCCAAGGCTCC

        Comment


        • #5
          I only had a chance to try it on one file, and it worked fine in that case, but that's not a very robust test... can you send me the file you're using?

          What you have posted might be enough to replicate the problem, but to ensure it works correctly in entirety, the full file is probably essential.

          Comment


          • #6
            If you haven't already found a solution, I've got a simple script that can interconvert between any of the formats supported in BioPerl's Bio::AlignIO, which you'd need to have installed. For Phylip to Fasta:

            Code:
            convertAlignment.pl -if phylip -of fasta < file.phy > file.fa
            It's available in my bioinfo/scripts repository.

            Comment


            • #7
              Originally posted by Brian Bushnell View Post
              I only had a chance to try it on one file, and it worked fine in that case, but that's not a very robust test... can you send me the file you're using?

              What you have posted might be enough to replicate the problem, but to ensure it works correctly in entirety, the full file is probably essential.

              Sorry for the late reply.
              You can find my files attached, but I changed the fasta and phylip to .txt because is impossible to attach files with that extensions.
              Attached Files

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Essential Discoveries and Tools in Epitranscriptomics
                by seqadmin


                The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
                Yesterday, 07:01 AM
              • seqadmin
                Current Approaches to Protein Sequencing
                by seqadmin


                Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                04-04-2024, 04:25 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 04-11-2024, 12:08 PM
              0 responses
              55 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 10:19 PM
              0 responses
              51 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 09:21 AM
              0 responses
              45 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-04-2024, 09:00 AM
              0 responses
              55 views
              0 likes
              Last Post seqadmin  
              Working...
              X