Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • phylip to fasta

    Hi,

    Can anyone provide me a script in python or perl that converts phylip interleaved into fasta? I already found some scripts but it seems that they all work with phylip sequential and not with interleaved.

    Thanks a lot!!

  • #2
    What a coincidence, I have a script which does just that in the BBMap package!

    Code:
    phylip2fasta.sh in=file.phylip out=file.fasta

    Comment


    • #3
      Thanks so much!

      I already saw that there are others useful scripts in bbmap.

      Comment


      • #4
        Originally posted by Brian Bushnell View Post
        What a coincidence, I have a script which does just that in the BBMap package!

        Code:
        phylip2fasta.sh in=file.phylip out=file.fasta
        Sorry Brian, but seems that your script does not work well with my files. Can you tell me why?

        For example:
        file.phy
        3 120
        seq1 ATGTGTTCAGCTCTAGTCGACAAGTATTTCAACTACCAGGAACCTCCGAATCGACAACTT
        seq2 ATGTGTTCGGCTCTTGTAGAGAGACATTTCAGCTTCATAGCGTCTTCG------------
        seq3 ATGTGTTCAGCTCTAGTCGACAAATATTTTAACTACCAGGAACCTCCTGATCGACAACAC

        CAACTTTCGTTTGAAGACTCCAAAATGGCCTACGACAACGGGTCGGACAGCCAGGGCTCG
        ------AGGTTTGAAGACTCGAGAATGGCTTACGACAACGGGTCGGACAGCCAGGGTTCG
        CAGCTTCAATTTGAGGCCACCAAAATGGCCTACGACAACGGCTCAGATAGCCAAGGCTCC

        file.fas
        >seq1
        ATGTGTTCAGCTCTAGTCGACAAGTATTTCAACTACCAGGAACCTCCGAATCGACAACTT
        >seq2
        ATGTGTTCGGCTCTTGTAGAGAGACATTTCAGCTTCATAGCGTCTTCG
        >seq3
        ATGTGTTCAGCTCTAGTCGACAAATATTTTAACTACCAGGAACCTCCTGATCGACAACAC
        >CAACTTTCGTTTGAAGACTCCAAAATGGCCTACGACAACGGGTCGGACAGCCAGGGCTCG

        >------AGGTTTGAAGACTCGAGAATGGCTTACGACAACGGGTCGGACAGCCAGGGTTCG

        >CAGCTTCAATTTGAGGCCACCAAAATGGCCTACGACAACGGCTCAGATAGCCAAGGCTCC

        Comment


        • #5
          I only had a chance to try it on one file, and it worked fine in that case, but that's not a very robust test... can you send me the file you're using?

          What you have posted might be enough to replicate the problem, but to ensure it works correctly in entirety, the full file is probably essential.

          Comment


          • #6
            If you haven't already found a solution, I've got a simple script that can interconvert between any of the formats supported in BioPerl's Bio::AlignIO, which you'd need to have installed. For Phylip to Fasta:

            Code:
            convertAlignment.pl -if phylip -of fasta < file.phy > file.fa
            It's available in my bioinfo/scripts repository.

            Comment


            • #7
              Originally posted by Brian Bushnell View Post
              I only had a chance to try it on one file, and it worked fine in that case, but that's not a very robust test... can you send me the file you're using?

              What you have posted might be enough to replicate the problem, but to ensure it works correctly in entirety, the full file is probably essential.

              Sorry for the late reply.
              You can find my files attached, but I changed the fasta and phylip to .txt because is impossible to attach files with that extensions.
              Attached Files

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Advancing Precision Medicine for Rare Diseases in Children
                by seqadmin




                Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
                12-16-2024, 07:57 AM
              • seqadmin
                Recent Advances in Sequencing Technologies
                by seqadmin



                Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

                Long-Read Sequencing
                Long-read sequencing has seen remarkable advancements,...
                12-02-2024, 01:49 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 12-17-2024, 10:28 AM
              0 responses
              22 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 12-13-2024, 08:24 AM
              0 responses
              42 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 12-12-2024, 07:41 AM
              0 responses
              28 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 12-11-2024, 07:45 AM
              0 responses
              42 views
              0 likes
              Last Post seqadmin  
              Working...
              X